Hugging face agents

I’ve been working through the Hugging Face agents course, and I’m enjoying it quite a bit. Highly recommended! First, it’s rounding out my knowledge of LLMs, transformers, and AI in general. Second, it paints a very clear picture of what agentic AI is all about—while staying away from the hype. I’ll try to summarize here, but I really recommend checking out the full course.

This is not a formal definition, but I think the crucial feature of agents is the ability to use tools to interact with the environment. Instead of relying solely on the knowledge of the model itself, agents can search the web, access web pages, and use Unix commands like find, ls, and grep to help answer your questions. Another key characteristic is that this all happens in a loop, giving the agent the ability to course correct in case things don’t go as planned in order to achieve its goal.

This is known as the Re-Act loop, and it looks something like this:

  stateDiagram-v2 
  direction LR
    [*] --> Prompt
    Prompt --> Think
    Think --> Act
    Act --> Observe
    Observe --> Think
    Observe --> [*]

It all starts with a prompt that gives the agent a task to complete. The agent “thinks” using an LLM model to decide which tools should be used to complete the task. It acts by executing those tools and collecting the observations. It then decides if the task is complete, otherwise it goes back for another iteration until it solves the problem. Of course this is the happy path and many things can go wrong here. But conceptually that’s what the agent does.

First agent using smolagents

If that seems too abstract, let me walk you through a real example using a simple agent that uses a couple of tools to calculate the distance between two cities. Allow me to dive into the code.

This example is based on the smolagents framework from Hugging Face. It focuses on the CodeAgent class, which is a special kind of agent that uses Python code to answer its requests. I’ll be using uv to manage the dependencies for this tutorial, so if you don’t have it already, go ahead and install it following the installation steps first.

Create a new project and add the required dependencies:

uv init ai-agent
cd ai-agent
uv add smolagents[openai,toolkit]

Update your main.py file

import math
from smolagents import CodeAgent, WebSearchTool, OpenAIServerModel, tool


@tool
def distance(point1: tuple[float, float], point2: tuple[float, float]) -> float:
    """
    Haversine formula to calculate distance between two points on Earth

    Args:
        point1: a tuple containing the latitude and longitude of point 1 in decimal format
        point2: a tuple containing the latitude and longitude of point 2 in decimal format
    Output:
        Returns the distance in km between the 2 points.
    """

    # Convert decimal degrees to radians
    lat1, lon1, lat2, lon2 = map(math.radians, [*point1, *point2])

    # Haversine formula
    dlat = lat2 - lat1
    dlon = lon2 - lon1
    a = (
        math.sin(dlat / 2) ** 2
        + math.cos(lat1) * math.cos(lat2) * math.sin(dlon / 2) ** 2
    )
    c = 2 * math.asin(math.sqrt(a))

    # Radius of earth in kilometers
    r = 6371

    return c * r


model = OpenAIServerModel(
    api_base="http://localhost:11434/v1",
    model_id="qwen3-coder:30b",
    api_key="ollama",
)

agent = CodeAgent(tools=[WebSearchTool(), distance], model=model)

res = agent.run("""
    I want to calculate the distance between Toronto and New York City.
    - You should find the geo coordinates for Toronto
    - Find the geo coordinates for New York City
    - Calculate the distance between those 2 coordinates
""")

Let me break down the code:

Imports: bring in the smolagents classes and functions that are used to implement the agent.
Tool definition: defines a tool that calculates the distance between two coordinates using the Haversine formula.
Model API: I’m using Ollama to run this test on an RTX 4060 Ti with 16 GB of VRAM on a Debian 13 system. The qwen3-coder:30b model was the best performing option that would run on my hardware, but your mileage may vary. You can also use the InferenceClient and the Hugging Face API to run these tests. I had to find an alternative solution because I ran out of credits.
Instantiate the agent: here the agent is created with all the necessary tools and also a reference to the model.
Run the prompt and monitor the logs

Execute the file by running:

uv run main.py

Example output:

╭─────────────────────────────────── New run ────────────────────────────────────╮
│                                                                                │
│ I want to calculate the distance between Toronto and New York City.            │
│     - You should find the geo coordinates for Toronto                          │
│     - Find the geo coordinates for New York City                               │
│     - Calculate the distance between those 2 coordinates                       │
│                                                                                │
╰─ OpenAIServerModel - qwen3-coder:30b ──────────────────────────────────────────╯

The agent loads the model and executes the prompt. It tries to break the task in a series of steps that it can more easily reason about.

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Step 1 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
 ─ Executing parsed code: ───────────────────────────────────────────────────────
  toronto_coords = web_search(query="Toronto latitude longitude")
  print("Toronto coordinates:", toronto_coords)
 ────────────────────────────────────────────────────────────────────────────────
Execution logs:
Toronto coordinates: ## Search Results

[Latitude and longitude of Toronto, Canada - GPS
Coordinates](https://latlong.info/canada/ontario/toronto)
What is the latitude and longitude code of Toronto ? The latitude of Toronto ,
Canada is 43.70011000, and the longitude is -79.41630000. Toronto is located at
Canada country in the states place category with the gps coordinates of 43° 42'
0.396'' N and -79° 24' 58.68 E. Geographic coordinates are a way of specifying the
location of a place on Earth, using a pair of numbers to represent a ...

(more results) ...

Out: None
[Step 1: Duration 4.40 seconds| Input tokens: 2,110 | Output tokens: 77]
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Step 2 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
 ─ Executing parsed code: ───────────────────────────────────────────────────────
  nyc_coords = web_search(query="New York City latitude longitude")
  print("New York City coordinates:", nyc_coords)
 ────────────────────────────────────────────────────────────────────────────────
Execution logs:
New York City coordinates: ## Search Results

[New York City Latitude and Longitude Map - Maps of
World](https://www.mapsofworld.com/lat_long/new-york-city.html)
 Latitude and longitude of New  York  City is 40.71278 N and -74.00594 E. Map
showing the geographic coordinates of New  York  City , in United States.

(more results) ...

Out: None
[Step 2: Duration 4.79 seconds| Input tokens: 5,342 | Output tokens: 127]

On steps 1 and 2, the agent uses the web_search tool to crawl the web and look for the geographic coordinates for the two cities of interest. That information gets added to the agent memory and is used in the following steps.

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Step 3 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
 ─ Executing parsed code: ───────────────────────────────────────────────────────
  toronto = (43.70011, -79.4163)
  nyc = (40.712775, -74.005973)
  distance = distance(toronto, nyc)
  print("Distance between Toronto and New York City:", distance, "km")
 ────────────────────────────────────────────────────────────────────────────────
Code execution failed at line 'distance = distance(toronto, nyc)' due to:
InterpreterError: Cannot assign to name 'distance': doing this would erase the
existing tool!
[Step 3: Duration 7.77 seconds| Input tokens: 9,382 | Output tokens: 231]

Armed with the coordinates from steps 1 and 2, the agent now tries to use the provided distance tool to perform the calculation. Pay close attention to the highlighted lines though. The agent made a mistake when naming the variable and the Python parser complained about it. This gets corrected on the next step. That speaks to the ability of the Re-Act loop to self-correct and find the right answer, as long as you provide the correct feedback mechanisms. In this case the Python interpreter provided an error message.

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Step 4 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
 ─ Executing parsed code: ───────────────────────────────────────────────────────
  toronto = (43.70011, -79.4163)
  nyc = (40.712775, -74.005973)
  distance_result = distance(toronto, nyc)
  print("Distance between Toronto and New York City:", distance_result, "km")
 ────────────────────────────────────────────────────────────────────────────────
Execution logs:
Distance between Toronto and New York City: 555.6065996863686 km

Out: None
[Step 4: Duration 9.93 seconds| Input tokens: 12,736 | Output tokens: 348]

The correct code is now executed and the final answer is found.

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Step 5 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
 ─ Executing parsed code: ───────────────────────────────────────────────────────
  final_answer(555.61)
 ────────────────────────────────────────────────────────────────────────────────
Final answer: 555.61
[Step 5: Duration 2.67 seconds| Input tokens: 16,369 | Output tokens: 396]

Conclusion

I was very skeptical of the current AI hype cycle, so I was mostly following it from a safe distance for lack of a better word. But since I started experimenting with Claude Code and similar tools, something clicked. The multi-step approach combined with tool usage and verifications is really interesting. As a software engineer, this also resonates a lot with me because now I can shape and give tools to the LLM so it can complete tasks and verify its outputs.

I haven’t completed the course yet and I am curious about other tools like LlamaIndex and how RAG plays a role in the agent’s workflow. The course also clearly favors code agents rather than tool calling (JSON-based) but it does very little to compare both approaches. I can only imagine the kinds of attacks a code- generating agent would be susceptible to, so I’m not convinced that it is a clear winner. Also, the fact that most other players like Anthropic prefer tool- calling agents gives me pause. I might do a follow-up article as I advance through the course, but that is it for now.

First agent using smolagents#

Conclusion#

First agent using smolagents

Conclusion