Inference

Bede E. Hampo

Published Sep 10, 2025

+ Follow

Inference = using a trained model to get answers.

Training = teaching the model.
Inference = asking it questions after it has learned.

Analogy: A student studies for months (training). You ask them a question in an exam (inference).

Mini-task: Install Ollama or Hugging Face Transformers → run a local LLM → ask it “What’s 2+2?”. That’s inference.

Option 1: Using Ollama (Easiest, GUI + CLI)

Go to Ollama’s website: https://ollama.com
Download and install the Ollama app for your computer (Windows or Mac).
Open Ollama.
Install a model (LLM) inside Ollama:

ollama pull llama2

(This downloads the LLaMA 2 model to your computer.)

5. Run the model:

ollama run llama2

6. Ask a question:

After the model starts, type:

What’s 2+2?

7. See the answer. 🎉

The model will respond with 4.

Option 2: Using Hugging Face Transformers (Python way)

Install Python (if not already):https://www.python.org/downloads/
Open Terminal / Command Prompt.
Install Transformers library:

pip install transformers torch

4. Write a small Python script (llm_test.py):

from transformers import pipeline

# Load a small local model
generator = pipeline('text-generation', model='tiiuae/falcon-7b-instruct')

# Ask a question
output = generator("What is 2+2?", max_length=10)
print(output[0]['generated_text'])

5. Run your script:

python llm_test.py

6. See the answer. 🎉

Should print 4.

DummyLearn

543 followers

+ Subscribe

Teju Adebanjo 7mo

This is Awesome 👌

1 Reaction

To view or add a comment, sign in

More articles by Bede E. Hampo

HTTPS Isn’t Optional, It’s the Boundary of Your System

Jan 23, 2026

HTTPS Isn’t Optional, It’s the Boundary of Your System

Most developers think of HTTPS as a checkbox. Something you enable because every tutorial tells you to.
Most API Breaches Don’t Hack You, They Walk In

Jan 19, 2026

Most API Breaches Don’t Hack You, They Walk In

Most API breaches happen because systems trust valid requests too much, not because attackers break in. Common wrong…
Programming languages didn’t evolve to be better; they evolved to fix specific pain. You’re using them wrong.

Jan 5, 2026

Programming languages didn’t evolve to be better; they evolved to fix specific pain. You’re using them wrong.

Seen this before? Let’s set the record straight: Programming languages didn’t evolve to be “better.” They evolved to…

2 Comments
AI vs AGI

Sep 30, 2025

AI vs AGI

6 Comments
AI Agents - The Story of a Smart Assistant

Sep 29, 2025

AI Agents - The Story of a Smart Assistant
Prompt Engineering

Sep 26, 2025

Prompt Engineering

What is Prompt Engineering? Art of writing instructions for LLMs. Bad prompt → bad answer.
RAG (Retrieval-Augmented Generation)

Sep 20, 2025

RAG (Retrieval-Augmented Generation)

Hello Readers! My name is Amina, and I’m here to talk about my RAG. Combining search + LLM = RAG.

3 Comments
Vector Databases

Sep 18, 2025

Vector Databases

Vector DB = a special database to store embeddings (numbers). Let's you quickly find “closest meaning” results.
Embeddings

Sep 16, 2025

Embeddings

Embeddings = turning words/sentences into numbers (vectors). These numbers represent meaning, not just letters.
Training

Sep 15, 2025

Training

Training = feeding data to a model so it learns patterns. Example: Show a model 1M cat & dog pictures → it learns to…

See all articles

Option 1: Using Ollama (Easiest, GUI + CLI)

DummyLearn

543 followers

More articles by Bede E. Hampo

HTTPS Isn’t Optional, It’s the Boundary of Your System

Most API Breaches Don’t Hack You, They Walk In

Programming languages didn’t evolve to be better; they evolved to fix specific pain. You’re using them wrong.

AI vs AGI

AI Agents - The Story of a Smart Assistant

Prompt Engineering

RAG (Retrieval-Augmented Generation)

Vector Databases

Embeddings

Training

Explore content categories