Saturday, March 21, 2026

AI | Foundation Model and LLM

 Foundation Model (FM) = broader term:
  • Text (LLM)
  • Image
  • Audio
  • Video

LLM: AI model trained on huge amounts of text data to read, write, and understand language like humans

An LLM is a deep learning model trained on large text datasets to understand and generate human-like language.
Model works for textual content considered as LLM.

Break the term
  • Large → Trained on massive datasets (books, websites, code, etc.)
  • Language → Works with text (and sometimes speech)
  • Model → Mathematical system that predicts and generates words

How LLM works

An LLM works by:

  • Reading input text (your prompt)
  • Predicting the next most likely word
  • Repeating this to form sentences

Example:

Input: "The sky is"
LLM predicts → "blue"

Behind the scenes, it uses a concept called
Transformer architecture
(which helps it understand context and relationships between words).


Key Features of LLMs

1. Text Generation ✍️
  • Write essays, emails, code
2. Understanding Language 📖
  • Answer questions
  • Summarize content
3. Context Awareness 🧠
  • Remembers previous messages in a conversation
4. Multi-task Capability 🔄
  • Translation
  • Coding
  • Chatting
  • Reasoning

Examples of LLMs

Some popular LLMs include:

  • GPT models
  • Claude models
  • Llama models
  • Amazon Titan

Where LLMs are used
  • Chatbots (like ChatGPT)
  • Virtual assistants
  • Content writing tools
  • Customer support automation
  • Code generation
  • Search engines


    Model FamilyIs it LLM?Notes
    Llama (Meta) YesPure LLM
    GPT (OpenAI) YesPure LLM
    Claude (Anthropic) YesPure LLM
    DeepSeek YesPure LLM
    Titan / Nova (AWS) PartlyMix of LLM + other models

    No comments:

    Post a Comment

    Node | Cluster Vs Worker Threads

    Cluster: Multiple processes (scale app across CPU cores) Worker Threads: Multiple threads (handle CPU-heavy work inside one process) Cluster...