Models

Model tracker

Track leading models by provider, capabilities, release history, and practical use cases.

New Ranking Method

Traceable Capability Density ranks LLMs by how complete, current, and operationally legible they are.

Instead of chasing one benchmark, LLM-Docs uses a bespoke metric that rewards five things at once: concrete model identity, specification depth, workload reach, release freshness, and deployment confidence. The result is a conference-style grade system built for serious model tracking rather than hype cycles.

Identity Precision

/20

Rewards entries that clearly name a real model family or checkpoint instead of a generic article or use-case page.

Specification Depth

/25

Counts structured signals such as context window, modalities, pricing, tags, and use-case coverage.

Workload Reach

/20

Measures how much practical surface area the model exposes across modality support, context scale, and workload breadth.

Temporal Momentum

/15

Gives more credit to recent releases so the ranking stays tied to the current frontier.

Deployment Confidence

/20

Rewards active, clearly tracked entries and penalizes weak or auto-detected records with thin operational detail.

Conference-style grades

A*: exceptional across almost every tracked signal; clear model identity, high metadata depth, and strong operational clarity.

A: strong and credible, with only one weaker area preventing top-tier status.

B: useful model entry, but not yet elite on clarity, breadth, or deployment evidence.

C: low-confidence or thinly specified entry; present in tracking, but not yet strong enough for serious ranking trust.

How the metric works

The page computes a 100-point score for each tracked entry. High scores come from being a clearly named model with rich specs and current release signals. Weak scores usually mean the entry behaves more like a generic announcement, workflow guide, or topic page than a properly specified LLM profile. That keeps the leaderboard from confusing content noise with model quality.

This is intentionally not a raw intelligence benchmark. It is a structured ranking of model seriousness and traceability: the models you can reason about, compare, and operationalize with confidence.

Leaderboard

TCD leaderboard

Ranked from the current tracked set. Eligible entries: 12.

RankModelProviderGradeTCDWhy it lands here
1Reference Frontier ModelExample AI LabA72Broad workload reach, fresh release signal, high tracking confidence, but weak model identity hold this entry back.
2Reference Open ModelOpen Model CommunityB67Fresh release signal, high tracking confidence, but weak model identity hold this entry back.
3Gemma 4: Byte for byte, the most capable open modelsGoogle DeepMindC39Fresh release signal, but thin metadata hold this entry back.
4Gemini 3.1 Flash Live: Making audio AI more natural and reliableGoogle DeepMindC39Fresh release signal, but thin metadata hold this entry back.
5What 81,000 people want from AIAnthropicC39Fresh release signal, but thin metadata hold this entry back.
6Gemini 3.1 Flash-Lite: Built for intelligence at scaleGoogle DeepMindC36Fresh release signal, but thin metadata hold this entry back.
7Nano Banana 2: Combining Pro capabilities with lightning-fast speedGoogle DeepMindC36Fresh release signal, but thin metadata hold this entry back.
8Introducing Claude Sonnet 4.6AnthropicC36Fresh release signal, but thin metadata hold this entry back.
9Introducing Claude Opus 4.6AnthropicC36Fresh release signal, but thin metadata hold this entry back.
10Enterprises power agentic workflows in Cloudflare Agent Cloud with OpenAIOpenAIC34Fresh release signal, but thin metadata hold this entry back.
11ChatGPT for researchOpenAIC34Fresh release signal, but thin metadata hold this entry back.
12Writing with ChatGPTOpenAIC34Fresh release signal, but thin metadata hold this entry back.
Example AI LabactiveA · 72

Reference Frontier Model

Template-style entry for tracking a flagship commercial model.

Context window: 256K

Broad workload reach, fresh release signal, high tracking confidence, but weak model identity hold this entry back.

Open Model CommunityactiveB · 67

Reference Open Model

Template-style entry for tracking an open-weight or open-source model.

Context window: 128K

Fresh release signal, high tracking confidence, but weak model identity hold this entry back.

Google DeepMindauto-detectedC · 39

Gemma 4: Byte for byte, the most capable open models

Gemma 4: Our most intelligent open models to date, purpose-built for advanced reasoning and agentic workflows.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

Google DeepMindauto-detectedC · 39

Gemini 3.1 Flash Live: Making audio AI more natural and reliable

Our latest voice model has improved precision and lower latency to make voice interactions more fluid, natural and precise.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

Anthropicauto-detectedC · 39

What 81,000 people want from AI

We invited Claude.ai users to share how they use AI, what they dream it could make possible, and what they fear it might do. Nearly 81,000 people participated—the largest and most multilingual qualitative study of its k…

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

Google DeepMindauto-detectedC · 36

Gemini 3.1 Flash-Lite: Built for intelligence at scale

Gemini 3.1 Flash-Lite is our fastest and most cost-efficient Gemini 3 series model yet.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

Google DeepMindauto-detectedC · 36

Nano Banana 2: Combining Pro capabilities with lightning-fast speed

Our latest image generation model offers advanced world knowledge, production ready specs, subject consistency and more, all at Flash speed.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

Anthropicauto-detectedC · 36

Introducing Claude Sonnet 4.6

Sonnet 4.6 delivers frontier performance across coding, agents, and professional work at scale.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

Anthropicauto-detectedC · 36

Introducing Claude Opus 4.6

We’re upgrading our smartest model. Across agentic coding, computer use, tool use, search, and finance, Opus 4.6 is an industry-leading model, often by wide margin.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

OpenAIauto-detectedC · 34

Enterprises power agentic workflows in Cloudflare Agent Cloud with OpenAI

Cloudflare brings OpenAI’s GPT-5.4 and Codex to Agent Cloud, enabling enterprises to build, deploy, and scale AI agents for real-world tasks with speed and security.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

OpenAIauto-detectedC · 34

ChatGPT for research

Learn how to use ChatGPT for research to gather sources, analyze information, and create structured, citation-backed insights.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

OpenAIauto-detectedC · 34

Writing with ChatGPT

Learn how to use ChatGPT for writing to draft, revise, and refine content with clear structure, tone, and intent.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

OpenAIauto-detectedC · 34

CyberAgent moves faster with ChatGPT Enterprise and Codex

CyberAgent uses ChatGPT Enterprise and Codex to securely scale AI adoption, improve quality, and accelerate decisions across advertising, media, and gaming.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

Meta AIauto-detectedC · 34

Introducing Muse Spark: MSL’s First Model, Purpose-Built to Prioritize People

Muse Spark is Meta's most powerful model yet — it currently powers the Meta AI app and website, and will be rolling out to WhatsApp, Instagram, Facebook, Messenger, and AI glasses in the coming weeks. The post Introduci…

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

OpenAIauto-detectedC · 34

Codex now offers more flexible pricing for teams

Codex now includes pay-as-you-go pricing for ChatGPT Business and Enterprise, providing teams a more flexible option to start and scale adoption.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

OpenAIauto-detectedC · 34

Gradient Labs gives every bank customer an AI account manager

Gradient Labs uses GPT-4.1 and GPT-5.4 mini and nano to power AI agents that automate banking support workflows with low latency and high reliability.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

Anthropicauto-detectedC · 31

Claude is a space to think

We’ve made a choice: Claude will remain ad-free. We explain why advertising incentives are incompatible with a genuinely helpful AI assistant, and how we plan to expand access without compromising user trust.

Context window: Not set

Fresh release signal, but thin metadata hold this entry back.

OpenAIauto-detectedC · 24

AI fundamentals

Learn what AI is, how it works, and how tools like ChatGPT use large language models. A clear, beginner-friendly guide to understanding artificial intelligence.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Analyzing data with ChatGPT

Learn how to analyze data with ChatGPT by exploring datasets, generating insights, creating visualizations, and turning findings into actionable decisions.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Applications of AI at OpenAI

Explore how OpenAI products like ChatGPT, Codex, and APIs bring AI into real-world use for work, development, and everyday tasks.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Brainstorming with ChatGPT

Learn how to use ChatGPT to brainstorm ideas, organize thinking, and turn rough concepts into structured, actionable plans.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

ChatGPT for customer success teams

Learn how customer success teams use ChatGPT to manage accounts, improve communication, reduce churn, and drive adoption and renewals.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

ChatGPT for finance teams

Learn how finance teams use ChatGPT to streamline reporting, analyze data, improve forecasts, and communicate insights more clearly.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

ChatGPT for marketing teams

Learn how marketing teams use ChatGPT to plan campaigns, generate content, analyze performance, and move from ideas to execution faster.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

ChatGPT for operations teams

Learn how operations teams use ChatGPT to streamline workflows, improve coordination, standardize processes, and drive faster execution.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

ChatGPT for sales teams

Learn how sales teams use ChatGPT to research accounts, personalize outreach, manage deals, and improve pipeline and conversion.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Creating images with ChatGPT

Learn how to create and refine images with ChatGPT using clear prompts, iterate on designs, and generate high-quality visuals in minutes.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Financial services

Explore AI resources for financial services, including prompt packs, GPTs, guides, and tools to help institutions deploy and scale AI securely.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Getting started with ChatGPT

Learn how to use ChatGPT, start your first conversation, and discover simple ways to write, brainstorm, and solve problems with AI.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Healthcare

Explore how clinicians use ChatGPT to support diagnosis, documentation, and patient care with secure, HIPAA-compliant AI tools.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Prompting fundamentals

Learn prompting fundamentals and how to write clear, effective prompts to get better, more useful responses from ChatGPT.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Research with ChatGPT

Learn how to research with ChatGPT using search and deep research to find up-to-date information, analyze sources, and generate structured insights.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Responsible and safe use of AI

Learn how to use AI responsibly with best practices for safety, accuracy, and transparency when using tools like ChatGPT.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Using custom GPTs

Learn how to build and use custom GPTs to automate workflows, maintain consistent outputs, and create purpose-built AI assistants.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Using projects in ChatGPT

Learn how to use orojects in ChatGPT to organize chats, files, and instructions, manage ongoing work, and collaborate more effectively.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Using skills

Learn how to create and use ChatGPT skills to build reusable workflows, automate recurring tasks, and ensure consistent, high-quality outputs.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

OpenAIauto-detectedC · 24

Working with files in ChatGPT

Learn how to upload and work with files in ChatGPT to analyze data, summarize documents, and generate content from PDFs, spreadsheets, and more.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.

Hugging Faceauto-detectedC · 24

Multimodal Embedding & Reranker Models with Sentence Transformers

Automatic model update from Hugging Face.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata hold this entry back.

OpenAIauto-detectedC · 24

The next phase of enterprise AI

OpenAI outlines the next phase of enterprise AI, as adoption accelerates across industries with Frontier, ChatGPT Enterprise, Codex, and company-wide AI agents.

Context window: Not set

Fresh release signal, but weak model identity and thin metadata and article-like title hold this entry back.