TUI and CLI for browsing AI models, benchmarks, coding agents, and provider statuses.
- ~4,000+ models across 85+ providers from models.dev — filter by capability, price, context, and provider category
- ~1,000 benchmark entries across 4 data sources — Artificial Analysis, Epoch AI, Arena, and LLM Stats — compare models head-to-head with scatter plots and radar charts
- 12+ coding agents tracked with version detection, changelogs, and GitHub integration
- 22 provider statuses monitored live across 7 status page platforms
- Richer model details — the Models tab detail panel now surfaces the model's description, a Structured Output capability (Yes/No/—), and reasoning controls (budget range, effort levels, toggle) alongside the existing capabilities.
- Deeper pricing — thinking-token, audio in/out, and tiered ("over N tokens") prices show as conditional rows when a model carries them.
- Field glossary — press
iin the Models tab for a scrollable breakdown of what each detail field means (mirrors the Benchmarks glossary). - CLI parity —
models showand--jsoninclude the new description, structured-output, reasoning-controls, and pricing fields.
- Add agents in-app — press
Ato add a custom coding agent by name +owner/repo; it's tracked and fetched immediately, no config editing. - In-app self-updates — update a tracked agent (
u), all agents with updates available (U), or cancel an in-flight update (x), all from the Agents tab. The update command is derived from how each agent was actually installed (bun/npm/brew/cargo/AUR/apt/dnf, etc.), with an interactive suspend-and-run mode (i) for updaters that need a prompt or sudo.
- Mouse support — click a row to select it, click a panel to focus it, click a header tab to switch tabs, and scroll with the wheel. Popups (sort, columns, glossary, trackers, help) scroll and accept row clicks too.
- Multi-source benchmarks — 4 switchable data sources (Artificial Analysis, Epoch AI, Arena, LLM Stats) with state-preserving switching, in-app refresh, and a benchmark glossary
- Column picker — choose which metric columns are visible, persisted per source in config.toml
- Comparator column — cycle field average / peer average / rank alongside every score in the detail panel
- Refresh keys everywhere — refresh benchmarks, models.dev data, and agent GitHub data without restarting
default_tabconfig — launch the TUI straight into your tab of choice- Prebuilt Nix binaries — builds are pushed to a public Cachix cache;
nix rundownloads instead of compiling
brew install modelscargo install modelsdevnix run github:reyamira/models
nix profile install github:reyamira/modelsThe flake is available directly from GitHub tags and branches; it is not currently published to FlakeHub or nixpkgs.
scoop install extras/modelsparu -S models-binPre-built binaries, .deb, and .rpm packages are available on GitHub Releases. See the Installation wiki page for all methods, shell completions, and command aliases.
modelstour.mp4
Navigate with arrow keys, switch tabs with [/], search with /, and press ? for context-aware help. Mouse works too — click a row to select it, click a panel to focus it, and scroll with the wheel. See Getting Started for a full walkthrough.
Three-column layout with providers, model list, and rich detail panel. RTFO capability indicators, 6 filter keys, sort by name/date/cost/context, cross-provider search, and copy-to-clipboard.
Models wiki page • CLI: models list, models show, models search, models providers
Curated catalog of 12+ agents with automatic version detection, GitHub release tracking, styled changelogs with search and match navigation, and live service health from provider status pages. Add your own agents without leaving the TUI (A), and update installed ones in-app (u one / U all) — the update command is derived from how each tool was actually installed (npm, bun, Homebrew, uv, pipx, or a system package manager like pacman/AUR, apt, dnf).
Agents wiki page • CLI: agents status, agents <tool>, agents latest, agents list-sources
~1,000 entries across 4 switchable data sources (Artificial Analysis, Epoch AI, Arena, LLM Stats) with quality indexes, Elo ratings, speed, and pricing. Compare mode with head-to-head tables, scatter plots, and radar charts. Choose visible metric columns, cycle a field-average/peer-average/rank comparator in the detail panel, and refresh any source in-app. Filter by creator, region, type, reasoning, and open/closed source.
Benchmarks wiki page • CLI: models benchmarks list, models benchmarks show
Live health monitoring for 22 AI providers across 7 status page platforms. Overall dashboard with health gauge, incident and maintenance cards. Provider detail with grouped services, incidents, and scheduled maintenance.
Status wiki page • CLI: models status list, models status show, models status status
Full documentation lives in the wiki:
| Page | Description |
|---|---|
| Installation | All install methods, shell completions, command aliases |
| Getting Started | First launch, navigation, basic usage |
| Models | Models tab and CLI commands |
| Agents | Agents tab and CLI commands |
| Benchmarks | Benchmarks tab and CLI commands |
| Status | Status tab and CLI commands |
| Configuration | Config file, custom agents, tracked providers |
| Data Sources | Where the data comes from |
| Architecture | Internal design for contributors |
| Contributing | How to contribute |
- Models: models.dev by SST
- Benchmarks: Artificial Analysis, Epoch AI (CC-BY), Arena, LLM Stats
- Agents: Curated catalog in
data/agents.json— contributions welcome! - Status: Official provider status pages (Statuspage, BetterStack, Instatus, incident.io, and more)
Contributions are welcome! Please read the Contributing Guide before submitting a PR.
This project follows the Contributor Covenant Code of Conduct.
MIT



