| OpenRouter |
Unified API gateway for many frontier and open models with routing and pricing controls. |
Freemium |
Docs |
AI, LLM, router |
Indie favorite |
| Hugging Face Inference API |
Hosted inference for thousands of models across NLP, vision, and audio. |
Freemium |
Docs |
AI, Open Source, inference |
Largest community |
| Replicate |
Run open-source models in the cloud with simple APIs and deployments. |
Paid |
Docs |
models, images, video |
Creator ecosystem |
| Ollama |
Run and manage LLMs locally with one-line model pulls. |
Free |
Docs |
local, privacy, LLM |
Local dev |
| Mistral API |
High-performance open-weight and commercial LLMs with competitive pricing. |
Freemium |
Docs |
LLM, text-generation |
EU favorite |
| Gemini API |
Google's multimodal models for text, image, and tooling. |
Freemium |
Docs |
multimodal, text, vision |
Google scale |
| Together AI |
Inference and fine-tuning platform for open models with competitive throughput. |
Paid |
Docs |
LLM, inference, finetune |
Production ready |
| Groq API |
Ultra-low-latency LPU inference for Llama and Mixtral models. |
Freemium |
Docs |
latency, LLM |
Blazing fast |
| LM Studio (local) |
Desktop app to run, compare, and serve local LLMs. |
Free |
Docs |
local, desktop |
User friendly |
| GPT4All (local) |
Open-source local LLM ecosystem with desktop apps and SDKs. |
Open Source |
Docs |
local, open source |
Privacy focused |
| KoboldCpp |
Fast CPU/GPU backend for running LLMs locally with UI integrations. |
Open Source |
Docs |
local, cpp, open source |
Active OSS |
| Text Generation Web UI |
Popular web UI for running and chatting with local text-generation models. |
Open Source |
Docs |
local, UI, open source |
Large community |
| Perplexity API |
Answer engine API combining retrieval with strong model responses. |
Paid |
Docs |
RAG, search |
Emerging API |
| Google Colab |
Hosted Jupyter notebooks with free GPUs for ML. |
Free |
Docs |
notebooks, gpu, ml |
Widely used |
| Claude |
AI assistant for chat and reasoning, hosted by Anthropic. |
Freemium |
Docs |
AI, assistant, chat |
Widely used |
| ChatGPT |
Conversational AI by OpenAI, useful for general-purpose chat, QA, and integration. |
Freemium |
Docs |
AI, chat, LLM |
Mainstream |
| AutoGPT |
Open-source autonomous agent framework that orchestrates LLMs to perform multi-step tasks. |
Open Source |
Docs |
agent, LLM, automation |
Well-known in agent community |
| AutoGen (Microsoft) |
Multi-agent orchestration framework from Microsoft for building LLM-powered workflows and applications. |
Open Source |
Docs |
agent, orchestration, LLM |
Emerging enterprise interest |
| LangChain |
Framework for building applications with LLMs by chaining prompts, agents, and memory together. |
Open Source |
Docs |
framework, LLM, agents |
Very high |
| LlamaIndex (formerly GPT-Index) |
Indexing and data-connector library to connect LLMs to external data sources for retrieval-augmented generation. |
Open Source |
Docs |
RAG, indexing, LLM |
High among RAG builders |
| VLLM |
High-performance LLM inference engine for faster CPU/GPU batched inference. |
Open Source |
Docs |
inference, LLM, performance |
Popular for inference optimization |
| GooseAI |
Managed service and model host providing access to various foundation models and inference endpoints. |
Freemium |
Docs |
inference, models, host |
Niche / growing |
| Langflow |
Visual builder for LLM chains and flows — drag-and-drop interface for LangChain-compatible components. |
Open Source |
Docs |
visual, workflow, LLM |
Indie favorite |
| Letta |
Conversational AI framework designed to build chat experiences and conversational agents. |
Open Source |
Docs |
conversational, AI, framework |
Emerging |
| Apidog |
API testing platform (with AI-assisted features) for designing, testing, and documenting APIs. |
Freemium |
Docs |
API, testing, automation |
Indie dev favorite |
| Qodo |
AI assistant focused on code completion, refactorings and code understanding (developer-facing). |
Freemium |
Docs |
AI, code, assistant |
Emerging |
| DeepSeek |
AI-powered semantic search and reasoning product that enables building powerful search experiences. |
Freemium |
Docs |
semantic search, AI, retrieval |
Niche |
| Google Gemini |
Google’s family of large multimodal models and APIs for chat, images, and coding assistance. |
Freemium |
Docs |
multimodal, Google, LLM |
Mainstream |
| NotebookLM |
Google’s research assistant that helps you ask questions and get answers from documents and notes. |
Freemium |
Docs |
research, notebook, LLM |
Google product audience |