Niche — AI Infrastructure
AI infrastructure startups powering the LLM era
AI infrastructure: inference, GPU orchestration, MLOps, evals, and deployment platforms.
- Cohere
Enterprise AI for every business
- Groq
The fastest inference on the planet
- Cognition
The world's first AI software engineer
- Replicate
Run AI models with a cloud API
- Together AI
Fast inference for open-source AI models
- Weights & Biases
The AI developer platform for ML experiments
- Copy.ai
AI-powered marketing copy and workflows
- Parallel
A parallel web, for AIs
- Braintrust
AI evaluation and experimentation platform
- Fireworks AI
Fast, affordable LLM inference API
- Letta
Memory-first infrastructure for AI agents
- Vellum AI
LLM product development and evaluation platform
- Lambda Labs
GPU cloud built for AI training and inference
- Etched AI
Transformer chips faster than GPUs
- Baseten
ML model deployment platform for production teams
- Qdrant
High-performance vector search engine
- BentoML
Unified AI model serving and deployment framework
- Oway
AI Infrastructure for America's Empty Truck Space
- Writer
Full-stack generative AI for enterprises
- Roboflow
Build and deploy computer vision in minutes
- CrewAI
Multi-agent framework for collaborative AI automation
- Lepton AI
Pythonic cloud for AI applications
- Anyscale
Scale Python and AI applications on any infrastructure
- Cerebras Systems
The fastest AI inference in the world
- Labelbox
AI training data platform for the full ML pipeline
- AnythingLLM
The all-in-one open source AI desktop application
- Beam
Serverless infrastructure for AI and ML workloads
- fal.ai
Serverless GPU inference for AI models
- Traceloop
Open-source LLM observability with OpenTelemetry
- Portkey
AI gateway and LLMOps platform for production
- Humanloop
Prompt management and LLM evaluation platform
- Dust
AI agents and workflows for enterprise teams
- Voiceflow
Platform to design and deploy AI agents
- Aleph Alpha
European sovereign AI models and infrastructure
- Predibase
Fine-tune and serve LLMs at scale
- Argilla
Open-source data labeling platform for AI
- Vapi
Voice AI infrastructure for developers
- Bland AI
AI phone agents at enterprise scale
- Retell AI
Build, test, and deploy voice AI agents
- LlamaIndex
Data framework for building LLM applications
- Nomic AI
Open-source embeddings and data visualisation
- Unstructured
Parse and prepare any document for LLMs
- Lovable
Build full-stack web apps by describing them in natural language
- Bolt
AI full-stack web development in the browser
- Conductor
Run a team of AI coding agents on your Mac
- Induced AI
AI agents that autonomously browse the web
- Watney
Intelligent robots, today
- Applied Compute
Specific intelligence for enterprises
- Sycamore
The trusted agent OS for the enterprise
- Normal
AI for our most pressing crises in silicon
- Fauna Robotics (acquired by Amazon)
Robots for everyone
- Cake AI
Cake AI
- Thread AI
Mission critical AI infrastructure
- Ceramic
Redefining AI infrastructure
- Magic
Building an AI software engineer
- Bland
The enterprise platform for AI phone calls
- Wordware
AI Agent orchestration platform
- Adaptive ML
AI, tuned to production
- TensorZero
Open-source LLM infra
- Liminal
Horizontal security for GenAI
- StackAI
No-code platform to build AI agents
- Meter
Internet infrastructure for the enterprise
- Eudia
Augment your legal team with AI
- Hyperbolic
Building the open-access AI cloud
- Pienso
No-code tools for training AI models
- Osmosis
Create task-specific models that beat foundation models
- Finch (Legal)
Pre-litigation, automated
- Good Start Labs
Building games to make AI better
- Seneca
Firefighting drones
- Tensormesh
The caching layer built for LLM inference
- Poolside
Frontier research to operational intelligence
- Relace
LLMs for code generation
- Fastino
Task-specific Language Models
- Netic
The AI revenue engine for SMBs
- GigaML
Voice AI agents for B2C companies
- Nuraline
Enabling AI systems to self-improve
- The General Intelligence Company
Autonomous agents for startups
- Contextual AI
Build specialized RAG agents
- Mithril
The AI omnicloud
- Shaped
Real-time retrieval engine
- LLMArena
Find the best AI for you
- Skild AI
Building general purpose robotic intelligence
- Inferact
The world's AI inference engine
- Terranova
Terraforming robots
- Adaline
Iterate, evaluate, deploy, and monitor LLMs
- Decart
A new era of real-time generative experiences
- Dedalus Labs
Multi-modal, multi-tool agents in minutes
- Doubleword
InferenceOps platform
- Modular
VMware for the AI era
- Etched
Building the hardware for superintelligence
- Positron
Energy-efficient AI chips
- Bedrock Robotics
Advanced autonomy for the built world
- Oxide Computer Company
On prem cloud computer
- Braintrust
The observability layer for production AI
- Bretton AI
Trusted AI agents for financial crime
- Atoms
Physical automation to move the world
- Gimlet Labs
Applied AI research lab