Niche — AI Infrastructure
AI infrastructure startups powering the LLM era
AI infrastructure: inference, GPU orchestration, MLOps, evals, and deployment platforms.
- Cohere
Enterprise AI platform built for the workplace.
- Harvey
AI for elite law firms.
- Replicate
Run open-source machine learning models with one API call.
- Lambda Labs
GPU cloud built for AI training and inference.
- Baseten
ML infrastructure for fast, scalable, cost-efficient inference.
- Scale AI
Data labeling and evaluation for the AI era.
- Weights & Biases
The AI developer platform.
- Groq
The fastest LLM inference, by a wide margin.
- Cerebras Systems
The world's largest computer chip.
- CoreWeave
The AI hyperscaler.
- Together AI
The AI Acceleration cloud.
- Fireworks AI
Productive AI platform with the fastest inference.
- Lovable
Idea to app in seconds.
- Bolt (StackBlitz)
Prompt, run, edit, deploy full-stack apps.
- Anomalo
Automated data quality monitoring.
- Fal.ai
The fastest cloud for generative media.
- Lakera
Security for LLM apps.
- Figure
General-purpose humanoid robots for the workforce.
- Agility Robotics
Digit — the bipedal robot working in warehouses today.
- Cognition
The world's first AI software engineer
- Copy.ai
AI-powered marketing copy and workflows
- Parallel
A parallel web, for AIs
- Braintrust
AI evaluation and experimentation platform
- Letta
Memory-first infrastructure for AI agents
- Vellum AI
LLM product development and evaluation platform
- Etched AI
Transformer chips faster than GPUs
- Qdrant
High-performance vector search engine
- BentoML
Unified AI model serving and deployment framework
- Oway
AI Infrastructure for America's Empty Truck Space
- Roboflow
Build and deploy computer vision in minutes
- CrewAI
Multi-agent framework for collaborative AI automation
- Lepton AI
Pythonic cloud for AI applications
- Anyscale
Scale Python and AI applications on any infrastructure
- Cerebras Systems
The fastest AI inference in the world
- Labelbox
AI training data platform for the full ML pipeline
- AnythingLLM
The all-in-one open source AI desktop application
- Beam
Serverless infrastructure for AI and ML workloads
- Traceloop
Open-source LLM observability with OpenTelemetry
- Portkey
AI gateway and LLMOps platform for production
- Humanloop
Prompt management and LLM evaluation platform
- Dust
AI agents and workflows for enterprise teams
- Voiceflow
Platform to design and deploy AI agents
- Aleph Alpha
European sovereign AI models and infrastructure
- Predibase
Fine-tune and serve LLMs at scale
- Argilla
Open-source data labeling platform for AI
- Vapi
Voice AI infrastructure for developers
- Bland AI
AI phone agents at enterprise scale
- Retell AI
Build, test, and deploy voice AI agents
- LlamaIndex
Data framework for building LLM applications
- Nomic AI
Open-source embeddings and data visualisation
- Unstructured
Parse and prepare any document for LLMs
- Conductor
Run a team of AI coding agents on your Mac
- Induced AI
AI agents that autonomously browse the web
- Watney
Intelligent robots, today
- Applied Compute
Specific intelligence for enterprises
- Sycamore
The trusted agent OS for the enterprise
- Normal
AI for our most pressing crises in silicon
- Fauna Robotics (acquired by Amazon)
Robots for everyone
- Cake AI
Cake AI
- Thread AI
Mission critical AI infrastructure
- Ceramic
Redefining AI infrastructure
- Magic
Building an AI software engineer
- Bland
The enterprise platform for AI phone calls
- Wordware
AI Agent orchestration platform
- Adaptive ML
AI, tuned to production
- TensorZero
Open-source LLM infra
- Liminal
Horizontal security for GenAI
- StackAI
No-code platform to build AI agents
- Meter
Internet infrastructure for the enterprise
- Eudia
Augment your legal team with AI
- Hyperbolic
Building the open-access AI cloud
- Pienso
No-code tools for training AI models
- Osmosis
Create task-specific models that beat foundation models
- Finch (Legal)
Pre-litigation, automated
- Good Start Labs
Building games to make AI better
- Seneca
Firefighting drones
- Tensormesh
The caching layer built for LLM inference
- Poolside
Frontier research to operational intelligence
- Relace
LLMs for code generation
- Fastino
Task-specific Language Models
- Netic
The AI revenue engine for SMBs
- GigaML
Voice AI agents for B2C companies
- Nuraline
Enabling AI systems to self-improve
- The General Intelligence Company
Autonomous agents for startups
- Contextual AI
Build specialized RAG agents
- Mithril
The AI omnicloud
- Shaped
Real-time retrieval engine
- LLMArena
Find the best AI for you
- Inferact
The world's AI inference engine
- Terranova
Terraforming robots
- Adaline
Iterate, evaluate, deploy, and monitor LLMs
- Decart
A new era of real-time generative experiences
- Dedalus Labs
Multi-modal, multi-tool agents in minutes
- Doubleword
InferenceOps platform
- Modular
VMware for the AI era
- Etched
Building the hardware for superintelligence
- Positron
Energy-efficient AI chips
- Bedrock Robotics
Advanced autonomy for the built world
- Oxide Computer Company
On prem cloud computer
- Braintrust
The observability layer for production AI
- Bretton AI
Trusted AI agents for financial crime
- Atoms
Physical automation to move the world
- Gimlet Labs
Applied AI research lab