boxlite
boxlite-aiCompute substrate for AI agents: lightweight enough to live on your laptop, elastic enough to scale into the cloud and unleash unlimited resources.
Benchmarks we run. Prices we verify daily. Field evidence we curate from postmortems, dev posts, and vendor retros ; terse, dated, honest.
Compute substrate for AI agents: lightweight enough to live on your laptop, elastic enough to scale into the cloud and unleash unlimited resources.
Fast, low-cost inference for open and proprietary models with native function calling.
Sub-100ms LPU-based inference for Llama, Mixtral, and other open models.
SGLang is a high-performance serving framework for large language models and multimodal models.
Inference and fine-tuning across 200+ open-source LLMs with serverless and dedicated endpoints.
A high-throughput and memory-efficient inference and serving engine for LLMs