Vendor
Groq
Ultra-low-latency inference on custom LPU hardware.
1 tool tracked
- Groq
Sub-100ms LPU-based inference for Llama, Mixtral, and other open models.
Vendor
Ultra-low-latency inference on custom LPU hardware.
Sub-100ms LPU-based inference for Llama, Mixtral, and other open models.