Deploy and scale LLM inference—serve models like Llama 4 and DeepSeek
on our low-cost GPU platform.
Pay-as-you-go APIs mean you pay only for the inference you use.
Llama-4-Maverick-17B-128E-Instruct
$0.10 in | $0.35 out36.00K Context
Maverick beats GPT‑4o on coding, vision, reasoning and remains lightweight for efficient local deployment.
DeepSeek-V3
$0.15 in | $0.40 out163.84K Context
Rivals closed models on math and code; open weights, vetted safety, and low pricing simplify enterprise deployment.
DeepSeek-R1
$0.15 in | $0.40 out163.84K Context
Rivals closed models on math and code; open weights, vetted safety, and low pricing simplify enterprise deployment.
Quick Start
All Available Models
Instant access to high-performance models—no queues, no GPUs to reserve.
Just straightforward model options you can build on.
Llama-4-Maverick-17B-128E-Instruct
$0.10 in | $0.35 out36.00K Context
Maverick beats GPT‑4o on coding, vision, reasoning and remains lightweight for efficient local deployment.
DeepSeek-V3
$0.15 in | $0.40 out163.84K Context
Rivals closed models on math and code; open weights, vetted safety, and low pricing simplify enterprise deployment.
DeepSeek-R1
$0.15 in | $0.40 out163.84K Context
Rivals closed models on math and code; open weights, vetted safety, and low pricing simplify enterprise deployment.
DeepSeek-R1-0528
$0.25 in | $1.00 out32.77K Context
May 28th update to the original DeepSeek R1 Performance on par with OpenAI o1, but open-sourced and with fully open reasoning tokens.
Get in Touch
We're here to support your compute and AI needs. Let us know if you're looking to:
- Find an affordable GPU provider
- Sell your compute online
- Manage on-prem infrastructure
- Build a hybrid cloud solution
- Optimize your AI deployment
Businesses of any size are welcome.
CloudRift Inc., a Delaware corporation
PO Box 1224, Santa Clara, CA 95052, USA
+1 (831) 534-3437