Limited-time GPU firepower Dirt-cheap LLM Inference: Llama 4, DeepSeek 0528

4 months ago 28

Deploy and scale LLM inference—serve models like Llama 4 and DeepSeek
on our low-cost GPU platform.
Pay-as-you-go APIs mean you pay only for the inference you use.

Llama-4-Maverick-17B-128E-Instruct

$0.10 in | $0.35 out36.00K Context

Maverick beats GPT‑4o on coding, vision, reasoning and remains lightweight for efficient local deployment.

DeepSeek-V3

$0.15 in | $0.40 out163.84K Context

Rivals closed models on math and code; open weights, vetted safety, and low pricing simplify enterprise deployment.

DeepSeek-R1

$0.15 in | $0.40 out163.84K Context

Rivals closed models on math and code; open weights, vetted safety, and low pricing simplify enterprise deployment.

Quick Start

import OpenAI from "openai"; const openai = new OpenAI({ baseURL: "https://inference.cloudrift.ai/v1", apiKey: "YOUR_RIFT_API_KEY", }); const completion = await openai.chat.completions.create({ model: "llama4:maverick", messages: [ { role: "user", content: "What is the meaning of life?" } ], stream: true, }); for await (const chunk of completion) { process.stdout.write(chunk.choices[0]?.delta.content as string); }

All Available Models

Instant access to high-performance models—no queues, no GPUs to reserve.
Just straightforward model options you can build on.

Llama-4-Maverick-17B-128E-Instruct

$0.10 in | $0.35 out36.00K Context

Maverick beats GPT‑4o on coding, vision, reasoning and remains lightweight for efficient local deployment.

DeepSeek-V3

$0.15 in | $0.40 out163.84K Context

Rivals closed models on math and code; open weights, vetted safety, and low pricing simplify enterprise deployment.

DeepSeek-R1

$0.15 in | $0.40 out163.84K Context

Rivals closed models on math and code; open weights, vetted safety, and low pricing simplify enterprise deployment.

DeepSeek-R1-0528

$0.25 in | $1.00 out32.77K Context

May 28th update to the original DeepSeek R1 Performance on par with OpenAI o1, but open-sourced and with fully open reasoning tokens.

Get in Touch

We're here to support your compute and AI needs. Let us know if you're looking to:

Find an affordable GPU provider
Sell your compute online
Manage on-prem infrastructure
Build a hybrid cloud solution
Optimize your AI deployment

Businesses of any size are welcome.

CloudRift Inc., a Delaware corporation
PO Box 1224, Santa Clara, CA 95052, USA
+1 (831) 534-3437

Read Entire Article

Limited-time GPU firepower Dirt-cheap LLM Inference: Llama 4, DeepSeek 0528

Llama-4-Maverick-17B-128E-Instruct

DeepSeek-V3

DeepSeek-R1

Quick Start

All Available Models

Llama-4-Maverick-17B-128E-Instruct

DeepSeek-V3

DeepSeek-R1

DeepSeek-R1-0528

Get in Touch

Related

Carding, Sabotage and Survival: A Darknet Market Veteran's S...

Spec-Driven Development

How Much Does AI Cost?