Inference.net

4 months ago 14

Inference.net is a global network of data centers serving fast, scalable, pay-per-token APIs for models like DeepSeek V3 and Llama 3.3. Connect in minutes. Scale forever.

import OpenAI from "openai"; const openai = new OpenAI({ baseURL: "https://api.inference.net/v1", apiKey: process.env.INFERENCE_API_KEY, }); const completion = await openai.chat.completions.create({ model: "deepseek/deepseek-r1-0528/fp-8", messages: [ { role: "user", content: "What is the meaning of life?" } ], stream: true, }); for await (const chunk of completion) { process.stdout.write(chunk.choices[0]?.delta.content as string); }
Read Entire Article