Skip to main content

June 5, 2026

New Models: Whisper-Large-v3 and E5-Mistral-7B-Instruct
  • Whisper-Large-v3 is now available for audio transcription and translation - OpenAI’s state-of-the-art ASR model, EU-hosted with full data sovereignty
  • E5-Mistral-7B-Instruct is now available for generating embeddings - high-quality vector representations for RAG, search, and classification workflows
  • Both models run on EU sovereign infrastructure with no data leaving EU jurisdiction
  • Rate limits updated for both models
  • Model overview updated with new embedding and audio model sections

May 24, 2026

Anthropic SDK Compatibility
  • New /v1/messages endpoint for Anthropic SDK compatibility
  • Use the Anthropic Python SDK with Infercom models - just change the base URL
  • Supports streaming, system prompts, multi-turn conversations, and tool use
  • Documentation

May 22, 2026

New Model: MiniMax-M2.7
  • MiniMax-M2.7 is now available as an EU-hosted model with 192K context window
  • Recommended for agentic coding workflows - replaces M2.5 as the default recommendation
  • MiniMax-M2.5 (160K context) remains available but will be deprecated in a future release
  • Rate limits updated for MiniMax-M2.7

May 15, 2026

Responses API
  • New POST /v1/responses endpoint for agentic workflows - supports function tools, streaming, and reasoning effort control
  • Compatible with OpenAI Responses API standard
  • Supported models: MiniMax-M2.7, MiniMax-M2.5, gpt-oss-120b
  • Documentation
Model Update
  • gemma-3-12b-it moved to EU sovereign infrastructure - first vision-capable model with full EU data sovereignty
  • Vision guide
New Integration
  • Codex CLI now supported via Responses API
  • Updated Cline and OpenCode guides with Plan/Execute configuration pattern

April 24, 2026

Model Catalog Updates
  • DeepSeek-V3.1 moved to Global Model Catalog: Now served via proxy (US region) instead of EU-hosted infrastructure
  • gemma-3-12b-it added: Google’s Gemma 3 12B model now available via Global Model Catalog (Japan). This is the first vision-capable model on Infercom — see Vision guide
  • DeepSeek-V3.2 context length corrected: Updated from 8k to 32k tokens
  • EU-hosted models: MiniMax-M2.5 and gpt-oss-120b remain as the sovereign EU options
Documentation Updates

April 6, 2026

New Documentation: Agentic Coding

April 3, 2026

Global Model Catalog Streamlined
  • Reduced Global Model Catalog to 2 models: Meta-Llama-3.3-70B-Instruct (128k) and DeepSeek-V3.2 (8k)
  • 8 models deprecated: Qwen3-32B, Qwen3-235B, DeepSeek-R1-0528, DeepSeek-R1-Distill-Llama-70B, DeepSeek-V3-0324, DeepSeek-V3.1-Terminus, Llama-4-Maverick-17B-128E-Instruct, Meta-Llama-3.1-8B-Instruct
  • See deprecations for migration guidance
  • Fixed MiniMax-M2.5 context length to 160k (was incorrectly listed as 164k)

March 12, 2026

Model Update: MiniMax M2.5 replaces Meta-Llama-3.3-70B-Instruct
  • MiniMax M2.5 is now available as an EU-hosted model on Infercom’s sovereign infrastructure in Germany
  • Meta-Llama-3.3-70B-Instruct has been deprecated and removed from the platform. See deprecations for migration guidance
  • Rate limits updated for MiniMax M2.5

February 27, 2026

Global Model Catalog Launch
  • 9 new models available through the Global Model Catalog, including DeepSeek-R1-0528, Qwen3-235B, Llama-4-Maverick, and more
  • EU-hosted and globally-routed models are now clearly separated in the model overview
  • Rate limits updated for all 12 models across Free and Developer tiers
  • API sn_metadata now returns is_external and region fields for all models

February 14, 2026

New Documentation
  • Added Performance & Latency guide covering connection pooling, response performance metadata, streaming optimization, and best practices for minimizing latency

February 5, 2026

Documentation for Global Model Catalog
  • Added documentation for the Global Model Catalog, covering EU-hosted and globally-routed models
  • Documented how to identify model regions via the API (sn_metadata.region) and the Playground
  • Updated API reference with sn_metadata schema and ?verbose=true query parameter
  • Reviewed and qualified sovereignty claims across documentation to reflect Global Model Catalog

January 23, 2026

Model Update
  • Updated DeepSeek model from DeepSeek-V3-0324-cb to DeepSeek-V3.1 with expanded 128k context window
  • The previous model has been deprecated and is listed on the deprecations page
  • Updated rate limits for all models

January 13, 2026

New Improvements
  • Documentation cleanup and link fixes

December 30, 2025

Updated the model catalog to reflect currently available models on Infercom Inference Service. The model list and rate limits documentation have been updated accordingly.

November 17, 2025

We are pleased to announce the launch of Infercom’s documentation portal. This comprehensive documentation has been created based on SambaNova’s documentation (dated October 7, 2025) and adapted for Infercom’s EU sovereign AI inference platform. Key features
  • Complete API reference documentation with OpenAI-compatible endpoints.
  • Developer guides for integrating with Infercom’s inference platform.
  • Model catalog and configuration documentation for all supported models.
  • Platform architecture and deployment guides.
  • Usage examples in Python and TypeScript, powered by OpenAI-compatible SDKs.
All documentation will be continuously updated to reflect Infercom-specific features, European data sovereignty capabilities, and platform enhancements as they become available.