Skip to content

Releases & Roadmap

stdapi.ai is under active development with regular feature releases. Nine major releases delivered since launch (v1.0-v1.9) with continuous improvements.

Recent Releases

See Release History below for the full changelog of all releases.

Latest: v1.9 – Files API & Images API JSON Body (OpenAI and Anthropic Files API backed by Amazon S3, multipart uploads, JSON body for image edits and variations)


Planned Features

The following features may be implemented in future releases based on community demand and feedback. Implementation priority is determined by user requests and use case requirements.

Want a feature? Submit feedback on GitHub Issues.

Chat Completions

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/completions Amazon Bedrock Amazon Bedrock - foundation models
OpenAI OpenAI /v1/responses Amazon Bedrock Amazon Bedrock - foundation models
Ollama Ollama /api/generate Amazon Bedrock Amazon Bedrock - foundation models
Ollama Ollama /api/chat Amazon Bedrock Amazon Bedrock - foundation models
Cohere Cohere /v1/chat Amazon Bedrock Amazon Bedrock - foundation models
Qwen Qwen /v1/chat/completions translation_options Amazon Translate Amazon Translate

Translation

Provider Endpoint/Feature AWS Backend
DeepL /v2/translate Amazon Translate Amazon Translate

Embeddings

Provider Endpoint/Feature AWS Backend
Ollama Ollama /api/embeddings Amazon Bedrock Amazon Bedrock - embedding models
Cohere Cohere /v1/embed Amazon Bedrock Amazon Bedrock - embedding models

Semantic Search & Ranking

Provider Endpoint/Feature AWS Backend
Cohere Cohere /v1/rerank Amazon Bedrock Amazon Bedrock - rerank models

Model Discovery

Provider Endpoint/Feature AWS Backend
Ollama Ollama /api/tags Amazon Bedrock Amazon Bedrock - model listing
Ollama Ollama /api/show Amazon Bedrock Amazon Bedrock - model details
Stability AI Stability AI /v1/engines/list Amazon Bedrock Amazon Bedrock - model catalog
Model selection wildcards (To automatically latest model versions) Amazon Bedrock Amazon Bedrock - model catalog

Speech & Audio

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/realtime/sessions Amazon Nova Sonic Amazon Nova Sonic
OpenAI OpenAI /v1/realtime/transcription_sessions Amazon Nova Sonic Amazon Nova Sonic
Transcriptions with Nova Sonic Amazon Nova Sonic Amazon Nova Sonic
Translations with Nova Sonic Amazon Nova Sonic Amazon Nova Sonic
Long-form speech (async) Amazon Polly Amazon Polly - async tasks
Streaming transcription Amazon Transcribe Amazon Transcribe - streaming
Custom vocabularies Amazon Transcribe Amazon Transcribe - custom vocab

Image Generation

Provider Endpoint/Feature AWS Backend
Stability AI Stability AI /v1/generation/text-to-image Amazon Bedrock Amazon Bedrock - image models
Stability AI Stability AI /v1/generation/image-to-image Amazon Bedrock Amazon Bedrock - image models
Stability AI Stability AI /v1/generation/image-to-image/masking Amazon Bedrock Amazon Bedrock - image models
Stability AI Stability AI /v2/stable-image/generate Amazon Bedrock Amazon Bedrock - image models
Stability AI Stability AI /v2/stable-image/upscale Amazon Bedrock Amazon Bedrock - image models
Stability AI Stability AI /v2/stable-image/edit Amazon Bedrock Amazon Bedrock - image models

Model-Specific Features

Provider Endpoint/Feature AWS Backend
Running Provisioned throughput Amazon Bedrock Amazon Bedrock - provisioned models

AWS Bedrock Advanced Features

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/agents Amazon Bedrock Amazon Bedrock - Agents
OpenAI OpenAI /v1/vector_stores Amazon Bedrock Amazon Bedrock - Knowledge Bases
OpenAI OpenAI /v1/evals Amazon Bedrock Amazon Bedrock - Model Evaluation
OpenAI OpenAI /v1/fine_tuning/jobs Amazon Bedrock Amazon Bedrock - custom models

Batch & Async Processing

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/batches Amazon Bedrock Amazon Bedrock - batch inference
Anthropic Anthropic /v1/messages/batches Amazon Bedrock Amazon Bedrock - batch inference

Content Safety & Moderation

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/moderations Amazon Comprehend Amazon Comprehend - toxicity

Usage & Analytics

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/usage Amazon CloudWatch Amazon CloudWatch
OpenAI OpenAI /v1/organization/usage Amazon CloudWatch Amazon CloudWatch

Authentication & Access Control

Provider Endpoint/Feature AWS Backend
User authentication Amazon Cognito Amazon Cognito
Multi-tenant API keys Amazon DynamoDB Amazon DynamoDB
API key rotation AWS Secrets Manager AWS Secrets Manager
Rate limiting Amazon DynamoDB Amazon DynamoDB
AWS Bedrock API keys AWS Bedrock AWS Bedrock

Release History

v1.9.0 – Files API & Images API JSON Body

This release introduces a Files API backed by Amazon S3, available through both the OpenAI-compatible and Anthropic-compatible interfaces. Files uploaded via either API share the same S3 storage and can be referenced across both interfaces. Large files can be uploaded incrementally using the OpenAI multipart uploads API. Stored files can be referenced by ID directly in image edit and variation requests (JSON body), as well as in chat completion messages as document or image inputs. The image edits endpoint now also accepts an application/json body as an alternative to multipart form-data, making it easier to chain pipeline steps without re-uploading files.

New Required Configuration

Files API requires AWS_S3_BUCKET to be configured (shared with the image URL response feature). The S3 prefix for stored files defaults to files/ and is configurable via AWS_S3_FILES_PREFIX. Ensure your IAM role includes read, write, delete, and list permissions on the files prefix in addition to the existing S3 permissions for presigned URLs.

Files & Storage

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/files – CRUD operations Amazon S3 Amazon S3
OpenAI OpenAI /v1/uploads – multipart uploads Amazon S3 Amazon S3
Anthropic Anthropic /v1/files – CRUD operations Amazon S3 Amazon S3

Image Generation

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/images/edits – JSON body with images/mask referencing Files API IDs or URLs Amazon Bedrock Amazon Bedrock - image models
OpenAI OpenAI /v1/images/variations – JSON body with image referencing a Files API ID or URL Amazon Bedrock Amazon Bedrock - image models

Chat Completions & Messages

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI Files API file IDs usable as document/image inputs in chat completions Amazon Bedrock Amazon Bedrock - foundation models
Anthropic Anthropic Files API file IDs usable as document/image inputs in messages Amazon Bedrock Amazon Bedrock - foundation models

Fixes

  • Document inputs via S3 URLs are not supported as Bedrock Converse API inputs for some models (e.g., Claude) — now properly detected and handled

v1.8.0 – Broader Model Compatibility & Structured Output

This release focuses on improving reliability and compatibility across a wide variety of models. Structured response formats (JSON object and JSON schema) are now supported on OpenAI chat completions, and request metadata can be forwarded to Bedrock. Tool handling has been significantly improved—both for model-specific system tools and for Amazon Nova's grounding tool, including multi-turn support. Region routing is now more robust, correctly enforcing non-global inference profiles for region-restricted models and handling edge cases gracefully.

New Required IAM Permissions

v1.8.0 requires two new IAM permissions to attach request metadata tags to jobs:

  • bedrock:TagResource on arn:aws:bedrock:*:*:async-invoke/* — needed for Bedrock asynchronous invocation jobs (see IAM Permissions). The twelvelabs.marengo-embed-3-0-v1:0 and twelvelabs.marengo-embed-2-7-v1:0 models rely on asynchronous invocation and will fail with an access denied error if this permission is missing.
  • transcribe:TagResource on arn:aws:transcribe:*:*:transcription-job/* — needed for Amazon Transcribe transcription jobs (see IAM Permissions). The amazon.transcribe model will fail with an access denied error if this permission is missing.

Ensure your IAM role or user policy includes both statements before upgrading to v1.8.0.

Chat Completions

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI response_format – JSON object and JSON schema structured output Amazon Bedrock Amazon Bedrock - foundation models
OpenAI OpenAI metadata – request metadata forwarding to Bedrock Amazon Bedrock Amazon Bedrock - foundation models
Amazon Nova Amazon Nova Nova Code Interpreter global profile support Amazon Bedrock Amazon Bedrock - Nova models

Messages (Anthropic-Compatible)

Provider Endpoint/Feature AWS Backend
Amazon Nova Amazon Nova nova_grounding responses mapped to web_search content blocks Amazon Bedrock Amazon Bedrock - Nova models
Amazon Nova Amazon Nova Multi-turn conversation support with nova_grounding Amazon Bedrock Amazon Bedrock - Nova models

Platform Features

Feature Description
Non-global profiles for region-restricted models Region-restricted models are now always assigned non-global inference profiles, preventing requests from bypassing configured region restrictions
Region routing edge case handling Region routing gracefully handles cases where no usable regions are available
ECS-based server ID When running on ECS, server_id in logs is set to task_id.container_name for precise instance identification across tasks and containers
Request metadata tagging stdapi.ai request context (request_id, server_id, user_id) is automatically attached as tags to every Bedrock and Amazon Transcribe job, making it easy to trace API calls across AWS service logs

Fixes

  • Fix systemTool_ prefix handling: removed broken auto-promotion logic; system tools require specific tool output handling not compatible with generic tool forwarding
  • AWS_BEDROCK_LEGACY default changed from true to false to prevent access denied errors on legacy models that have not been actively used recently
  • Bedrock read timeouts are now handled as standard model errors (503) instead of unhandled exceptions, and are properly retried across regions when multi-region routing is enabled

v1.7.0 – Automatic Region Routing, Deprecated Model Fallback & Resilience Improvements

The headline feature of v1.7 is automatic multi-region routing: stdapi.ai now intelligently distributes requests across your configured AWS regions, failing over automatically on quota limits or unavailability—and because each region carries its own independent quota, adding regions directly multiplies your effective tokens-per-minute and daily limits. Alongside this, deprecated model IDs are transparently redirected to their replacements so clients survive AWS model retirements without any code changes. This release also adds S3 URL support for file inputs across all relevant endpoints, a configurable AI response timeout, and memory efficiency improvements.

Platform Features

Feature Description
Automatic region routing with configurable strategies Intelligently distributes Bedrock requests across configured AWS regions with automatic failover on quota limits or unavailability; supports ordered, lowest_latency, and round_robin strategies
Deprecated model fallback Transparently reroute deprecated model IDs to their replacements; extend or override the built-in mapping; warns on legacy model usage
AI response timeout Configurable timeout for AI model responses to prevent indefinitely hanging requests
Expanded file input support File inputs (images, documents, audio) now support S3 URLs in addition to HTTP URLs, data URIs, and plain base64 across all relevant endpoints; improves memory efficiency by releasing file data as early as possible
Model lifecycle timestamps Model created/updated timestamps now derived from lifecycle data (startOfLifeTime, endOfLifeTime)

Fixes

  • Fix SSE stream error handling in monitoring to handle specific API and AWS client errors gracefully
  • Fix audio MIME type detection failure when libmagic's in-memory buffer path silently returns application/octet-stream; fall back to file-based detection to ensure correct format is sent to Bedrock

v1.6.0 – Anthropic API Compatibility & Advanced Claude Capabilities

Introduces a full Anthropic-compatible API layer, enabling direct use of the Anthropic SDK and Claude-native tools with AWS Bedrock. Adds Claude server tools support via OpenAI chat completions, token count estimation, automatic Anthropic beta flag filtering, and configurable route prefixes.

Chat Completions

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/chat/completions Claude server tools (bash, str_replace_based_edit_tool, computer, memory) Claude Claude models on Amazon Bedrock

Messages (Anthropic-Compatible)

Provider Endpoint/Feature AWS Backend
Anthropic Anthropic /v1/messages – Full Anthropic Messages API Amazon Bedrock Amazon Bedrock - Converse API
Anthropic Anthropic /v1/messages/count_tokens – Token counting Amazon Bedrock Amazon Bedrock - CountTokens API
Claude Claude Claude server tools (bash, text editor, computer, memory) Amazon Bedrock Amazon Bedrock - Claude models
Amazon Nova Amazon Nova Web search tool (web_searchnova_grounding) Amazon Bedrock Amazon Bedrock - Nova models

Model Discovery (Anthropic-Compatible)

Provider Endpoint/Feature AWS Backend
Anthropic Anthropic /v1/models – List models (Anthropic format) Amazon Bedrock Amazon Bedrock - model catalog
Anthropic Anthropic /v1/models/{model_id} – Get model details Amazon Bedrock Amazon Bedrock - model catalog

Platform Features

Feature Description
ANTHROPIC_ROUTES_PREFIX configuration Configurable base path prefix for Anthropic-compatible routes (default: /anthropic)
OPENAI_ROUTES_PREFIX configuration Configurable base path prefix for OpenAI-compatible routes
Token count estimation (TOKENS_ESTIMATION) Estimate token counts via tiktoken when models don't provide them; configurable encoding via TOKENS_ESTIMATION_DEFAULT_ENCODING
Anthropic beta flag filtering (ANTHROPIC_BETA_FILTER) Automatically filter unsupported anthropic-beta flags to prevent Bedrock ValidationException errors; extensible via ANTHROPIC_BETA_ALLOWLIST
Claude model name aliases Use official Anthropic model names (e.g., claude-opus-4-6) auto-resolved to AWS Bedrock identifiers

v1.5.0 – Advanced Reasoning & Model Compatibility (with v1.5.1–v1.5.2 maintenance updates)

Introduces advanced reasoning capabilities with Amazon Nova 2 and Anthropic Claude 4.6+ adaptive reasoning, enhanced system prompt handling for broader model compatibility.

Chat Completions

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI System prompt handling for unsupported models Amazon Bedrock Amazon Bedrock - foundation models
Amazon Nova Amazon Nova Nova 2 chat model reasoning implementation Amazon Bedrock Amazon Bedrock - foundation models
Claude Claude Claude 4.6+ adaptive reasoning configuration Amazon Bedrock Amazon Bedrock - Claude models

Fixes & Maintenance (v1.5.1–v1.5.2)

v1.5.2

  • Add "/" route to avoid 404 errors on root endpoint
  • Fix empty system content block handling (improves AWS Bedrock Converse API compatibility)

v1.5.1

  • Fix Amazon Nova Canvas image editing to fall back to TEXT_IMAGE task type when no mask is provided

v1.4.0 – Audio Enhancements & Model Compatibility

Expands audio capabilities with Mistral Voxtral support, speaker diarization, audio formats for chat completions, and introduces prompt caching TTL and model aliasing for better OpenAI compatibility.

Chat Completions

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/chat/completions audio format support Amazon Bedrock Amazon Bedrock - foundation models
OpenAI OpenAI /v1/chat/completions extended Bedrock finish reasons mapping Amazon Bedrock Amazon Bedrock
OpenAI OpenAI Prompt caching TTL support Amazon Bedrock Amazon Bedrock - prompt caching

Speech & Audio

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/audio/transcriptions diarized_json format Amazon Transcribe Amazon Transcribe
Mistral Mistral Voxtral audio model Amazon Bedrock Amazon Bedrock - foundation models

Platform Features

Feature Description
Model alias support Seamless OpenAI compatibility via model name aliasing

Fixes

  • Fix chat completion file input handling and refactor base64 decoding and MIME handling for file processing.
  • Re-raise startup exceptions and disable botocore logging to improve error visibility

v1.3.0 – Image Editing & Variation Support (with v1.3.1–v1.3.5 maintenance updates)

Adds support for OpenAI's image editing and variation endpoints, enabling image manipulation capabilities backed by Amazon Bedrock. Includes maintenance updates for content block handling, tool call validation, streaming fixes, and TTS optimization.

Image Generation

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/images/edits Amazon Bedrock Amazon Bedrock - image models
OpenAI OpenAI /v1/images/variations Amazon Bedrock Amazon Bedrock - image models

Speech & Audio (v1.3.2)

Feature Description
DEFAULT_TTS_LANGUAGE setting Configurable default language for TTS to optimize performance

Fixes & Maintenance (v1.3.1–v1.3.5)

v1.3.5

  • Refactor content block handling to skip empty entries in assistant responses

v1.3.4

  • Handle invalid tool call arguments with robust JSON content validation
  • Add deprecation mapping for amazon.titan-image-generator-v2:0amazon.nova-canvas-v1:0

v1.3.3

  • Remove premature stop condition for contentBlockStop in streaming chat completions

v1.3.2

  • Support image[] array-style notation for OpenAI image edits
  • Handle empty audio segments in transcription duration calculation

v1.3.1

  • Improve JSON parsing for tool arguments and results
  • Correct exampleexamples in OpenAPI model path parameter

v1.2.0 – Service Tiers, System Tools & Performance Enhancements

Introduces service tiers and latency headers for all Bedrock routes, Bedrock-specific system tools (Nova grounding), GPT5.2 API compatibility, configurable guardrail overrides, and Python 3.14 optimization.

Chat Completions

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/chat/completions service_tier parameter Amazon Bedrock Amazon Bedrock - service tiers
OpenAI OpenAI /v1/chat/completions Bedrock-specific system tools (Nova grounding) Amazon Bedrock Amazon Bedrock - system tools
OpenAI OpenAI /v1/chat/completions GPT5.2 API update (reasoning_effort=xhigh)

Content Safety & Moderation

Feature AWS Backend
Configuration flag for guardrail override allow Amazon Bedrock Amazon Bedrock Guardrails

Platform Features

Feature AWS Backend / Description
Service tiers and latency headers (all Bedrock routes) Amazon Bedrock Amazon Bedrock - service tiers
Python 3.14 support Upgraded to Python 3.14 with performance optimization
Dependency update Direct aiobotocore usage (replaced aioboto3)

Fixes

  • Fix warnings for duplicated FastAPI routes (/docs and /openapi.json).

v1.1.0 – Embeddings Enhancement, Prompt Caching & Advanced Routing

Expands multimodal embedding capabilities, adds prompt caching support, and introduces advanced routing with application inference profiles and prompt routers.

Chat Completions

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI Prompt caching /v1/chat/completions prompt_cache_key Amazon Bedrock Amazon Bedrock - prompt caching
OpenAI OpenAI /v1/chat/completions GPT5.1 API update (reasoning_effort=none)

Embeddings

Provider Endpoint/Feature AWS Backend
Intelligent S3 multimodal upload Amazon S3 Amazon S3
Intelligent Sync/async Bedrock invocation Amazon Bedrock Amazon Bedrock
Amazon Nova Amazon Nova Multimodal embeddings models
Twelve Labs Twelve Labs Marengo V3 models

Advanced Routing

Feature AWS Backend
Application inference profiles Amazon Bedrock Amazon Bedrock - application inference profiles
Prompt routers Amazon Bedrock Amazon Bedrock - prompt routers
Server-side ARN mapping Amazon Bedrock Amazon Bedrock
Client-side ARN passing (optional) Amazon Bedrock Amazon Bedrock

Fixes

  • /v1/chat/completions: Fix default value passed to the converse API for tools without parameters.
  • stdapi-ai Terraform module: Fix error if alarms_enabled = true but sns_topic_arn undefined.

v1.0.0 – Foundation Release

The initial release establishes core OpenAI API compatibility with AWS Bedrock backing.

Chat Completions

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/chat/completions Amazon Bedrock Amazon Bedrock - foundation models
All models supporting Converse/ConverseStream APIs Amazon Bedrock Amazon Bedrock - Converse API
Deepseek Deepseek /v1/chat/completions reasoning_content Amazon Bedrock Amazon Bedrock - foundation models
Qwen Qwen enable_thinking + thinking_budget parameter Amazon Bedrock Amazon Bedrock - foundation models
Qwen Qwen top_k parameter Amazon Bedrock Amazon Bedrock - foundation models

Embeddings

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/embeddings Amazon Bedrock Amazon Bedrock - embedding models
Cohere Cohere Embed V3 & V4 models
Twelve Labs Twelve Labs Marengo V2 models
Amazon Amazon Titan Embed V1 & V2 models

Speech & Audio

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/audio/speech Amazon Polly Amazon Polly + Amazon Comprehend
OpenAI OpenAI /v1/audio/transcriptions Amazon Transcribe Amazon Transcribe
OpenAI OpenAI /v1/audio/translations Amazon Transcribe Amazon Transcribe + Amazon Translate

Image Generation

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/images/generations Amazon Bedrock Amazon Bedrock - image models
Amazon Nova Amazon Nova Canvas V1 models
Amazon Amazon Titan Image Generator V1 & V2 models
Stability AI Stability AI Image Core, Ultra et SD3.5 Large models

Model Discovery

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/models Amazon Bedrock Amazon Bedrock - model catalog

Platform Features

Feature AWS Backend
Bedrock Features
Content filtering and safety Amazon Bedrock Amazon Bedrock Guardrails
Cross-region inference Amazon Bedrock Amazon Bedrock - global/regional
Application inference profiles Amazon Bedrock Amazon Bedrock - inference profiles
Model parameters (temperature, top_p, etc.) Amazon Bedrock Amazon Bedrock - native parameters
Multi-region failover Amazon Bedrock Amazon Bedrock - multi-region
Bedrock guardrails Amazon Bedrock Amazon Bedrock Guardrails
AWS Services
File storage Amazon S3 Amazon S3 - presigned URLs, Transfer Acceleration
Authentication
Static token authentication AWS Systems Manager AWS SSM Parameter Store / AWS Secrets Manager Secrets Manager
Development mode (no auth)
Observability
Distributed tracing AWS X-Ray AWS X-Ray + OpenTelemetry
Structured logging Amazon CloudWatch Amazon CloudWatch (When running on ECS/EKS)
Health check endpoint
HTTP/Security
CORS support
Trusted host validation
Proxy headers (X-Forwarded-*)
GZip compression
📚 Documentation
Interactive API docs & OpenAPI schema
🔌 Compatibility
Provider-specific parameters