Skip to content

Roadmap & ChangelogΒΆ

What's NextΒΆ

The following features may be implemented in future releases based on community demand and feedback. Implementation priority is determined by user requests, use case requirements, and alignment with the project's goals. All features can be implemented using AWS services as backends. If you need a specific feature, submit feedback or contribute to discussions.

πŸ’¬ Chat CompletionsΒΆ

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/completions Amazon Bedrock Amazon Bedrock - foundation models
OpenAI OpenAI /v1/responses Amazon Bedrock Amazon Bedrock - foundation models
Claude Claude /v1/messages Amazon Bedrock Amazon Bedrock - foundation models
Claude Claude Extended thinking mode Amazon Bedrock Amazon Bedrock - Claude thinking
Ollama Ollama /api/generate Amazon Bedrock Amazon Bedrock - foundation models
Ollama Ollama /api/chat Amazon Bedrock Amazon Bedrock - foundation models
Cohere Cohere /v1/chat Amazon Bedrock Amazon Bedrock - foundation models
Qwen Qwen /v1/chat/completions translation_options Amazon Translate Amazon Translate

🌐 Translation¢

Provider Endpoint/Feature AWS Backend
DeepL /v2/translate Amazon Translate Amazon Translate

🧠 Embeddings¢

Provider Endpoint/Feature AWS Backend
Ollama Ollama /api/embeddings Amazon Bedrock Amazon Bedrock - embedding models
Cohere Cohere /v1/embed Amazon Bedrock Amazon Bedrock - embedding models

πŸ” Semantic Search & RankingΒΆ

Provider Endpoint/Feature AWS Backend
Cohere Cohere /v1/rerank Amazon Bedrock Amazon Bedrock - rerank models

πŸ“‹ Model DiscoveryΒΆ

Provider Endpoint/Feature AWS Backend
Ollama Ollama /api/tags Amazon Bedrock Amazon Bedrock - model listing
Ollama Ollama /api/show Amazon Bedrock Amazon Bedrock - model details
Stability AI Stability AI /v1/engines/list Amazon Bedrock Amazon Bedrock - model catalog
Model selection wildcards (To automatically latest model versions) Amazon Bedrock Amazon Bedrock - model catalog

πŸŽ™οΈ Speech & AudioΒΆ

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/realtime/sessions Amazon Nova Sonic Amazon Nova Sonic
OpenAI OpenAI /v1/realtime/transcription_sessions Amazon Nova Sonic Amazon Nova Sonic
Transcriptions with Nova Sonic Amazon Nova Sonic Amazon Nova Sonic
Translations with Nova Sonic Amazon Nova Sonic Amazon Nova Sonic
Long-form speech (async) Amazon Polly Amazon Polly - async tasks
Streaming transcription Amazon Transcribe Amazon Transcribe - streaming
Speaker diarization Amazon Transcribe Amazon Transcribe - diarization
Custom vocabularies Amazon Transcribe Amazon Transcribe - custom vocab

🎨 Image Generation¢

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/images/edits Amazon Bedrock Amazon Bedrock - image models
OpenAI OpenAI /v1/images/variations Amazon Bedrock Amazon Bedrock - image models
Stability AI Stability AI /v1/generation/text-to-image Amazon Bedrock Amazon Bedrock - image models
Stability AI Stability AI /v1/generation/image-to-image Amazon Bedrock Amazon Bedrock - image models
Stability AI Stability AI /v1/generation/image-to-image/masking Amazon Bedrock Amazon Bedrock - image models
Stability AI Stability AI /v2/stable-image/generate Amazon Bedrock Amazon Bedrock - image models
Stability AI Stability AI /v2/stable-image/upscale Amazon Bedrock Amazon Bedrock - image models
Stability AI Stability AI /v2/stable-image/edit Amazon Bedrock Amazon Bedrock - image models

🎯 Model-Specific Features¢

Provider Endpoint/Feature AWS Backend
Running Provisioned throughput Amazon Bedrock Amazon Bedrock - provisioned models

πŸ€– AWS Bedrock Advanced FeaturesΒΆ

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/agents Amazon Bedrock Amazon Bedrock - Agents
OpenAI OpenAI /v1/vector_stores Amazon Bedrock Amazon Bedrock - Knowledge Bases
OpenAI OpenAI /v1/evals Amazon Bedrock Amazon Bedrock - Model Evaluation
OpenAI OpenAI /v1/fine_tuning/jobs Amazon Bedrock Amazon Bedrock - custom models

πŸ“¦ Batch & Async ProcessingΒΆ

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/batches Amazon Bedrock Amazon Bedrock - batch inference
Claude Claude /v1/messages/batches Amazon Bedrock Amazon Bedrock - batch inference

πŸ›‘οΈ Content Safety & ModerationΒΆ

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/moderations Amazon Comprehend Amazon Comprehend - toxicity

πŸ“ Files & StorageΒΆ

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/files Amazon S3 Amazon S3

πŸ“Š Usage & AnalyticsΒΆ

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/usage Amazon CloudWatch Amazon CloudWatch
OpenAI OpenAI /v1/organization/usage Amazon CloudWatch Amazon CloudWatch

πŸ” Authentication & Access ControlΒΆ

Provider Endpoint/Feature AWS Backend
User authentication Amazon Cognito Amazon Cognito
Multi-tenant API keys Amazon DynamoDB Amazon DynamoDB
API key rotation AWS Secrets Manager AWS Secrets Manager
Rate limiting Amazon DynamoDB Amazon DynamoDB
AWS Bedrock API keys AWS Bedrock AWS Bedrock

✨ Release History¢

v1.2.0 – Service Tiers, System Tools & Performance EnhancementsΒΆ

Introduces service tiers and latency headers for all Bedrock routes, Bedrock-specific system tools (Nova grounding), GPT5.2 API compatibility, configurable guardrail overrides, and Python 3.14 optimization.

πŸ’¬ Chat CompletionsΒΆ

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/chat/completions service_tier parameter Amazon Bedrock Amazon Bedrock - service tiers
OpenAI OpenAI /v1/chat/completions Bedrock-specific system tools (Nova grounding) Amazon Bedrock Amazon Bedrock - system tools
OpenAI OpenAI /v1/chat/completions GPT5.2 API update (reasoning_effort=xhigh)

πŸ›‘οΈ Content Safety & ModerationΒΆ

Feature AWS Backend
Configuration flag for guardrail override allow Amazon Bedrock Amazon Bedrock Guardrails

Platform FeaturesΒΆ

Feature AWS Backend / Description
Service tiers and latency headers (all Bedrock routes) Amazon Bedrock Amazon Bedrock - service tiers
Python 3.14 support Upgraded to Python 3.14 with performance optimization
Dependency update Direct aiobotocore usage (replaced aioboto3)

FixesΒΆ

  • Fix warnings for duplicated FastAPI routes (/docs and /openapi.json).

v1.1.0 – Embeddings Enhancement, Prompt Caching & Advanced RoutingΒΆ

Expands multimodal embedding capabilities, adds prompt caching support, and introduces advanced routing with application inference profiles and prompt routers.

πŸ’¬ Chat CompletionsΒΆ

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI Prompt caching /v1/chat/completions prompt_cache_key Amazon Bedrock Amazon Bedrock - prompt caching
OpenAI OpenAI /v1/chat/completions GPT5.1 API update (reasoning_effort=none)

🧠 Embeddings¢

Provider Endpoint/Feature AWS Backend
Intelligent S3 multimodal upload Amazon S3 Amazon S3
Intelligent Sync/async Bedrock invocation Amazon Bedrock Amazon Bedrock
Amazon Nova Amazon Nova Multimodal embeddings models
Twelve Labs Twelve Labs Marengo V3 models

🎯 Advanced Routing¢

Feature AWS Backend
Application inference profiles Amazon Bedrock Amazon Bedrock - application inference profiles
Prompt routers Amazon Bedrock Amazon Bedrock - prompt routers
Server-side ARN mapping Amazon Bedrock Amazon Bedrock
Client-side ARN passing (optional) Amazon Bedrock Amazon Bedrock

FixesΒΆ

  • /v1/chat/completions: Fix default value passed to the converse API for tools without parameters.
  • stdapi-ai Terraform module: Fix error if alarms_enabled = true but sns_topic_arn undefined.

v1.0.0 – Foundation ReleaseΒΆ

The initial release establishes core OpenAI API compatibility with AWS Bedrock backing.

πŸ’¬ Chat CompletionsΒΆ

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/chat/completions Amazon Bedrock Amazon Bedrock - foundation models
All models supporting Converse/ConverseStream APIs Amazon Bedrock Amazon Bedrock - Converse API
Deepseek Deepseek /v1/chat/completions reasoning_content Amazon Bedrock Amazon Bedrock - foundation models
Qwen Qwen enable_thinking + thinking_budget parameter Amazon Bedrock Amazon Bedrock - foundation models
Qwen Qwen top_k parameter Amazon Bedrock Amazon Bedrock - foundation models

🧠 Embeddings¢

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/embeddings Amazon Bedrock Amazon Bedrock - embedding models
Cohere Cohere Embed V3 & V4 models
Twelve Labs Twelve Labs Marengo V2 models
Amazon Amazon Titan Embed V1 & V2 models

πŸŽ™οΈ Speech & AudioΒΆ

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/audio/speech Amazon Polly Amazon Polly + Amazon Comprehend
OpenAI OpenAI /v1/audio/transcriptions Amazon Transcribe Amazon Transcribe
OpenAI OpenAI /v1/audio/translations Amazon Transcribe Amazon Transcribe + Amazon Translate

🎨 Image Generation¢

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/images/generations Amazon Bedrock Amazon Bedrock - image models
Amazon Nova Amazon Nova Canvas V1 models
Amazon Amazon Titan Image Generator V1 & V2 models
Stability AI Stability AI Image Core, Ultra et SD3.5 Large models

πŸ“‹ Model DiscoveryΒΆ

Provider Endpoint/Feature AWS Backend
OpenAI OpenAI /v1/models Amazon Bedrock Amazon Bedrock - model catalog

Platform FeaturesΒΆ

Feature AWS Backend
πŸ€– Bedrock Features
Content filtering and safety Amazon Bedrock Amazon Bedrock Guardrails
Cross-region inference Amazon Bedrock Amazon Bedrock - global/regional
Application inference profiles Amazon Bedrock Amazon Bedrock - inference profiles
Model parameters (temperature, top_p, etc.) Amazon Bedrock Amazon Bedrock - native parameters
Multi-region failover Amazon Bedrock Amazon Bedrock - multi-region
Bedrock guardrails Amazon Bedrock Amazon Bedrock Guardrails
☁️ AWS Services
File storage Amazon S3 Amazon S3 - presigned URLs, Transfer Acceleration
πŸ” Authentication
Static token authentication AWS Systems Manager AWS SSM Parameter Store / AWS Secrets Manager Secrets Manager
Development mode (no auth)
πŸ“Š Observability
Distributed tracing AWS X-Ray AWS X-Ray + OpenTelemetry
Structured logging Amazon CloudWatch Amazon CloudWatch (When running on ECS/EKS)
❀Health check endpoint
πŸ”’ HTTP/Security
CORS support
Trusted host validation
Proxy headers (X-Forwarded-*)
GZip compression
πŸ“š Documentation
Interactive API docs & OpenAPI schema
πŸ”Œ Compatibility
Provider-specific parameters