Overview
Fano Speech API service (2026 version)
Cantonese Speech-to-Text on AWS Marketplace with auto code-switching (Cantonese/Mandarin/English) and speaker diarisation (up to 8 speakers) for calls, meetings, and voice applications.
Fano Speech API is an API-first Speech-to-Text service for teams building reliable voice workflows on AWS. It supports both Streaming (real-time) STT and Async (batch) STT, so you can transcribe live audio streams and process recordings at scale using one consistent platform.
Streaming Speech-to-Text (Released Dec 2025)
Transcribe audio in real time with low latency, ideal for
- Live captioning and real-time transcription;
- Voice agents and conversational interfaces;
- Immediate command recognition and in-the-moment workflows
Key capabilities include
- Seamless Multilingual Code-switching that acurately transcribes Cantonese, Mandarin, English within a single audio stream, without manually switching language codes;
- Keyword biasing provides a list of keywords (brand names, product terms, industry jargon) to boost recognition accuracy for your domain;
- Automatic punctuation: Output includes intelligent punctuation so transcripts are immediately readable and ready for downstream use.
Async Speech-to-Text (Released Nov 2025)
Process recordings efficiently with asynchronous jobs, optimized for scale and operational stability.
Key capabilities include:
- High-accuracy monolingual models: Dedicated models optimized for English, Mandarin, and Cantonese for single-language audio;
- Multilingual ASR model (yue-x-auto): Designed for Hong Kong and Singapore language mixing, supports Cantonese/Mandarin/English code-switching without manual language tags;
- Default speaker diarization: Automatically labels up to 8 speakers, optimized for call logs, meetings, interviews, and multi-speaker recordings;
- Keyword biasing: Improve accuracy on domain terminology by passing a keyword list in the request;
- Webhook callback for completed jobs (Enhanced Dec 2025): Receive a callback notification when transcription is finished, no need to poll job status repeatedly.
Common use cases include:
- Contact center transcription, QA, and analytics;
- Real-time agent assist and voicebot experiences;
- Meeting and interview transcription with speaker separation;
- Compliance and audit workflows that need readable, structured transcripts;
- Encourage correct recognition of product names and jargon via keyword biasing
Highlights
- Low latency Streaming STT (real time transcription) for live captioning, voice agents, and instant command recognition
- Cantonese, Mandarin, English code switching (no manual language switching) for seamless multilingual transcription in a single stream/job
- Keyword biasing and readable output: boost domain terms and get cleaner transcripts with automatic punctuation (plus speaker diarization up to 8 speakers in Async)
Details
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
Pricing
Dimension | Description | Cost/12 months |
|---|---|---|
USD 1,000 (Enterprise Plan) | USD 1,000 Plan Balance (Usage Credits) Prepaid balance of USD 1,000 in usage credits for Fano Speech API. Credits are consumed based on metered transcription minutes: Async STT at $0.012/min and Real-time (Streaming) STT at $0.024/min. Use credits across both modes until the balance is exhausted. | $1,000.00 |
Vendor refund policy
Contact support@fano.ai
Custom pricing options
How can we make this page better?
Legal
Vendor terms and conditions
Content disclaimer
Delivery details
Software as a Service (SaaS)
SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.
Support
Vendor support
AWS infrastructure support
AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.