Curated weekly · 19 tools · 30 categories
The AI tool landscape,
curated & ranked.
Each entry includes pricing, use cases, limitations, and an AIMenta editorial verdict — so you can spend less time evaluating and more time deploying.
By category
19 matching tools for "voice"
Descript
· DescriptEdit video and podcast by editing the transcript. Industry-defining tool for podcasters and content creators; AI features include voice cloning, eye contact, and studio sound.
ElevenLabs
· ElevenLabsThe category-defining voice AI. Highest-quality TTS, voice cloning from 30 seconds of audio, and an expanding library of conversational voice models. The default for production voice.
ABBYY Vantage
· ABBYYABBYY Vantage is an enterprise intelligent document processing (IDP) platform combining OCR, machine learning document classification, and data extraction into a low-code platform. Unlike cloud-native services (AWS Textract, Azure Document Intelligence), ABBYY Vantage supports on-premises deployment and provides 150+ pre-built skills for common document types: invoices, purchase orders, contracts, ID documents, bank statements, and customs forms. For APAC enterprises in regulated sectors — financial services, healthcare, government, logistics — where data sovereignty requires on-premises deployment or where document complexity exceeds cloud API capabilities, ABBYY Vantage is the enterprise IDP recommendation.
AWS Textract
· Amazon Web ServicesAWS Textract is a fully managed machine learning document processing service that automatically extracts text, handwriting, tables, and form data from scanned documents and images. Unlike simple OCR, Textract understands document structure — it can identify form fields, table cells, and key-value pairs without requiring custom templates. For APAC enterprises on AWS running high-volume document processing workflows — KYC document extraction (passports, identity documents), invoice and purchase order processing, contract data extraction, and insurance claims processing — Textract provides a scalable, API-accessible intelligent document processing (IDP) layer that integrates natively with AWS storage, Lambda, and downstream business applications.
Azure Document Intelligence
· MicrosoftAzure Document Intelligence (formerly Form Recognizer) is Microsoft's AI document processing service, offering pre-built extraction models for common document types (invoices, receipts, ID documents, contracts) and a custom model builder for organisation-specific document types. For APAC enterprises on Azure or Microsoft 365 — the majority of large APAC financial institutions, professional services firms, and multinationals — Document Intelligence is the natural document AI choice: it integrates natively with Power Automate for workflow automation, Logic Apps for process orchestration, and Copilot Studio for document-driven conversational AI.
Coupa
· Coupa Software Inc.Coupa is the leading AI-powered business spend management (BSM) platform that unifies procurement, supplier management, invoicing, contract management, and expense management in a single cloud platform — with AI capabilities that surface savings opportunities, automate risk monitoring, and provide predictive spend analytics across the enterprise. Coupa is widely deployed at large APAC enterprises in financial services, technology, manufacturing, and retail — organisations that manage hundreds of millions of dollars in indirect spend across multiple Asian markets and supplier networks. Coupa's Community.ai leverages anonymised spend data from its entire customer network to provide benchmarking and savings recommendations specific to spend category, industry, and geography — including APAC market-specific insights on supplier pricing and category benchmarks. For APAC finance and procurement leaders, Coupa provides the spend visibility and AI-driven control needed to reduce maverick spend, accelerate invoice processing, and manage supplier risk across complex Asian supply chains.
Deepgram
· DeepgramSpeech-to-text API focused on accuracy, latency, and customization. Nova-3 leads on real-time streaming for voice agents and call analytics.
ERNIE
· BaiduERNIE (Enhanced Representation through kNowledge IntEgration) is Baidu's large language model family, powering the Wenxin Yiyan (文心一言) consumer AI product. As China's dominant search engine operator, Baidu has embedded ERNIE across its ecosystem — Maps, DuerOS voice assistant, cloud services, and enterprise AI products. ERNIE 4.5 (2026) demonstrates competitive Chinese-language performance and is the preferred model for enterprises with established Baidu Cloud relationships or state-sector compliance requirements.
Genesys Cloud CX
· GenesysGenesys Cloud CX is an enterprise contact centre as a service (CCaaS) platform that integrates AI across the entire contact centre operation — intelligent routing, IVR, real-time agent assistance, workforce engagement management, and analytics. Genesys has deep APAC deployments in telecommunications (Telstra, Singtel, SoftBank), financial services (major APAC banks and insurers), and retail enterprises that run contact centres of 500–10,000+ agents. Genesys AI capabilities include: AI-powered routing that matches each interaction to the best-fit agent based on skills, customer history, and predicted outcomes; real-time agent copilot that provides live suggestions and knowledge articles during calls; automatic speech recognition and NLP in major APAC languages; sentiment analysis for real-time coaching triggers; and predictive engagement that identifies and intervenes with website visitors likely to need support. For APAC enterprises with large contact centre operations, Genesys Cloud represents the consolidation of voice, chat, email, social, and messaging channels on a single AI-powered platform.
Jasper
· JasperMarketing-focused AI writing platform with brand voice training, campaign workflows, and a library of marketing-specific templates.
Jasper
· Jasper AI Inc.Jasper is an AI content generation platform targeting marketing teams, with strength in long-form marketing content: blog posts, ad copy, email campaigns, landing pages, and social media content. Jasper's brand voice feature allows teams to define and enforce a consistent writing style across all AI-generated content — a key differentiator versus using ChatGPT or Claude directly. For APAC content marketing teams managing high volumes of blog, email, and social content production, Jasper provides structured AI workflows above the raw capability of general-purpose LLMs.
Medallia AI
· Medallia Inc.Medallia AI is the artificial intelligence and machine learning capability layer embedded across the Medallia Experience Cloud platform — covering customer experience (CX), employee experience (EX), and contact centre analytics. The AI capabilities include text analytics on open-ended survey responses, social feedback, and contact centre recordings; sentiment scoring and topic classification; predictive NPS and attrition modelling; and AI-generated action recommendations. For APAC enterprises already on Medallia for their Voice of Customer or employee listening programmes — common in large financial services, telecommunications, retail, and hospitality companies in Singapore, Hong Kong, Australia, and Japan — Medallia AI represents an incremental capability upgrade that improves the signal quality from existing survey investments.
Murf
· Murf AIStudio-style voice generator with 120+ voices in 20+ languages. Strong UX for non-technical users producing e-learning, IVR, and explainer audio.
OpenAI Voice
· OpenAIOpenAI's TTS and Realtime voice models. Realtime API enables genuine voice agents with sub-second latency; TTS HD is a strong, less-expensive alternative to ElevenLabs for narration.
Traydstream
· TraydstreamTraydstream is an AI-powered trade finance document digitisation and compliance checking platform that addresses one of APAC's most costly operational problems: Letter of Credit discrepancies. The platform uses optical character recognition and AI to extract data from trade documents (Bills of Lading, Commercial Invoices, Certificates of Origin, Packing Lists), cross-checks documents against LC terms and UCP 600 rules, and flags discrepancies before bank submission. Processing 8M+ trade finance documents per month across APAC, Europe, and the Middle East, Traydstream is deployed by DBS, HSBC, Standard Chartered, and hundreds of corporates across the Singapore-Hong Kong trade finance corridor.
UiPath (AI and Document Understanding)
· UiPath Inc.UiPath is the leading enterprise RPA platform globally, with deep install base across APAC in financial services, shared services, manufacturing, and BPO. UiPath AI adds Document Understanding (intelligent document processing for invoices, purchase orders, contracts, and customs forms), AI Center (an MLOps platform for deploying ML models into UiPath workflows), Autopilot (AI-assisted bot creation), and Communications Mining. For APAC enterprises with existing UiPath automation programmes, these AI features represent the upgrade path from rule-based RPA to AI-augmented intelligent automation without platform migration.
Voiceflow
· VoiceflowNo-code conversational AI platform enabling APAC enterprise teams to design and deploy AI chatbots and agents across web, WhatsApp, LINE, and messaging channels.
Writer
· WriterEnterprise writing platform with proprietary Palmyra LLMs, brand-voice enforcement, and on-prem deployment options. Targets regulated industries.
Writer
· WriterEnterprise AI writing platform with brand voice enforcement, style guide compliance, and team-wide content governance for APAC regulated organisations.
Need help choosing the right stack?
We help APAC enterprises design AI tool stacks that match their data, compliance, and budget realities — not vendor decks.