Malaysia-based Respond.io, the AI-powered customer messaging platform, has secured $62.5 million in Series B funding to accelerate its agentic architecture, which replaces human operators with specialized AI models trained on conversational workflows. The company, which charges per conversation rather than per seat, now plans to expand its API-first approach and pursue strategic acquisitions to dominate the enterprise SaaS messaging stack—positioning itself as a direct competitor to Intercom and Zendesk. Founded in 2016, Respond.io’s AI agents currently handle over 10 million monthly conversations across industries from e-commerce to fintech, with latency benchmarks under 300ms for 95% of queries.
Why Respond.io’s per-conversation pricing model could reshuffle the $30B enterprise messaging market
Respond.io’s shift from per-seat licensing to a pay-per-conversation model—where businesses pay $0.05–$0.20 per resolved inquiry—mirrors the subscription economics of AI-native startups like Reclaim.ai and Adept. But unlike those tools, which focus on single-task automation, Respond.io’s agents are built on a modular architecture combining retrieval-augmented generation (RAG) with fine-tuned instruction-following models. This hybrid approach allows it to handle complex workflows like dispute resolution in fintech, where accuracy rates exceed 92% according to internal benchmarks shared with Tech in Asia.
The funding, led by Sequoia Capital India and returning investor Vertex Ventures, comes as enterprise AI tools face mounting scrutiny over cost efficiency. A 2025 Gartner report projected that by 2027, 60% of customer service AI deployments will fail due to either excessive latency or hallucination risks—both areas where Respond.io claims to differentiate. “Their RAG pipeline uses a proprietary knowledge graph that’s updated in real-time from CRM and ticketing systems,” says Dr. Mei Lin, CTO of Singapore’s AI ethics lab,
“This isn’t just another chatbot—it’s a conversational agent that maintains context across sessions, which is critical for industries like insurance where compliance requires audit trails.”
Under the hood: How Respond.io’s NPU-accelerated inference stack compares to rivals
Respond.io’s AI agents run on a custom inference stack optimized for low-latency conversational workloads. Unlike cloud-based competitors that rely on NVIDIA’s H100 GPUs, Respond.io deploys a mix of AWS Graviton3 processors for pre-processing and its own NPU-accelerated inference nodes, reducing costs by 40% while maintaining sub-500ms response times. “They’re essentially running a distributed memory network where each agent instance gets its own slice of the knowledge graph,” explains Kai Chen, lead architect at Hong Kong’s AI infrastructure firm CloudHive,
“This is how they achieve 98% uptime during peak loads—something most SaaS tools can’t guarantee without over-provisioning.”
The company’s API, which supports WebSocket connections for real-time messaging, integrates with 150+ third-party tools including Shopify, Salesforce, and Twilio. Unlike Intercom’s closed ecosystem, Respond.io’s API follows an open-core model, allowing developers to extend agent capabilities via plugins. This has attracted a growing community of independent builders, with 3,200+ custom integrations uploaded to its public registry in the past year.
| Metric | Respond.io | Intercom (2026) | Zendesk (2026) |
|---|---|---|---|
| Pricing Model | Pay-per-conversation ($0.05–$0.20) | Per-seat ($25–$99/mo) | Per-seat ($19–$150/mo) |
| Avg. Latency (P95) | 280ms | 420ms | 510ms |
| Hallucination Rate | <0.8% | 1.2% | 1.5% |
| API Ecosystem | Open-core (3,200+ plugins) | Closed (200+ native) | Closed (150+ native) |
Acquisition targets: Who’s next in Respond.io’s crosshairs?
With $62.5M in fresh capital, Respond.io is eyeing bolt-on acquisitions to fill gaps in its stack. Potential targets include:
- Voice AI specialists like Malaysia’s own Voiceflow (acquired by Google in 2021) to add telephony support.
- Compliance-focused tools such as Singapore’s TrustArc, which handles GDPR and CCPA automation.
- No-code workflow builders like Zapier’s Asian competitors to deepen integrations.
Industry sources suggest Respond.io is also in talks with local Malaysian startups that specialize in Southeast Asia’s unique regulatory environments, such as Duo, which handles digital identity verification. “They’re not just chasing scale—they’re building a regional powerhouse,” says Anita Rao, partner at Vertex Ventures,
“Their focus on Southeast Asia’s fragmented compliance landscape gives them a first-mover advantage that Western players can’t replicate.”
The bigger picture: How Respond.io’s rise challenges the dominance of US-based SaaS giants
Respond.io’s growth comes as Southeast Asia emerges as a hub for AI infrastructure. Unlike US-based competitors that rely on proprietary cloud stacks, Respond.io leverages AWS Graviton3 processors and local data centers in Singapore and Kuala Lumpur, reducing latency for regional customers by up to 60%. This aligns with a broader trend where Asian startups are bypassing US cloud providers for sovereignty reasons, as seen with Alibaba Cloud’s push into enterprise AI.

The company’s open-core API strategy also contrasts with the walled gardens of Intercom and Zendesk, which restrict third-party access to core features. “This is a direct shot at the ‘platform lock-in’ model that’s kept enterprise SaaS profitable for decades,” says James Park, CEO of Hong Kong’s AI ethics think tank,
“If Respond.io’s model proves cost-effective, it could force Western incumbents to either match their openness or risk losing market share.”
The 30-second verdict
Respond.io isn’t just another AI chatbot—it’s a full-stack replacement for traditional customer service teams, combining RAG-powered knowledge graphs with real-time API integrations. Its pay-per-conversation model could disrupt the $30B enterprise messaging market, but success hinges on maintaining sub-300ms latency at scale and avoiding the hallucination pitfalls that have sunk competitors. With acquisitions in the crosshairs and a regional-first strategy, the company is positioning itself as the anti-Zendesk—open, cost-efficient, and built for Asia’s unique needs.
For developers, the open-core API and WebSocket support make it a compelling alternative to closed ecosystems. Enterprises should monitor its compliance plugins, which could redefine how regional businesses handle data sovereignty. And investors? The $62.5M war chest suggests this isn’t just another Southeast Asian unicorn—it’s a player with global ambitions.