Voice AI Goes Open Source: Enterprises Leverage First-Party Data

1mo

Voice is a super interesting modality right now - maybe the first modality we're seeing move to open source models across a number of scale ups / enterprises. Reliability concerns, high costs, and open source model performance are pushing engineers to do their own fine tuning vs. relying on third-party vendors of proprietary models. Many of these orgs have already been collecting their own first-party data and now with third-party vendors like Extrian, David AI, etc they can train really high quality models. RL has been insanely hyped, but it's been unclear how long it will take scale ups and enterprises to actually lean in. Voice AI might be hitting that inflection point faster than expected.

7 Comments

Carles Reina 3w

Don't think voice open source is a thing in any meaningful way. Current OS models don't scale, sound awful and fine-tuning isn't straightforward due to voice consent. LLM OS is way bigger with Mistral, DeepSeek or Llama

2 Reactions

Matthew Gregory 1mo

FYI. https://github.com/build-trust/freeflow

Spencer Hardwick 3w

The harness conversation is the interesting one right now in my opinion. Models are converging. The differentiator is what sits around them. Fine-tuning optimizes the model. The harness optimizes the relationship between a specific person and a specific model. Both matter, but only one compounds with use. I'm really curious to see how that extends to voice

Kenneth Eversole 1mo

Whole RL fine-tuning space, I think, is really fascinating. At OpsCompanion, we're kind of slowly stumbling on something that can be quite powerful to create a kind of full eval system for the entire SDLC process.

1 Reaction

Zach A. 1mo

Check out attention labs. Helping voice models and agents hear+understand reliably

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

Sanju Sivalingam
1mo
Report this post
LoRA lets you fine-tune without retraining the whole model. a tiny adapter. your data. fraction of the cost. and somehow people are still paying $15/1M tokens for generic output. LoRA (Low-Rank Adaptation) is one of the most underrated developments in practical ai. instead of retraining billions of parameters, you train a small adapter that sits on top of the model. same effective result. 10-100x cheaper. runs on hardware you already have. the tools exist. the cost is gone. the knowledge barrier is shrinking. what's left is just the decision to do it vs defaulting to the API forever. your model. your data. your moat.
Like Comment
To view or add a comment, sign in
Millyweb Development

2 followers
3w
Report this post
We just shipped the exchange router for Helix Cortex—basically the system that stores and retrieves every AI conversation your business runs. Here's why that matters: if you're using AI to handle customer conversations, estimates, or follow-ups, you need to know what actually happened in those exchanges. Not just "AI talked to a customer," but the actual thread, decisions made, what data got passed where. Without this, you're flying blind on quality and compliance. For service businesses specifically, this solves a real problem I see everywhere. You deploy AI to handle initial calls or intake forms, it works great for a week, then you realize you can't audit what it said to your customers or why it made certain recommendations. That kills trust fast. The exchange router means every interaction is logged and retrievable—so you can actually learn from what's working, catch mistakes before they become problems, and show customers (or yourself) exactly what happened. We're building this in the open because honestly, we're figuring it out in real time. I'm not pretending this is some perfect architecture—we're learning what works as we scale it. But the principle is solid: if you're going to let AI represent your business, you need complete visibility into those conversations. That's not negotiable.
Like Comment
To view or add a comment, sign in
Boz Zou, MSc., MA, BSc.
2w
Report this post
Incredible to see the trajectory of open-source AI. The evolution of local models is essential for ensuring everyone maintains sovereignty of inference. The path forward is clear, and the potential for democratization is massive.

George Pu

Own or Be Owned | $10M+ bootstrapped by 27 | Sovereignty & AI ownership
2w

ElevenLabs charges $330/month for AI voice generation. $11 billion valuation. 41% of the Fortune 500. I tested a free, open-source model that runs on a laptop. Apache 2.0. No restrictions. The voice quality is significantly better. The model was released two months before Mistral's newer, more restricted alternative. Older. Freer. Better. In a blind listening test, 63.75% of listeners preferred a free self-hosted model over ElevenLabs. The $11 billion company lost a quality test to something anyone can download. We built our narration pipeline on this. 200 blog posts a month. Zero API costs. Nothing leaves our machine. When a better model dropped, we swapped it in an afternoon. The moat for every AI company charging for API access is the delay between open-source and commercial quality. In text-to-speech, that delay went negative. It won't just be TTS.
Like Comment
To view or add a comment, sign in
Sudheer K
1mo
Report this post
AI Ops Just Got Real | Mar 23, 2026 1. GitAgent — Unknown What it does: Think of it as Docker for AI agents, simplifying ecosystem fragmentation. What changed: Went from 5 separate frameworks to 1 unified platform, reducing integration time by 70%. Why it matters: Teams were doing integration manually, now they can focus on agent development. 🔗 https://lnkd.in/gNuy-BCm 2. Nemotron-Cascade 2 — NVIDIA What it does: Basically, your pipeline now gets better reasoning at a fraction of the parameter scale. What changed: Improved reasoning capabilities with 30B parameters, down from 100B, reducing costs by 60%. Why it matters: The 80% of projects stuck at scaling can now achieve better performance with less resources. 🔗 https://lnkd.in/gtuyF9nx 3. HyEvo — Unknown What it does: So instead of homogeneous LLM-only workflows, HyEvo integrates probabilistic and deterministic nodes. What changed: Workflow generation time decreased by 40%, from 10 hours to 6 hours, with HyEvo. Why it matters: This breaks the assumption that automated workflow generation has to be inefficient. 🔗 https://lnkd.in/gxrWu26R 📊 The Bigger Picture: The real story this week is agentic AI's operational leap. What broke in your pipeline this week? Drop it below — the best insights come from operator experience. #AgenticAI #AITools #ArtificialIntelligence #AINews #GenerativeAI #TechNews #MachineLearning
- +3
1 Comment
Like Comment
To view or add a comment, sign in
Reply Challenges

1,532 followers
2w
Report this post
The scoring system evaluates AI multi-agent systems based on multiple weighted criteria, including but not limited to: Detection Quality: 1️⃣ Count-based accuracy 2️⃣ Economic accuracy System Performance: 1️⃣ Cost 2️⃣ Latency 3️⃣ Agent architecture quality Plus, benchmark & bonus: All metrics are evaluated against an optimal benchmark solution. Solutions that outperform this benchmark receive additional credit. Dataset Difficulty: Each dataset has a weighted scoring system where more complex datasets offer higher maximum points. Read more details on the "How it works" section on 👉 replychallenges.com/Agent #ReplyChallenges
Like Comment
To view or add a comment, sign in
Ken Ferguson
3w
Report this post
The first wave of AI was all about 'the prompt'—glorified autocomplete. The future is Intent-Driven Agent Architectures. It's shifting from 'what I said' to 'what I need'. Stop writing better prompts and start building better intents. New post: Beyond the Prompt. Read more: https://lnkd.in/g9MfideW
Like Comment
To view or add a comment, sign in
Ken Ferguson
3w
Report this post
The first wave of AI was all about 'the prompt'—glorified autocomplete. The future is Intent-Driven Agent Architectures. It's shifting from 'what I said' to 'what I need'. Stop writing better prompts and start building better intents. New post: Beyond the Prompt. Read more: https://lnkd.in/g9MfideW
Like Comment
To view or add a comment, sign in
Angsuman Chakraborty
3w
Report this post
To progress on AI we cannot rely on big-daddy models from large corporations as we need huge amount of processing both for testing, fine-tuning and deployment, and cost become exorbitant at scale. Instead we should focus on improving smaller dense open source models, single purpose models which can be run locally with only electricity cost to worry about. The deficiencies of the models is overcome with prompting, RAG and web search.
Like Comment
To view or add a comment, sign in
Anu Chaudhary
1mo
Report this post
Explore top AI model hosting options for developers in 2025. Compare platforms on reliability, latency, scalability, pricing, and integration. Find the best fit to accelerate AI projects. https://lnkd.in/g-Qgbv5f #2025Trends #AiHosting #CloudServices #FreelanceDevelopers #MachineLearningLogos
Like Comment
To view or add a comment, sign in
BabySea

7 followers
1w
Report this post
The inference infrastructure for generative media. AI workloads today run across fragmented systems: multiple models, providers, and inconsistent execution layers. BabySea handles routing, failover, and execution across 80+ generative media models from 12+ AI labs and 7+ inference providers, running in production across the US, EU, and APAC. Every request is tracked with full visibility into latency, provider selection, and cost. Built for reliability. Designed for control. We’re building the execution layer for how generative AI runs in production. Get started: https://babysea.ai
Like Comment
To view or add a comment, sign in

4,563 followers

34 Posts

View Profile Follow

Voice AI Goes Open Source: Enterprises Leverage First-Party Data

More Relevant Posts

Explore content categories