techcrunch.com
Open in
urlscan Pro
2a00:1288:110:c204::c000
Public Scan
Submitted URL: https://techcrunch.com/2024/12/12/cartesia-claims-its-ai-is-efficient-enough-to-run-pretty-much-anywhere/)
Effective URL: https://techcrunch.com/2024/12/12/cartesia-claims-its-ai-is-efficient-enough-to-run-pretty-much-anywhere/
Submission: On December 13 via api from BE — Scanned from ES
Effective URL: https://techcrunch.com/2024/12/12/cartesia-claims-its-ai-is-efficient-enough-to-run-pretty-much-anywhere/
Submission: On December 13 via api from BE — Scanned from ES
Form analysis
2 forms found in the DOMGET https://techcrunch.com/
<form role="search" method="get" action="https://techcrunch.com/" class="wp-block-search__button-inside wp-block-search__text-button wp-block-search"><label class="wp-block-search__label screen-reader-text"
for="wp-block-search__input-1">Search</label>
<div class="wp-block-search__inside-wrapper "><input autocomplete="off" data-wp-on--keydown="actions.queryAutoSearch" class="wp-block-search__input" id="wp-block-search__input-1" placeholder="Search articles or topics" value="" type="search"
name="s" required=""><button aria-label="Submit" class="wp-block-search__button wp-element-button" type="submit">Submit</button></div>
</form>
POST /
<form method="POST" action="/" class="newsletter-signup-compact__form validated-form" aria-busy="false" novalidate="">
<div class="d-flex gap-30 g-col-8 flex-column flex-md-row">
<div class="d-grid gap-8 g-col-8">
<div class="newsletter-signup-item g-col-12 g-col-sm-6 g-col-lg-3 "><input type="checkbox" id="newsletter-techcrunch-daily-news" name="newsletters[]" tabindex="-1" class="visually-hidden newsletter-signup-item__control" value="2725851"><button
class="newsletter-signup-item__select" type="button">
<h5 class="newsletter-signup-item__title" aria-hidden="true">TechCrunch Daily News</h5>
<div class="newsletter-signup-item__status-icon"></div>
<p class="newsletter-signup-item__description" aria-hidden="true">Every weekday and Sunday, you can get the best of TechCrunch’s coverage.</p><label class="visually-hidden newsletter-signup-item__action-label"
for="newsletter-techcrunch-daily-news">Add TechCrunch Daily News to your subscription choices</label>
</button></div>
<div class="newsletter-signup-item g-col-12 g-col-sm-6 g-col-lg-3 "><input type="checkbox" id="newsletter-techcrunch-ai" name="newsletters[]" tabindex="-1" class="visually-hidden newsletter-signup-item__control" value="2758392"><button
class="newsletter-signup-item__select" type="button">
<h5 class="newsletter-signup-item__title" aria-hidden="true">TechCrunch AI</h5>
<div class="newsletter-signup-item__status-icon"></div>
<p class="newsletter-signup-item__description" aria-hidden="true">TechCrunch's AI experts cover the latest news in the fast-moving field.</p><label class="visually-hidden newsletter-signup-item__action-label"
for="newsletter-techcrunch-ai">Add TechCrunch AI to your subscription choices</label>
</button></div>
<div class="newsletter-signup-item g-col-12 g-col-sm-6 g-col-lg-3 "><input type="checkbox" id="newsletter-techcrunch-space" name="newsletters[]" tabindex="-1" class="visually-hidden newsletter-signup-item__control" value="2725844"><button
class="newsletter-signup-item__select" type="button">
<h5 class="newsletter-signup-item__title" aria-hidden="true">TechCrunch Space</h5>
<div class="newsletter-signup-item__status-icon"></div>
<p class="newsletter-signup-item__description" aria-hidden="true">Every Monday, gets you up to speed on the latest advances in aerospace.</p><label class="visually-hidden newsletter-signup-item__action-label"
for="newsletter-techcrunch-space">Add TechCrunch Space to your subscription choices</label>
</button></div>
<div class="newsletter-signup-item g-col-12 g-col-sm-6 g-col-lg-3 "><input type="checkbox" id="newsletter-startups-weekly" name="newsletters[]" tabindex="-1" class="visually-hidden newsletter-signup-item__control" value="2725843"><button
class="newsletter-signup-item__select" type="button">
<h5 class="newsletter-signup-item__title" aria-hidden="true">Startups Weekly</h5>
<div class="newsletter-signup-item__status-icon"></div>
<p class="newsletter-signup-item__description" aria-hidden="true">Startups are the core of TechCrunch, so get our best coverage delivered weekly.</p><label class="visually-hidden newsletter-signup-item__action-label"
for="newsletter-startups-weekly">Add Startups Weekly to your subscription choices</label>
</button></div>
</div>
<div class="g-col-4 d-flex flex-column gap-8 justify-content-center flex-basis-30 flex-shrink-0">
<legend role="status" aria-live="polite" class="newsletter-signup-compact__status">No newsletters selected.</legend>
<fieldset>
<div class="form-input d-flex gap-8 flex-column ">
<div class="d-flex flex-column gap-8 form-input__wrapper"><input value="" class="form-input__input flex-grow-1 " type="email" id="email" name="email" placeholder="Email address" required="">
<div class="d-flex gap-16">
<div class="btn"><button type="submit" class="btn__link is-disabled force-small">Subscribe</button></div>
<p class="newsletter-signup-compact__compliance">By submitting your email, you agree to our <a href="https://legal.yahoo.com/us/en/yahoo/terms/otos/index.html">Terms</a> and
<a href="https://legal.yahoo.com/us/en/yahoo/privacy/index.html">Privacy Notice</a>.</p>
</div>
</div>
</div>
</fieldset>
</div>
</div>
</form>
Text Content
We, TechCrunch, are part of the Yahoo family of brands. When you use our sites and apps, we use cookies to: * provide our sites and apps to you * authenticate users, apply security measures, and prevent spam and abuse, and * measure your use of our sites and apps If you click 'Accept all', we and our partners including 234 who are part of the IAB Transparency & Consent Framework, will also store and/or access information on a device (in other words, use cookies) and use precise geolocation data and other personal data such as IP address and browsing and search data, for personalised advertising and content, advertising and content measurement, and audience research and services development. If you do not want us and our partners to use cookies and personal data for these additional purposes, click 'Reject all'. If you would like to customise your choices, click 'Manage privacy settings'. You can change your choices at any time by clicking on the 'Privacy & cookie settings' or 'Privacy controls' link on our sites and apps. Find out more about how we use your personal data in our privacy policy and cookie policy. View our 234 partners Manage privacy settingsReject allAccept all Skip to content TechCrunch Desktop Logo TechCrunch Mobile Logo * Latest * Startups * Venture * Apple * Security * AI * Apps * Events * Podcasts * Newsletters Sign In Search Submit Site Search Toggle Mega Menu Toggle TOPICS Latest AI Amazon Apps Biotech & Health Climate Cloud Computing Commerce Crypto Enterprise EVs Fintech Fundraising Gadgets Gaming Google Government & Policy Hardware Instagram Layoffs Media & Entertainment Meta Microsoft Privacy Robotics Security Social Space Startups TikTok Transportation Venture MORE FROM TECHCRUNCH Events Startup Battlefield StrictlyVC Newsletters Podcasts Videos Partner Content TechCrunch Brand Studio Crunchboard Contact Us Sign In Image Credits:v_alex / Getty Images Apps CARTESIA CLAIMS ITS AI IS EFFICIENT ENOUGH TO RUN PRETTY MUCH ANYWHERE Kyle Wiggers 4:00 AM PST · December 12, 2024 It’s becoming increasingly costly to develop and run AI. OpenAI’s AI operations costs could reach $7 billion this year, while Anthropic’s CEO recently suggested that models costing over $10 billion could arrive soon. So the hunt is on for ways to make AI cheaper. Some researchers are focusing on techniques to optimize existing model architectures — i.e. the structure and components that make models tick. Others are developing new architectures they believe have a better shot of scaling up affordably. Karan Goel is in the latter camp. At the startup he helped co-found, Cartesia, Goel’s working on what he calls state space models (SSMs), a newer, highly efficient model architecture that can handle large amounts of data — text, images, and so on — at once. “We believe new model architectures are necessary to build truly useful AI models,” Goel told TechCrunch. “The AI industry is a competitive space, both commercial and open source, and building the best model is crucial to success.” ACADEMIC ROOTS Before joining Cartesia, Goel was a PhD candidate in Stanford’s AI lab, where he worked under the supervision of computer scientist Christopher Ré, among others. While at Stanford, Goel met Albert Gu, a fellow PhD candidate in the lab, and the two sketched out what would become the SSM. Goel eventually took part-time jobs at Snorkel AI, then Salesforce, while Gu became assistant professor at Carnegie Mellon. But Gu and Goel went on studying SSMs, releasing several pivotal research papers on the architecture. In 2023, Gu and Goel — along with two of their former Stanford peers, Arjun Desai and Brandon Yang — decided to join forces to launch Cartesia to commercialize their research. Cartesia’s founding team. From left to right: Brandon Yang, Karan Goel, Albert Gu, and Arjun Desai. Image Credits:Cartesia Cartesia, whose founding team also includes Ré, is behind many derivatives of Mamba, perhaps the most popular SSM today. Gu and Princeton professor Tri Dao started Mamba as an open research project last December, and continue to refine it through subsequent releases. Cartesia builds on top of Mamba in addition to training its own SSMs. Like all SSMs, Cartesia’s give AI something like a working memory, making the models faster — and potentially more efficient — in how they draw on computing power. SSMS VS. TRANSFORMERS Most AI apps today, from ChatGPT to Sora, are powered by models with a transformer architecture. As a transformer processes data, it adds entries to something called a “hidden state” to “remember” what it processed. For instance, if the model is working its way through a book, the hidden state values might be representations of words in the book. The hidden state is part of the reason transformers are so powerful. But it’s also the cause of their inefficiency. To “say” even a single word about a book a transformer just ingested, the model would have to scan through its entire hidden state — a task as computationally demanding as rereading the whole book. In contrast, SSMs compress every prior data point into a sort of summary of everything they’ve seen before. As new data streams in, the model’s “state” gets updated, and the SSM discards most previous data. The result? SSMs can handle large amounts of data while outperforming transformers on certain data generation tasks. With inference costs going the way they are, that’s an attractive proposition indeed. ETHICAL CONCERNS Cartesia operates like a community research lab, developing SSMs in partnership with outside organizations as well as in-house. Sonic, the company’s latest project, is an SSM that can clone a person’s voice or generate a new voice and adjust the tone and cadence in the recording. Goel claims that Sonic, which is available through an API and web dashboard, is the fastest model in its class. “Sonic is a demonstration of how SSMs excel on long-context data, like audio, while maintaining the highest performance bar when it comes to stability and accuracy,” he said. Cartesia’s Sonic model can customize speech to a fair degree, including the PROSODY. Image Credits:Cartesia While Cartesia has managed to ship products quickly, it’s stumbled into many of the same ethical pitfalls that’ve plagued other AI model-makers. Cartesia trained at least some of its SSMs on The Pile, an opendata set known to contain unlicensed copyrighted books. Many AI companies argue that fair-use doctrine shields them from infringement claims. But that hasn’t stopped authors from suing Meta and Microsoft, plus others, for allegedly training models on The Pile. And Cartesia has few apparent safeguards for its Sonic-powered voice cloner. A few weeks back, I was able to create a clone of Vice President Kamala Harris’ voice using campaign speeches (listen below). Cartesia’s tool only requires that you check a box indicating that you’ll abide by the startup’s ToS. Cartesia isn’t necessarily worse in this regard than other voice cloning tools on the market. With reports of voice clones beating bank security checks, however, the optics aren’t amazing. Goel wouldn’t say Cartesia is no longer training models on The Pile. But he did address the moderation issues, telling TechCrunch that Cartesia has “automated and manual review” systems in place and is “working on systems for voice verification and watermarking.” “We have dedicated teams testing for aspects like technical performance, misuse, and bias,” Goel said. “We’re also establishing partnerships with external auditors to provide additional independent verification of our models’ safety and reliability … We recognize this is an ongoing process that requires constant refinement.” BUDDING BUSINESS Goel says that “hundreds” of customers are paying for Sonic API access, Cartesia’s primary line of revenue, including automated calling app Goodcall. Cartesia’s API is free for up to 100,000 characters read aloud, with the most expensive plan topping out at $299 per month for 8 million characters. (Cartesia also offers an enterprise tier with dedicated support and custom limits.) By default, Cartesia uses customer data to train its models — a not-unheard-of policy, but one unlikely to sit well with privacy-conscious users. Goel notes that users can opt out if they wish, and that Cartesia offers custom retention policies for larger orgs. Cartesia’s data practices don’t appear to be hurting business, for what it’s worth — at least not while Cartesia has a technical advantage. Goodcall CEO Bob Summers says that he chose Sonic because it was the only voice generation model with a latency under 90 milliseconds. “[It] outperformed its next best alternative by a factor of four,” Summers added. Goodcall’s AI “agent” service relies on Cartesia’s Sonic API. Image Credits:Goodcall Today, Sonic’s being used for gaming, voice dubbing, and more. But Goel thinks it’s only scratching the surface of what SSMs can do. His vision is models that run on any device and understand and generate any modality of data — text, images, videos, and so on — almost instantly. In a small step toward this, Cartesia this summer launched a beta of Sonic On-Device, a version of Sonic optimized to run on phones and other mobile devices for applications like real-time translation. Alongside Sonic On-Device, Cartesia published Edge, a software library to optimize SSMs for different hardware configurations, and Rene, a compact language model. “We have a big, long-term vision of becoming the go-to multimodal foundation model for every device,” Goel said. “Our long-term roadmap includes developing multimodal AI models, with the goal of creating real-time intelligence that can reason over massive contexts.” If that’s to come to pass, Cartesia will have to convince potential new clients its architecture is worth suffering the learning curve. It’ll also have to stay ahead of other vendors experimenting with alternatives to the transformer. Startups Zephyra, Mistral, and AI21 Labs have trained hybrid Mamba-based models. Elsewhere, Liquid AI, led by robotics luminary Daniela Rus, is developing its own architecture. Goel asserts that 26-employee Cartesia is positioned for success, though — thanks in part to a new cash infusion. The company this month closed a $22 million funding round led by Index Ventures, bringing Cartesia’s total raised to $27 million. Shardul Shah, partner at Index Ventures, sees Cartesia’s tech one day driving apps for customer service, sales and marketing, robotics, security, and more. “By challenging the traditional reliance on transformer-based architectures, Cartesia has unlocked new ways to build real-time, cost-effective, and scalable AI applications,” he said. “The market is demanding faster, more efficient models that can run anywhere — from data centers to devices. Cartesia’s technology is uniquely poised to deliver on this promise and drive the next wave of AI innovation.” A* Capital, Conviction, General Catalyst, Lightspeed, and SV Angel also participated in San Francisco-based Cartesia’s latest funding round. TechCrunch has an AI-focused newsletter! Sign up here to get it in your inbox every Wednesday. Topics AI, Apps, Cartesia, edge AI, Enterprise, Exclusive, Funding, Generative AI, Index Ventures, speech generation, SSM, Startups, state space models, voice generation MOST POPULAR * CARTA IS MAKING IT TOO DIFFICULT TO CANCEL SUBSCRIPTIONS, SOME FOUNDERS SAY * Charles Rollet * TESLA’S LOSS IS ZOOX’S GAIN * Kirsten Korosec * CHATGPT NOW UNDERSTANDS REAL-TIME VIDEO, SEVEN MONTHS AFTER OPENAI FIRST DEMOED IT * Kyle Wiggers * IT SURE LOOKS LIKE OPENAI TRAINED SORA ON GAME CONTENT — AND LEGAL EXPERTS SAY THAT COULD BE A PROBLEM * Kyle Wiggers * GOOGLE UNVEILS PROJECT MARINER: AI AGENTS TO USE THE WEB FOR YOU * Maxwell Zeff * RESEARCHERS UNCOVER CHINESE SPYWARE USED TO TARGET ANDROID DEVICES * Carly Page * GOOGLE SAYS ITS NEW QUANTUM CHIP INDICATES THAT MULTIPLE UNIVERSES EXIST * Julie Bort Kyle Wiggers Senior Reporter, Enterprise @Kyle_L_Wiggers Kyle Wiggers is a senior reporter at TechCrunch with a special interest in artificial intelligence. His writing has appeared in VentureBeat and Digital Trends, as well as a range of gadget blogs including Android Police, Android Authority, Droid-Life, and XDA-Developers. He lives in Brooklyn with his partner, a piano educator, and dabbles in piano himself. occasionally — if mostly unsuccessfully. View Bio NEWSLETTERS See More Subscribe for the industry’s biggest tech news TECHCRUNCH DAILY NEWS Every weekday and Sunday, you can get the best of TechCrunch’s coverage. Add TechCrunch Daily News to your subscription choices TECHCRUNCH AI TechCrunch's AI experts cover the latest news in the fast-moving field. Add TechCrunch AI to your subscription choices TECHCRUNCH SPACE Every Monday, gets you up to speed on the latest advances in aerospace. Add TechCrunch Space to your subscription choices STARTUPS WEEKLY Startups are the core of TechCrunch, so get our best coverage delivered weekly. Add Startups Weekly to your subscription choices No newsletters selected. Subscribe By submitting your email, you agree to our Terms and Privacy Notice. RELATED * AI GOOGLE GEMINI: EVERYTHING YOU NEED TO KNOW ABOUT THE GENERATIVE AI MODELS * Kyle Wiggers * Maxwell Zeff 14 hours ago * AI CHATGPT: EVERYTHING YOU NEED TO KNOW ABOUT THE AI-POWERED CHATBOT * Kyle Wiggers * Cody Corrall * Alyssa Stringer 16 hours ago * Government & Policy IT SURE LOOKS LIKE OPENAI TRAINED SORA ON GAME CONTENT — AND LEGAL EXPERTS SAY THAT COULD BE A PROBLEM * Kyle Wiggers 2 days ago LATEST IN APPS See More * Apps WHATSAPP LETS YOU SELECT SPECIFIC PEOPLE WITHIN A GROUP TO START A GROUP CALL WITHOUT DISTURBING ANYONE * Ivan Mehta 3 hours ago * AI GOOGLE GEMINI: EVERYTHING YOU NEED TO KNOW ABOUT THE GENERATIVE AI MODELS * Kyle Wiggers * Maxwell Zeff 14 hours ago * In Brief NEARLY HALF OF US TEENS ARE ONLINE ALMOST CONSTANTLY, PEW STUDY FINDS * Aisha Malik 16 hours ago * X * LinkedIn * Facebook * Instagram * youTube * Mastodon * Threads * Bluesky * TechCrunch * Staff * Contact Us * Advertise * Crunchboard Jobs * Site Map * Terms of Service * Privacy Policy * RSS Terms of Use * Privacy & Cookie Settings * Code of Conduct * About Our Ads * Project Mariner * OpenAI’s Sora * Gemini 2.0 * Cruise Robotaxis * 12 Days of OpenAI * Tech Layoffs * ChatGPT © 2024 Yahoo. Some areas of this page may shift around if you resize the browser window. Be sure to check heading and document order.