Home Startups Sesame Debuts Base AI Model Behind the Hit Virtual Assistant Maya

Sesame Debuts Base AI Model Behind the Hit Virtual Assistant Maya

by Biz Recap Team
Sesame debuts base ai model behind the hit virtual assistant

Sesame Unveils CSM-1B Model for AI Voice Assistant Maya

Introduction to the CSM-1B Model

AI company Sesame has introduced the CSM-1B, a powerful base model that serves as the foundation for its remarkably lifelike voice assistant, Maya. With a size of 1 billion parameters, this model is licensed under Apache 2.0, allowing for commercial applications with minimal restrictions.

Technical Specifications and Features

The CSM-1B model generates audio codes using a technique known as residual vector quantization (RVQ). This approach enables the encoding of audio into distinct tokens, a method employed by modern AI audio technologies, including those from tech giants like Google and Meta.

Utilizing a model from Meta’s Llama family as its backbone, CSM-1B is complemented by an audio decoder that enhances its functionality. The fine-tuned version of this base model is what primarily powers Sesame’s voice assistant, Maya, showcasing its adaptability and performance capabilities.

Model Capabilities and Limitations

According to Sesame, the model is versatile and can produce various voices; however, it has not been specifically fine-tuned for any individual voice. It possesses a limited ability to handle non-English languages, largely due to the nature of the training data. Notably, the specifics of the training dataset remain undisclosed by the company.

Sound Safeguards and Ethical Use

It’s important to highlight that CSM-1B currently lacks robust safeguards against misuse. Sesame has implemented an honor system, appealing to developers and users not to use the model for unethical purposes—such as mimicking someone’s voice without consent, creating deceptive content, or engaging in harmful activities.

A demonstration on the Hugging Face platform revealed how quickly CSM-1B can clone a voice, sparking concerns over its potential misuse in generating controversial speech or misinformation.

Industry Concerns and Developer Community Feedback

Consumer Reports has raised alarms regarding the lack of meaningful safeguards in many AI voice cloning technologies available on the market today. The concerns echo a broader apprehension within the industry about the implications of voice synthesis capabilities.

Company Background and Future Prospects

Co-founded by Brendan Iribe, a pioneer behind Oculus, Sesame gained significant attention in February with its voice assistants that nearly transcend the uncanny valley. Maya and another assistant named Miles exhibit human-like characteristics, such as breathing and natural speech patterns, which allow for interactive dialogues.

Beyond voice technology, Sesame is also exploring the development of AI glasses intended for daily use, which will incorporate their proprietary models. The company has secured financial backing from notable investors, including Andreessen Horowitz, Spark Capital, and Matrix Partners.

Conclusion

The introduction of CSM-1B marks a significant advancement in AI voice technology. While it demonstrates impressive capabilities, it also calls for a thoughtful approach to ensure its responsible use in an increasingly AI-driven world.

Source link

You may also like

About Us

Welcome to BizRecap, your ultimate destination for comprehensive business and market news. At BizRecap, we believe that staying informed is the cornerstone of success in today’s fast-paced world. Our mission is to deliver accurate, insightful, and timely updates across all topics related to the business and financial landscape.

Copyright ©️ 2024 BizRecap | All rights reserved.