- The AI Update with Kevin Davis
- Posts
- New Open Source AI Voice Cloning Tech Released...
New Open Source AI Voice Cloning Tech Released...
We're Back! Great to be back in the saddle sending out a newsletter this morning. Looking forward to another stellar year for AI.
BREAKING NEWS
OpenVoice: A Game-Changer in AI Voice Cloning or a Pandora's Box?
In a move that's set Silicon Valley abuzz, MyShell, along with researchers from MIT and Tsinghua University, has flung open the doors to voice cloning technology with its open-source platform, OpenVoice.
Today, we proudly open source our OpenVoice algorithm, embracing our core ethos - AI for all.
Experience it now: app.myshell.ai/bot/z6Bvua/170…. Clone voices with unparalleled precision, with granular control of tone, from emotion to accent, rhythm, pauses, and intonation, using just a… twitter.com/i/web/status/1…
— MyShell (@myshell_ai)
2:00 PM • Jan 2, 2024
This tool promises unprecedented precision in replicating human voices, complete with nuanced emotional inflections and accents, all from a mere snippet of audio. But is this a breakthrough for inclusivity in AI, or are we teetering on the edge of a disconcerting precipice?
OpenVoice isn't just another voice cloning tool; it's a statement – "AI for All." This ethos, while noble, raises questions about the potential misuse of such technology. Could we soon see a surge in deepfakes or a new era of cyber impersonation? The implications are vast and varied.
The mechanics behind OpenVoice are as intriguing as they are concerning. It utilizes a dual-model system: a text-to-speech engine paired with a tone converter.
The former imbibes style and language nuances, trained on a diverse set of audio samples, while the latter adapts to the user's unique vocal attributes. The result? A voice clone that's not just a robotic echo but a dynamic doppelgänger.
Reported testing of OpenVoice yielded a voice clone that, while still betraying its synthetic origins, was impressively swift and malleable. The ability to tweak emotional tones added layers to the clone that other platforms simply lacked.
But let's not gloss over who's steering this ship. MyShell, barely a year old and flush with seed money, is already commanding an impressive user base. The startup's model is intriguing – a decentralized AI app platform that's part subscription-based, part promotional hub for third-party bot creators.
It's a clever monetization strategy for an open-source tool, but one that doesn't fully allay concerns over the potential commodification of personal voices.
As we stand at the threshold of this new frontier, we must ask: How will MyShell and the wider industry navigate the ethical minefields that lie ahead?
The company's commitment to supporting open-source research with grants and computing power is commendable, but the stewardship of such potent technology will be its true test.
In the end, OpenVoice is not just a technical marvel; it's a mirror reflecting our collective anxiety and excitement about AI's role in our future. It's a reminder that with great power comes great responsibility – a mantra that all players in the AI arena would do well to remember.
The Year Ahead
Our Long Break From Publishing
Some of you may have noticed the last issue was over 2 weeks ago on December 20th. There are a few reasons for that, but the main one was taking a break and focusing on myself and my family.
As some of you know I was diagnosed with Cancer last year and have been continuing the fight this year after it metastasized in my liver and lungs.
Treatments have been going well, tumors have been shrinking in my liver, and my CEA number which measures cancer cells in my bloodstream has improved dramatically.
The second reason is the news for December seemed like a snoozefest most of the month. In light of that, I am planning on making some dramatic changes this year.
What I Have In Mind
Membership Training within Skool or Circle with both paid and free levels.
Every issue highlights at least one useful prompt from my archive of 12+ months of using ChatGPT and MidJourney
Expanding content onto other platforms like YouTube
Providing an option for weekday, weekly, or both newsletter subscriptions.
More… What would make this newsletter better for you that other people are not doing?
OTHER NEWS
Midjourney's AI Video Ambition: A Leap into Uncharted Waters
Silicon Valley's Midjourney is charting a new course, one that could redefine the generative video industry. The company, known for its Discord-based image generation tool, is now training its sights on a "text to video" model.
David Holz, CEO, announced during a Discord session that Midjourney's video model training kicks off in January, with a product launch expected within months. The move is strategic, leveraging their mature image model to stir the market's competitive pot.
Yet, Midjourney is late to the party. Stability AI's Stable Video Diffusion, Meta's EMU, and other established players like Pika and Runway ML have already staked claims in the video generation domain.
Even lesser-known entities like Leonardo AI boast video capabilities. Midjourney's entry isn't just a splash—it's a dive into a crowded pool.
The company's v6 update, with enhanced image realism and prompt adherence, underscores its bid to remain a contender. Success hinges on model cohesion—crucial in a nascent yet imperfect field.
But this isn't merely a tech race. The ripple effects of AI video generation are profound. The creative and media landscapes are poised for upheaval.
Imagine entertainers and advertisers effortlessly crafting content or the potential shifts in our reality perception.
Midjourney's quest is ambitious, its implications staggering. As they and others refine their AI tools, we're on the cusp of a new creative era. The question isn't if the industry will transform—it's how and when.
SOCIAL MEDIA
Image Challenge: Masks
Inviting you all to my first challenge of 2024 🥳🥂
AI art challenge #140:"MASKS"!!!
Base Prompts in ALT and in text (half &half because ChatGPT DALL-E 3 prompts are too long for ALT text)🤗
Like & RT ♻️
Share your prompt in ALT so we can all learn together 🫱🏻🫲🏼
Excited to… twitter.com/i/web/status/1…
— 4rtofficially1ntelligent (@4rtofficial)
1:00 AM • Jan 3, 2024
FEEDBACK LOOP
Sincerely, How Did We Do With This Issue?I would really appreciate your feedback to make this newsletter better... |
LIKE IT, SHARE IT
That’s all for today.