- The AI Update with Kevin Davis
- Posts
- AI Update: Is GPT-4 Dumber? CEO Replaces 90% Of Support Staff With AI, Code Interpreter Hack For Multiple Files
AI Update: Is GPT-4 Dumber? CEO Replaces 90% Of Support Staff With AI, Code Interpreter Hack For Multiple Files
Are you seeing worse results with GPT-4? Today we look at changes to GPT-4, AI replacing support and a way to upload multiple files at once into Code Interpreter.
The Rise and Fall of GPT-4: Is OpenAI's Chatbot Getting "Lazier" and "Dumber"?
Industry insiders speculate on a radical redesign of OpenAI's powerful AI model
Imagine having a high-performance sports car one day, only to find it transformed into a clunky old pickup truck the next. Well, that's how some users of OpenAI's GPT-4 chatbot are feeling lately. The once-powerful AI model has been experiencing a decline in performance, leaving users frustrated and questioning its capabilities. But what's behind this sudden change?
Rumors are swirling in the AI community about a radical redesign of the system by OpenAI. Let's dive into the details and see what's really going on.
From Ferrari to Pickup Truck:
Earlier this year, OpenAI wowed the world with ChatGPT, the chatbot powered by GPT-3 and GPT-3.5. These AI models were hailed as groundbreaking, with impressive reasoning capabilities and uncanny answers.
The launch of GPT-4 in March created even more excitement, as it was considered the most powerful AI model available. Developers and tech insiders flocked to try it out, eager to harness its multimodal capabilities.
However, the initial excitement quickly turned to shock when users received their bills for using GPT-4. The model was slow but accurate, and its cost was astronomical. Sharon Zhou, CEO of Lamini, a startup that works with large language models, described the new model as a high-performance but expensive machine.
A Radical Redesign:
But then, something changed. GPT-4 became faster, but its performance noticeably declined. Users took to Twitter and OpenAI's developer forum to express their frustrations. They reported weakened logic, more errors, trouble following instructions, and a general decline in the quality of responses.
It was as if the AI had become "lazier" and "dumber."Industry insiders started speculating about a major redesign of GPT-4 by OpenAI.
One theory suggests that OpenAI is creating smaller expert models within GPT-4, known as a Mixture of Experts (MOE) approach. These expert models would specialize in different subjects, such as biology, physics, chemistry, and more. When a user asks a question, the system would know which expert model to consult for the best answer. This approach could reduce costs while maintaining performance.
A "Ship of Theseus" Situation:
This redesign has been compared to the philosophical thought experiment known as the "Ship of Theseus." In the experiment, parts of a ship are gradually replaced over time, raising the question of when it becomes a whole new ship.
OpenAI is essentially turning GPT-4 into a fleet of smaller expert models, which some argue is a new model altogether.
Experts have shared details of GPT-4's architecture on Twitter, suggesting a MOE model with 16 experts. This approach could lead to better-generated responses and faster, cheaper results. However, there may be a tradeoff between cost and quality, as OpenAI might be sacrificing some performance to reduce expenses.
The Future of GPT-4:
OpenAI has remained tight-lipped about the redesign, leaving the AI community to speculate. Some experts believe that the decline in GPT-4's performance is temporary and that the system is collecting data to improve and learn. Only time will tell if this radical overhaul will lead to a more efficient and effective AI model.
OpenAI's GPT-4 chatbot has undergone a significant transformation, leaving users disappointed with its declining performance. Industry insiders suspect a radical redesign of the system, possibly involving a Mixture of Experts’ approach.
While the current state of GPT-4 may be underwhelming, the potential for improved performance and cost-effectiveness is promising. As OpenAI continues to innovate, we can expect exciting developments in the field of AI and natural language processing.
The Double-edged Sword of AI: Efficiency Gains and Job Losses
In a move that has sparked widespread controversy, Suumit Shah, the CEO of Bengaluru-based Duukan, has replaced 90% of his company's support staff with AI. Shah took to Twitter to celebrate the decision, touting the efficiency and cost-saving benefits of the AI chatbot that now handles customer queries.
We had to layoff 90% of our support team because of this AI chatbot.
Tough? Yes. Necessary? Absolutely.
The results?
Time to first response went from 1m 44s to INSTANT!
Resolution time went from 2h 13m to 3m 12s
Customer support costs reduced by ~85%Here's how's we did it 🧵
— Suumit Shah (@suumitshah)
5:45 PM • Jul 10, 2023
The response from the public, however, was less than enthusiastic. Shah's celebratory tweets, which boasted about the chatbot's ability to respond to customer queries in under two minutes and cut support costs by 85%, were met with backlash for their insensitivity towards the 23 employees who lost their jobs as a result.
In a rather tone-deaf response to the criticism, Shah expressed annoyance at the "downer" on his "AI love fest." He added that he would discuss assistance for his laid-off staff on LinkedIn, where people are supposedly more interested in "profitability and not sympathy."It's clear that Shah sees AI as the inevitable future of business, stating, "AI is taking our jobs. Over time, everybody will start doing this. It's not just us." However, his cavalier attitude towards the human cost of this transition is troubling.
The move also raises questions about the quality and safety of AI in customer service roles. The US National Eating Disorder Association recently replaced its entire helpline staff with an AI chatbot, which was quickly disabled after it gave out harmful information.
As AI continues to advance, companies like IBM have announced plans to stop hiring for jobs that could potentially be performed by AI. This is expected to impact 300 million full-time jobs globally, leading to increased stress and anxiety among tech workers.
Shah's actions and subsequent response serve as a stark reminder of the ethical implications of AI's encroachment into the job market. It's clear that while AI can bring efficiency and cost savings, it also brings a host of social and ethical challenges that we must navigate carefully.
As we move into this new era, let's hope that other leaders in the tech industry approach these challenges with a greater degree of empathy and foresight than Shah has demonstrated. After all, progress should never come at the expense of people's livelihoods.
Images Of The Day
Street art in the style of Banksy.
street art mural of street rat hipster and his t-bucket rat rod by graffiti artist Banksy --ar 2:1
street art mural of street rat hipster and his t-bucket rat rod by graffiti artist Banksy --ar 2:1
street art mural of dystopian scene of children playing in war-torn Ukraine by graffiti artist Banksy --ar 2:1 --style raw
Code Interpreter Hack To Upload Multiple Files
Prompt Of The Day
One of the ways I tested Claude 2.0 was to have it compare the avatars I have for columnists and tech reporters, and find what they have in common in their writing style.
Prompt: describe the traits shared by all of the reporter and columnist avatars in the attached pdf
Result:
Based on reviewing the avatar descriptions in the attached PDF, here are some of the main shared traits of the reporter and columnist avatars:
- Authoritative tone - The avatars are described as authoritative, renowned, and respected voices in their field. Their writing comes across as confident and knowledgeable.
- Insightful analysis - The avatars provide thoughtful analysis and critical perspectives on issues, not just surface-level reporting. They aim to provide context, implications, and a deeper understanding.
- Clear, engaging writing - Their writing styles are described as clear, concise, approachable, and engaging. They can explain complex topics accessibly.
- Mix of information - Their articles/columns incorporate research, quotes, data, anecdotes, and personal experience to provide a well-rounded perspective.
- Thoughtful structure - Many start with a hook, provide background, analyze from multiple angles, and end with a strong conclusion. The structure aids understanding.
- Independent voice - They are portrayed as independent thinkers who aren't afraid to criticize industries or challenge conventional wisdom if warranted.
- Experience and expertise - Their authority stems from years of reporting experience and deep knowledge of their beats and specialty topics.
- Forward-thinking - They look at the implications of technologies and issues for the future, not just current events. Their work often aims to give readers a broader perspective.
In summary, the avatars are seasoned, authoritative voices who provide insightful and accessible analysis on complex topics for their readers. Their independence and engaging writing styles make them trusted guides on their subjects.
Thoughts: I have been thinking of creating hybrid writing avatars, this exercise as given me some ideas of where to start as the foundation.
Sincerely, How Did We Do With This Issue?I would really appreciate your feedback to make this newsletter better... |
That’s all for today. Finally back home for a few days after 15 tests and meetings at the Mayo and back down there Monday.
The good news on the cancer side is we are seeing some shrinking in the cancer cells from the chemo I had while waiting for T-cells to be trained.
Until tomorrow,
Kevin Davis