- AIJ Newsletter
- Posts
- OpenAI's Voice Mode, Google's Gemini 1.5, Llama 3.2 Revolution, and Altman’s Equity Gain
OpenAI's Voice Mode, Google's Gemini 1.5, Llama 3.2 Revolution, and Altman’s Equity Gain
Happy Thursday, AI & Data Enthusiasts! We’ve big news regarding changes in OpenAI's leadership today. Mira, Bob, and Barret left the company the same day. Admist these changes, Sam Altman is set to restructure OpenAI’s from a nonprofit to a for-profit benefit corporation. Additionally, we have exciting updates on new features from OpenAI, Google's Gemini, and Llama. Let’s dig into these updates and more in our today’s newletter.
In today’s edition:
OpenAI Launches Advanced Voice Mode: New Voices, Faster Responses, and Customization
Google’s Gemini 1.5 Pro and Flash Models for Developers are Here!
Llama 3.2: Revolutionizing Edge AI with Open, Customizable Models
Sam Altman Set to Gain Equity Amid Leadership Changes
- Naseema Perveen
WHAT CAUGHT OUR ATTENTION MOST
OpenAI Launches Advanced Voice Mode: New Voices, Faster Responses, and Customization
OpenAI has officially launched Advanced Voice Mode (AVM) for ChatGPT, offering a more immersive, natural conversational experience. This exciting upgrade introduces five new voices, faster response times, and enhanced personalization features. Currently available to Plus and Teams subscribers, AVM will be accessible to Enterprise and Edu users starting next week.
New Voices for a Natural Experience: AVM introduces five new voices—Arbor, Maple, Sol, Spruce, and Vale—joining the original voices like Breeze and Juniper. These nature-inspired tones aim to make interactions feel more fluid and human-like.
Custom Instructions and Memory: Users can now personalize how ChatGPT responds to them through Custom Instructions, and AVM’s new Memory feature allows ChatGPT to recall previous interactions, providing a more tailored and seamless experience.
Improved Speed and Accent Recognition: AVM boasts faster response times and better understanding of accents, ensuring smoother conversations. These updates address previous issues, providing more reliable performance.
With the rollout of Advanced Voice Mode, OpenAI is taking conversational AI to the next level by offering more voices, personalized experiences, and faster interactions. While video and screen-sharing capabilities aren’t included yet, they’re set to arrive in future updates. AVM is a big step towards making AI-powered conversations feel more natural and engaging for users worldwide.
IN PARTNERSHIP WITH ELEKS
Power your competitive advantage with intelligent automation from ELEKS
ELEKS' intelligent automation service transforms your business operations through data-driven solutions. We automate complex tasks, streamlining processes to increase productivity and reduce operational costs. Our tailored solutions adapt to your changing needs and help you unlock new growth opportunities by freeing your team to focus on high-value tasks.
The result? Enhanced customer satisfaction, improved client retention, and a stronger market position.
KEEP YOUR EYE ON IT
Google’s Gemini 1.5 Pro and Flash Models for Developers are Here!
Google has released production-ready versions of its Gemini 1.5 API models, offering enhanced performance and reduced costs. These updates bring significant improvements for developers building apps and tools using Google’s AI technology.
Gemini 1.5 Pro and Flash: Both models deliver better performance in code generation, math, reasoning, and video analysis, with 50% lower costs for developers using Gemini 1.5 Pro.
Improved Features: Enhancements include better factual accuracy, reduced hallucinations, multilingual support, and optimized SQL and audio processing across 102 languages.
New Pricing: From October 1st, the API prices for both models will be reduced significantly, making AI development more affordable.
With Gemini 1.5, Google is streamlining AI app development by lowering costs and boosting performance, further pushing the boundaries of what’s possible in AI integration.
Llama 3.2: Revolutionizing Edge AI with Open, Customizable Models
Meta's latest release, Llama 3.2, takes edge AI to the next level with small and medium-sized vision models (11B, 90B) and lightweight text-only models (1B, 3B) designed for edge and mobile devices. These models are optimized for on-device use and pre-trained to handle tasks like summarization, instruction-following, and rewriting with minimal latency, making them ideal for applications running locally.
Vision models (11B, 90B): These models enable advanced image understanding and visual reasoning, outperforming closed models like Claude 3 Haiku on tasks such as image captioning, document-level understanding, and visual grounding.
Text-only models (1B, 3B): Lightweight models support 128K token context length for efficient summarization, rewriting, and tool-use scenarios, optimized for Qualcomm, MediaTek, and Arm processors.
Llama Stack: Meta introduced Llama Stack distributions to simplify model deployment across environments, whether it's on-prem, cloud, single-node, or on-device, offering easy integration with tools like retrieval-augmented generation (RAG).
With a focus on openness and modifiability, Meta continues to drive innovation by empowering developers to build custom applications and local deployments, further solidifying Llama’s place as a leader in the AI space.
Sam Altman Set to Gain Equity Amid Leadership Changes
Sam Altman, OpenAI's CEO, is reportedly set to receive equity in the company for the first time, following the departure of CTO Mira Murati. Following Mira’s departure, OpenAI’s chief research officer, Bob McGrew, and a research VP, Barret Zoph, left the company on Wedneday. Admist these leadership changes, Sam Altman plans to restrucutre OpenAI from a nonprofit to a for-profit benefit corporation to attract more investors. This move follows similar structures adopted by competitors like Anthropic and xAI.
The nonprofit board will lose majority control but retain a minority stake in the new for-profit entity. This restructuring is seen as a way to alleviate investor concerns over OpenAI's cap on returns, though it raises questions from the AI safety community about maintaining accountability in the company's push toward developing superintelligent AI. The details and timeline for the shift are still being finalized.
🎤 VOICE YOUR OPINION
Which biotech field will AI most significantly impact? |
ICYMI
Intel unveils next-generation AI solutions with the Launch of Xeon 6 and Gaudi 3.
OpenAI’s chief research officer has left following CTO Mira Murati’s exit.
World Economic Forum has predicted that AI will create around 97 million new jobs.
Reddit is bringing AI-powered, automatic translation to dozens of new countries.
Meta released its first open AI model that can process images.
MONEY MATTERS
AI agent platform Landbase came out of stealth with $12.5 million in funding.
OffDeal secures $4.7 Million in seed funding to revolutionize small business M&A with AI-powered investment bank.
InMobi secured $100 million for AI acquisition ahead of its IPO.
AI-powered compliance platform Datricks raised $15 million in a new round.
The AI SaaS market is predicted to amount to $1.5 trillion by 2030.
LINKS WE’RE LOVIN’
✅Podcast: Regulating AI for a Safer Future.
✅Cheat sheet: Cheat Sheets for AI, Neural Networks, Machine Learning, Deep Learning & Big Data.
✅Course: Introduction to Artificial Intelligence.
✅Webinar: 6 steps for zero downtime on a manufacturing line.
✅Watch: iPhone 16 Pro vs Pixel 9 Pro vs S24 Ultra.
SHARE THE NEWSLETTER & GET REWARDS
Your referral count: 0
Or copy & paste your referral link to others: https://youraiexperience.beehiiv.com/subscribe?ref=PLACEHOLDER
What do you think of the newsletter? |
That’s all for now. And, thanks for staying with us. If you have specific feedback, please let us know by leaving a comment or emailing us. We are here to serve you!
Join 130k+ AI and Data enthusiasts by subscribing to our LinkedIn page.
Become a sponsor of our next newsletter and connect with industry leaders and innovators.
Reply