- The AI Digest
- Posts
- Elon Musk reveals driverless robotaxi
Elon Musk reveals driverless robotaxi
Welcome, AI Enthousiasts!
In today’s AI newsletter:
→ AI Newsflash
→ Elon Musk and Tesla unveil Robotaxi
→ Google rolls out two new Gemini 1.5 models
→ OpenAI makes 4 major announcements at DevDay
→ Microsoft Copilot gets voice, vision upgrade
→ OpenAI secures record-breaking $6.6B in funding
→ Meta unveils advanced AI video model
→ OpenAI seeks independence from Microsoft
→ Control object motion in AI videos
→ 5 Best AI Tools for projectmanagement
Reading time: 12 minutes
AI Newsflash
Apple is rumored to be preparing a more affordable version of its Vision headset, priced at $2,000, compared to the $3,500 Vision Pro, which has seen lower-than-expected sales.
Google has made its Imagen 3 image generator available to all Gemini users, though only Advanced subscribers ($19.99/month) can create images featuring people.
OpenAI Chairman Bret Taylor’s AI venture, Sierra, is reportedly in talks to raise hundreds of millions in funding, with a valuation exceeding $4 billion. The startup focuses on conversational AI agents for enterprise use.
Meta has expanded its Meta AI service to six additional countries, including the EU, but due to regulatory constraints, multimodal features won’t be available in the EU for users of Ray-Ban Meta smart glasses.
Anthropic introduced the Message Batches API, enabling developers to submit up to 10,000 queries for asynchronous processing within 24 hours, offering a 50% cost reduction compared to standard API calls.
Grindr is working on an AI-powered "wingman" feature that will help users scout potential matches, set up dates, and even interact with other AI systems to find ideal partners.
Apple’s new Apple Intelligence features are expected to launch on October 28, alongside the release of iOS 18.1, according to insider reports from Mark Gurman of Bloomberg.
Elon Musk has officially introduced Tesla's highly anticipated Robotaxi, a sleek, two-door vehicle featuring futuristic gull-wing doors, and notably lacking a steering wheel or pedals. In addition to this unveiling, he surprised the audience with announcements about a larger Robovan and fresh updates on the Optimus humanoid robot.
Key Highlights:
The "Cybercab" Robotaxi is slated for production in 2026 and will be priced below $30,000, with projected operating costs as low as 20 cents per mile.
Tesla continues to focus on AI, cameras, and vast amounts of training data for its autonomous systems, avoiding the use of lidar technology, which is commonly employed by competitors.
A bigger, autonomous Robovan, which was introduced unexpectedly, is said to have the capacity to transport up to 20 passengers.
Musk forecasts that the price for Tesla’s Optimus robots could fall between $20,000 and $30,000, calling it "the biggest product of all time."
After years of anticipation, Tesla has finally pulled the curtain back on its fully autonomous Robotaxi, offering a competitive price under $30,000. With such low operating costs, the Robotaxi and the larger Robovan have the potential to dramatically transform the transportation landscape once they are fully deployed.
Google has unveiled two new upgraded versions of its Gemini 1.5 model within the Gemini API, including the 1.5 pro-002 model, which has set new standards in math performance, and 1.5-flash-002, which excels in instruction following.
Key highlights:
The 1.5 pro-002 model delivers state-of-the-art results on advanced math benchmarks such as AMC + AIME 24 and MATH, showcasing significant improvements in solving complex mathematical problems.
The 1.5-flash-002 model demonstrates major progress in following instructions, making it a valuable tool for developers working on various applications.
Alongside the new models, improvements have been made to rate limits, pricing for 1.5 Pro, and default filter settings, aimed at enhancing the overall experience for developers.
Google's Gemini 1.5 models mark a leap forward in AI capabilities, particularly in areas like math and coding, which have traditionally posed challenges for AI systems. With these advancements, developers can tackle more complex problems and create even more innovative solutions, pushing the boundaries of what AI can achieve.
AI TRAINING
Unlock multiple ChatGPT tools in one chat
ChatGPT has introduced a new shortcut feature that allows users to seamlessly switch between image generation, web search, and advanced reasoning tools—all within a single chat, eliminating the need to start new conversations.
Step-by-step
Start a new chat in ChatGPT and type "/" in the input field.
Choose from three options:
Picture (DALL-E) for image generation.
Search (web) for performing web searches.
Reason (GPT-o1) for advanced reasoning tasks.
Use specific commands based on your needs:
For images, type "/picture [description]" (e.g., "/picture futuristic city").
For web searches, type "/search [query]" (e.g., "/search latest in AI").
For complex reasoning, type "/reason [task]" (e.g., "/reason Explain quantum computing").
This new shortcut feature in ChatGPT enhances productivity by allowing users to switch between different tools in a single chat. It streamlines processes such as generating images, searching the web, or performing complex reasoning, making it easier and faster to tackle various tasks in one place.
OpenAI recently hosted its DevDay 2024 event, unveiling a range of new API features and upgrades aimed at making its AI tools more accessible, efficient, and affordable for developers.
Key highlights:
Realtime API: Enables developers to build speech-to-speech applications using the same model behind Advanced Voice, offering a selection of six different voices.
Model Distillation: Simplifies the fine-tuning of smaller models by leveraging outputs from larger ones, making model training more approachable for developers.
Prompt Caching: Cuts costs by nearly 50% and speeds up response times by up to 80% by reusing recent input tokens in API calls.
New Vision Fine-Tuning: Allows models to be trained using both images and text, optimizing tasks such as image recognition and analysis.
Despite lacking the typical fanfare of past events, the updates from OpenAI's DevDay 2024 are set to make a significant impact. These enhancements open the door for creating innovative applications while also lowering development costs and simplifying processes, making the platform more accessible to developers at all levels.
Microsoft has announced a series of major AI upgrades for its Copilot assistant on Windows PCs, introducing new vision and voice features, personalization updates, and the return of the Recall feature with enhanced privacy measures.
Key highlights:
Copilot Voice: Allows users to interact with the assistant through natural speech, making communication more conversational and intuitive, similar to OpenAI’s Voice Mode.
Copilot Vision: Enables the AI to understand and interact with web content that users are viewing, providing context-aware assistance directly within Microsoft Edge.
‘Think Deeper’: Enhances Copilot’s reasoning capabilities by using chain-of-thought reasoning powered by OpenAI’s o1 model, improving the assistant's decision-making.
Recall Feature: The controversial Recall feature is being re-released with upgraded privacy and security settings, requiring users to opt in.
Personalization: Copilot is being improved to act on users’ behalf, adapting to their personal preferences and needs, as highlighted by Microsoft AI CEO Mustafa Suleyman.
These upgrades bring Microsoft's Copilot in line with the latest AI advancements, enhancing its functionality and making it more personalized, intuitive, and capable. The improvements push Copilot closer to offering users a more agentic experience, where the assistant can proactively perform tasks based on individual preferences.
OpenAI has secured a massive $6.6 billion funding round, boosting its valuation to an astounding $157 billion, cementing its status as the most well-funded AI startup globally.
Key highlights:
Led by Thrive Capital: The funding round included major investors such as Microsoft, Nvidia, SoftBank, and MGX.
Use of Funds: OpenAI plans to invest in expanding research efforts, increasing its computing power, and developing new AI tools.
Revenue Projections: Investor documents predict OpenAI’s revenue could reach $25 billion by 2026 and grow to $100 billion by 2029.
Exclusive Investor Arrangements: The company reportedly encouraged investors to avoid supporting competitors like Anthropic and xAI.
Corporate Restructure: OpenAI is set to transition into a for-profit entity, though this restructure won’t take effect until sometime next year, according to reports.
This long-anticipated funding round marks a significant moment for OpenAI, showcasing the company’s continued dominance despite internal leadership changes and fierce competition. The enormous valuation demonstrates that investors still view OpenAI as the leader in the AI space, with immense potential for future growth.
Meta has introduced Movie Gen, a new suite of AI models designed for generating and editing video and audio, directly positioning itself against competitors like OpenAI’s Sora and other industry leaders.
Key highlights:
Four Core Models: Movie Gen includes a 30B video generation model, a 13B audio model, a personalized video creation model, and a video editing model.
Capabilities: The system can generate high-definition videos up to 16 seconds long from text prompts, complete with synchronized audio such as sound effects and background music.
Video Editing: Users can edit videos using natural language prompts and upload reference images to create personalized videos, offering a more tailored experience.
Performance: Meta claims that Movie Gen surpasses competitors like Runway Gen3, Luma Labs, and OpenAI’s Sora in terms of human-rated video quality and consistency.
Launch on Instagram: Meta CEO Mark Zuckerberg announced that Movie Gen will be integrated into Instagram next year, showcasing some sample video generations in his post.
Movie Gen distinguishes itself from other AI video tools by combining text-based video generation with advanced editing capabilities. With plans to integrate into Instagram, this tool could revolutionize content creation, making it easier for users to create and edit high-quality videos with simple text prompts. This move could reshape how creators engage with video production on social media.
OpenAI is reportedly seeking to reduce its dependence on Microsoft for computing power and has begun exploring alternative options, including building its own data infrastructure and developing AI chips, according to a report by The Information.
Key highlights:
Microsoft Delays: OpenAI CFO Sarah Friar informed shareholders that Microsoft has been slow in providing the necessary computing power, prompting OpenAI to explore other avenues.
Data Center Lease: OpenAI is planning to lease an entire data center from Oracle in Abilene, TX. Although Microsoft is a competitor, the deal likely required its approval.
AI Chip Development: OpenAI is also working on developing its own AI chips, which could help lower costs for future computing needs, as it currently relies heavily on renting resources from Microsoft.
Tensions Over Fairwater: Strains have reportedly emerged between OpenAI and Microsoft over the design and timeline of a joint data center project called ‘Fairwater.’
The relationship between OpenAI and Microsoft, while mutually beneficial in many ways, seems to be showing signs of tension. As OpenAI looks for greater independence in computing resources, this shift could have significant consequences for the future of the AI industry and the dynamics between key players like Microsoft and Oracle. How this evolving partnership unfolds could impact the broader AI landscape.
KLING AI
Control object motion in AI videos
Kling AI, a leading AI video generation platform, now offers the ability to add strategic movement to specific elements in videos, giving users greater control over their generated clips.
Step-by-step:
Select an Image: Choose a high-quality image that contains multiple elements you want to animate.
Upload to Kling AI: Open Kling AI's Image-to-Video tool and upload your selected image.
Apply Motion Brush: Use the Motion Brush to highlight areas in the image you want to animate, then set motion paths to control the direction and movement of each element.
Fine-Tune: Adjust settings, refine the animation with text prompts, and finalize your preferences before generating the video.
Pro Tip: For more realistic results, keep the movements subtle and natural. Experiment with various combinations of movement to see what works best for your image.
Kling AI’s new motion feature provides creators with more precise control over their video content, enhancing the flexibility and realism of AI-generated clips. This ability to strategically animate specific elements opens up new creative possibilities for personalized and engaging video projects.
5 Best AI Tools for projectmanagement
M1-Project: An AI-driven project management platform offering a 51% discount on all subscription plans for the first month with the promo code GENAI.
Monday.com: A versatile project management tool that leverages AI to assist teams in collaborating, tracking progress, and streamlining workflows.
ClickUp: A centralized platform powered by AI, combining project management, time tracking, and productivity features into one comprehensive tool.
Motion: AI-based project management software designed to automate scheduling, prioritize tasks, and enhance team collaboration.
Reclaim.ai: An AI-powered time management tool that automatically optimizes team schedules, tasks, and meetings for better efficiency.
That’s a wrap!
We had a lot to talk about, so let’s wrap it up. If you have any questions, feel free to shoot over an e-mail and we wil get you a response within 24 hours.
If you have specific feedback or anything interesting you’d like to share, please let us know by replying to this email.