- AI Logs
- Posts
- š° Jony Ive and Sam Altman seek $1 billion for the 'iPhone of AI', and Robot dogs train for moon rescue missions
š° Jony Ive and Sam Altman seek $1 billion for the 'iPhone of AI', and Robot dogs train for moon rescue missions
Plus: Prompt of the week, image upscaling, plus this weekās best tools and AI news.
Hello all, I'm thrilled to announce my live workshop on Interesting Engineering Academy. The 'AI Design Studio' is a fully interactive online course customized for non-coders and seasoned programmers eager to step up their interactions with AI in software development and design.
As a special treat for our AI Logs subscribers, we offer an exclusive discount. Use the code AILOGS50 to snag an extra 50 percent off. So, what are you waiting for? Get the first course now.
This past week has me thinking that weāre potentially both closer to ChatGPT 4.5 and potentially even AGI than most āexpertsā and enthusiasts alike seem to believe ā and while Iāve said both here in previous issues of AI Logs, I have more conviction than ever now. Hereās why:
First things first, a new feature was released in ChatGPT, namely, the ability to edit images generated in DALLā¢E, which is a huge deal. Beyond just being an amazing functionality, typically, when new features are being rolled out, itās indicative of a new model not being far behind it. Secondly, with other AIs such as Claude reportedly outperforming ChatGPT in certain benchmarks, it gives the industry leader, OpenAI, motivation to show what itās really capable of ā competition is a worthy motivator.
But finally, in regards to the potential of GPT 4.5, I had a breakthrough in prompting this week that had me beyond stoked, although I havenāt been able to replicate it since (which is further proof positive that an update may be imminent). I prompted ChatGPT to prompt itself, and to deliver results while still working on other tasks, and to prompt itself and to keep working autonomously, without me, and for a full project lasting nearly an hour ā it DID. I was blown away. Autonomous Agents have come a long way, and Iāve featured MultiOn and others here in previous issues, but to see it inside of ChatGPT, and the results it was able to achieve, were unreal.
Although Iāve been unable to replicate it (it now tells me that isnāt something itās ethically able to do), I have the thread, the final results and witnessed it with my own eyes, and this, coupled with the above data points, rumors Iāve seen circulating etc., have me feeling fairly confident that the only way we wouldnāt see GPT 4.5 soon is if the board determines itās too powerful for the public, or that theyāre so close to releasing GPT 5 yet, which I doubt, to justify releasing a 4.5 model.
So Iāve addressed the potential of ChatGPT 4.5 being near, but what about the claim of AGI being closer than some suspect? Well, regular readers know that Iām bullish on many models in addition to LLMs (language) such as LWM (world), LAM (action), LNM (nature), and more, and highlighted Metaās Yann LaCun saying that ālarge language models aloneā canāt achieve AGI. But I speculate that these models together, particularly with autonomous agents and other models recently launched specific to robots, may get us really close; like: really, really close. Now we just need more compute, which Groq and their new LPU chips may well have solved for.
Thereās so much else to unpack this week that I could write another ten paragraphs here, but in the name of brevity, letās jump into this weekās prompts, tools, tutorial, and news; Iāll try to incorporate as much of the weekās headlines there too as possible.
IE+ SUPPORT INTERESTING ENGINEERING
Invest In Science And Engineering
Enjoy exclusive access to the forefront of AI content, highlighting trends and news that shape the future. Join a community passionate about AI, delve into the latest AI breakthroughs, and be informed with our AI-focused weekly premium newsletters. With IE+, AI reporting goes beyond the ordinary - and it is Ad-Free.
NEWS
š Robot dogs drill to train for moon rescue missions, navigating craters
To make AI as smart as butterflies, a team of Penn State researchers created a multi-sensory AI platform.
š³ļø China could deploy AI to disrupt elections in US, India: Microsoft warns
Presidential elections in Taiwan were a dry run and China could sharpen its attack in high profile elections lined up in 2024.
š» Why did whisper take a million hours of YouTube videos?
As per reports, the OpenAI team illegally used more than one million hours of YouTube videos, here is why.
MUST READ
In September 2023, reports emerged of former Apple design guru Jony Ive joining forces with OpenAI CEO Sam Altman to develop the āiPhone of AI.ā
The duo held brainstorming sessions at the designerās San Francisco studio about a new consumer product centered on OpenAIās technology.
SoftBank CEO Masayoshi Son was reportedly involved in some of the discussions, āpitching a central role for ARM, as well as offering financial backing.ā The Japanese multinational investment company holds a 90 percent stake in the chip manufacturer.
PROMPT OF THE WEEK
Iām tempted to share the prompt that got ChatGPT to run autonomously, but since itās no longer working and this section is meant to be actionable, Iāll switch gears (many of the elements of the āautonomousā prompt are previous āprompts of the weekā here anyhow).
So instead Iāll offer an āanti-promptā, as itās how to voice-speak with your AI, similarly to Hume or, until recently, Pi, rather than text-prompt it. But it is remarkable and noteworthy how GREAT the results are getting when we speak to our AI (ChatGPT, Gemini, etc.), and renders the skill of āpromptingā virtually obsolete, although the same underlying principles remain.
You can talk to most AIs using the microphone icon, and can have it read its responses back audibly after it types it, but several let you actually have a voice conversation, and ChatGPT is a great one for this. Click the headphone icon to the right of the chatbox, and you can start asking it questions, practicing sales pitches or interviews, having it effectively act as your teacher, or so much more.
A few great use cases for this are practicing a keynote and asking for feedback, learning a new language and speaking in that foreign language at a level that you pushes you to the next one, polishing a sales pitch, learning history and much more.
Why this works ā we often speak faster than we type and donāt overthink it as much, we also often learn better by hearing than by reading. It also makes your AI more like a companion or mentor. If you havenāt tried this previously, I highly recommend it.
AI PICTURE OF THE WEEK
While this slide is not fully legible, it is one of the slides the autonomous mode of ChatGPT/DALLā¢E created for my stealth music AI platform, WeJ.
TUTORIAL
How to edit images on ChatGPT/DALLā¢E:
Once youāve had DALLā¢E generate an image, click on the image itself to make it full screen, at which point you will see the āeditā button on the bottom right, underneath the generation. It will allow you to type desired edits on both mobile and desktop, but on the browser version you can select regions to micro-edit, and can keep editing until you achieve the desired results.
This is a great way to create slides, infographics, and still images to animate using a text-to-video platform like Runway ML, Pika, etc. (both featured in previous issues of AI Logs).
TOOLS OF THE WEEK
š£ļø Lazy is an AI app builder thatās surprisingly easy and delivers pretty quality, although fairly basic, apps. Free to play with, if you generate an app you actually like and want to promote and potentially even monetize, the cost is beyond reasonable at around $40/month with small usage costs that scale up. I played with this to spin out several types of apps already, and am genuinely impressed.
š¤ OpenUI is an open-sourced W3C generative AI for UI (user interface; front-end dev) that is a great compliment to the many g-AI development tools, many of which Iāve featured here previously such as Open Devin and Devika. With the capability of both front- and back-end development using AI, the software game and its capability of turning ideas into digital reality is definitely redefining itself.
š§ Upmetrics is a great AI that can generate pitch decks, business plans, financial models, and much more. Highly editable and decent at market research and strategy, this tool can help non-experienced founders āfake it until they make itā. It can also help seasoned founders save tons of time and get to market (or investable) much quicker.
š² LangFlow for LangChain is an AI to create and share AI research, workflows, autonomous agents and much more. Just acquired by DataStax, a RAG API and Database-as-a-Servixe, LangFlow could help level the playing field for many younger AI startups and help many leverage bleeding edge technology in their current ventures. Not for the purely non-technical AI users yet, this Python based tool is definitely one everyone should at least be aware of, and that could scale into something most impactful.
what else?
šØ For IEās daily engineering, science & tech bulletin, subscribe to The Blueprint
š§š»āš§ For expert advice on engineering careers, subscribe to Engineer Pros
āļø To explore the wonders of mechanical engineering, get your Mechanical
š¬ For a weekly round-up of our best science, tech & engineering videos, subscribe to IE Originals
For our weekly premium newsletter and an ad-free experience, sign up for IE+