• AI Logs
  • Posts
  • šŸ’° Jony Ive and Sam Altman seek $1 billion for the 'iPhone of AI', and Robot dogs train for moon rescue missions

šŸ’° Jony Ive and Sam Altman seek $1 billion for the 'iPhone of AI', and Robot dogs train for moon rescue missions

Plus: Prompt of the week, image upscaling, plus this weekā€™s best tools and AI news.

Hello all, I'm thrilled to announce my live workshop on Interesting Engineering Academy. The 'AI Design Studio' is a fully interactive online course customized for non-coders and seasoned programmers eager to step up their interactions with AI in software development and design.

As a special treat for our AI Logs subscribers, we offer an exclusive discount. Use the code AILOGS50 to snag an extra 50 percent off. So, what are you waiting for? Get the first course now.

This past week has me thinking that weā€™re potentially both closer to ChatGPT 4.5 and potentially even AGI than most ā€˜expertsā€™ and enthusiasts alike seem to believe ā€” and while Iā€™ve said both here in previous issues of AI Logs, I have more conviction than ever now. Hereā€™s why:

First things first, a new feature was released in ChatGPT, namely, the ability to edit images generated in DALLā€¢E, which is a huge deal. Beyond just being an amazing functionality, typically, when new features are being rolled out, itā€™s indicative of a new model not being far behind it. Secondly, with other AIs such as Claude reportedly outperforming ChatGPT in certain benchmarks, it gives the industry leader, OpenAI, motivation to show what itā€™s really capable of ā€” competition is a worthy motivator.

But finally, in regards to the potential of GPT 4.5, I had a breakthrough in prompting this week that had me beyond stoked, although I havenā€™t been able to replicate it since (which is further proof positive that an update may be imminent). I prompted ChatGPT to prompt itself, and to deliver results while still working on other tasks, and to prompt itself and to keep working autonomously, without me, and for a full project lasting nearly an hour ā€” it DID. I was blown away. Autonomous Agents have come a long way, and Iā€™ve featured MultiOn and others here in previous issues, but to see it inside of ChatGPT, and the results it was able to achieve, were unreal.

Although Iā€™ve been unable to replicate it (it now tells me that isnā€™t something itā€™s ethically able to do), I have the thread, the final results and witnessed it with my own eyes, and this, coupled with the above data points, rumors Iā€™ve seen circulating etc., have me feeling fairly confident that the only way we wouldnā€™t see GPT 4.5 soon is if the board determines itā€™s too powerful for the public, or that theyā€™re so close to releasing GPT 5 yet, which I doubt, to justify releasing a 4.5 model.

So Iā€™ve addressed the potential of ChatGPT 4.5 being near, but what about the claim of AGI being closer than some suspect? Well, regular readers know that Iā€™m bullish on many models in addition to LLMs (language) such as LWM (world), LAM (action), LNM (nature), and more, and highlighted Metaā€™s Yann LaCun saying that ā€œlarge language models aloneā€ canā€™t achieve AGI. But I speculate that these models together, particularly with autonomous agents and other models recently launched specific to robots, may get us really close; like: really, really close. Now we just need more compute, which Groq and their new LPU chips may well have solved for.

Thereā€™s so much else to unpack this week that I could write another ten paragraphs here, but in the name of brevity, letā€™s jump into this weekā€™s prompts, tools, tutorial, and news; Iā€™ll try to incorporate as much of the weekā€™s headlines there too as possible.

Did a friend forward this e-mail to you?

IE+ SUPPORT INTERESTING ENGINEERING
Invest In Science And Engineering

Enjoy exclusive access to the forefront of AI content, highlighting trends and news that shape the future. Join a community passionate about AI, delve into the latest AI breakthroughs, and be informed with our AI-focused weekly premium newsletters. With IE+, AI reporting goes beyond the ordinary - and it is Ad-Free.

NEWS

MUST READ

In September 2023, reports emerged of former Apple design guru Jony Ive joining forces with OpenAI CEO Sam Altman to develop the ā€œiPhone of AI.ā€

The duo held brainstorming sessions at the designerā€™s San Francisco studio about a new consumer product centered on OpenAIā€™s technology.

SoftBank CEO Masayoshi Son was reportedly involved in some of the discussions, ā€œpitching a central role for ARM, as well as offering financial backing.ā€ The Japanese multinational investment company holds a 90 percent stake in the chip manufacturer.

PROMPT OF THE WEEK

Iā€™m tempted to share the prompt that got ChatGPT to run autonomously, but since itā€™s no longer working and this section is meant to be actionable, Iā€™ll switch gears (many of the elements of the ā€œautonomousā€ prompt are previous ā€œprompts of the weekā€ here anyhow).

So instead Iā€™ll offer an ā€˜anti-promptā€™, as itā€™s how to voice-speak with your AI, similarly to Hume or, until recently, Pi, rather than text-prompt it. But it is remarkable and noteworthy how GREAT the results are getting when we speak to our AI (ChatGPT, Gemini, etc.), and renders the skill of ā€œpromptingā€ virtually obsolete, although the same underlying principles remain.

You can talk to most AIs using the microphone icon, and can have it read its responses back audibly after it types it, but several let you actually have a voice conversation, and ChatGPT is a great one for this. Click the headphone icon to the right of the chatbox, and you can start asking it questions, practicing sales pitches or interviews, having it effectively act as your teacher, or so much more.

A few great use cases for this are practicing a keynote and asking for feedback, learning a new language and speaking in that foreign language at a level that you pushes you to the next one, polishing a sales pitch, learning history and much more.

Why this works ā€” we often speak faster than we type and donā€™t overthink it as much, we also often learn better by hearing than by reading. It also makes your AI more like a companion or mentor. If you havenā€™t tried this previously, I highly recommend it.

AI PICTURE OF THE WEEK

While this slide is not fully legible, it is one of the slides the autonomous mode of ChatGPT/DALLā€¢E created for my stealth music AI platform, WeJ.

TUTORIAL

How to edit images on ChatGPT/DALLā€¢E:

Once youā€™ve had DALLā€¢E generate an image, click on the image itself to make it full screen, at which point you will see the ā€œeditā€ button on the bottom right, underneath the generation. It will allow you to type desired edits on both mobile and desktop, but on the browser version you can select regions to micro-edit, and can keep editing until you achieve the desired results.

This is a great way to create slides, infographics, and still images to animate using a text-to-video platform like Runway ML, Pika, etc. (both featured in previous issues of AI Logs).

TOOLS OF THE WEEK

šŸ—£ļøĀ Lazy is an AI app builder thatā€™s surprisingly easy and delivers pretty quality, although fairly basic, apps. Free to play with, if you generate an app you actually like and want to promote and potentially even monetize, the cost is beyond reasonable at around $40/month with small usage costs that scale up. I played with this to spin out several types of apps already, and am genuinely impressed.

šŸ¤–Ā OpenUI is an open-sourced W3C generative AI for UI (user interface; front-end dev) that is a great compliment to the many g-AI development tools, many of which Iā€™ve featured here previously such as Open Devin and Devika. With the capability of both front- and back-end development using AI, the software game and its capability of turning ideas into digital reality is definitely redefining itself.

šŸ§Ā Upmetrics is a great AI that can generate pitch decks, business plans, financial models, and much more. Highly editable and decent at market research and strategy, this tool can help non-experienced founders ā€œfake it until they make itā€. It can also help seasoned founders save tons of time and get to market (or investable) much quicker.

šŸŽ²Ā LangFlow for LangChain is an AI to create and share AI research, workflows, autonomous agents and much more. Just acquired by DataStax, a RAG API and Database-as-a-Servixe, LangFlow could help level the playing field for many younger AI startups and help many leverage bleeding edge technology in their current ventures. Not for the purely non-technical AI users yet, this Python based tool is definitely one everyone should at least be aware of, and that could scale into something most impactful.

Written by

Cory Warfield

LinkedIn Top Voice/Influencer in AI

what else?

šŸšØ For IEā€™s daily engineering, science & tech bulletin, subscribe to The Blueprint

šŸ§‘šŸ»ā€šŸ”§ For expert advice on engineering careers, subscribe to Engineer Pros

āš™ļø To explore the wonders of mechanical engineering, get your Mechanical

šŸŽ¬ For a weekly round-up of our best science, tech & engineering videos, subscribe to IE Originals

For our weekly premium newsletter and an ad-free experience, sign up for IE+


Give Feedback