Hi, I'm Mikhail Ilin. I'm a random stuff engineer, indie maker and
solopreneur. Most of the things I like are related to web technologies, music
or design. I've founded Lopaka.app.
My GitHub, Twitter and
LinkedIn
-
Apple Might Finally Build the Voice Assistant We Deserve
General ·The funny thing is, most tasks that the average person performs can already be automated one way or another, and all of this can technically be implemented for specific use cases.
Overall this resembles what a voice assistant was supposed to be. All these Alexas, Siris ā they were supposed to be exactly like this, and now weāre reaching the point where itās actually possible.
And it feels like Apple could release this very soon. Because they have the whole ecosystem tied to an Apple account, which includes photos, emails, messengers, the operating system, and everything else.
-
Why We Need a Voice OS: Beyond the Cursor Interface
General ·For the past couple of weeks Iāve been actively working on a speech recognition and dictation app. Along the way I keep getting ideas for new features. For example, I immediately added an output processing option where you can enter a prompt and itāll run your transcript through GPT.
Spinning up all these ideas in my head, seasoning them with content from my info feed, I came to the conclusion that the voice OS concept ā or a voice operating system ā isnāt actually crazy!
Iāve noticed that many people are starting to use Cursor as the central hub for organizing their life and business. They store information about themselves, their projects, connect various MCPs and agents, fully managing their lives through this interface. Example 1 and example two.
But Cursorās current interface is unnecessary. It shouldnāt be like this. Users donāt need all that scaffolding. You shouldnāt have to launch a separate project in Cursor to get started ā it rips you out of context.
When I started dictating more with my voice, I got the urge to press fewer buttons, got lazy about opening files and programs. From there comes the idea of a voice assistant built into the system. If you develop that thought, it becomes clear that we donāt really need the whole interface, except for situations that require visual interaction.
As a user, I want to always have access to my database and knowledge. A simple dictation and speech recognition app can easily become a tool where you can give commands and connect MCPs and agents. You can say āin such-and-such project add such-and-such taskā and overall the agent has enough intelligence and tools to find your project and make some changes there.
I want to automate managing various interfaces and agents and make it accessible through natural human interfaces. We canāt create universal physical buttons to control everything with our hands, but we can already control many tasks with our voice.
For example, Iād like to reply to emails by voice. Sure, I open the email client with a mouse, but then I donāt need to click through folders, search for emails. Itās enough to tell the voice agent to open an email from a specific person and draft a reply, schedule a meeting ā that would be perfect.
The Voice OS operating system concept fits this vision of the future. Curious what you think about this, cool or sketchy?
-
Programmers Then vs Now: The Real Reason We Can't Sleep
General ·Programmers back in the day: Canāt sleep because theyāre thinking about what algorithm and logic they need to implement
Programmers now: Canāt sleep because theyāre thinking about what prompt they need to write
-
Why Google Antigravity's Voice Input Changed My Vibe Coding Game
General ·Voice Input in Antigravity
For the last six months Iāve been deep into vibe coding: about once a month I sit down and spend several days grinding on some genius idea. Nothing solid has come out of it yet. Right now Iām working on a voice recognition tool so I can easily dictate messages whenever I want.
I realized I donāt like Cursor. I used it for quite a while, I had a paid subscription. Then I switched to OpenAI Codex and played around with it for a long time. I installed the plugin and canceled my Cursor subscription because I bought an OpenAI oneā¦
And now that Google Antigravity is out, itās an absolute beast. The default limits are super generous. Gemini Flash works great, and itās the cheapest and fastest model. Plus, it has built-in voice recognition, so you can dictate tasks to the agent right in the editor.
This is a straight-up killer feature because for some reason Cursor doesnāt have it. Without a voice recorder it becomes a huge barrier. I used to constantly stress about what to write to the agent⦠I tried writing in English, switched to Russian, and now in Antigravity I just hit the voice button.
I just dump all the chaos from my head ā and thatās enough: it understands everything perfectly. Nothing else needed. I can do other things on the fly, drink tea, play guitar at the same time. Thatās the huge value of Antigravity ā highly recommend trying it.
Right now Iām making a voice recognition app ā analogous to WhisperFlow or MacWhisper. There are Mac apps that can transcribe voice to text. And now Iām making a free alternative, want to open-source it.
I donāt know if something like this already exists ā I didnāt find anything in a quick search. When I was looking for a voice-to-file transcription tool, I got so fed up that I had to vibe-code my own on Lovable. Anyway, Iām working on it. Will share soon.
This text I dictated into my Swift app šŖ
-
I Can't Stay Silent: AI Slop Is Meme Culture 3.0
General ·I canāt take it anymore! You need to know this:
I love AI slop and everything the latest video models are putting out. This is literally memes 3.0!!11 An endless source of laughs! A breath of fresh air!
Iām watching all these videos with great interest.
Drop your favorite clips, please