This project demonstrates a sophisticated understanding of multimodal interaction by addressing specific ergonomic constraints in high-stakes professional environments. It successfully moves beyond generic AI concepts to provide a functional, context-aware solution for hands-free workflows.
Deep Dive
Prerequisite Knowledge
- No data available.
Where to go next
- No data available.
Deep Dive
Mihara Dinusara Wickramarathna Dias Bandaranayaka | Voice Note AI | Rysera STEMAdded:
Hi, I am Mihar Bander and purpose of my project is breaking the barriers of traditional input output method in fields like surgery, mechanical repair or cooking or even chemical research.
Using a mouse emote input out input method is impractical or sometimes even impossible. That's why I made this method to achieve a seamless hands-free bridge between human intent and computer action.
In many high stakes fields from stellar operation to room to busy mechanical workshops, the traditional keyboard and mouse aren't just inconvenient, they are a barrier. So I created Javis AI to break that barrier.
So let me show you. Hey Javis with a custom offline local work listener. The system is always ready protecting privacy while remaining instantly accessible.
Javis I always prefer my system updates to be summarized in one short sentence.
Please remember this.
>> I've already saved your preference to summarize system updates in one short sentence. I'll be sure to do that for you.
So unlike standard chatbots that forget everything when you close the tab, Javis AI has the processed JSON memory drive.
It does how you like your task done across sessions.
Show me my systems list space and then create a folder called on my desktop.
I have successfully created the folder named Asets on your desktop.
Inside that folder, create a text file and write all the currently running processes to it.
I have created a text file named running processes.txt inside the assets folder on your desktop and it now contains a list of all currently running processes.
Let's see if it is done.
Notice what happened here. The AI didn't just give me text. It asked action, then reason that I need PowerShell, executed the command, checked for success, and actually modified my file system all handsree.
And because of my preferences, it kept the confirmation brief.
There is also a powerful web search agent built right in. Simply asking Java to search the web triggers an autonomous process that best and re summarize top result from the internet in real time with pulling external knowledge. But what if you have to take some notes while your hands are busy?
a chatbot is the most efficient way.
That's why I added a note takingaking feature for this AI.
Javis, I need to take some notes.
Sure. I'm opening the note takingaking interface for you now. Memo. You can speak or type your notes there and the AI will structure and format them automatically.
Now let's look at the note module.
Traditional voice notes are just messy transcript. This system uses Gemini to understand intent and structure.
I am going to use a trick here to make this quick. So photosynthesis is the process by which green plants make their own food using sunlight. This process mainly takes place in the leaves. Chlorophyll, the green pigment in plants, absorbs sunlight. Carbon dioxide from the air and water from the soil are used to produce glucose and oxygen. This process is essential for life on Earth because it provides oxygen for animals and humans.
Look at the result. It removed my filter words, recognized the priority, applied the heading to to style automatically.
It's an editor that listens beyond just smart transcription. The notes module is complete interface. You can clear current sessions to start fresh. Save your notes directly to the local file system and even browse or reload fast saved notes seamlessly all with the voice command or single click. Keeping your knowledge base organized without typing. Actually what inspired me to add this feature is in online lectures I want to focus more on understanding the concept better but I can't do it while I am taking notes. So this will be one of my everyday tools.
We also have a offline background scheduleuler for scheduleuling time precise task.
Javis should schedule a task for 1 minute from now to check if I am connected to Wi-Fi and what network it is there.
I've scheduled a task to check your Wi-Fi connection and network name for 1 minute from now.
While we wait for that alarm to trigger, notice that the AI isn't just setting a timer prompt. It is actually scheduuling an autonomous background task in his internal Windows engine. We don't even have to keep this window open. If we close it, it will automatically open up and inform us.
And there it is, a complete autonomous background execution of task. You can even say every Sunday at 12:30 p.m. do this or do that. You can use this as a alarm, a reminder, or even a completely automated task like submitting a report.
But there is a little problem. Even if we could take notes and execute shadow commands and execute voice commands, there are some task we cannot or it's little bit harder to do with voice commands. So Javis activate hand gesture control.
Activating hand gesture control. Hand with fingers displayed. Your camera will open shortly and you'll be able to control your computer using hand gestures.
Finally, for the ultimate hands-free experience, gesture OS when voice isn't enough, we use computer vision.
So using media pipe and open CV, I can control the mouse with sub pixel precision just by pointing a simple pinch performance can click and two fingers hold. But we can go lot further. The system execut with this and gesture. I can define custom gestures for any industry specific shortcut. One hand sign the system execute my premap actions. It transforms the AR around the user into the control surface. The UI lets you specify if the gesture applies to the left, right, or any hand.
Oh, it also lets you configure actions as it is a PowerShell command, a cmd command or a keyboard shortcut.
And let's say you are not a very technical person and you don't know the commands or the keyboard shortcuts to execute, but you can use AI to suggest the correct command for a plain English description for what you want to do.
Javis AI and the gesture control OS aren't just tools. They are the future of how we interact with technology when our hands are busy, but our minds are busy. Thank you.
Related Videos
OpenHuman VS Hermes AI: Who Wins?
JulianGoldieSEO
285 views•2026-05-29
Long-Running Agents — Build an Agent That Never Forgets with Google ADK
suryakunju
142 views•2026-05-30
5 Mind Blowing Omni Uses Cases
PaulJLipsky
1K views•2026-06-02
This computer is made from real human brain cells. And you can buy it.
Talktmsmedia
3K views•2026-05-28
BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2
aimmediahouse
122 views•2026-06-03
I Made the Same Anime Fight Scene in Every AI Video Generator
NobleGooseAnime
295 views•2026-05-30
Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S
cnnnews18
3K views•2026-06-01
I Tested NEW Opus 4.8 on Four Projects (Updated LLM Leaderboard)
AICodingDaily
298 views•2026-05-29











