Show HN: We're building a desktop app for browser-based AI agents

by jawertyon 2/1/2025, 1:41 PMwith 56 comments

What's up HN!

This is Jared and Art. We met on HN and started building together.

Over the last few months we've been thinking a lot about how AI agents are going to impact the future. We want agents to be something that's actually useful for normal people as well as the 10x'ers. This lead us to building Meha over the last few months, our first swing at our vision! We saw OpenAI release Operators then we said f*k it let's post.

Meha is a desktop app that uses your Chrome browser to execute tasks in the background. It controls your installed Chrome browser and uses LLMs with playwright to plan and execute actions to accomplish your task. You get to see each planning step the bot is doing and have access to its long term memory.

Meha also uses its own file system and can export files for download. Another thing we've been focused on in multi-agent workflows and Meha can run many bots at the same time. One of the reasons why we can ship this for free in the mean time is because of how cheap the agents are. But we are planning to have a Pro version for power users. We prefer not to raise since we're against VC funding.

We have been influenced by a lot of concepts in probabilistic robotics and RL to develop a fairly robust 'agentic' framework. As well as an algorithm for efficiently converting/compressing large html pages into a semantic format. If you're interested we will open source this asap in an SDK (will work with all OpenAI API spec LLMs and with llama.cpp) let us know.

We're currently in beta and working on figuring out what this product will become and super stoked! Let us know what you think. To get access to Meha we have links on our discord to download (Both MacOS and Windows is available). Please give us all the feedback/criticism (even if you hate AI).

Link to Meha: https://meha.ai

by stormfatheron 2/2/2025, 3:09 AM

> As well as an algorithm for efficiently converting/compressing large html pages into a semantic format.

For the love of humanity please open source this. This seems tremendously useful by itself.

by skeeter2020on 2/2/2025, 12:47 AM

Looked through their privacy policy, and they state the collect and use basically everything they can from your browser & system metadata, to the content you share and/or create. Not that different from every other attempt in the frothy AI space, but a real turn-off and hard no for me.

by Kuinoxon 2/2/2025, 2:30 AM

I asked it to go on seloger.com, to find "some flats on paris below 400k". It went on some specific district of Paris, and didn't put a price citeria then responded how I could do it myself.

I then asked to create a CSV of the first 100 flats corresponding to my criteria, it created only 3 entries, purely hallucinated.

by arjunchinton 2/8/2025, 2:59 AM

Hey I am also building in the space and launched rtrvr.ai, but we went the route of a Chrome Extension so people don't have to worry about installing random software on their devices [also the reason that I am hesitant to try this out].

But let me know your thougths on rtrvr.ai, looks like we are targeting the same use cases of automation, scraping, research?

by artabraon 2/1/2025, 10:42 PM

Hi everyone, this is Art!

Happy to hear all the thoughts for those who try the app out! Even if you just have ideas about how agents might look in their final form, there's so many avenue's this tech can take and we have a ton of wild ideas we'll be building so stay tuned. :D

by iiJDSiion 2/2/2025, 12:08 AM

Very cool! Any video demos for sample tasks? I didn't come across any on the website (browsing on mobile).

by sky2224on 2/2/2025, 12:11 AM

Interesting idea. With the web scraping utility, do I need to specify which websites I wish for the api to scrape from or do I essentially just say, "hey I want this data, go get it"?

If it's the latter, how do you go about making sure you're not about to download malicious data to my machine?

by xnxon 2/2/2025, 11:34 PM

Cool. Is this a wrapper for https://github.com/browser-use/browser-use ?

by noahfkon 2/2/2025, 12:11 AM

it kept taking me to non existent websites that were a summary of what i asked it for

by pdntspaon 2/2/2025, 12:39 AM

I would be very interested in your research on compressing HTML pages!

by delducaon 2/1/2025, 10:47 PM

Is it a native app or electron based?

by xenaon 2/2/2025, 1:24 AM

How do I uniquely identify Meha to block it?

by phdelightfulon 2/2/2025, 1:56 AM

Since you asked for “all the feedback,” there’s a typo on your landing page:

“The Meha API utilizes it's home-grown” -> “its”

Also, I got a relay access denied error when I tried to email you at info@meha.ai

by anticorporateon 2/2/2025, 12:15 AM

The headline needs another word. Maybe browser-based agents. I assumed this was about browser user-agents.

by dotancohenon 2/2/2025, 12:48 AM

Irrespective of the product, I think that you could have posted without the pseudo cussing. Having a little respect for your audience, and trying to appear professional, goes a long way in attracting users.