Ordered a glasses mounted camera to feed to my local AI when I'm on the go

@ryanramage I'm giving the agent a tool call to use LLaVA to "see". I'm going to allow it access to the screen, an image file, or the camera.

I want a flow along the lines of:
- "Hey, save this text as a reminder for later"
- tool("see", "extract the text from the image") => take a pic and run through llava
- tool('save', "{summarized text}", ["reminder"]) => save to local database for later
- response: "Saved!"

- "What was the last reminder you saved?"
- tool('load', 'reminder', {limit: 1})

@ryanramage All within a few seconds with as little power or ram usage possible.

Follow

@ryanramage yeah feel free to follow the main repo: github.com/RangerMauve/mind-go

Gonna push my latest version in the next week or so.

Sign in to participate in the conversation
Mauvestodon

Escape ship from centralized social media run by Mauve.