**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 05, 2024, 19:56

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 05, 2024, 19:56

Mauve 👁💜 @mauve@mastodon.mauve.moe

Jun 05, 2024, 19:56

Mauve 👁💜 @mauve@mastodon.mauve.moe

Ordered a glasses mounted camera to feed to my local AI when I'm on the go

**ryan 𝕣𝕒 𝕞𝕒𝕘𝕖** @ryanramage@mastodon.online · Jun 05, 2024, 20:06

**ryan 𝕣𝕒 𝕞𝕒𝕘𝕖** @ryanramage@mastodon.online · Jun 05, 2024, 20:06

Jun 05, 2024, 20:06

ryan 𝕣𝕒 𝕞𝕒𝕘𝕖 @ryanramage@mastodon.online

@mauve ohhh - interesting! What will it do? augment things?

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 05, 2024, 21:33

**Mauve 👁💜** @mauve@mastodon.mauve.moe · Jun 05, 2024, 21:33

Jun 05, 2024, 21:33

Mauve 👁💜 @mauve@mastodon.mauve.moe

@ryanramage I'm giving the agent a tool call to use LLaVA to "see". I'm going to allow it access to the screen, an image file, or the camera.

I want a flow along the lines of:
- "Hey, save this text as a reminder for later"
- tool("see", "extract the text from the image") => take a pic and run through llava
- tool('save', "{summarized text}", ["reminder"]) => save to local database for later
- response: "Saved!"

- "What was the last reminder you saved?"
- tool('load', 'reminder', {limit: 1})