Rosie the Robot - Your Digital Desktop Maid

Speaker: James Coffey
Duration: 15 minutes
Time: 7:15 PM - 7:30 PM

Overview

James Coffey presented Rosie the Robot, a desktop organization assistant that tackles the universal problem of messy digital workspaces. As a newcomer to AIMUG (attending for just 2 months), James jumped right in to solve his own pain point: years of accumulated desktop chaos.

📺 Watch the Talk

The Problem

Digital Chaos

2-3 years of accumulated junk on desktop
Vague file names
Duplicates everywhere
Random downloads
Must click files to discover contents

Why Not Traditional Methods?

Deterministic methods (K-means clustering) don't provide intuitive naming
Don't structure files according to personal preferences
Need something smarter and more contextual

The Solution: Rosie the Robot

Vision

A digital maid that:

Asks a few questions
Proposes changes
Gets approval (human-in-the-loop)
Safely organizes your computer
Reports what it did
Doesn't need to run again

Current Implementation Status

CLI interface (command-line tool)
SQLite event store for state management
LangGraph-ready architecture
Deterministic core (no LLM yet - coming later)

Technical Architecture

Core Components

1. File Scan Tool

Walks through directories
Scans for all files
Builds inventory of chaos

2. Embeddings Tool with Fallbacks

Primary: HDB scan clustering
Fallback 1: Agglomerative clustering
Fallback 2: Cosine similarity
Naming: TF-IDF for cluster names

3. Tree Shaper

Keeps directory structure flat
Prevents overly deep nesting
Maintains accessibility

4. Checkpointing & File Ops

Safe operations with undo capability
Windows recycle bin integration
Non-destructive by default

Three-Agent Architecture (Planned)

Planner Agent
- Scans everything
- Creates organization strategy
Reviewer Agent
- Human-in-the-loop interface
- Asks what to keep/rename
- Gathers user preferences
Executor Agent
- Combines algorithmic clustering
- Applies user preferences
- Performs actual file operations

Live Demo

Dry Run Scan

James demonstrated scanning his downloads folder:

Less than 1 minute execution
Generated cluster proposals
Created directory structure plan
Showed confidence levels for moves

Key Features Shown

Two action tables: Create directories & Move files
Confidence levels (currently hardcoded at 0.6)
Largest folders analysis
Safety first: Won't execute without explicit "yes"
Recycle bin integration: All deletions recoverable

Developer Cleanup Mode

Special mode for developers:

Finds common caches
Identifies virtual environments
Safely removes with recycle bin backup

Development Philosophy

Start Simple, Add Intelligence

Phase 1: Deterministic core (current)
Phase 2: Add LLM capabilities
Phase 3: Full LangGraph integration

Key Principles

One painful use case (messy desktop)
Clear definition of done (clean desktop)
Human-in-the-loop gates on risky steps
Build deterministic first, add AI later

Technical Details

Current Stack

Python
SQLite for event store
Pydantic (learning)
LangGraph skeleton (ready for integration)

Clustering Approach

HDB scan for primary clustering
TF-IDF for naming
Confidence scoring (to be refined)
Multiple fallback algorithms

Q&A Insights

Q: How is confidence calculated?

Currently hardcoded at 0.6
Planning to implement proper confidence scoring
Will use classification and assignment confidence

Q: Is anyone building end-to-end applications?

James asked the community about their focus
Mix of internal dev tooling and end-user applications
Goal: Make this downloadable as .exe for non-developers

Q: Interesting approach combining traditional ML with LLMs

Using established clustering algorithms
LLMs will enhance, not replace, deterministic core
Best of both worlds approach

Roadmap

Next Steps

Implement actual confidence calculations
Add LangGraph agents
Build human-in-the-loop UI
Create executable for non-developers
Add LLM enhancement layer

Future Features

Smart naming based on content
Learning from user corrections
Pattern recognition for file types
Automated scheduling

Key Takeaways

Start with deterministic solutions before adding LLMs
Traditional data science still valuable in AI era
Human-in-the-loop essential for file operations
Safety first: Always provide undo options
Solve your own problems - best motivation

Development Approach

James emphasized getting feedback early:

"Instead of overbuilding before I get feedback, this is feedback"
Live production development
Community-driven improvements
Iterative enhancement

Resources

📺 Watch James's Rosie the Robot Talk on YouTube
Follow James on X: @jamescoffey
Project updates at next AIMUG meetings
Background: Data science, MLOps, developer advocacy

About the Speaker

James Coffey is new to the Austin AI community (2 months) but brings extensive experience in data science and MLOps. Previously a developer advocate, he's diving into AI engineering by solving his own messy desktop problem. This is his first LangChain/LangGraph project, demonstrating that the community welcomes all skill levels.

Overview​

📺 Watch the Talk​

The Problem​

Digital Chaos​

Why Not Traditional Methods?​

The Solution: Rosie the Robot​

Vision​

Current Implementation Status​

Technical Architecture​

Core Components​

1. File Scan Tool​

2. Embeddings Tool with Fallbacks​

3. Tree Shaper​

4. Checkpointing & File Ops​

Three-Agent Architecture (Planned)​

Live Demo​

Dry Run Scan​

Key Features Shown​

Developer Cleanup Mode​

Development Philosophy​

Start Simple, Add Intelligence​

Key Principles​

Technical Details​

Current Stack​

Clustering Approach​

Q&A Insights​

Roadmap​

Next Steps​

Future Features​

Key Takeaways​

Development Approach​

Resources​

About the Speaker​