Rosie the Robot - Your Digital Desktop Maid
Speaker: James Coffey
Duration: 15 minutes
Time: 7:15 PM - 7:30 PM
Overview
James Coffey presented Rosie the Robot, a desktop organization assistant that tackles the universal problem of messy digital workspaces. As a newcomer to AIMUG (attending for just 2 months), James jumped right in to solve his own pain point: years of accumulated desktop chaos.
📺 Watch the Talk
The Problem
Digital Chaos
- 2-3 years of accumulated junk on desktop
- Vague file names
- Duplicates everywhere
- Random downloads
- Must click files to discover contents
Why Not Traditional Methods?
- Deterministic methods (K-means clustering) don't provide intuitive naming
- Don't structure files according to personal preferences
- Need something smarter and more contextual
The Solution: Rosie the Robot
Vision
A digital maid that:
- Asks a few questions
- Proposes changes
- Gets approval (human-in-the-loop)
- Safely organizes your computer
- Reports what it did
- Doesn't need to run again
Current Implementation Status
- CLI interface (command-line tool)
- SQLite event store for state management
- LangGraph-ready architecture
- Deterministic core (no LLM yet - coming later)
Technical Architecture
Core Components
1. File Scan Tool
- Walks through directories
- Scans for all files
- Builds inventory of chaos
2. Embeddings Tool with Fallbacks
- Primary: HDB scan clustering
- Fallback 1: Agglomerative clustering
- Fallback 2: Cosine similarity
- Naming: TF-IDF for cluster names
3. Tree Shaper
- Keeps directory structure flat
- Prevents overly deep nesting
- Maintains accessibility
4. Checkpointing & File Ops
- Safe operations with undo capability
- Windows recycle bin integration
- Non-destructive by default
Three-Agent Architecture (Planned)
- Planner Agent
- Scans everything
- Creates organization strategy
- Reviewer Agent
- Human-in-the-loop interface
- Asks what to keep/rename
- Gathers user preferences
- Executor Agent
- Combines algorithmic clustering
- Applies user preferences
- Performs actual file operations
Live Demo
Dry Run Scan
James demonstrated scanning his downloads folder:
- Less than 1 minute execution
- Generated cluster proposals
- Created directory structure plan
- Showed confidence levels for moves
Key Features Shown
- Two action tables: Create directories & Move files
- Confidence levels (currently hardcoded at 0.6)
- Largest folders analysis
- Safety first: Won't execute without explicit "yes"
- Recycle bin integration: All deletions recoverable
Developer Cleanup Mode
Special mode for developers:
- Finds common caches
- Identifies virtual environments
- Safely removes with recycle bin backup
Development Philosophy
Start Simple, Add Intelligence
- Phase 1: Deterministic core (current)
- Phase 2: Add LLM capabilities
- Phase 3: Full LangGraph integration
Key Principles
- One painful use case (messy desktop)
- Clear definition of done (clean desktop)
- Human-in-the-loop gates on risky steps
- Build deterministic first, add AI later
Technical Details
Current Stack
- Python
- SQLite for event store
- Pydantic (learning)
- LangGraph skeleton (ready for integration)
Clustering Approach
- HDB scan for primary clustering
- TF-IDF for naming
- Confidence scoring (to be refined)
- Multiple fallback algorithms
Q&A Insights
Q: How is confidence calculated?
- Currently hardcoded at 0.6
- Planning to implement proper confidence scoring
- Will use classification and assignment confidence
Q: Is anyone building end-to-end applications?
- James asked the community about their focus
- Mix of internal dev tooling and end-user applications
- Goal: Make this downloadable as .exe for non-developers
Q: Interesting approach combining traditional ML with LLMs
- Using established clustering algorithms
- LLMs will enhance, not replace, deterministic core
- Best of both worlds approach
Roadmap
Next Steps
- Implement actual confidence calculations
- Add LangGraph agents
- Build human-in-the-loop UI
- Create executable for non-developers
- Add LLM enhancement layer
Future Features
- Smart naming based on content
- Learning from user corrections
- Pattern recognition for file types
- Automated scheduling
Key Takeaways
- Start with deterministic solutions before adding LLMs
- Traditional data science still valuable in AI era
- Human-in-the-loop essential for file operations
- Safety first: Always provide undo options
- Solve your own problems - best motivation
Development Approach
James emphasized getting feedback early:
- "Instead of overbuilding before I get feedback, this is feedback"
- Live production development
- Community-driven improvements
- Iterative enhancement
Resources
- 📺 Watch James's Rosie the Robot Talk on YouTube
- Follow James on X: @jamescoffey
- Project updates at next AIMUG meetings
- Background: Data science, MLOps, developer advocacy
About the Speaker
James Coffey is new to the Austin AI community (2 months) but brings extensive experience in data science and MLOps. Previously a developer advocate, he's diving into AI engineering by solving his own messy desktop problem. This is his first LangChain/LangGraph project, demonstrating that the community welcomes all skill levels.