3 Commits

Author SHA1 Message Date
8f25e30f52 Rewrite CLAUDE.md and clean project structure
- Rewrote CLAUDE.md with comprehensive development guide
- Archived 20 old docs to docs/archive/
- Added PROJECT_ROADMAP_2025.md with research learnings
- Added CLASSIFICATION_METHODS_COMPARISON.md
- Added SESSION_HANDOVER_20251128.md
- Added tools for analysis (brett_gmail/microsoft analyzers)
- Updated .gitignore for archive folders
- Config changes for local vLLM endpoint
2025-11-28 13:07:27 +11:00
10862583ad Add batch LLM classifier tool with prompt caching optimization
- Created standalone batch_llm_classifier.py for custom email queries
- Optimized all LLM prompts for caching (static instructions first, variables last)
- Configured rtx3090 vLLM endpoint (qwen3-coder-30b)
- Tested batch_size=4 optimal (100% success, 4.65 req/sec)
- Added comprehensive documentation (tools/README.md, BATCH_LLM_QUICKSTART.md)

Tool is completely separate from main ML pipeline - no interference.
Prerequisite: vLLM server must be running at rtx3090.bobai.com.au
2025-11-14 16:01:57 +11:00
22fe08a1a6 Add model integration tools and comprehensive completion assessment
Features:
- Created download_pretrained_model.py for downloading models from URLs
- Created setup_real_model.py for integrating pre-trained LightGBM models
- Generated MODEL_INFO.md with model usage documentation
- Created COMPLETION_ASSESSMENT.md with comprehensive project evaluation
- Framework complete: all 16 phases implemented, 27/30 tests passing
- Model integration ready: tools to download/setup real LightGBM models
- Clear path to production: real model, Gmail OAuth, and deployment ready

This enables:
1. Immediate real model integration without code changes
2. Clear path from mock framework testing to production
3. Support for both downloaded and self-trained models
4. Documented deployment process for 80k+ email processing

Generated with Claude Code

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-21 12:12:52 +11:00