Key improvements:
- Implement relaxed model matching to handle modern naming conventions (e.g., qwen3:4b-instruct-2507-q4_K_M)
- Add smart auto-selection prioritizing Qwen3 series over older models
- Replace rigid pattern matching with flexible base+size matching
- Add comprehensive logging for model resolution transparency
- Introduce new 'models' command for detailed model status reporting
- Improve pip installation feedback with progress indication
- Fix Python syntax warning in GitHub template script
The enhanced system now provides clear visibility into model selection
decisions and gracefully handles various model naming patterns without
requiring complex configuration.
- Applied Black formatter and isort across entire codebase for professional consistency
- Moved implementation scripts (rag-mini.py, rag-tui.py) to bin/ directory for cleaner root
- Updated shell scripts to reference new bin/ locations maintaining user compatibility
- Added comprehensive linting configuration (.flake8, pyproject.toml) with dedicated .venv-linting
- Removed development artifacts (commit_message.txt, GET_STARTED.md duplicate) from root
- Consolidated documentation and fixed script references across all guides
- Relocated test_fixes.py to proper tests/ directory
- Enhanced project structure following Python packaging standards
All user commands work identically while improving code organization and beginner accessibility.
Enable 'rag-mini check-update' and 'rag-mini update' commands
by routing them through to the Python script.
✅ Commands now work:
- rag-mini check-update (shows available updates)
- rag-mini update (installs updates with confirmation)
- Regular commands show discrete notifications
🔧 Fix: Shell wrapper now properly routes update commands
to rag-mini.py instead of showing 'unknown command' error.
Major fixes:
- Fix model selection to prioritize qwen3:1.7b instead of qwen3:4b for testing
- Correct context length from 80,000 to 32,000 tokens (proper Qwen3 limit)
- Implement content-preserving safeguards instead of dropping responses
- Fix all test imports from claude_rag to mini_rag module naming
- Add virtual environment warnings to all test entry points
- Fix TUI EOF crash handling with proper error handling
- Remove warmup delays that were causing startup lag and unwanted model calls
- Fix command mappings between bash wrapper and Python script
- Update documentation to reflect qwen3:1.7b as primary recommendation
- Improve TUI box alignment and formatting
- Make language generic for any documents, not just codebases
- Add proper folder names in user feedback instead of generic terms
Technical improvements:
- Unified model rankings across all components
- Better error handling for missing dependencies
- Comprehensive testing and validation of all fixes
- All tests now pass and system is deployment-ready
All major crashes and deployment issues resolved.
🔧 Script Handling Improvements:
- Fix infinite recursion in bash wrapper for index/search commands
- Improve embedding system diagnostics with intelligent detection
- Add timeout protection and progress indicators to installer test
- Enhance interactive input handling with graceful fallbacks
🎯 User Experience Enhancements:
- Replace confusing error messages with educational diagnostics
- Add RAG performance tips about model sizing (4B optimal, 8B+ overkill)
- Correct model recommendations (qwen3:4b not qwen3:3b)
- Smart Ollama model detection shows available models
- Clear guidance for next steps after installation
🛠 Technical Fixes:
- Add get_embedding_info() method to CodeEmbedder class
- Robust test prompt handling with /dev/tty input
- Path validation and permission fixing in test scripts
- Comprehensive error diagnostics with actionable solutions
Installation now completes reliably with clear feedback and guidance.