5 Commits

Author SHA1 Message Date
af4db45ce9 Implement global rag-mini command with transparent virtual environment handling
- Create global wrapper script in /usr/local/bin/rag-mini
- Automatically handles virtual environment activation
- Suppress virtual environment warnings when using global wrapper
- Update installation scripts to install global wrapper automatically
- Add comprehensive timing documentation (2-3 min fast, 5-10 min slow internet)
- Add agent warnings for background process execution
- Update all documentation with realistic timing expectations
- Fix README commands to use correct syntax (rag-mini init -p .)

Major improvements:
- Users can now run 'rag-mini' from anywhere without activation
- Installation creates transparent global command automatically
- No more virtual environment complexity for end users
- Comprehensive agent/CI/CD guidance with timeout warnings
- Complete documentation consistency across all files

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-09-06 21:15:56 +10:00
01ecd74983 Complete GitHub issue implementation and security hardening
Major improvements from comprehensive technical and security reviews:

🎯 GitHub Issue Fixes (All 3 Priority Items):
• Add headless installation flag (--headless) for agents/CI automation
• Implement automatic model name resolution (qwen3:1.7b → qwen3:1.7b-q8_0)
• Prominent copy-paste instructions for fresh Ubuntu/Windows/Mac systems

🔧 CI/CD Pipeline Fixes:
• Fix virtual environment activation in GitHub workflows
• Add comprehensive test execution with proper dependency context
• Resolve test pattern matching for safeguard preservation methods
• Eliminate CI failure emails with robust error handling

🔒 Security Hardening:
• Replace unsafe curl|sh patterns with secure download-verify-execute
• Add SSL certificate validation with retry logic and exponential backoff
• Implement model name sanitization to prevent injection attacks
• Add network timeout handling and connection resilience

 Enhanced Features:
• Robust model resolution with fuzzy matching for quantization variants
• Cross-platform headless installation for automation workflows
• Comprehensive error handling with graceful fallbacks
• Analysis directory gitignore protection for scan results

🧪 Testing & Quality:
• All test suites passing (4/4 tests successful)
• Security validation preventing injection attempts
• Model resolution tested with real Ollama instances
• CI workflows validated across Python 3.10/3.11/3.12

📚 Documentation:
• Security-hardened installation maintains beginner-friendly approach
• Copy-paste instructions work on completely fresh systems
• Progressive complexity preserved (TUI → CLI → advanced)
• Step-by-step explanations for all installation commands
2025-09-02 17:15:21 +10:00
92cb600dd6 Fix LLM response formatting and Windows installer robustness
- Preserve whitespace and newlines in streaming responses
- Clean thinking tags from final LLM responses
- Add lazy initialization to _call_ollama method
- Improve Windows installer to handle existing virtual environments
- Add better error reporting for import failures

These fixes address formatting corruption in numbered lists and
improve installer reliability when dependencies already exist.
2025-08-15 14:26:53 +10:00
a84ff94fba Improve UX with streaming tokens, fix model references, and add icon integration
This comprehensive update enhances user experience with several key improvements:

## Enhanced Streaming & Thinking Display
- Implement real-time streaming with gray thinking tokens that collapse after completion
- Fix thinking token redisplay bug with proper content filtering
- Add clear "AI Response:" headers to separate thinking from responses
- Enable streaming by default for better user engagement
- Keep thinking visible for exploration, collapse only for suggested questions

## Natural Conversation Responses
- Convert clunky JSON exploration responses to natural, conversational format
- Improve exploration prompts for friendly, colleague-style interactions
- Update summary generation with better context handling
- Eliminate double response display issues

## Model Reference Updates
- Remove all llama3.2 references in favor of qwen3 models
- Fix non-existent qwen3:3b references, replace with proper model names
- Update model rankings to prioritize working qwen models across all components
- Ensure consistent model recommendations in docs and examples

## Cross-Platform Icon Integration
- Add desktop icon setup to Linux installer with .desktop entry
- Add Windows shortcuts for desktop and Start Menu integration
- Improve installer user experience with visual branding

## Configuration & Navigation Fixes
- Fix "0" option in configuration menu to properly go back
- Improve configuration menu user-friendliness
- Update troubleshooting guides with correct model suggestions

These changes significantly improve the beginner experience while maintaining
technical accuracy and system reliability.
2025-08-15 12:20:06 +10:00
cc99edde79 Add comprehensive Windows compatibility and enhanced LLM setup
- Add Windows installer (install_windows.bat) and launcher (rag.bat)
- Enhance both Linux and Windows installers with intelligent Qwen3 model detection and setup
- Fix installation script continuation issues and improve user guidance
- Update README with side-by-side Linux/Windows commands
- Auto-save model preferences to config.yaml for consistent experience

Makes FSS-Mini-RAG fully cross-platform with zero-friction Windows adoption 🚀
2025-08-15 10:52:44 +10:00