fss-mini-rag-github

Author	SHA1	Message	Date
FSSCoding	af4db45ce9	Implement global rag-mini command with transparent virtual environment handling - Create global wrapper script in /usr/local/bin/rag-mini - Automatically handles virtual environment activation - Suppress virtual environment warnings when using global wrapper - Update installation scripts to install global wrapper automatically - Add comprehensive timing documentation (2-3 min fast, 5-10 min slow internet) - Add agent warnings for background process execution - Update all documentation with realistic timing expectations - Fix README commands to use correct syntax (rag-mini init -p .) Major improvements: - Users can now run 'rag-mini' from anywhere without activation - Installation creates transparent global command automatically - No more virtual environment complexity for end users - Comprehensive agent/CI/CD guidance with timeout warnings - Complete documentation consistency across all files 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-09-06 21:15:56 +10:00
FSSCoding	01ecd74983	Complete GitHub issue implementation and security hardening Major improvements from comprehensive technical and security reviews: 🎯 GitHub Issue Fixes (All 3 Priority Items): • Add headless installation flag (--headless) for agents/CI automation • Implement automatic model name resolution (qwen3:1.7b → qwen3:1.7b-q8_0) • Prominent copy-paste instructions for fresh Ubuntu/Windows/Mac systems 🔧 CI/CD Pipeline Fixes: • Fix virtual environment activation in GitHub workflows • Add comprehensive test execution with proper dependency context • Resolve test pattern matching for safeguard preservation methods • Eliminate CI failure emails with robust error handling 🔒 Security Hardening: • Replace unsafe curl\|sh patterns with secure download-verify-execute • Add SSL certificate validation with retry logic and exponential backoff • Implement model name sanitization to prevent injection attacks • Add network timeout handling and connection resilience ⚡ Enhanced Features: • Robust model resolution with fuzzy matching for quantization variants • Cross-platform headless installation for automation workflows • Comprehensive error handling with graceful fallbacks • Analysis directory gitignore protection for scan results 🧪 Testing & Quality: • All test suites passing (4/4 tests successful) • Security validation preventing injection attempts • Model resolution tested with real Ollama instances • CI workflows validated across Python 3.10/3.11/3.12 📚 Documentation: • Security-hardened installation maintains beginner-friendly approach • Copy-paste instructions work on completely fresh systems • Progressive complexity preserved (TUI → CLI → advanced) • Step-by-step explanations for all installation commands	2025-09-02 17:15:21 +10:00
BobAi	a84ff94fba	Improve UX with streaming tokens, fix model references, and add icon integration This comprehensive update enhances user experience with several key improvements: ## Enhanced Streaming & Thinking Display - Implement real-time streaming with gray thinking tokens that collapse after completion - Fix thinking token redisplay bug with proper content filtering - Add clear "AI Response:" headers to separate thinking from responses - Enable streaming by default for better user engagement - Keep thinking visible for exploration, collapse only for suggested questions ## Natural Conversation Responses - Convert clunky JSON exploration responses to natural, conversational format - Improve exploration prompts for friendly, colleague-style interactions - Update summary generation with better context handling - Eliminate double response display issues ## Model Reference Updates - Remove all llama3.2 references in favor of qwen3 models - Fix non-existent qwen3:3b references, replace with proper model names - Update model rankings to prioritize working qwen models across all components - Ensure consistent model recommendations in docs and examples ## Cross-Platform Icon Integration - Add desktop icon setup to Linux installer with .desktop entry - Add Windows shortcuts for desktop and Start Menu integration - Improve installer user experience with visual branding ## Configuration & Navigation Fixes - Fix "0" option in configuration menu to properly go back - Improve configuration menu user-friendliness - Update troubleshooting guides with correct model suggestions These changes significantly improve the beginner experience while maintaining technical accuracy and system reliability.	2025-08-15 12:20:06 +10:00
Brett Fox	cc99edde79	Add comprehensive Windows compatibility and enhanced LLM setup - Add Windows installer (install_windows.bat) and launcher (rag.bat) - Enhance both Linux and Windows installers with intelligent Qwen3 model detection and setup - Fix installation script continuation issues and improve user guidance - Update README with side-by-side Linux/Windows commands - Auto-save model preferences to config.yaml for consistent experience Makes FSS-Mini-RAG fully cross-platform with zero-friction Windows adoption 🚀	2025-08-15 10:52:44 +10:00
BobAi	c201b3badd	Fix critical deployment issues and improve system reliability Major fixes: - Fix model selection to prioritize qwen3:1.7b instead of qwen3:4b for testing - Correct context length from 80,000 to 32,000 tokens (proper Qwen3 limit) - Implement content-preserving safeguards instead of dropping responses - Fix all test imports from claude_rag to mini_rag module naming - Add virtual environment warnings to all test entry points - Fix TUI EOF crash handling with proper error handling - Remove warmup delays that were causing startup lag and unwanted model calls - Fix command mappings between bash wrapper and Python script - Update documentation to reflect qwen3:1.7b as primary recommendation - Improve TUI box alignment and formatting - Make language generic for any documents, not just codebases - Add proper folder names in user feedback instead of generic terms Technical improvements: - Unified model rankings across all components - Better error handling for missing dependencies - Comprehensive testing and validation of all fixes - All tests now pass and system is deployment-ready All major crashes and deployment issues resolved.	2025-08-15 09:47:15 +10:00
BobAi	597c810034	Fix installer indexing hang and improve user experience 🔧 Script Handling Improvements: - Fix infinite recursion in bash wrapper for index/search commands - Improve embedding system diagnostics with intelligent detection - Add timeout protection and progress indicators to installer test - Enhance interactive input handling with graceful fallbacks 🎯 User Experience Enhancements: - Replace confusing error messages with educational diagnostics - Add RAG performance tips about model sizing (4B optimal, 8B+ overkill) - Correct model recommendations (qwen3:4b not qwen3:3b) - Smart Ollama model detection shows available models - Clear guidance for next steps after installation 🛠 Technical Fixes: - Add get_embedding_info() method to CodeEmbedder class - Robust test prompt handling with /dev/tty input - Path validation and permission fixing in test scripts - Comprehensive error diagnostics with actionable solutions Installation now completes reliably with clear feedback and guidance.	2025-08-14 20:23:57 +10:00
BobAi	11639c8237	Add Ollama auto-installation and educational LLM model suggestions ✨ Features: - One-click Ollama installation using official script - Educational LLM model recommendations after successful install - Smart 3-option menu: auto-install, manual, or skip - Clear performance vs quality guidance for model selection 🛡 Safety & UX: - Uses official ollama.com/install.sh script - Shows exact commands before execution - Graceful fallback to manual installation - Auto-starts Ollama server and verifies health - Educational approach with size/performance trade-offs 🎯 Model Recommendations: - qwen3:0.6b (lightweight, 400MB) - qwen3:1.7b (balanced, 1GB) - qwen3:3b (excellent for this project, 2GB) - qwen3:8b (premium results, 5GB) - Creative suggestions: mistral for storytelling, qwen3-coder for development Transforms installation from multi-step manual process to guided automation.	2025-08-14 19:50:12 +10:00
BobAi	3fe26ef138	Address PR feedback: Better samples and realistic search examples Based on feedback in PR comment, implemented: Installer improvements: - Added choice between code/docs sample testing - Created FSS-Mini-RAG specific sample files (chunker.py, ollama_integration.py, etc.) - Timing-based estimation for full project indexing - Better sample content that actually relates to this project TUI enhancements: - Replaced generic searches with FSS-Mini-RAG relevant questions: * "chunking strategy" * "ollama integration" * "indexing performance" * "why does indexing take long" - Added search count tracking and sample limitation reminder - Intelligent transition to full project after 2 sample searches - FSS-Mini-RAG specific follow-up question patterns Key fixes: - No more dead search results (removed auth/API queries that don't exist) - Sample questions now match actual content that will be found - User gets timing estimate for full indexing based on sample performance - Clear transition path from sample to full project exploration This prevents the "installed malware" feeling when searches return no results.	2025-08-14 08:55:53 +10:00
BobAi	e6d5f20f7d	Improve installer experience and beginner-friendly features - Replace slow full-project test with fast 3-file sample - Add beginner guidance and welcome messages - Add sample questions to combat prompt paralysis - Add intelligent follow-up question suggestions - Improve TUI with contextual next steps Installer improvements: - Create minimal sample project (3 files) for testing - Add helpful tips and guidance for new users - Better error messaging and progress indicators TUI enhancements: - Welcome message for first-time users - Sample search questions (authentication, error handling, etc.) - Pattern-based follow-up question generation - Contextual suggestions based on search results These changes address user feedback about installation taking too long and beginners not knowing what to search for.	2025-08-14 08:26:22 +10:00
BobAi	a96ddba3c9	MAJOR: Remove all Claude references and rename to Mini-RAG Complete rebrand to eliminate any Claude/Anthropic references: Directory Changes: - claude_rag/ → mini_rag/ (preserving git history) Content Changes: - Replaced 930+ Claude references across 40+ files - Updated all imports: from claude_rag → from mini_rag - Updated all file paths: .claude-rag → .mini-rag - Updated documentation and comments - Updated configuration files and examples Testing Changes: - All tests updated to use mini_rag imports - Integration tests verify new module structure This ensures complete independence from Claude/Anthropic branding while maintaining all functionality and git history.	2025-08-12 19:21:30 +10:00
BobAi	4166d0a362	Initial release: FSS-Mini-RAG - Lightweight semantic code search system 🎯 Complete transformation from 5.9GB bloated system to 70MB optimized solution ✨ Key Features: - Hybrid embedding system (Ollama + ML fallback + hash backup) - Intelligent chunking with language-aware parsing - Semantic + BM25 hybrid search with rich context - Zero-config portable design with graceful degradation - Beautiful TUI for beginners + powerful CLI for experts - Comprehensive documentation with 8+ Mermaid diagrams - Professional animated demo (183KB optimized GIF) 🏗️ Architecture Highlights: - LanceDB vector storage with streaming indexing - Smart file tracking (size/mtime) to avoid expensive rehashing - Progressive chunking: Markdown headers → Python functions → fixed-size - Quality filtering: 200+ chars, 20+ words, 30% alphanumeric content - Concurrent batch processing with error recovery 📦 Package Contents: - Core engine: claude_rag/ (11 modules, 2,847 lines) - Entry points: rag-mini (unified), rag-tui (beginner interface) - Documentation: README + 6 guides with visual diagrams - Assets: 3D icon, optimized demo GIF, recording tools - Tests: 8 comprehensive integration and validation tests - Examples: Usage patterns, config templates, dependency analysis 🎥 Demo System: - Scripted demonstration showing 12 files → 58 chunks indexing - Semantic search with multi-line result previews - Complete workflow from TUI startup to CLI mastery - Professional recording pipeline with asciinema + GIF conversion 🛡️ Security & Quality: - Complete .gitignore with personal data protection - Dependency optimization (removed python-dotenv) - Code quality validation and educational test suite - Agent-reviewed architecture and documentation Ready for production use - copy folder, run ./rag-mini, start searching\!	2025-08-12 16:38:28 +10:00

11 Commits