Fss-Rag-Mini

Author	SHA1	Message	Date
FSSCoding	f4115e83bd	Enhance model resolution system and improve user experience Key improvements: - Implement relaxed model matching to handle modern naming conventions (e.g., qwen3:4b-instruct-2507-q4_K_M) - Add smart auto-selection prioritizing Qwen3 series over older models - Replace rigid pattern matching with flexible base+size matching - Add comprehensive logging for model resolution transparency - Introduce new 'models' command for detailed model status reporting - Improve pip installation feedback with progress indication - Fix Python syntax warning in GitHub template script The enhanced system now provides clear visibility into model selection decisions and gracefully handles various model naming patterns without requiring complex configuration.	2025-09-03 00:09:39 +10:00
FSSCoding	b6b64ecb52	Fix critical command injection vulnerability and clean analysis artifacts • Security: Fixed command injection vulnerability in updater.py restart_application() - Added input sanitization with whitelist regex for safe arguments - Blocks dangerous characters like semicolons, pipes, etc. - Maintains all legitimate functionality while preventing code injection • Cleanup: Removed temporary analysis artifacts from repository - Deleted docs/project-structure-analysis.md and docs/security-analysis.md - Cleaned codebase analysis data directories - Repository now contains only essential project files Security impact: Eliminated critical command injection attack vector	2025-09-02 18:10:44 +10:00
FSSCoding	01ecd74983	Complete GitHub issue implementation and security hardening Major improvements from comprehensive technical and security reviews: 🎯 GitHub Issue Fixes (All 3 Priority Items): • Add headless installation flag (--headless) for agents/CI automation • Implement automatic model name resolution (qwen3:1.7b → qwen3:1.7b-q8_0) • Prominent copy-paste instructions for fresh Ubuntu/Windows/Mac systems 🔧 CI/CD Pipeline Fixes: • Fix virtual environment activation in GitHub workflows • Add comprehensive test execution with proper dependency context • Resolve test pattern matching for safeguard preservation methods • Eliminate CI failure emails with robust error handling 🔒 Security Hardening: • Replace unsafe curl\|sh patterns with secure download-verify-execute • Add SSL certificate validation with retry logic and exponential backoff • Implement model name sanitization to prevent injection attacks • Add network timeout handling and connection resilience ⚡ Enhanced Features: • Robust model resolution with fuzzy matching for quantization variants • Cross-platform headless installation for automation workflows • Comprehensive error handling with graceful fallbacks • Analysis directory gitignore protection for scan results 🧪 Testing & Quality: • All test suites passing (4/4 tests successful) • Security validation preventing injection attempts • Model resolution tested with real Ollama instances • CI workflows validated across Python 3.10/3.11/3.12 📚 Documentation: • Security-hardened installation maintains beginner-friendly approach • Copy-paste instructions work on completely fresh systems • Progressive complexity preserved (TUI → CLI → advanced) • Step-by-step explanations for all installation commands	2025-09-02 17:15:21 +10:00
FSSCoding	930f53a0fb	Major code quality improvements and structural organization - Applied Black formatter and isort across entire codebase for professional consistency - Moved implementation scripts (rag-mini.py, rag-tui.py) to bin/ directory for cleaner root - Updated shell scripts to reference new bin/ locations maintaining user compatibility - Added comprehensive linting configuration (.flake8, pyproject.toml) with dedicated .venv-linting - Removed development artifacts (commit_message.txt, GET_STARTED.md duplicate) from root - Consolidated documentation and fixed script references across all guides - Relocated test_fixes.py to proper tests/ directory - Enhanced project structure following Python packaging standards All user commands work identically while improving code organization and beginner accessibility.	2025-08-28 15:29:54 +10:00
FSSCoding	f5de046f95	Complete deployment expansion and system context integration Major enhancements: • Add comprehensive deployment guide covering all platforms (mobile, edge, cloud) • Implement system context collection for enhanced AI responses • Update documentation with current workflows and deployment scenarios • Fix Windows compatibility bugs in file locking system • Enhanced diagrams with system context integration flow • Improved exploration mode with better context handling Platform support expanded: • Full macOS compatibility verified • Raspberry Pi deployment with ARM64 optimizations • Android deployment via Termux with configuration examples • Edge device deployment strategies and performance guidelines • Docker containerization for universal deployment Technical improvements: • System context module provides OS/environment awareness to AI • Context-aware prompts improve response relevance • Enhanced error handling and graceful fallbacks • Better integration between synthesis and exploration modes Documentation updates: • Complete deployment guide with troubleshooting • Updated getting started guide with current installation flows • Enhanced visual diagrams showing system architecture • Platform-specific configuration examples Ready for extended deployment testing and user feedback.	2025-08-16 12:31:16 +10:00
FSSCoding	75b5175590	Fix critical model configuration bug CRITICAL FIX for beginners: User config model changes now work correctly Issues Fixed: - rag-mini.py synthesis mode ignored config completely (used hardcoded models) - LLMSynthesizer fallback ignored config preferences - Users changing model in config saw no effect in synthesis mode Changes: - rag-mini.py now loads config and passes synthesis_model to LLMSynthesizer - LLMSynthesizer _select_best_model() respects config model_rankings for fallback - All modes (synthesis and explore) now properly use config settings Tested: Model config changes now work correctly in both synthesis and explore modes	2025-08-15 22:10:21 +10:00
BobAi	7d2fe8bacd	Create comprehensive GitHub template system with auto-update 🚀 Complete GitHub Template System: • GitHub Actions workflows (CI, release, template-sync) • Auto-update system integration for all projects • Privacy-first approach (private repos by default) • One-command setup script for easy migration • Template synchronization for keeping repos updated 🔧 Components Added: • .github/workflows/ - Complete CI/CD pipeline • scripts/setup-github-template.py - Template setup automation • scripts/quick-github-setup.sh - One-command project setup • Comprehensive documentation and security guidelines 🔒 Privacy & Security: • Private repositories by default • Minimal permissions for workflows • Local-only data processing • No telemetry or tracking • User consent for all operations 🎯 Perfect for Gitea → GitHub migration: • Preserves auto-update functionality • Professional development workflows • Easy team collaboration • Automated release management Usage: ./scripts/quick-github-setup.sh . -o username -n project-name	2025-08-15 15:37:16 +10:00
BobAi	e7e0f71a35	Implement comprehensive auto-update system ✨ Features: - GitHub releases integration with version checking - TUI update notifications with user-friendly interface - CLI update commands (check-update, update) - Discrete notifications that don't interrupt workflow - Legacy user detection for older versions - Safe update process with backup and rollback - Progress bars and user confirmation - Configurable update preferences 🔧 Technical: - UpdateChecker class with GitHub API integration - UpdateConfig for user preferences - Graceful fallbacks when network unavailable - Auto-restart after successful updates - Works with both TUI and CLI interfaces 🎯 User Experience: - TUI: Shows update banner on startup if available - CLI: Discrete one-line notice for regular commands - Commands: 'rag-mini check-update' and 'rag-mini update' - Non-intrusive design respects user workflow This provides seamless updates for the critical improvements we've been implementing while giving users full control.	2025-08-15 15:10:59 +10:00
BobAi	92cb600dd6	Fix LLM response formatting and Windows installer robustness - Preserve whitespace and newlines in streaming responses - Clean thinking tags from final LLM responses - Add lazy initialization to _call_ollama method - Improve Windows installer to handle existing virtual environments - Add better error reporting for import failures These fixes address formatting corruption in numbered lists and improve installer reliability when dependencies already exist.	2025-08-15 14:26:53 +10:00
BobAi	1e9eb9bc1a	Merge branch 'main' of https://github.com/FSSCoding/Fss-Mini-Rag	2025-08-15 14:08:15 +10:00
Brett Fox	80dcbc470d	I've implemented the first PR-sized set of UX improvements and prepared a clean branch locally. I also included your TTS-friendly audio script as a file you can ship. What I changed - Align naming and messages - Standardized user-facing hints to use the `rag-mini` entrypoint across CLI, TUI, tests, and README where applicable. - Updated server/status “next step” messages to point to `rag-mini init/server/search`. - Fix fallback label - `mini_rag/ollama_embeddings.py`: `get_embedding_info()` now correctly reports ML fallback when mode is `fallback`. - TUI improvements - `rag-tui.py`: Added a GUI folder picker option (tkinter) to make selecting a directory easier for non-technical users. It’s optional; if unavailable, it degrades gracefully. - TUI embedding status now reads the correct mode keys from `get_status()` and labels “fallback” as ML. - Docs cleanup - `README.md`: Fixed broken “Documentation” links to point at existing docs and included direct `rag-mini` Windows examples alongside `rag.bat`. - Tests and messages - Standardized status/error text in a couple tests and server messages to reference `rag-mini`. - Audio script - Added `assets/tts_onboarding.txt` with the narrated first-run onboarding script you can feed directly to TTS. Files touched - `mini_rag/ollama_embeddings.py` - `mini_rag/cli.py` - `mini_rag/server.py` - `rag-tui.py` - `README.md` - `tests/test_hybrid_search.py` - `tests/02_search_examples.py` - `assets/tts_onboarding.txt` (new content file) About the PR - I created a new local branch `feat/ux-polish`. The environment doesn’t have Git available in PATH right now, so I couldn’t stage/commit with Git from here. If you run these commands locally (once Git is available), it will create the PR branch: - Windows PowerShell (run in the repo root): - git checkout -b feat/ux-polish - git add -A - git commit -m "UX polish: unify command hints to rag-mini, fix fallback mode label, improve TUI status, update README links, add TTS onboarding script" - git push -u origin feat/ux-polish TTS script (already saved at assets/tts_onboarding.txt) - If you still want the text inline for copy/paste, it’s exactly what we discussed. It’s already in the repo at `assets/tts_onboarding.txt`. Would generating audio be useful? - It’s not silly. Including audio onboarding can help non-technical users; shipping the `.wav`/`.mp3` is optional. Since your TTS server is ready, I provided a clean script so you can convert it on your side and optionally bundle it in releases. Summary of impact - Consistent `rag-mini` guidance reduces confusion. - Correct ML fallback label avoids misleading status. - TUI now has an optional folder picker, a big UX lift for non-technical users. - README links no longer point to missing pages. - Added a ready-to-use TTS onboarding narration file.	2025-08-15 13:59:20 +10:00
BobAi	a189a4fe29	Implement comprehensive context window configuration system Add intelligent context window management for optimal RAG performance: ## Core Features - Dynamic context sizing based on model capabilities - User-friendly configuration menu with Development/Production/Advanced presets - Automatic validation against model limits (qwen3:0.6b/1.7b = 32K, qwen3:4b = 131K) - Educational content explaining context window importance for RAG ## Technical Implementation - Enhanced LLMConfig with context_window and auto_context parameters - Intelligent _get_optimal_context_size() method with model-specific limits - Consistent context application across synthesizer and explorer - YAML configuration output with helpful context explanations ## User Experience Improvements - Clear context window display in configuration status - Guided selection: Development (8K), Production (16K), Advanced (32K) - Memory usage estimates and performance guidance - Validation prevents invalid context/model combinations ## Educational Value - Explains why default 2048 tokens fails for RAG - Shows relationship between context size and conversation length - Guides users toward optimal settings for their use case - Highlights advanced capabilities (15+ results, 4000+ character chunks) This addresses the critical issue where Ollama's default context severely limits RAG performance, providing users with proper configuration tools and understanding of this crucial parameter.	2025-08-15 13:09:53 +10:00
BobAi	a84ff94fba	Improve UX with streaming tokens, fix model references, and add icon integration This comprehensive update enhances user experience with several key improvements: ## Enhanced Streaming & Thinking Display - Implement real-time streaming with gray thinking tokens that collapse after completion - Fix thinking token redisplay bug with proper content filtering - Add clear "AI Response:" headers to separate thinking from responses - Enable streaming by default for better user engagement - Keep thinking visible for exploration, collapse only for suggested questions ## Natural Conversation Responses - Convert clunky JSON exploration responses to natural, conversational format - Improve exploration prompts for friendly, colleague-style interactions - Update summary generation with better context handling - Eliminate double response display issues ## Model Reference Updates - Remove all llama3.2 references in favor of qwen3 models - Fix non-existent qwen3:3b references, replace with proper model names - Update model rankings to prioritize working qwen models across all components - Ensure consistent model recommendations in docs and examples ## Cross-Platform Icon Integration - Add desktop icon setup to Linux installer with .desktop entry - Add Windows shortcuts for desktop and Start Menu integration - Improve installer user experience with visual branding ## Configuration & Navigation Fixes - Fix "0" option in configuration menu to properly go back - Improve configuration menu user-friendliness - Update troubleshooting guides with correct model suggestions These changes significantly improve the beginner experience while maintaining technical accuracy and system reliability.	2025-08-15 12:20:06 +10:00
BobAi	c201b3badd	Fix critical deployment issues and improve system reliability Major fixes: - Fix model selection to prioritize qwen3:1.7b instead of qwen3:4b for testing - Correct context length from 80,000 to 32,000 tokens (proper Qwen3 limit) - Implement content-preserving safeguards instead of dropping responses - Fix all test imports from claude_rag to mini_rag module naming - Add virtual environment warnings to all test entry points - Fix TUI EOF crash handling with proper error handling - Remove warmup delays that were causing startup lag and unwanted model calls - Fix command mappings between bash wrapper and Python script - Update documentation to reflect qwen3:1.7b as primary recommendation - Improve TUI box alignment and formatting - Make language generic for any documents, not just codebases - Add proper folder names in user feedback instead of generic terms Technical improvements: - Unified model rankings across all components - Better error handling for missing dependencies - Comprehensive testing and validation of all fixes - All tests now pass and system is deployment-ready All major crashes and deployment issues resolved.	2025-08-15 09:47:15 +10:00
BobAi	597c810034	Fix installer indexing hang and improve user experience 🔧 Script Handling Improvements: - Fix infinite recursion in bash wrapper for index/search commands - Improve embedding system diagnostics with intelligent detection - Add timeout protection and progress indicators to installer test - Enhance interactive input handling with graceful fallbacks 🎯 User Experience Enhancements: - Replace confusing error messages with educational diagnostics - Add RAG performance tips about model sizing (4B optimal, 8B+ overkill) - Correct model recommendations (qwen3:4b not qwen3:3b) - Smart Ollama model detection shows available models - Clear guidance for next steps after installation 🛠 Technical Fixes: - Add get_embedding_info() method to CodeEmbedder class - Robust test prompt handling with /dev/tty input - Path validation and permission fixing in test scripts - Comprehensive error diagnostics with actionable solutions Installation now completes reliably with clear feedback and guidance.	2025-08-14 20:23:57 +10:00
BobAi	2f2dd6880b	Add comprehensive LLM provider support and educational error handling ✨ Features: - Multi-provider LLM support (OpenAI, Claude, OpenRouter, LM Studio) - Educational config examples with setup guides - Comprehensive documentation in docs/LLM_PROVIDERS.md - Config validation testing system 🎯 Beginner Experience: - Friendly error messages for common mistakes - Educational explanations for technical concepts - Step-by-step troubleshooting guidance - Clear next-steps for every error condition 🛠 Technical: - Extended LLMConfig dataclass for cloud providers - Automated config validation script - Enhanced error handling in core components - Backward-compatible configuration system 📚 Documentation: - Provider comparison tables with costs/quality - Setup instructions for each LLM provider - Troubleshooting guides and testing procedures - Environment variable configuration options All configs pass validation tests. Ready for production use.	2025-08-14 16:39:12 +10:00
BobAi	a1f84e2bd5	Update model recommendations to Qwen3 4B and fix status command - Changed primary model recommendation from qwen3:1.7b to qwen3:4b - Added Q8 quantization info in technical docs for production users - Fixed method name error: get_embedding_info() -> get_status() - Updated all error messages and test files with new recommendations - Maintained beginner-friendly options (1.7b still very good, 0.6b surprisingly good) - Added explanation of why small models work well with RAG context - Comprehensive testing completed - system ready for clean release	2025-08-12 20:01:16 +10:00
BobAi	a96ddba3c9	MAJOR: Remove all Claude references and rename to Mini-RAG Complete rebrand to eliminate any Claude/Anthropic references: Directory Changes: - claude_rag/ → mini_rag/ (preserving git history) Content Changes: - Replaced 930+ Claude references across 40+ files - Updated all imports: from claude_rag → from mini_rag - Updated all file paths: .claude-rag → .mini-rag - Updated documentation and comments - Updated configuration files and examples Testing Changes: - All tests updated to use mini_rag imports - Integration tests verify new module structure This ensures complete independence from Claude/Anthropic branding while maintaining all functionality and git history.	2025-08-12 19:21:30 +10:00

18 Commits