BobAi 4166d0a362 Initial release: FSS-Mini-RAG - Lightweight semantic code search system

🎯 Complete transformation from 5.9GB bloated system to 70MB optimized solution

✨ Key Features:
- Hybrid embedding system (Ollama + ML fallback + hash backup)
- Intelligent chunking with language-aware parsing
- Semantic + BM25 hybrid search with rich context
- Zero-config portable design with graceful degradation
- Beautiful TUI for beginners + powerful CLI for experts
- Comprehensive documentation with 8+ Mermaid diagrams
- Professional animated demo (183KB optimized GIF)

🏗️ Architecture Highlights:
- LanceDB vector storage with streaming indexing
- Smart file tracking (size/mtime) to avoid expensive rehashing
- Progressive chunking: Markdown headers → Python functions → fixed-size
- Quality filtering: 200+ chars, 20+ words, 30% alphanumeric content
- Concurrent batch processing with error recovery

📦 Package Contents:
- Core engine: claude_rag/ (11 modules, 2,847 lines)
- Entry points: rag-mini (unified), rag-tui (beginner interface)
- Documentation: README + 6 guides with visual diagrams
- Assets: 3D icon, optimized demo GIF, recording tools
- Tests: 8 comprehensive integration and validation tests
- Examples: Usage patterns, config templates, dependency analysis

🎥 Demo System:
- Scripted demonstration showing 12 files → 58 chunks indexing
- Semantic search with multi-line result previews
- Complete workflow from TUI startup to CLI mastery
- Professional recording pipeline with asciinema + GIF conversion

🛡️ Security & Quality:
- Complete .gitignore with personal data protection
- Dependency optimization (removed python-dotenv)
- Code quality validation and educational test suite
- Agent-reviewed architecture and documentation

Ready for production use - copy folder, run ./rag-mini, start searching\!

2025-08-12 16:38:28 +10:00

4.9 KiB

Raw Blame History

Getting Started with FSS-Mini-RAG

Step 1: Installation

Choose your installation based on what you want:

Option A: Ollama Only (Recommended)

# Install Ollama first
curl -fsSL https://ollama.ai/install.sh | sh

# Pull the embedding model  
ollama pull nomic-embed-text

# Install Python dependencies
pip install -r requirements.txt

Option B: Full ML Stack

# Install everything including PyTorch
pip install -r requirements-full.txt

Step 2: Test Installation

# Index this RAG system itself
./rag-mini index ~/my-project

# Search for something 
./rag-mini search ~/my-project "chunker function"

# Check what got indexed
./rag-mini status ~/my-project

Step 3: Index Your First Project

# Index any project directory
./rag-mini index /path/to/your/project

# The system creates .claude-rag/ directory with:
# - config.json (settings)
# - manifest.json (file tracking)  
# - database.lance/ (vector database)

Step 4: Search Your Code

# Basic semantic search
./rag-mini search /path/to/project "user login logic"

# Enhanced search with smart features  
./rag-mini-enhanced search /path/to/project "authentication"

# Find similar patterns
./rag-mini-enhanced similar /path/to/project "def validate_input"

Step 5: Customize Configuration

Edit project/.claude-rag/config.json:

{
  "chunking": {
    "max_size": 3000,
    "strategy": "semantic"  
  },
  "files": {
    "min_file_size": 100
  }
}

Then re-index to apply changes:

./rag-mini index /path/to/project --force

Common Use Cases

Find Functions by Name

./rag-mini search /project "function named connect_to_database"

Find Code Patterns

./rag-mini search /project "error handling try catch"
./rag-mini search /project "database query with parameters"

Find Configuration

./rag-mini search /project "database connection settings"
./rag-mini search /project "environment variables"

Find Documentation

./rag-mini search /project "how to deploy" 
./rag-mini search /project "API documentation"

Python API Usage

from claude_rag import ProjectIndexer, CodeSearcher, CodeEmbedder
from pathlib import Path

# Initialize
project_path = Path("/path/to/your/project")
embedder = CodeEmbedder()
indexer = ProjectIndexer(project_path, embedder)
searcher = CodeSearcher(project_path, embedder)

# Index the project
print("Indexing project...")
result = indexer.index_project()
print(f"Indexed {result['files_processed']} files, {result['chunks_created']} chunks")

# Search
print("\nSearching for authentication code...")
results = searcher.search("user authentication logic", limit=5)

for i, result in enumerate(results, 1):
    print(f"\n{i}. {result.file_path}")
    print(f"   Score: {result.score:.3f}")
    print(f"   Type: {result.chunk_type}")
    print(f"   Content: {result.content[:100]}...")

Advanced Features

Auto-optimization

# Get optimization suggestions
./rag-mini-enhanced analyze /path/to/project

# This analyzes your codebase and suggests:
# - Better chunk sizes for your language mix
# - Streaming settings for large files
# - File filtering optimizations

File Watching

from claude_rag import FileWatcher

# Watch for file changes and auto-update index
watcher = FileWatcher(project_path, indexer)
watcher.start_watching()

# Now any file changes automatically update the index

Custom Chunking

from claude_rag import CodeChunker

chunker = CodeChunker()

# Chunk a Python file
with open("example.py") as f:
    content = f.read()

chunks = chunker.chunk_text(content, "python", "example.py")
for chunk in chunks:
    print(f"Type: {chunk.chunk_type}")
    print(f"Content: {chunk.content}")

Tips and Best Practices

For Better Search Results

Use descriptive phrases: "function that validates email addresses"
Try different phrasings if first search doesn't work
Search for concepts, not just exact variable names

For Better Indexing

Exclude build directories: node_modules/, build/, dist/
Include documentation files - they often contain valuable context
Use semantic chunking strategy for most projects

For Configuration

Start with default settings
Use analyze command to get optimization suggestions
Increase chunk size for larger functions/classes
Decrease chunk size for more granular search

For Troubleshooting

Check ./rag-mini status to see what was indexed
Look at .claude-rag/manifest.json for file details
Run with --force to completely rebuild index
Check logs in .claude-rag/ directory for errors

What's Next?

Try the test suite to understand how components work:
```
python -m pytest tests/ -v
```
Look at the examples in examples/ directory
Read the main README.md for complete technical details
Customize the system for your specific project needs

4.9 KiB Raw Blame History

Getting Started with FSS-Mini-RAG

Step 1: Installation

Option A: Ollama Only (Recommended)

Option B: Full ML Stack

Step 2: Test Installation

Step 3: Index Your First Project

Step 4: Search Your Code

Step 5: Customize Configuration

Common Use Cases

Find Functions by Name

Find Code Patterns

Find Configuration

Find Documentation

Python API Usage

Advanced Features

Auto-optimization

File Watching

Custom Chunking

Tips and Best Practices

For Better Search Results

For Better Indexing

For Configuration

For Troubleshooting

What's Next?

4.9 KiB

Raw Blame History