- PROJECT_BLUEPRINT.md: Full architecture with LightGBM, Qwen3, structured embeddings - RESEARCH_FINDINGS.md: 2024 benchmarks, competition analysis, validation - BUILD_INSTRUCTIONS.md: Step-by-step implementation guide - README.md: User-friendly overview and quick start - Research-backed hybrid ML/LLM email classifier - 94-96% accuracy target, 17min for 80k emails - Privacy-first, local processing, distributable wheel - Modular architecture with tiered dependencies - LLM optional (graceful degradation) - OpenAI-compatible API support
63 lines
546 B
Plaintext
63 lines
546 B
Plaintext
# Python
|
|
__pycache__/
|
|
*.py[cod]
|
|
*$py.class
|
|
*.so
|
|
.Python
|
|
env/
|
|
venv/
|
|
*.egg-info/
|
|
dist/
|
|
build/
|
|
|
|
# Data and Models
|
|
data/training/
|
|
src/models/pretrained/*.pkl
|
|
src/models/pretrained/*.joblib
|
|
*.h5
|
|
*.joblib
|
|
|
|
# Credentials
|
|
.env
|
|
credentials/
|
|
*.json
|
|
!config/*.json
|
|
!config/*.yaml
|
|
|
|
# Logs
|
|
logs/*.log
|
|
*.log
|
|
|
|
# IDE
|
|
.vscode/
|
|
.idea/
|
|
*.swp
|
|
*.swo
|
|
|
|
# OS
|
|
.DS_Store
|
|
Thumbs.db
|
|
|
|
# Checkpoints
|
|
checkpoints/
|
|
*.checkpoint
|
|
|
|
# Results
|
|
results/
|
|
output/
|
|
|
|
# Pytest
|
|
.pytest_cache/
|
|
.coverage
|
|
htmlcov/
|
|
|
|
# MyPy
|
|
.mypy_cache/
|
|
.dmypy.json
|
|
dmypy.json
|
|
|
|
# Temporary files
|
|
*.tmp
|
|
*.bak
|
|
*~
|