microdao-daarion

Files

Apple 00f9102e50 feat: add Ollama runtime support and RAG implementation plan

Ollama Runtime:
- Add ollama_client.py for Ollama API integration
- Support for dots-ocr model via Ollama
- Add OLLAMA_BASE_URL configuration
- Update inference.py to support Ollama runtime (RUNTIME_TYPE=ollama)
- Update endpoints to handle async Ollama calls
- Alternative to local transformers model

RAG Implementation Plan:
- Create TODO-RAG.md with detailed Haystack integration plan
- Document Store setup (pgvector)
- Embedding model selection
- Ingest pipeline (PARSER → RAG)
- Query pipeline (RAG → LLM)
- Integration with DAGI Router
- Bot commands (/upload_doc, /ask_doc)
- Testing strategy

Now supports three runtime modes:
1. Local transformers (RUNTIME_TYPE=local)
2. Ollama (RUNTIME_TYPE=ollama)
3. Dummy (USE_DUMMY_PARSER=true)

2025-11-16 02:56:36 -08:00

memory-service

fix: synchronize all metadata fields to meta in schemas

2025-11-15 12:31:01 -08:00

parser-service

feat: add Ollama runtime support and RAG implementation plan

2025-11-16 02:56:36 -08:00

stt-service

refactor: rewrite STT service to use qwen3_asr_toolkit Python API

2025-11-15 12:55:21 -08:00