microdao-daarion

Author	SHA1	Message	Date
Apple	e9dedffa48	feat(production): sync all modified production files to git Includes updates across gateway, router, node-worker, memory-service, aurora-service, swapper, sofiia-console UI and node2 infrastructure: - gateway-bot: Dockerfile, http_api.py, druid/aistalk prompts, doc_service - services/router: main.py, router-config.yml, fabric_metrics, memory_retrieval, offload_client, prompt_builder - services/node-worker: worker.py, main.py, config.py, fabric_metrics - services/memory-service: Dockerfile, database.py, main.py, requirements - services/aurora-service: main.py (+399), kling.py, quality_report.py - services/swapper-service: main.py, swapper_config_node2.yaml - services/sofiia-console: static/index.html (console UI update) - config: agent_registry, crewai_agents/teams, router_agents - ops/fabric_preflight.sh: updated preflight checks - router-config.yml, docker-compose.node2.yml: infra updates - docs: NODA1-AGENT-ARCHITECTURE, fabric_contract updated Made-with: Cursor	2026-03-03 07:13:29 -08:00
Apple	9a36020316	P3.5-P3.7: 2-layer inventory, capability routing, STT/TTS adapters, Dev Contract NCS: - _collect_worker_caps() fetches capability flags from node-worker /caps - _derive_capabilities() merges served model types + worker provider flags - installed_artifacts replaces inventory_only (disk scan with DISK_SCAN_PATHS env) - New endpoints: /capabilities/caps, /capabilities/installed Node Worker: - STT_PROVIDER, TTS_PROVIDER, OCR_PROVIDER, IMAGE_PROVIDER env flags - /caps endpoint returns capabilities + providers for NCS aggregation - STT adapter (providers/stt_mlx_whisper.py) — remote + local mode - TTS adapter (providers/tts_mlx_kokoro.py) — remote + local mode - OCR handler via vision_prompted (ollama_vision with OCR prompt) - NATS subjects: node.{id}.stt/tts/ocr/image.request Router: - POST /v1/capability/{stt,tts,ocr,image} — capability-based offload routing - GET /v1/capabilities — global view with capabilities_by_node - require_fresh_caps(ttl) preflight guard - find_nodes_with_capability(cap) + load-based node selection Ops: - ops/fabric_snapshot.py — full runtime snapshot collector - ops/fabric_preflight.sh — quick check + snapshot save + diff - docs/fabric_contract.md — Dev Contract v0.1 (preflight-first) - tests/test_fabric_contract.py — CI enforcement (6 tests) Made-with: Cursor	2026-02-27 05:24:09 -08:00
Apple	194c87f53c	feat(fabric): decommission Swapper from critical path, NCS = source of truth - Node Worker: replace swapper_vision with ollama_vision (direct Ollama API) - Node Worker: add NATS subjects for stt/tts/image (stubs ready) - Node Worker: remove SWAPPER_URL dependency from config - Router: vision calls go directly to Ollama /api/generate with images - Router: local LLM calls go directly to Ollama /api/generate - Router: add OLLAMA_URL and PREFER_NODE_WORKER=true feature flag - Router: /v1/models now uses NCS global capabilities pool - NCS: SWAPPER_URL="" -> skip Swapper probing (status=disabled) - Swapper configs: remove all hardcoded model lists, keep only runtime URLs, timeouts, limits - docker-compose.node1.yml: add OLLAMA_URL, PREFER_NODE_WORKER for router; SWAPPER_URL= for NCS; remove swapper-service from node-worker depends_on - docker-compose.node2-sofiia.yml: same changes for NODA2 Swapper service still runs but is NOT in the critical inference path. Source of truth for models is now NCS -> Ollama /api/tags. Made-with: Cursor	2026-02-27 04:16:16 -08:00
Apple	a6531507df	merge: integrate remote codex/sync-node1-runtime with fabric layer changes Resolve conflicts in docker-compose.node1.yml, services/router/main.py, and gateway-bot/services/doc_service.py — keeping both fabric layer (NCS, node-worker, Prometheus) and document ingest/query endpoints. Made-with: Cursor	2026-02-27 03:09:12 -08:00
Apple	ed7ad49d3a	P3.2+P3.3+P3.4: NODA1 node-worker + NATS auth config + Prometheus counters P3.2 — Multi-node deployment: - Added node-worker service to docker-compose.node1.yml (NODE_ID=noda1) - NCS NODA1 now has NODE_WORKER_URL for metrics collection - Fixed NODE_ID consistency: router NODA1 uses 'noda1' - NODA2 node-worker/NCS gets NCS_REPORT_URL for latency reporting P3.3 — NATS accounts/auth (opt-in config): - config/nats-server.conf with 3 accounts: SYS, FABRIC, APP - Per-user topic permissions (router, ncs, node_worker) - Leafnode listener :7422 with auth - Not yet activated (requires credential provisioning) P3.4 — Prometheus counters: - Router /fabric_metrics: caps_refresh, caps_stale, model_select, offload_total, breaker_state, score_ms histogram - Node Worker /prom_metrics: jobs_total, inflight gauge, latency_ms histogram - NCS /prom_metrics: runtime_health, runtime_p50/p95, node_wait_ms - All bound to 127.0.0.1 (not externally exposed) Made-with: Cursor	2026-02-27 03:03:18 -08:00
Apple	c4b94a327d	P2.2+P2.3: NATS offload node-worker + router offload integration Node Worker (services/node-worker/): - NATS subscriber for node.{NODE_ID}.llm.request / vision.request - Canonical JobRequest/JobResponse envelope (Pydantic) - Idempotency cache (TTL 10min) with inflight dedup - Deadline enforcement (DEADLINE_EXCEEDED on expired jobs) - Concurrency limiter (semaphore, returns busy) - Ollama + Swapper vision providers Router offload (services/router/offload_client.py): - NATS req/reply with configurable retries - Circuit breaker per node+type (3 fails/60s → open 120s) - Concurrency semaphore for remote requests Model selection (services/router/model_select.py): - exclude_nodes parameter for circuit-broken nodes - force_local flag for fallback re-selection - Integrated circuit breaker state awareness Router /infer pipeline: - Remote offload path when NCS selects remote node - Automatic fallback: exclude failed node → force_local re-select - Deadline propagation from router to node-worker Tests: 17 unit tests (idempotency, deadline, circuit breaker) Docs: ops/offload_routing.md (subjects, envelope, verification) Made-with: Cursor	2026-02-27 02:44:05 -08:00
Apple	a92c424845	P2: Global multi-node model selection + NCS on NODA1 Architecture for 150+ nodes: - global_capabilities_client.py: NATS scatter-gather discovery using wildcard subject node.*.capabilities.get — zero static node lists. New nodes auto-register by deploying NCS and subscribing to NATS. Dead nodes expire from cache after 3x TTL automatically. Multi-node model_select.py: - ModelSelection now includes node, local, via_nats fields - select_best_model prefers local candidates, then remote - Prefer list resolution: local first, remote second - All logged per request: node, runtime, model, local/remote NODA1 compose: - Added node-capabilities service (NCS) to docker-compose.node1.yml - NATS subscription: node.noda1.capabilities.get - Router env: NODE_CAPABILITIES_URL + ENABLE_GLOBAL_CAPS_NATS=true NODA2 compose: - Router env: ENABLE_GLOBAL_CAPS_NATS=true Router main.py: - Startup: initializes global_capabilities_client (NATS connect + first discovery). Falls back to local-only capabilities_client if unavailable. - /infer: uses get_global_capabilities() for cross-node model pool - Offload support: send_offload_request(node_id, type, payload) via NATS Verified on NODA2: - Global caps: 1 node, 14 models (NODA1 not yet deployed) - Sofiia: cloud_grok → grok-4-1-fast-reasoning (OK) - Helion: NCS → qwen3:14b local (OK) - When NODA1 deploys NCS, its models appear automatically via NATS discovery Made-with: Cursor	2026-02-27 02:26:12 -08:00
Apple	89c3f2ac66	P1: NCS-first model selection + NATS capabilities + Grok 4.1 Router model selection: - New model_select.py: resolve_effective_profile → profile_requirements → select_best_model pipeline. NCS-first with graceful static fallback. - selection_policies in router-config.node2.yml define prefer order per profile without hardcoding models (e.g. local_default_coder prefers qwen3:14b then qwen3.5:35b-a3b). - Cloud profiles (cloud_grok, cloud_deepseek) skip NCS; on cloud failure use fallback_profile via NCS for local selection. - Structured logs: selected_profile, required_type, runtime, model, caps_age_s, fallback_reason on every infer request. Grok model fix: - grok-2-1212 no longer exists on xAI API → updated to grok-4-1-fast-reasoning across all 3 hardcoded locations in main.py and router-config.node2.yml. NCS NATS request/reply: - node-capabilities subscribes to node.noda2.capabilities.get (NATS request/reply). Enabled via ENABLE_NATS_CAPS=true in compose. - NODA1 router can query NODA2 capabilities over NATS leafnode without HTTP connectivity. Verified: - NCS: 14 served models from Ollama+Swapper+llama-server - NATS: request/reply returns full capabilities JSON - Sofiia: cloud_grok → grok-4-1-fast-reasoning (tested, 200 OK) - Helion: NCS → qwen3:14b via Ollama (caps_age=23.7s cache hit) - Router health: ok Made-with: Cursor	2026-02-27 02:17:34 -08:00
Apple	e2a3ae342a	node2: fix Sofiia routing determinism + Node Capabilities Service Bug fixes: - Bug A: GROK_API_KEY env mismatch — router expected GROK_API_KEY but only XAI_API_KEY was present. Added GROK_API_KEY=${XAI_API_KEY} alias in compose. - Bug B: 'grok' profile missing in router-config.node2.yml — added cloud_grok profile (provider: grok, model: grok-2-1212). Sofiia now has default_llm=cloud_grok with fallback_llm=local_default_coder. - Bug C: Router silently defaulted to cloud DeepSeek when profile was unknown. Now falls back to agent.fallback_llm or local_default_coder with WARNING log. Hardcoded Ollama URL (172.18.0.1) replaced with config-driven base_url. New service: Node Capabilities Service (NCS) - services/node-capabilities/ — FastAPI microservice exposing live model inventory from Ollama, Swapper, and llama-server. - GET /capabilities — canonical JSON with served_models[] and inventory_only[] - GET /capabilities/models — flat list of served models - POST /capabilities/refresh — force cache refresh - Cache TTL 15s, bound to 127.0.0.1:8099 - services/router/capabilities_client.py — async client with TTL cache Artifacts: - ops/node2_models_audit.md — 3-layer model view (served/disk/cloud) - ops/node2_models_audit.yml — machine-readable audit - ops/node2_capabilities_example.json — sample NCS output (14 served models) Made-with: Cursor	2026-02-27 02:07:40 -08:00
NODA1 System	f53e71a0f4	feat(docs): add versioned document update and versions APIs	2026-02-21 16:49:24 +01:00
NODA1 System	5d52cf81c4	feat(docs): add standard file processing and router document ingest/query	2026-02-21 14:02:59 +01:00
NODA1 System	f44e920486	agromatrix: enforce mentor auth and expose shared-memory review via gateway	2026-02-21 13:18:36 +01:00
NODA1 System	68ac8fa355	agromatrix: add shared-memory review api and crawl4ai robustness	2026-02-21 13:18:36 +01:00
NODA1 System	01bfa97783	agromatrix: tighten numeric source contract guard	2026-02-21 13:18:36 +01:00
NODA1 System	d963c52fe5	agromatrix: add pending-question memory, anti-repeat guard, and numeric contract	2026-02-21 13:18:36 +01:00
NODA1 System	a87a1fe52c	agromatrix: deterministic plant-id flow + confidence guard + plantnet env	2026-02-21 13:18:36 +01:00
NODA1 System	50dfcd7390	router: enforce direct image inputs for plant tools and inject runtime image_data	2026-02-21 13:18:36 +01:00
NODA1 System	a91309de11	agromatrix: deploy context/photo learning + deterministic excel policy	2026-02-21 13:18:36 +01:00
NODA1 System	a8a153a87a	router: add tool manager runtime and memory retrieval updates	2026-02-20 17:56:33 +01:00
Apple	de8bb36462	docs+router: formalize runtime policy and remove temporary cloud-first code override	2026-02-18 10:40:40 -08:00
Apple	05435e7fad	router: bypass local routing rules for cloud-first agents	2026-02-18 10:28:53 -08:00
Apple	ef59cb0950	router: enforce cloud-first direct path for top-level and monitor agents	2026-02-18 10:26:29 -08:00
Apple	635f2d7e37	helion: deepseek-first, on-demand CrewAI, local subagent profiles, concise post-synthesis	2026-02-18 09:21:47 -08:00
Apple	b9f83a5006	Sync NODE1 runtime config for Sofiia monitor + Clan canary fixes	2026-02-18 05:56:21 -08:00
Apple	21576f0ca3	node1: add universal file tool, gateway document delivery, and sync runbook	2026-02-15 01:50:37 -08:00
Apple	7f3ee700a4	fix(router): guard DSML tool-call flows Prevent DeepSeek DSML from leaking to users and avoid returning raw memory_search/web results when DSML is detected. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-10 04:19:57 -08:00
Apple	7887f7cbe9	fix: DSML fallback — 3rd LLM call for clean synthesis + think tag stripping Router (main.py): - When DSML detected in 2nd LLM response after tool execution, make a 3rd LLM call with explicit synthesis prompt instead of returning raw tool results to the user - Falls back to format_tool_calls_for_response only if 3rd call fails Router (tool_manager.py): - Added _strip_think_tags() helper for <think>...</think> removal from DeepSeek reasoning artifacts Gateway (http_api.py): - Strip <think>...</think> tags before sending to Telegram - Strip DSML/XML-like markup (function_calls, invoke, parameter tags) - Ensure empty text after stripping gets "..." fallback Deployed to NODE1 and verified services running. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-09 10:30:37 -08:00
Apple	acceac6929	fix: helion string literal + memory brief anti-echo in Router - Fixed unquoted `helion` variable reference to string literal `"helion"` in tool_manager.py search_memories fallback - Replaced `[Контекст пам'яті]` with `[INTERNAL MEMORY - do NOT repeat to user]` in all 3 injection points in main.py - Verified: Senpai now responds without Helion contamination or memory brief leaking Tested and deployed on NODE1. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-09 10:05:25 -08:00
Apple	b9f7ca8ecf	fix(critical): Senpai using Helion's memory — 3 root causes fixed 1. YAML structure bug: Senpai was in `policies:` instead of `agents:` in router-config.yml. Router couldn't find Senpai config → no routing rule → fallback to local model. 2. tool_manager agent_id not passed: memory_search and graph_query tools were called without agent_id → defaulted to "helion" → ALL agents' tool calls searched Helion's Qdrant collections. Fixed: agent_id now flows from main.py → execute_tool → _memory_search. 3. Config not mounted: router-config.yml was baked into Docker image, host changes had no effect. Added volume mount in docker-compose. Also added: - Sofiia agent config + routing rule (was completely missing) - Senpai routing rule: cloud_deepseek (was falling to local qwen3:8b) - Anti-echo instruction for memory brief injection Deployed and verified on NODE1: Senpai now searches senpai_* collections. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-09 10:00:08 -08:00
Apple	ef3473db21	snapshot: NODE1 production state 2026-02-09 Complete snapshot of /opt/microdao-daarion/ from NODE1 (144.76.224.179). This represents the actual running production code that has diverged significantly from the previous main branch. Key changes from old main: - Gateway (http_api.py): expanded from ~40KB to 164KB with full agent support - Router: new /v1/agents/{id}/infer endpoint with vision + DeepSeek routing - Behavior Policy: SOWA v2.2 (3-level: FULL/ACK/SILENT) - Agent Registry: config/agent_registry.yml as single source of truth - 13 agents configured (was 3) - Memory service integration - CrewAI teams and roles Excluded from snapshot: venv/, .env, data/, backups, .tgz archives Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-09 08:46:46 -08:00
Apple	0c8bef82f4	feat: Add Alateya, Clan, Eonarch agents + fix gateway-router connection ## Agents Added - Alateya: R&D, biotech, innovations - Clan (Spirit): Community spirit agent - Eonarch: Consciousness evolution agent ## Changes - docker-compose.node1.yml: Added tokens for all 3 new agents - gateway-bot/http_api.py: Added configs and webhook endpoints - gateway-bot/clan_prompt.txt: New prompt file - gateway-bot/eonarch_prompt.txt: New prompt file ## Fixes - Fixed ROUTER_URL from :9102 to :8000 (internal container port) - All 9 Telegram agents now working ## Documentation - Created PROJECT-MASTER-INDEX.md - single entry point - Added various status documents and scripts Tokens configured: - Helion, NUTRA, Agromatrix (existing) - Alateya, Clan, Eonarch (new) - Druid, GreenFood, DAARWIZZ (configured)	2026-01-28 06:40:34 -08:00
Apple	5290287058	feat: implement TTS, Document processing, and Memory Service /facts API - TTS: xtts-v2 integration with voice cloning support - Document: docling integration for PDF/DOCX/PPTX processing - Memory Service: added /facts/upsert, /facts/{key}, /facts endpoints - Added required dependencies (TTS, docling)	2026-01-17 08:16:37 -08:00
Apple	bca81dc719	feat: Node Self-Healing, DAGI Audit, Agent Prompts, Infra Invariants ### Backend (city-service) - Node Registry + Self-Healing API (migration 039) - Improved get_all_nodes() with robust fallback for node_registry/node_cache - Agent Prompts Runtime API for DAGI Router integration - DAGI Router Audit endpoints (phantom/stale detection) - Node Agents API (Guardian/Steward) - Node metrics extended (CPU/GPU/RAM/Disk) ### Frontend (apps/web) - Node Directory with improved error handling - Node Cabinet with metrics cards - DAGI Router Card component - Node Metrics Card component - useDAGIAudit hook ### Scripts - check-invariants.py - deploy verification - node-bootstrap.sh - node self-registration - node-guardian-loop.py - continuous self-healing - dagi_agent_audit.py - DAGI audit utility ### Migrations - 034: Agent prompts seed - 035: Agent DAGI audit - 036: Node metrics extended - 037: Node agents complete - 038: Agent prompts full coverage - 039: Node registry self-healing ### Tests - test_infra_smoke.py - test_agent_prompts_runtime.py - test_dagi_router_api.py ### Documentation - DEPLOY_CHECKLIST_2024_11_30.md - Multiple TASK_PHASE docs	2025-11-30 13:52:01 -08:00
Apple	776ab3a7ba	feat: add node-registry dashboard, matrix-presence-aggregator, ocr-service updates	2025-11-28 05:23:14 -08:00
Apple	6bd769ef40	feat(city-map): Add 2D City Map with coordinates and agent presence - Add migration 013_city_map_coordinates.sql with map coordinates, zones, and agents table - Add /city/map API endpoint in city-service - Add /city/agents and /city/agents/online endpoints - Extend presence aggregator to include agents[] in snapshot - Add AgentsSource for fetching agent data from DB - Create CityMap component with interactive room tiles - Add useCityMap hook for fetching map data - Update useGlobalPresence to include agents - Add map/list view toggle on /city page - Add agent badges to room cards and map tiles	2025-11-27 07:00:47 -08:00
Apple	3de3c8cb36	feat: Add presence heartbeat for Matrix online status - matrix-gateway: POST /internal/matrix/presence/online endpoint - usePresenceHeartbeat hook with activity tracking - Auto away after 5 min inactivity - Offline on page close/visibility change - Integrated in MatrixChatRoom component	2025-11-27 00:19:40 -08:00

36 Commits