update documentation
This commit is contained in:
@@ -39,21 +39,21 @@ All external access is routed through **Caddy** (reverse proxy) with **Authelia*
|
||||
|------|--------|
|
||||
| GPU | NVIDIA RTX A4000 |
|
||||
| Role | Primary AI inference node |
|
||||
| Key Services | Ollama (inference) |
|
||||
| Key Services | llama-server (llama.cpp), Inference Service |
|
||||
|
||||
### Mini PC 1 — Media Node (`192.168.0.81`)
|
||||
| Spec | Detail |
|
||||
|------|--------|
|
||||
| GPU | NVIDIA RTX 5050 |
|
||||
| Role | Media services, embeddings, vector storage |
|
||||
| Key Services | Jellyfin, Nextcloud, Qdrant, arr stack, NexusAI memory/embedding |
|
||||
| Key Services | Jellyfin, Nextcloud, Qdrant, arr stack, NexusAI memory/embedding, Ollama |
|
||||
| Storage | NVMe (OS) + 3x external HDDs (see [Storage Layout](#storage-layout)) |
|
||||
|
||||
### Mini PC 2 — Infrastructure Node (`192.168.0.205`)
|
||||
| Spec | Detail |
|
||||
|------|--------|
|
||||
| Role | Network management, monitoring, auth, DNS, git |
|
||||
| Key Services | Caddy, Authelia, Tailscale, Pihole, Grafana, Gitea |
|
||||
| Role | Network management, monitoring, auth, DNS, git, NexusAI orchestration |
|
||||
| Key Services | Caddy, Authelia, Tailscale, Pihole, Grafana, Gitea, NexusAI orchestration |
|
||||
| Storage | NVMe (OS only) |
|
||||
|
||||
---
|
||||
@@ -155,7 +155,8 @@ All external access is routed through **Caddy** (reverse proxy) with **Authelia*
|
||||
|
||||
| Service | Notes |
|
||||
|---------|-------|
|
||||
| Ollama | Runs LLM inference using the RTX A4000. Also serves `nomic-embed-text` embeddings (768-dim vectors) consumed by NexusAI's embedding service on Mini PC 1. |
|
||||
| llama-server (llama.cpp) | Primary LLM inference using the RTX A4000. Started manually before the inference service. Serves the OpenAI-compatible API on port 8080. |
|
||||
| Ollama | Serves `nomic-embed-text` embeddings (768-dim vectors) consumed by NexusAI's embedding service on Mini PC 1. |
|
||||
|
||||
---
|
||||
|
||||
@@ -234,7 +235,7 @@ Phase 1 focused on establishing a stable, secure, and observable foundation:
|
||||
- ✅ Self-hosted git (Gitea)
|
||||
- ✅ Media stack fully operational (Jellyfin, arr stack, Nextcloud)
|
||||
- ✅ Download pipeline with VPN isolation (Gluetun + qBittorrent)
|
||||
- ✅ NexusAI foundation services running (Qdrant, Ollama)
|
||||
- ✅ NexusAI foundation services running (Qdrant, Ollama, llama.cpp)
|
||||
- ✅ Container management across nodes (Portainer + agent)
|
||||
|
||||
---
|
||||
@@ -249,6 +250,6 @@ Phase 2 shifts focus to resilience, security hardening, and smart home integrati
|
||||
- **Additional security hardening** — Audit exposed services, tighten firewall rules, review Authelia policies
|
||||
- **IP webcam integration** — Add camera feeds into the homelab ecosystem
|
||||
- **Home Assistant** — Integrate smart home automation and sensor data
|
||||
- **Continued NexusAI development** — Entities layer, embedding service, inference and orchestration buildout
|
||||
- **Continued NexusAI development** — Entity extraction pipeline, summaries layer, SettingsView implementation
|
||||
|
||||
> This section will be expanded as Phase 2 planning matures.
|
||||
Reference in New Issue
Block a user