sam-4screen-desktop 2026-5-16:17:16:54

This commit is contained in:
2026-05-16 17:16:54 +10:00
parent 3baf6f09d3
commit 3879f053dc
6 changed files with 161 additions and 19 deletions

View File

@@ -0,0 +1,153 @@
---
created: 2026-05-16 17:02
modified: 2026-05-16 17:02
type: note
tags:
- ai dev-ops website iframe
aliases: []
id: 1778914902-WMFA
---
# [[1778914902-WMFA]]
# Local Hybrid Vector + Graph RAG Setup via Caddy & Docker
This document outlines the architecture and configuration files required to run a single, unified local RAG system (Vector search for static files + Graph search for Obsidian notes) served inside an iframe across three separate context-specific showcase websites (`devops.local`, `coding.local`, `ai.local`).
---
## 1. Network Routing (`Caddyfile`)
This configuration uses a `Caddyfile` snippets to proxy your backend container while securely handling cross-origin iframe security rules (`Content-Security-Policy`).
```caddy
# Core AI RAG Application Backend
ai.local {
reverse_proxy localhost:8000
header {
# Restrict iframe rendering specifically to your 3 interest domains
Content-Security-Policy "frame-ancestors 'self' https://devops.local https://coding.local https://ai-site.local"
# Standard security hardening
X-Content-Type-Options "nosniff"
Referrer-Policy "strict-origin-when-cross-origin"
}
}
# Example Configuration Blocks for Frontend Sites
devops.local {
root * /var/www/devops_site
file_server
}
coding.local {
root * /var/www/coding_site
file_server
}
ai-site.local {
root * /var/www/ai_site
file_server
}
```
---
## 2. Infrastructure Layer (`docker-compose.yml`)
The app runs out of a localized, slimmed-down Python environment container. Underlying vector files and Graph databases are explicitly mounted as **read-only** (`:ro`) to guarantee stability against prompt manipulation.
```yaml
version: '3.8'
services:
unified-ai-rag:
image: python:3.11-slim
container_name: local_ai_rag
working_dir: /app
volumes:
# Mount application scripts
- ./app:/app
# Mount databases and notes securely as READ-ONLY
- ./db/semantic_rag:/app/db/semantic_rag:ro
- ./db/obsidian_graph.gpickle:/app/db/obsidian_graph.gpickle:ro
ports:
- "8000:8000"
environment:
- OPENAI_API_KEY=your_openai_api_key_here
- ANTHROPIC_API_KEY=your_anthropic_api_key_here
command: ["pip", "install", "-r", "requirements.txt", "&&", "chainlit", "run", "app.py", "--port", "8000"]
restart: unless-stopped
```
---
## 3. Web UI Application & Context Controller (`app.py`)
This Chainlit-driven script reads incoming referrers or basic text prompts to shift its personality depending on which website the recruiter came from.
```python
import chainlit as cl
import networkx as nx
from chromadb import PersistentClient
# 1. Initialization hooks for Vector and Graph layers (Read-Only)
def load_databases():
vector_client = PersistentClient(path="/app/db/semantic_rag")
vector_layer = vector_client.get_collection(name="resume_and_docs")
graph_layer = nx.read_gpickle("/app/db/obsidian_graph.gpickle")
return vector_layer, graph_layer
vector_db, graph_db = load_databases()
@cl.on_chat_start
async def start():
# Detect the site referring the frame to adjust persona
http_headers = cl.user_session.get("http_headers", {})
referer = http_headers.get("referer", "")
system_prompt = "You are a helpful AI assistant reviewing my portfolio data."
welcome_msg = "Hello! Ask me any questions about my profile or experience."
if "devops.local" in referer:
system_prompt = "Persona: DevOps Engineer. Focus heavily on infrastructure, IoT architecture, CI/CD, and server logs."
welcome_msg = "Welcome Recruiter! Ask me anything about my DevOps automation and IoT infrastructure."
elif "coding.local" in referer:
system_prompt = "Persona: Software Engineer. Emphasize backend code, software design patterns, and clean programming methodologies."
welcome_msg = "Hello! Let's talk about my development portfolio and technical code paradigms."
elif "ai-site.local" in referer:
system_prompt = "Persona: AI/RAG Specialist. Discuss custom embedding techniques, semantic lookups, and graph networks."
welcome_msg = "Greetings! Feel free to pick my brain about graph-based indexing and large language models."
cl.user_session.set("system_prompt", system_prompt)
await cl.Message(content=welcome_msg).send()
@cl.on_message
async def main(message: cl.Message):
query = message.content
sys_prompt = cl.user_session.get("system_prompt")
# Executing the dual pass strategy
# (Extract semantic chunks + pull Obsidian link neighbors from the graph)
# Synthesize outputs down through the LLM context window here...
response_text = f"Processed query using profile persona contextualized framework rules."
await cl.Message(content=response_text).send()
```
---
## 4. Frontend Integration (`iframe`)
Embed this string block directly inside your target HTML sites:
```html
<iframe
src="https://ai.local"
style="width: 100%; height: 650px; border: 1px solid #ccc; border-radius: 8px;"
allow="clipboard-read; clipboard-write">
</iframe>
```