RESEARCH INTELLIGENCE DASHBOARD

Graph-Powered Research Triage & Agentic Experimentation

Graph AlgorithmsKnowledge ManagementAgentic AINLPObsidianFastAPINext.jsNetworkX

Key Metrics

GRAPH ALGORITHMS

MATCHING TIERS

ANALYSIS FUNCTIONS

362

VAULT NOTES

INGESTION SOURCES

$0.15

COST PER EVAL

OVERVIEW

The Research Intelligence Dashboard surfaces actionable insights from automated newsletter ingestion, Instagram transcription, and an Obsidian knowledge vault. Rather than reading dozens of research items weekly and guessing which are relevant, the system applies graph algorithms to rank, cluster, and predict connections — then uses Claude API for on-demand relevance scoring.

The core insight: an Obsidian vault with wiki-links is a graph. By loading it into NetworkX, I can run PageRank to find which notes are structurally important, Louvain community detection to find topic clusters, and Adamic-Adar link prediction to suggest missing connections. These algorithms feed a three-tier smart matcher that surfaces research items by explicit links, keyword overlap, and graph-propagated discovery.

DATA INGESTION PIPELINE

Three automated pipelines feed structured data into the Obsidian vault:

Newsletter Ingestion — Claude Code scheduled tasks run every Monday (JournalClub) and Friday (TLDR). Each agent reads the newsletter from Gmail, downloads cited papers, writes a synthesis per paper, assesses relevance to active projects, and outputs structured markdown with wiki-links.

Instagram Ingestion — Instaloader fetches posts from followed agentic engineering creators. faster-whisper (base model, int8 quantized, CPU) transcribes audio. Claude Haiku extracts structured metadata — title, key points, and keywords with wiki-links to relevant projects. Output is written as vault notes with YAML frontmatter.

Knowledge Linker — Retroactive wiki-link injection across the entire vault. When a new project or concept is added, the linker scans all existing notes and connects anything that should be linked. Uses 16 aliases per concept, longest-match-first replacement, with protection for code blocks and existing links. The vault's link density doubled on the first run.

GRAPH ENGINE & ALGORITHMS

The graph engine loads the Obsidian vault via obsidiantools.Vault, builds a MultiDiGraph from wiki-links, and collapses it to a NetworkX DiGraph (removing self-loops and non-existent notes). Nodes are markdown note names; directed edges are wiki-links.

PageRank — nx.pagerank(G) ranks notes by structural importance. The centrality rank for each note is injected into Claude analysis prompts as graph context, so the LLM knows which projects are structurally central.

Louvain Community Detection — nx.community.louvain_communities(G.to_undirected(), seed=42) finds topic clusters. Community membership drives Tier 3 matching — if a new paper lands in the same community as a project, the system surfaces it regardless of keyword overlap.

Adamic-Adar Link Prediction — nx.adamic_adar_index() predicts missing links within a 3-hop neighborhood. Top 10 suggestions surface as recommendations.

Graph Health Metrics — node count, edge count, orphan count (degree 0), weakly connected components, bridge count (critical links whose removal fragments the graph). Betweenness centrality skipped for graphs > 1000 nodes as a performance guard.

THREE-TIER SMART MATCHER

The smart matcher connects research items to projects across three confidence tiers:

Tier 1 — Explicit Links (confidence = 1.0) — Items with [[ProjectName]] wiki-links in vault markdown. Built by _build_explicit_index() mapping project names to linked items.

Tier 2 — Keyword Overlap (confidence 0.3–0.9) — Tech stack keywords weighted at 0.3 per match, general keywords at 0.1, capped at 0.9. Uses a 169-word stop-word frozenset and recognizes 22 multi-word tech terms ("retrieval augmented generation", "contrastive learning", etc.) as single tokens.

Tier 3 — Graph-Propagated Discovery — Pulls matches from Louvain community peers and Adamic-Adar predictions. Tagged with discovery_source and via_project for provenance. Limited to 15 community peers to avoid noise.

The matcher uses a 1-hour TTL cache (cachetools.TTLCache) with thread-safe locking. All three tiers compound — an item can have a Tier 1 match to one project and a Tier 3 match to another simultaneously.

AGENTIC RESEARCH LAB

The workbench evaluates flagged items through an agentic pipeline with a state machine: queued → researching → researched → sandbox_creating → sandbox_ready | manual | failed.

Research Phase — Spawns a claude -p subprocess (Opus, fallback Sonnet) with COSTAR-structured prompts. Allowed tools include WebSearch, WebFetch, Read, Write, context7, and exa-web-search. Output is a research.md with required sections: overview, installation, key APIs, programmatic assessment, experiment design, and safety notes. Cost detection scans for "subscription" or "pricing" keywords and flags items for manual review.

Sandbox Phase — Sonnet generates a Dockerfile (official base image, pinned versions, non-root USER), experiment.py, and run.sh. Docker runs with --network none for full isolation. Output must conform to a structured JSON schema: {metric_name, baseline, result, improvement, passed, description}.

Process Management — Agents spawn as subprocesses with start_new_session=True. Atomic file I/O uses tempfile + os.replace() throughout. Vault-sourced text is escaped via _fmt_safe() to prevent prompt injection.

FRONTEND & API

FastAPI Backend — REST endpoints expose parsed data from the vault. WebSocket support for live updates. Analysis endpoints trigger Claude API calls with SHA-256 cache keys (v2:{item}:{project}:{type}:{has_graph}) to prevent re-analysis.

Next.js Frontend — React with Tailwind v4 and shadcn/ui. Global feeds show all research items ranked by relevance. Project cockpit views show per-project items with graph context. The workbench UI tracks research and sandbox state transitions in real time.

Claude API Integration — 10 analysis functions spanning quick relevance (Haiku, 1024 tokens) to deep analysis (Sonnet, 2048 tokens) to full blog draft generation (Sonnet, 2000 tokens). Graph context — community peers, top 5 neighbors with PageRank scores, top 5 suggested connections with Adamic-Adar scores — is injected into every analysis prompt.

macOS App Bundle — Packaged as a native .app with a compiled C wrapper (launcher-wrapper.c) + ad-hoc codesign for Sequoia compatibility. Launches Caddy reverse proxy, conda environment, FastAPI, and Next.js.

Screenshots

Tech Stack

PythonFastAPINext.jsTypeScriptTailwind CSSNetworkXClaude APIfaster-whisperDockerObsidian

Details

Team

Steve Meadows

Timeline

Feb 2026 — Present