Intelligence Metrics — Expansion Roadmap¶

📊 Intelligence Metrics¶

The analytical core of RTSG — where the money lives

Strategic Vision

The Intelligence Engine is an API that every active AI agent, every CogOS instance, and every human user will want access to. It scores, compares, decomposes, and matches intelligence vectors — for models, documents, people, and teams. This page is the build roadmap.

1. The Product¶

What we're selling: An API + visual dashboard that lets anyone:

Score any entity (AI model, human, document, team) as an I-vector in ℝⁿ⁽ᵉ⁾ (n=12 for humans, variable per entity)
Compare any two or more entities across all n(e) dimensions (12 for humans) + Elo + composite metrics
Visualize comparisons in every modality: bars, radar, 3D shapes, heatmaps, scatter, parallel coordinates
Analyze geometric properties of the I-vector: shape class, density, parity, nearest neighbor, cosine similarity
Match entities for optimal synergy — find the best partner, team, or tool for a given task
Track changes over time — how does an entity's I-vector evolve?

Who pays:

AI labs benchmarking their models against the field
Enterprises choosing which model to deploy for which task
Individuals assessing their own cognitive profile
Teams optimizing composition for complementary strengths
Document/content analysis — "what kind of intelligence produced this?"

2. Comparison Engine — Every Way Possible¶

2.1 Visual Comparisons¶

Visualization	What it shows	Status
3D Polyhedra (Three.js)	Full I-vector as rotatable shape, overlay/grid/dim views	✅ Live
Radar Charts	Classic 8-axis spider plot, 2-5 models overlaid	🔨 Build next
Bar Charts	Side-by-side bars per dimension, grouped by model	🔨 Build next
Heatmap Matrix	Models × Dimensions, color-coded intensity	🔨 Build next
Parallel Coordinates	8 vertical axes, one line per model, crossings show trade-offs	🔨 Build next
Scatter Plots	Any 2 dimensions as X/Y, bubble size = Elo, color = group	🔨 Build next
Elo vs ‖I‖ Plot	X=Elo, Y=‖I‖, reveals models that over/under-perform for their shape	🔨 Build next
Shape Silhouettes	2D projection of 3D polyhedra, overlaid like a police lineup	📋 Planned
Time Series	I-vector components over model versions (V1→V2→V3)	📋 Planned
Synergy Map	Network graph where edge thickness = complementarity score	📋 Planned

2.2 Numerical Comparisons¶

Metric	Formula	What it reveals
Euclidean Norm ‖I‖	√(Σ I_d²)	Raw magnitude — total cognitive firepower
Cosine Similarity	(A·B)/(‖A‖‖B‖)	Shape similarity regardless of scale — "same profile, different magnitude?"
Euclidean Distance	‖A−B‖	How far apart two entities are in intelligence space
Manhattan Distance	Σ\|A_d − B_d\|	Sum of per-dimension gaps — "total disagreement"
Dimension Parity	max(I)/min(I)	How lopsided the vector is — specialists vs generalists
Entropy	−Σ p_d log p_d where p_d = I_d/Σ I_d	Evenness of distribution — high entropy = generalist
Density	‖I‖ / max_possible(‖I‖)	How much of the possible space is filled
Nearest Neighbor	argmin_B ‖A−B‖	Who is most similar to whom?
Farthest Neighbor	argmax_B ‖A−B‖	Maximum cognitive distance — the most complementary pair
Dominance Count	#{d : A_d > B_d}	In how many dimensions does A beat B?
Pareto Frontier	Non-dominated set	Which models are not strictly beaten on all dims by any other?
Centroid	(1/n)Σ I_i	The "average intelligence" of a group
Variance	(1/n)Σ ‖I_i − centroid‖²	How spread out a group is

2.3 Geometric Analysis — Shape Taxonomy¶

The shape of an I-vector polyhedron is not random. It encodes cognitive architecture. We propose a classification:

Shape Class	Signature	Example	Interpretation
Sphere	All dims ≈ equal, low parity	Balanced generalist	Even investment across all capabilities
Spike	1-2 dims >> rest, high parity	Math specialist	Deep investment in narrow capability
Slab	Top ring high, bottom ring low (or vice versa)	Technical-but-not-social	Systematic gap between capability families
Crown	Alternating high-low around the rings	Claude (L=9,M=8,S=5,K=1,N=7,A=9,P=8,E=7)	Strong on every other axis — complex trade-off pattern
Teardrop	Mostly high with 1-2 collapsed dims	Most frontier models (I_K≈1)	Missing capability drags overall shape
Diamond	Middle dims high, extremes low	Hypothetical balanced reasoner	Peaks in the center of each ring

Key insight: Two models with similar ‖I‖ can have radically different shapes, which means radically different strengths. Shape classification is more informative than any single number.

2.4 Synergy — The Killer Feature¶

Given two entities A and B, their synergy vector is:

S(A,B) = max(A_d, B_d) for each d

This represents the combined capability of the pair — the best of both on every dimension. The synergy gain is:

ΔS = ‖S(A,B)‖ − max(‖A‖, ‖B‖)

High ΔS means the pair covers each other's weaknesses. This is the matchmaking metric.

Complementarity score:

C(A,B) = 1 − cos(A,B)

Low cosine similarity = high complementarity = they're good together because they're different.

Optimal team of size k: Find the subset of k entities that maximizes ‖S(team)‖. This is the core matching algorithm.

Use cases:

"Which AI model should I pair with Claude for maximum coverage?"
"Given my personal I-vector, which AI compensates my weaknesses best?"
"Build me a 3-person team with maximal ‖S‖"
"Which document requires which intelligence profile to understand?"

3. Interactive Dashboard — Build Spec¶

Page: `/arena/compare/`¶

User flow:

User selects 2+ models from a checkbox list (or types a custom I-vector)
Dashboard instantly renders ALL comparison views:
Top row: Radar chart + 3D polyhedra side-by-side
Middle: Heatmap + parallel coordinates
Bottom: Numerical metrics table + scatter plot
Sidebar: full metric breakdown (cosine sim, distance, parity, entropy, synergy)
"Add to comparison" button — keeps adding models
"Match me" button — input your own scores, find your nearest/farthest/best-synergy model

Page: `/arena/geometry/`¶

Shape analysis dashboard:

Shows each model's shape classification with visual
Clusters models by shape similarity
"What shape is optimal for [task]?" — maps tasks to ideal shape profiles
Historical shape evolution (when we have version data)

Page: `/arena/synergy/`¶

Synergy calculator:

Select any pair → see synergy vector, gain, complementarity
"Build optimal team" — select pool, set team size, maximize coverage
Network visualization — all models as nodes, edges = synergy gain, thickness = complementarity
"Personal match" — input 8 self-assessment scores, get model recommendations

Page: `/arena/api/`¶

API documentation for the Intelligence Engine:

GET  /api/v1/models                    → List all scored models
GET  /api/v1/models/{id}/vector        → Get I-vector + Elo
GET  /api/v1/compare?a={id}&b={id}     → Full comparison (all metrics)
GET  /api/v1/similarity?a={id}&b={id}  → Cosine similarity
GET  /api/v1/nearest?id={id}&k=5       → k nearest neighbors
GET  /api/v1/synergy?ids=a,b,c         → Synergy vector + gain
GET  /api/v1/match?vector=9,8,5,1,7,9,8,7  → Best match for custom vector
GET  /api/v1/shape/{id}                → Shape classification + metrics
GET  /api/v1/pareto                    → Pareto frontier models
POST /api/v1/score                     → Score a document/text → I-vector

4. Build Phases¶

Phase 1 — Visual Dashboard (this week)¶

Interactive radar chart (select 2-5 models, overlaid spider plot)
Bar chart comparison (grouped bars, one group per dimension)
Heatmap matrix (models × dims, color intensity)
Numerical metrics table (cosine sim, distance, parity, entropy for selected pair)
All as a single React/HTML page, embeddable in wiki

Phase 2 — Geometric Analysis (next week)¶

Shape classifier (input I-vector → shape class + visual)
Parity/entropy/density calculations live in dashboard
Scatter plot (any 2 dims as axes, bubble = Elo)
Parallel coordinates plot
Pareto frontier visualization

Phase 3 — Synergy Engine (week 3)¶

Synergy calculator (pair → synergy vector + gain + complementarity)
"Match me" tool — input personal vector, get recommendations
Optimal team builder (select pool, set k, maximize ‖S‖)
Network graph visualization
"Score this document" prototype

Phase 4 — API + CogOS Integration (week 4+)¶

FastAPI endpoints at engine.smarthub.my
DuckDB backend for model vectors + historical data
Rate-limited public tier (free, 100 req/day)
Authenticated tier for agents/enterprises
CogOS SDK — any AI can call the Intelligence Engine mid-conversation
Webhook: "notify me when a new model enters my nearest-neighbor set"

5. Mathematical Foundation¶

All metrics derive from the I-vector space. The key equations:

I-vector: $\mathbf{I} = (I_L, I_M, I_S, I_K, I_N, I_A, I_P, I_{IE}, \ldots) \in \mathbb{R}^{n(e)}$

Norm: $\|\mathbf{I}\| = \sqrt{\sum_{d=1}^{8} I_d^2}$

Cosine similarity: $\cos(\mathbf{A}, \mathbf{B}) = \frac{\mathbf{A} \cdot \mathbf{B}}{\|\mathbf{A}\| \|\mathbf{B}\|}$

Complementarity: $C(\mathbf{A}, \mathbf{B}) = 1 - \cos(\mathbf{A}, \mathbf{B})$

Synergy vector: $S(\mathbf{A}, \mathbf{B})_d = \max(A_d, B_d)$

Synergy gain: $\Delta S = \|S(\mathbf{A}, \mathbf{B})\| - \max(\|\mathbf{A}\|, \|\mathbf{B}\|)$

Parity: $P(\mathbf{I}) = \frac{\max_d I_d}{\min_d I_d}$

Entropy: $H(\mathbf{I}) = -\sum_{d=1}^{8} p_d \log_2 p_d$ where $p_d = \frac{I_d}{\sum_d I_d}$

Density: $\rho(\mathbf{I}) = \frac{\|\mathbf{I}\|}{\sqrt{8 \cdot 10^2}} = \frac{\|\mathbf{I}\|}{10\sqrt{8}}$

Shape volume (convex hull of 8 vertices): Computed numerically from the polyhedron geometry. Measures how much of ℝ³ the projected shape occupies.

6. Competitive Advantage¶

Why this wins:

Nobody else has variable-dimensional intelligence vector (n=12 for humans)s. LMSYS has Elo (1D). We have Elo + I-vec (9D effective). That's 9x the information.
Shape analysis is novel. No one is classifying AI models by geometric cognitive signature.
Synergy is the matchmaking layer. The moment you can say "pair Claude + DeepSeek for 98% dimension coverage," you're selling infrastructure.
Self-assessment creates viral adoption. "What's my I-vector?" is inherently shareable.
Document scoring is the enterprise play. "This legal brief requires I_L=9, I_M=7 to parse correctly" → now you're routing work.
CogOS integration makes it a platform. Every AI agent that uses the Intelligence Engine becomes a node in the RTSG network.

7. Revenue Model¶

Tier	Price	Access
Free	$0	100 API calls/day, public model data, basic comparisons
Pro	$29/mo	Unlimited calls, synergy engine, custom vectors, team builder
Enterprise	$299/mo	Document scoring, CogOS integration, historical tracking, webhooks
API Partner	Custom	White-label, bulk scoring, embedded dashboards

This document is the strategic foundation. Phase 1 starts now.