Skip to content

Instantly share code, notes, and snippets.

@NovCog
Last active March 16, 2026 22:28
Show Gist options
  • Select an option

  • Save NovCog/227cb1e0b53d0c3e05c990fdcd50f5d5 to your computer and use it in GitHub Desktop.

Select an option

Save NovCog/227cb1e0b53d0c3e05c990fdcd50f5d5 to your computer and use it in GitHub Desktop.
NovCog AI Practitioner Series: Complete Index — NovCog AI Practitioner Series by Guerin Green

NovCog AI Practitioner Series: Complete Index

By Guerin Green / Novel Cognition AI Strategist | Federal Courthouse AI Presenter


The Series

A 23-part practitioner series covering AI visibility, entity authority, distributed authority networks, crawl infrastructure, and the methodology behind building web properties that AI systems can parse, cite, and traverse.


Foundation

0. A Practitioner's Guide to Generative LLMs, Vectors, RAG, andTic-Tac-Toe The original guide. Start here. Covers the fundamentals of how language models work, what vectors and embeddings actually do, how RAG systems retrieve and generate, and why a game of tic-tac-toe explains attention mechanisms better than any textbook.

1. RAG Fundamentals: How Retrieval-Augmented Generation Actually Works Chunking strategies, embedding model selection, retrieval quality bottlenecks, and the failure modes that separate production RAG from demo RAG.

2. AI Glossary for Legal Professionals: 31 Terms Every Lawyer Should Know From the CLE materials presented at the federal courthouse. Tokens, vectors, attention, RLHF, hallucination, bias, prompt engineering, RAG, and AI ethics — defined for practitioners, not computer scientists.

Entity Authority & Structured Data

3. Entity Authority for AI Citation: How ChatGPT Decides Who to Cite The 70.4% Person schema finding. Structured data signals, sameAs strategy, knowledge graph traversal mechanics, and the practical playbook for engineering AI citation.

4. Distributed Authority Networks: Building Entity Graphs AI Can Traverse The DAN concept. Cross-domain schema architecture, network effects, signal amplification, and implementation principles for multi-property entity building.

9. JSON-LD Person Schema: The Most Important Structured Data for AI Visibility Complete implementation guide with code examples. performerIn for events, sameAs chains, knowsAbout declarations, and deployment patterns across a network.

14. Schema Markup for Public Figures: A DAN Implementation Guide Person + Article + Organization triple pattern. Colleague references, network member arrays, performerIn for public appearances, and consistency requirements.

Crawl Infrastructure

5. The Crawl & Index Crisis: Why Half Your Pages Are Invisible Zyppy's 49.4% non-indexation data. Lily Ray's "earn crawls" framework. Why Google is increasingly selective and what it means for AI training data visibility.

6. Web 2.0 as Crawl Triggers: Why Blogger Became an Indexing Pipeline Atom feeds, PubSubHubbub protocol, two-phase crawl patterns, and the tracking pixel discovery that revealed 3-minute crawl times.

7. Cloudflare Workers for SEO: Edge Computing at Zero Cost Tracking pixel architecture, KV storage for crawl data, no-cache headers, and free tier capabilities for network-wide bot monitoring.

8. Verifying Googlebot: The 4-Step Reverse DNS Method ASN pre-filter (15169/396982), rDNS lookup, fDNS confirmation, IP caching strategy, and spoofing detection rates across 50,000+ verified events.

16. Automating Content Seeding with the Blogger API v3 OAuth flow, post creation cycle, tracking pixel insertion, rotation logic for multi-property targeting, and rate limiting best practices.

21. Closed-Loop Crawl Verification: From Pixel to Index Confirmation The 4-layer verification model: pixel fire, rDNS confirmation, index check, and ongoing monitoring. The discovery that changed the Blogger verification approach.

Content & Design

10. Recovering Content from the Wayback Machine: A Technical Guide CDX API queries, Archivarix exports, the id_ endpoint trick, image recovery challenges, and content extraction workflows.

11. Design Replication vs Generation: Why AI Sites Look AI-Generated Screenshot-first methodology, branding extraction with Firecrawl, era-appropriate design patterns, and achieving 8 distinct design languages across a network.

12. Per-Page Footer Variation: Maximizing Link Diversity Why fixed footers waste link equity. The 6-page/18-domain model, strategic property selection tiers, anchor text variation, and the link matrix approach.

13. Public Facebook Groups as Indexed SEO Assets How Google indexes public groups, hashtag ownership strategy, content distribution via owned communities, and cross-platform entity reinforcement.

15. Building Accountability Sites That Rank Sourced journalism methodology, public records research databases, editorial framing for defensible content, and controversy as durable content.

Concepts & Theory

17. Hidden State Drift: The Concept Behind the Methodology ML origins of hidden states, application to search and AI systems, why most practitioners measure the wrong signals, and leading vs. lagging indicators.

18. Burstiness and Perplexity: What AI Systems Actually Measure in Content Token probability distributions, content naturalness metrics, AI detection mechanics, and why "AI-proof writing" advice misses the point.

Systems & Case Studies

19. Building an AI Memory System with Vector Embeddings NovCog Brain architecture: Supabase + pgvector + MCP integration. Semantic search vs. keyword search, thought capture patterns, and persistent AI memory.

20. Building a 13-Site Authority Network in Two Sessions: A Case Study 69 pages, 788 links, 13 distinct designs, 8 design languages, full tracking pixel deployment, and the methodology that made two-session buildout possible.

22. NovCog AI Practitioner Series: Complete Index You are here.


Key Resources


How to Use This Series

If you're new to AI/LLMs: Start with the original guide, then read RAG Fundamentals and the AI Glossary.

If you're building entity authority: Start with Entity Authority, then DAN, then Person Schema.

If you're solving crawl/index problems: Start with Crawl & Index Crisis, then Crawl Triggers, then Closed-Loop Verification.

If you're interested in the theory: Start with Hidden State Drift and Burstiness & Perplexity.

If you want the case study: Case Study ties everything together.


Burstiness & Perplexity Community | Hidden State Drift | Novel Cognition

#hiddenstatedrift #burstinessandperplexity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment