GIG Gulf · Content + GEO Audit · Pipeline & Tool Gaps

Every audit, in order

Four stages — Download → Index → Save to DB → Analyse. Status measured against work done this session. Benchmarked against the Visible proposal (the standard GIG is being pitched).

Benchmark note: Visible (Gilles Praet) is pitching GIG a $3.5k–$9.5k/month GEO+SEO retainer. Their moat is one thing we don't yet have: a 100-prompt × 4-LLM AI-visibility measurement. Everything else in their proposal, we can match from first-party data we now hold. The pipeline below closes the gap.

Stage 1 · Download — acquire raw data

#Audit / pullSourceStatus
1Full site crawl + URL inventoryFirecrawl sitemapDONE 1,765 URLs
2Blog corpus scrape — 131 posts → recordsFirecrawlPARTIAL index ✓, 1/131
3Document-library inventory — 527 PDFsFirecrawl + PDF toolsTODO
4GSC export — pages · queries · dates (per market)Search ConsolePARTIAL UAE 16mo ✓
5GSC decay export — 3mo vs prior 3mo (fatigue input)Search Console compareTODO
6GA4 export — page engagement + quote/conversion eventsGA4PARTIAL live access + multi-market verified; per-page export pending
7Google Ads export — paid keywords + landing pagesGoogle AdsTODO
8Competitor crawl — Sukoon · Tawuniya · Salama · Orient + Lemonade/HippoFirecrawl (no access needed)TODO
9AI-visibility capture — prompts × LLMs (ChatGPT·Claude·Google; Perplexity/Gemini when keyed)LLM APIs (built in-house)DONE tool built + baseline run (20 prompts ×3)
10SERP + AI-Overview presence captureSerper (UAE geo)TODO
11Backlink / authority profileAhrefsBLOCKED plan tier

Stage 2 · Index — structure & classify the raw

#Audit / stepStatus
12URL taxonomy — classify all 1,765 by type · LOB · language · funnel-stage · formatTODO
13Blog record population — 22-field schema per postschema proven
14Date-stamping pass — publish/updated datesTODO
15Query classification — brand vs non-brand · intent · LOB · marketDONE 80/20
16Prompt → citation index — which LLM cited which page, per prompt (GEO)TODO (needs #9)

Stage 3 · Save to database — persist a queryable grid

#Audit / stepStatus
17Airtable master audit base — pages × metrics × scores × dispositionTODO (Airtable live)
18Per-URL metrics join — GSC + GA4 + Ads onto the inventoryGSC in hand
19Fatigue index compute — age + decay + cannibalisation + CTR-gap + repetitioninputs partial
20Richness scoring — the 10-element /20 per assetTODO (rubric ready)
21GEO phase scoring — Category Formation → Attribute Recall → Competitive Selection → Trust, per LOB/marketTODO (needs #9)

Stage 4 · Analyse — turn data into the plan

#Audit / analysisStatus
22Traffic concentration + dead-inventoryDONE top5=68%, 37%<10
23Category ROI — clicks per post by laneDONE Motor 897 vs Travel 86
24Brand vs non-brand demand gapDONE 80/20
25CTR quick-wins — high-impression, low-CTR giantsSTARTED
26Cannibalisation + technical-junk (duplicate URLs, language-switcher index bloat)STARTED
27Schema / AI-answer readiness audit (JSON-LD, FAQPage, llms.txt)TODO
28Competitor gap — volume · format · topics · share-of-voiceTODO
29AI-citation share — the GEO headline metricDONE 83% overall / 92% category (baseline)
30Multi-market + Arabic paritySTARTED Arabic dark
31Gap → 6-month roadmap + needle-moversDRAFTED

Missing tools — what to buy or build

CapabilityStatus / gapFix
AI-visibility measurement (100 prompts × 4 LLMs, citation tracking)Not wired. This is Visible's entire moat.BUILD — we have the LLM access (Claude, GPT, Gemini, Perplexity via API/OpenRouter) + Serper. ~1–2 days. Or BUY: Profound / Peec AI / otterly.ai.
GA4 Data API (scripted pulls)Service-account JSON not capturedGenerate in GCP (~15 min). UI export works for now.
Backlinks / keyword / competitor trafficAhrefs key valid but plan tier blocks APIUpgrade Ahrefs to an API tier, or Semrush.
Technical-SEO crawler at scaleFirecrawl scrapes; it's not a schema/indexation/redirect crawler for 1,765 URLsScreaming Frog or Sitebulb (~$200/yr).
Multi-market GSC + GA4Only UAE property reachableGIG grants .bh / .om / .qa / .sa properties.
llms.txt + schema validatorsNoneScriptable, no purchase.

Everything else the audit needs is keyed and working: Firecrawl, Serper, Airtable, the LLM stack, document/visual production, Cloudflare/Vercel hosting.

Articulate AI for GIG Gulf · Audit pipeline v1 · benchmarked vs Visible proposal (_competitor/visible-proposal-2026-05-14.pdf). Raw GSC saved to _gsc/.