Discover all customers of a given company by scanning websites, case studies, review sites, press, social media, job postings, and more. Use when you need competitive intelligence on who a company sells to.
npx goose-skills install customer-discovery --claude # Installs to: ~/.claude/skills/customer-discovery/
Find all customers of a company by scanning multiple public data sources. Produces a deduplicated report with confidence scoring.
Find all customers of DatadogWho are Notion's customers? Use deep mode.| Input | Required | Default | Description |
|---|---|---|---|
| Company name | Yes | — | The company to research |
| Website URL | No | Auto-detected | The company's website URL |
| Depth | No | standard | quick, standard, or deep |
Ask the user for:
mkdir -p customer-discovery-[company-slug]Collect all results into a running list. For each customer found, record:
1. Website logo wall
Run the scrape_website_logos.py script:
python3 skills/capabilities/customer-discovery/scripts/scrape_website_logos.py \
--url "[company-url]" --output jsonParse the JSON output and add each result to the customer list.
2. Case studies page
Use WebFetch on the company's case studies page (try /case-studies, /customers, /resources/case-studies). Extract customer names from page headings and content.
3. G2/Capterra reviews
If the review-scraper skill is available, use it to find reviewer companies:
python3 skills/capabilities/review-scraper/scripts/scrape_reviews.py \
--platform g2 --url "[g2-product-url]" --max-reviews 50 --output jsonFirst, WebSearch for the company's G2 page: site:g2.com "[company]". Extract reviewer company names from review author info.
4. Web search for press
WebSearch these queries and extract customer mentions from results:
"[company]" customer OR "case study" OR partnership"[company]" "we use" OR "switched to" OR "chose"5. Company blog posts
WebSearch: site:[company-domain] customer OR "case study" OR partnership OR "customer story"
6. Wayback Machine logos
Run the scrape_wayback_logos.py script:
python3 skills/capabilities/customer-discovery/scripts/scrape_wayback_logos.py \
--url "[company-url]" --output jsonLogos marked still_present: false are especially interesting — they indicate former customers.
7. Founder/exec LinkedIn posts
WebSearch: site:linkedin.com "[company]" customer OR "excited to announce" OR "welcome"
8. Twitter/X mentions
WebSearch: site:twitter.com "[company]" "we use" OR "just switched to" OR "loving"
9. Reddit/HN mentions
WebSearch these queries:
site:reddit.com "we use [company]" OR "[company] customer"site:news.ycombinator.com "[company]" customer OR user10. Job postings
WebSearch: "experience with [company]" site:linkedin.com/jobs OR site:greenhouse.io OR site:lever.co
Companies requiring experience with the product are likely customers.
11. YouTube testimonials
WebSearch: site:youtube.com "[company]" customer OR testimonial OR review
12. SEC filings
WebSearch: site:sec.gov "[company]" — Look for mentions in 10-K and 10-Q filings.
13. Podcast transcripts
WebSearch: "[company]" podcast customer OR transcript OR interview
14. GitHub usage signals
WebSearch: site:github.com "[company-package-name]" in dependency files, package.json, requirements.txt, etc.
15. Integration directories
WebFetch marketplace pages where the company lists integrations:
16. BuiltWith detection
python3 skills/capabilities/customer-discovery/scripts/search_builtwith.py \
--technology "[company-slug]" --max-results 50 --output json17. Crunchbase
WebSearch: site:crunchbase.com "[company]" customers OR partners
Merge results by company name using fuzzy matching:
Apply these rules:
High confidence:
Medium confidence:
Low confidence:
Create two output files:
customer-discovery-[company]/report.md:
# Customer Discovery: [Company Name]
**Date:** YYYY-MM-DD
**Depth:** quick | standard | deep
**Total customers found:** N
## High Confidence (N)
| Customer | Source | Evidence |
|----------|--------|----------|
| Shopify | Case study | [link] |
| ... | ... | ... |
## Medium Confidence (N)
| Customer | Source | Evidence |
|----------|--------|----------|
| ... | ... | ... |
## Low Confidence (N)
| Customer | Source | Evidence |
|----------|--------|----------|
| ... | ... | ... |
## Sources Scanned
- Website logo wall: [url] — N customers found
- G2 reviews: N reviews analyzed — N companies identified
- Wayback Machine: N snapshots checked — N logos found (N removed)
- Web search: N queries — N mentions
- ...
## Methodology
This report was generated using the customer-discovery skill, which scans
public data sources to identify companies that use [Company Name]. Confidence
levels reflect the strength and directness of the evidence found.customer-discovery-[company]/customers.csv:
CSV with columns: company_name,confidence,source_type,evidence_url,notes
Write the CSV using a code block or Python script.
| Script | Purpose | Key flags |
|---|---|---|
scrape_website_logos.py | Extract logos from current website | --url, --output json|summary |
scrape_wayback_logos.py | Find historical logos via Wayback Machine | --url, --paths, --output json|summary |
search_builtwith.py | BuiltWith technology detection (deep mode) | --technology, --max-results, --output json|summary |
All scripts require requests: pip3 install requests
External skill scripts (use if available):
skills/capabilities/review-scraper/scripts/scrape_reviews.py — G2/Capterra/Trustpilot reviews (requires Apify token)skills/capabilities/linkedin-post-research/scripts/search_posts.py — LinkedIn post search (requires Crustdata API key)--api-key flag); free scraping is used by default.Check and improve your brand's visibility across AI search engines (ChatGPT, Perplexity, Gemini, Grok, Claude, DeepSeek). Set up tracking, run visibility analyses, audit your website for AI readability, and get actionable recommendations. Uses the npx goose-aeo@latest CLI.
Extract competitor and customer intelligence from any company's landing page HTML. Discovers tech stack, analytics tools, ad pixels, customer logos, SEO metadata, CTAs, hidden elements, and more. No API keys required.
Detect buying signals from multiple sources, qualify leads, and generate outreach context