Skip to content

Instantly share code, notes, and snippets.

View triposat's full-sized avatar
🏠
Working from home

Satyam Tripathi triposat

🏠
Working from home
  • India
  • 15:58 (UTC +05:30)
View GitHub Profile
@triposat
triposat / n8n_workflow_template.json
Last active March 17, 2026 03:09
n8n workflow template for YouTube trending topics analysis — Apify YouTube Scraper + Gemini AI + Gmail
{
"nodes": [
{
"parameters": {
"rule": {
"interval": [
{
"field": "weeks",
"triggerAtDay": [1],
"triggerAtHour": 9
@triposat
triposat / glassdoor_company.py
Last active March 14, 2026 02:54
Glassdoor Company Profile Scraper - Standalone Python script using BFF API + curl_cffi + Decodo proxies (19 fields)
"""Glassdoor company profile scraper using the internal BFF API with Decodo proxies.
Extracts company overview data: size, industry, headquarters, ratings,
CEO approval, and more. Combines two BFF endpoints:
- /bff/employer-profile-mono/employer-data (company info)
- /bff/employer-profile-mono/employer-ratings (full ratings breakdown)
Data quality focus: raw JSON dumps, field-by-field validation, ghost field
detection across multiple companies.
@triposat
triposat / glassdoor_interviews.py
Last active March 14, 2026 02:55
Glassdoor Interviews Scraper - Standalone Python script using BFF API + curl_cffi + Decodo proxies (11 fields)
"""Glassdoor interview experiences scraper using the internal BFF API with Decodo proxies.
Handles Cloudflare challenge detection, session validation, retry logic,
CSRF token extraction, and Decodo proxy management.
Uses curl_cffi's advanced fingerprinting features:
- Pinned browser version (TLS fingerprint matches User-Agent exactly)
- TLS extension randomization (real Chrome permutes on every request)
- TLS GREASE values (Chrome adds random values for robustness)
- Brotli certificate compression (matches Chrome default)
@triposat
triposat / glassdoor_jobs.py
Last active March 14, 2026 02:55
Glassdoor Jobs Scraper - Standalone Python script using BFF API + curl_cffi + Decodo proxies (12 fields)
"""Glassdoor job listings scraper using the internal BFF API with Decodo proxies."""
import argparse
import csv
import json
import logging
import os
import random
import re
import sys
@triposat
triposat / glassdoor_reviews.py
Last active March 14, 2026 02:55
Glassdoor Reviews Scraper - Standalone Python script using BFF API + curl_cffi + Decodo proxies (21 fields)
"""Glassdoor reviews scraper using the internal BFF API with Decodo proxies.
Handles Cloudflare challenge detection, session validation, retry logic,
CSRF token extraction, and Decodo proxy management.
Uses curl_cffi's advanced fingerprinting features:
- Pinned browser version (TLS fingerprint matches User-Agent exactly)
- TLS extension randomization (real Chrome permutes on every request)
- TLS GREASE values (Chrome adds random values for robustness)
- Brotli certificate compression (matches Chrome default)
@triposat
triposat / yelp_business.py
Last active March 5, 2026 07:06
Yelp Scraper Toolkit – 4 Python scripts for scraping Yelp search results, business details, reviews, and not-recommended reviews
"""
Yelp Business Details Scraper
Usage:
python3 yelp_business.py "https://www.yelp.com/biz/raw-sugar-factory-san-francisco"
python3 yelp_business.py "https://www.yelp.com/biz/raw-sugar-factory-san-francisco" -o details.json
Proxy: Set PROXY_URL in .env file (e.g., PROXY_URL=http://user:pass@host:port)
"""
@triposat
triposat / glassdoor_reviews_scraper.py
Last active March 2, 2026 01:01
Glassdoor Reviews Scraper — Scrapes company reviews from Glassdoor using their internal BFF API with curl_cffi. Auto-resolves company names to employer IDs, supports rating/employment/text filters, and exports to CSV/JSON.
"""
Glassdoor Reviews Scraper using curl_cffi
Scrapes company reviews from Glassdoor using their internal BFF API.
Uses curl_cffi for browser TLS fingerprint impersonation to bypass Cloudflare.
Supports multiple Glassdoor regional sites (co.in, sg, com, co.uk, etc.).
Usage:
python glassdoor_reviews_scraper.py --company Amazon --pages 3
python glassdoor_reviews_scraper.py --company Google --site com --sort date --rating 4
@triposat
triposat / glassdoor_scraper.py
Last active March 2, 2026 01:01
Glassdoor Job Scraper — Scrapes job listings from Glassdoor using their internal BFF API with curl_cffi for TLS fingerprint impersonation. Supports multi-region sites, salary/rating/date filters, city name resolution, and CSV/JSON export.
"""
Glassdoor Job Scraper using curl_cffi
Scrapes job listings and details from Glassdoor using their internal BFF API.
Uses curl_cffi for browser TLS fingerprint impersonation to bypass Cloudflare.
Supports multiple Glassdoor regional sites (co.in, sg, com, co.uk, etc.).
Usage:
python glassdoor_scraper.py --keyword "machine learning engineer" --location-id 2671300 --pages 3
python glassdoor_scraper.py --site sg -k "data scientist" -l 2671300 --sort date
{
"nodes": [
{
"parameters": {
"rule": {
"interval": [
{}
]
}
},
{
"nodes": [
{
"parameters": {
"rule": {
"interval": [
{}
]
}
},