Summary
Extract structured data from Google Search Engine Results Pages (SERPs) with a powerful and configurable UI. This script scrapes results and exports them to various formats (JSON, CSV, Markdown, etc.). It is enhanced with deep parsing for forum-style results and includes an experimental auto-scraping feature.
Features
- Manual Scraping: Scrape all results on the current page with a single click.
- Automatic Scraping (Experimental): An optional feature that monitors the page for changes and automatically scrapes new results added by other helper extensions (e.g., Infy Scroll or uAutoPagerize).
- Floating UI Panel: Draggable, minimizable, and maximizable interface.
- Dual View Modes: JSON view and a filterable List/Preview view.
- Multiple Export Options: Copy or download as JSON, CSV, Markdown, URL list.
- Comprehensive Settings: Customize everything from data fetching (including forum stats and related posts), CSS selectors, export fields, to UI preferences and automation.
- Advanced Features: Right-click context menu, smart URL decoding, and multi-lingual date parsing (EN, JA, ZH-TW).
How to Use
- After installation, the floating panel appears on Google search pages.
- Click the "Scrape Page" button to scrape results from the currently displayed page. Please note: this action will clear and overwrite any previously scraped data.
- To Scrape Multiple Pages (Strongly Recommended):
- Since Google now uses pagination, it is strongly recommended to first install a browser extension like Infy Scroll or uAutoPagerize to merge all desired pages into one.
- Once all content is loaded, click this script's "Scrape Page" button once to process everything.
- Automatic Scraping (Advanced): This feature is designed specifically for use with the helper extensions mentioned above. After enabling a helper tool, turn on the "Auto-scrape" option in this script's settings to automatically append new results as they are loaded.
Important Notes
- Without a helper extension, the auto-scrape feature is completely ineffective as it cannot navigate to the next page on its own.
- Google's page structure can change. If the script fails, try adjusting the "Title Element Selector" in settings or report an issue.
- This script is primarily tested on English, Japanese, and Traditional Chinese Google sites.
Acknowledgements / Third-Party Code
- Includes an implementation of Punycode.js for decoding internationalized domain names.