pagespider

browser extension inspired by httrack that crawls websites and saves pages as self-contained html files for offline viewing.

features

saves pages with inlined assets (images, css, fonts as base64) or separate files
by default rewrites links between pages for offline navigation (like httrack)
static mode strips javascript for clean archives of js-heavy sites (sveltekit, next.js, etc.)
url filtering with contains/path-starts/regex
screenshots via native scroll-stitch or html2canvas
two storage backends: browser downloads or filesystem (via dashboard)
indexeddb queue survives browser crashes
cross-browser compatible

install

clone repo
open chrome://extensions (or about:addons in firefox)
enable developer mode
load unpacked from pagespider/ folder

usage

click extension icon
enter url or click "use current tab"
set depth (how many links deep to crawl)
optionally set url filter to limit scope
choose storage method (filesystem requires opening dashboard first)
start crawl

storage methods

downloads. saves to browser downloads folder. simple but no real folder structure.

filesystem. saves to folder you choose with proper structure. requires keeping the dashboard tab open during crawl.

options

option	description
url filter	only crawl urls matching pattern
depth	how many links deep (0 = single page, -1 = unlimited)
delay	ms between requests
assets	inline (base64) or separate files
screenshots	native (scroll-stitch) and/or html2canvas
static mode	strip scripts for clean static pages
rewrite links	convert links to local files
same origin	stay on same domain

limitations

buttons using js navigation (e.g. router.push()) won't work in static mode - only <a href> links get rewritten
html2canvas may miss some css effects
native screenshots may duplicate fixed/sticky elements

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
background		background
content		content
dashboard		dashboard
icons		icons
lib		lib
popup		popup
screenshots		screenshots
README.md		README.md
manifest.json		manifest.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pagespider

features

install

usage

storage methods

options

limitations

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

faeller/pagespider

Folders and files

Latest commit

History

Repository files navigation

pagespider

features

install

usage

storage methods

options

limitations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages