VibeHunt
Back to browse

Sosse

Selenium based search engine and crawler with offline archiving.

Sosse is a self‑hosted utility that uses Selenium to crawl web pages, index their content, and store archived copies for offline browsing. It provides a web‑based administration interface where users can submit URLs to crawl, manage a crawl queue, and configure crawlers, collections, and RSS or Atom feeds. The system tracks documents, tags, domains, cookies, and mime types, and it can exclude specific URLs or integrate external search engines.

The platform exposes a REST API and command‑line tools for interacting with the indexed data, supporting features such as profile history, screenshot capture, and analytics. Users can set up webhooks, permission rules, and AI‑driven automation for tasks like tagging or summarizing content, while configuration files allow fine‑grained control over the web server, crawler behavior, and other settings.

Sosse is released under the AGPL‑3.0 license, is free of subscription or tiered pricing, and is considered stable for production use. It can be installed via Debian packages, pip, or Docker/Docker‑compose environments, making it adaptable to various deployment preferences.

Reviews

Sign in to leave a review.

Loading reviews…

Similar apps