Based on the little-used HTML5 outline spec, investigate&implement an in-browser tool (currently a chrome extension or browser user script) to easily, interactively scrap a documentation web page into an 'index-content' map for (offline) searching.

Motivated by the fact that most scrappers today are command line tools, too tech-savvy.

Currently only target documentation web pages, which are much better structured and so easier to scrap. Also I think these pages benefit most from indexing&scraping.

Looking for hackers with the skills:

web scrapper indexer documentation

This project is part of:

Hack Week 13

Activity

  • over 8 years ago: cxiong added keyword "web" to this project.
  • over 8 years ago: cxiong added keyword "scrapper" to this project.
  • over 8 years ago: cxiong added keyword "indexer" to this project.
  • over 8 years ago: cxiong added keyword "documentation" to this project.
  • over 8 years ago: cxiong started this project.
  • over 8 years ago: cxiong originated this project.

  • Comments

    Be the first to comment!

    Similar Projects

    WebUI for your data by avicenzi

    [comment]: # (Please use the project descriptio...


    A command line image collector tool for my gallery website by AZhou

    [comment]: # (Please use the project descriptio...


    Deep clean-up of the Uyuni documentation files by omaric

    Project Description

    This project is plann...