Based on the little-used HTML5 outline spec, investigate&implement an in-browser tool (currently a chrome extension or browser user script) to easily, interactively scrap a documentation web page into an 'index-content' map for (offline) searching.

Motivated by the fact that most scrappers today are command line tools, too tech-savvy.

Currently only target documentation web pages, which are much better structured and so easier to scrap. Also I think these pages benefit most from indexing&scraping.

Looking for hackers with the skills:

web scrapper indexer documentation

This project is part of:

Hack Week 13

Activity

  • almost 6 years ago: cxiong added keyword "web" to this project.
  • almost 6 years ago: cxiong added keyword "scrapper" to this project.
  • almost 6 years ago: cxiong added keyword "indexer" to this project.
  • almost 6 years ago: cxiong added keyword "documentation" to this project.
  • almost 6 years ago: cxiong started this project.
  • almost 6 years ago: cxiong originated this project.

  • Comments

    Be the first to comment!

    Similar Projects

    Chimera Policy Hub by flavio_castelli

    [comment]: # (Please use the project descriptio...


    Sharing logic between desktop and web based applications through WASM by IGonzalezSosa

    Project Description

    A few months ago, the...


    Cockpit for YES Certification by nm75

    [comment]: # (Please use the project descriptio...


    WebRTC individual track recorder by avicenzi

    [comment]: # (Please use the project descriptio...


    Convert openqa-mon to webassembly by ybonatakis

    [comment]: # (Please use the project descriptio...


    Developing an opinionated storage appliance by asettle

    [comment]: # (Please use the project descriptio...


    document/blog commit -> container workflow by hennevogel

    we have fresh containers for every commit for O...