tREADME - webdump - [FORK] git://git.codemadness.org/webdump
 (HTM) git clone git://git.z3bra.org/webdump.git
 (DIR) Log
 (DIR) Files
 (DIR) Refs
 (DIR) README
 (DIR) LICENSE
       ---
       tREADME (1308B)
       ---
            1 !!!
            2 
            3 NOTE! work-in-progress (very slowly) and experimental.
            4 This code has many dirty hacks and ugliness. Intended for my personal use only.
            5 Knowing this: of course feel free to use it in any way you like, see LICENSE.
            6 
            7 !!!
            8 
            9 
           10 webdump
           11 -------
           12 
           13 Text-based HTML dump
           14 
           15 
           16 Build and install
           17 -----------------
           18 
           19 $ make
           20 # make install
           21 
           22 
           23 Dependencies
           24 ------------
           25 
           26 - C compiler.
           27 - libc + some BSDisms.
           28 
           29 
           30 Usage
           31 -----
           32 
           33 webdump < file.html | less -r
           34 
           35 hurl 'https://codemadness.org/' | webdump | less -r
           36 
           37 webdump -a -l -r -w $(tput cols) < file.html | less -r
           38 
           39 
           40 Goals / scope
           41 -------------
           42 
           43 The tool will only render HTML to stdout, similarly to links -dump or
           44 lynx -dump but simpler and more secure.
           45 
           46 - HTML and XHTML will be supported.
           47 - There will be some workarounds and quirks for broken and also legacy HTML code.
           48 - It will be usable and secure for reading HTML from mails and RSS/Atom feeds.
           49 - No remote resources which are part of the HTML will be downloaded:
           50   images, video, audio, etc. But these may be visible as an link reference.
           51 - Data will be written to stdout only.
           52 - No support for Javascript, CSS, frame rendering or forms.
           53 - No HTTP or network protocol handling in general: HTML data is read from stdin.
           54 
           55 
           56 License
           57 -------
           58 
           59 ISC, see LICENSE file.
           60 
           61 
           62 Author
           63 ------
           64 
           65 Hiltjo Posthuma <hiltjo@codemadness.org>