tREADME - webdump - [FORK] git://git.codemadness.org/webdump
(HTM) git clone git://git.z3bra.org/webdump.git
(DIR) Log
(DIR) Files
(DIR) Refs
(DIR) README
(DIR) LICENSE
---
tREADME (1308B)
---
1 !!!
2
3 NOTE! work-in-progress (very slowly) and experimental.
4 This code has many dirty hacks and ugliness. Intended for my personal use only.
5 Knowing this: of course feel free to use it in any way you like, see LICENSE.
6
7 !!!
8
9
10 webdump
11 -------
12
13 Text-based HTML dump
14
15
16 Build and install
17 -----------------
18
19 $ make
20 # make install
21
22
23 Dependencies
24 ------------
25
26 - C compiler.
27 - libc + some BSDisms.
28
29
30 Usage
31 -----
32
33 webdump < file.html | less -r
34
35 hurl 'https://codemadness.org/' | webdump | less -r
36
37 webdump -a -l -r -w $(tput cols) < file.html | less -r
38
39
40 Goals / scope
41 -------------
42
43 The tool will only render HTML to stdout, similarly to links -dump or
44 lynx -dump but simpler and more secure.
45
46 - HTML and XHTML will be supported.
47 - There will be some workarounds and quirks for broken and also legacy HTML code.
48 - It will be usable and secure for reading HTML from mails and RSS/Atom feeds.
49 - No remote resources which are part of the HTML will be downloaded:
50 images, video, audio, etc. But these may be visible as an link reference.
51 - Data will be written to stdout only.
52 - No support for Javascript, CSS, frame rendering or forms.
53 - No HTTP or network protocol handling in general: HTML data is read from stdin.
54
55
56 License
57 -------
58
59 ISC, see LICENSE file.
60
61
62 Author
63 ------
64
65 Hiltjo Posthuma <hiltjo@codemadness.org>