sfeed_web.1 - sfeed - RSS and Atom parser
(HTM) git clone git://git.codemadness.org/sfeed
(DIR) Log
(DIR) Files
(DIR) Refs
(DIR) README
(DIR) LICENSE
---
sfeed_web.1 (1153B)
---
1 .Dd July 27, 2021
2 .Dt SFEED_WEB 1
3 .Os
4 .Sh NAME
5 .Nm sfeed_web
6 .Nd finds URLs to feeds from a HTML webpage
7 .Sh SYNOPSIS
8 .Nm
9 .Op Ar baseurl
10 .Sh DESCRIPTION
11 .Nm
12 reads the HTML data of the webpage from stdin and writes the found URLs to
13 stdout.
14 .Pp
15 Such a link reference in HTML code looks like:
16 .Bd -literal -offset 4n
17 <link rel="alternate" href="atom.xml" type="application/atom+xml" />
18 .Ed
19 .Sh OPTIONS
20 .Bl -tag -width 8n
21 .It Ar baseurl
22 Optional base URL to use for found feed URLs that are relative.
23 .El
24 .Sh OUTPUT FORMAT
25 url<TAB>content-type<newline>
26 .Bl -tag -width Ds
27 .It URL
28 Found relative or absolute URL.
29 .Pp
30 For relative URLs if a <base href="..." /> tag is found it will be used,
31 otherwise if the
32 .Ar baseurl
33 option is specified then that is used, if neither are set then the relative URL
34 is printed.
35 .It content-type
36 Usually application/atom+xml or application/rss+xml.
37 .El
38 .Sh EXIT STATUS
39 .Ex -std
40 .Sh EXAMPLES
41 Get URLs from a website:
42 .Bd -literal -offset 4n
43 curl -s -L 'https://codemadness.org/' | sfeed_web 'https://codemadness.org/'
44 .Ed
45 .Sh SEE ALSO
46 .Xr sfeed_update 1 ,
47 .Xr sh 1
48 .Sh AUTHORS
49 .An Hiltjo Posthuma Aq Mt hiltjo@codemadness.org