[HN Gopher] Extracting web page content using Readability.js and...
___________________________________________________________________
Extracting web page content using Readability.js and shot-scraper
Author : simonw
Score : 3 points
Date : 2022-03-24 20:58 UTC (2 hours ago)
(HTM) web link (til.simonwillison.net)
(TXT) w3m dump (til.simonwillison.net)
| simonw wrote:
| Here's the terminal one-liner: shot-scraper
| javascript https://simonwillison.net/2022/Mar/24/datasette-061/ "
| async () => { const readability = await
| import('https://cdn.skypack.dev/@mozilla/readability');
| return (new readability.Readability(document)).parse();
| }"
|
| To first install shot-scraper, run: pip install
| shot-scraper shot-scraper install
___________________________________________________________________
(page generated 2022-03-24 23:02 UTC)