[HN Gopher] Show HN: A self-updating list of the most current us...
___________________________________________________________________
Show HN: A self-updating list of the most current useragents
Hi Hacker News! I made a site which displays the most common
useragents found on the web. The site updates weekly with data
sourced from the server access logs of another site I run in order
to give an accurate picture of the devices and browsers being used
on the web. I do a lot of web scraping in my work and it's this
group of people who I had in mind when creating the site. The data
is presented as useragent, browser, os, and relative percentage of
occurence. It can be viewed as a table on the site or via json in
the API. Please let me know your thoughts or feedback and I hope
you find it useful! Thanks!
Author : stayml
Score : 15 points
Date : 2022-12-17 21:50 UTC (1 hours ago)
(HTM) web link (www.useragents.me)
(TXT) w3m dump (www.useragents.me)
| dj_mc_merlin wrote:
| First of all, good job on the project. I do not mean to be too
| negative, but this:
|
| > The site updates weekly with data sourced from the server
| access logs of another site I run in order to give an accurate
| picture of the devices and browsers being used on the web.
|
| If you only source data from one website, it is not an accurate
| picture by any means. Consider that other websites have different
| user bases. Your Firefox usage seems way too high for example,
| it's likely your other website is largely used by other technical
| people.
| playingalong wrote:
| AFAICT they are not trying to provide current browser usage
| stats. They are only after having a list of popular browsers
| which is only a subset of the former. The last paragraph about
| scraping seems to indicate that.
|
| So they only need what is a reasonable UA as of this week. They
| don't need what is the most popular one.
| stayml wrote:
| Yes, this too. It should just be a -passable- sample of
| what's popular and seen on the web
| stayml wrote:
| Thanks! And yep, fair comment, and I had noticed this as well
| even more so in last week's list. I have been thinking about
| how I could adjust the numbers in some way to counteract this
| or add another data source.
| modeless wrote:
| Where are Android and iOS?
| jeroenhd wrote:
| How do you control for the faked user agents scrapers use? A bad
| scraper will be much more common in your logs than a well-behaved
| browser, after all.
| drusepth wrote:
| Always interesting to browse these lists. Figured I'd add this
| week's browser %s from a site I run to the thread for others
| interested in another source:
|
| > Chrome 62.09%
|
| > Safari 26.55%
|
| > Edge 3.23%
|
| > Opera 2.94%
|
| > Firefox 2.64%
|
| > Samsung Internet 0.88%
|
| > Internet Explorer 0.54%
|
| > Android Webview 0.24%
|
| > YaBrowser 0.16%
|
| Bias: site audience is mostly writers/authors
| black3r wrote:
| The OS column comes off as a bit deceptive, because windows 11
| still presents as Windows NT 10.0 and newer versions of macOS
| still present themselves as "Intel Mac OS X 10_15_7", even if
| you're on an ARM chip...,
|
| If you're parsing this info into the OS column, you should
| probably display this info in there at least with a note, or
| something like "Windows 10/11", "macOS 10.5 or newer"
| stayml wrote:
| Good point, thanks. I'll add that in
___________________________________________________________________
(page generated 2022-12-17 23:00 UTC)