[HN Gopher] Data Cascades in Machine Learning
___________________________________________________________________
Data Cascades in Machine Learning
Author : theafh
Score : 41 points
Date : 2021-06-04 16:50 UTC (6 hours ago)
(HTM) web link (ai.googleblog.com)
(TXT) w3m dump (ai.googleblog.com)
| washedup wrote:
| Good article. I would suggest following it up with
| https://pair.withgoogle.com/chapter/data-collection/
|
| Monitoring drift in the inputs, predictions, and performance are
| all crucial for any models in production. Personally, for
| input/target drift, I prefer using the D-stat from Kolmogorov-
| Smirnov test to look for any distribution changes.
| 6gvONxR4sf7o wrote:
| These kinds of papers are my favorite ML papers (see also:
| 'machine learning is the high interest credit card of technical
| debt'). The org design and project aspects of ML projects are
| some of the most pernicious issues I face, while the modeling and
| other _fun_ stuff often ends up not being that hard _once the
| right pieces are in place._
___________________________________________________________________
(page generated 2021-06-04 23:02 UTC)