[HN Gopher] Data Cascades in Machine Learning
       ___________________________________________________________________
        
       Data Cascades in Machine Learning
        
       Author : theafh
       Score  : 41 points
       Date   : 2021-06-04 16:50 UTC (6 hours ago)
        
 (HTM) web link (ai.googleblog.com)
 (TXT) w3m dump (ai.googleblog.com)
        
       | washedup wrote:
       | Good article. I would suggest following it up with
       | https://pair.withgoogle.com/chapter/data-collection/
       | 
       | Monitoring drift in the inputs, predictions, and performance are
       | all crucial for any models in production. Personally, for
       | input/target drift, I prefer using the D-stat from Kolmogorov-
       | Smirnov test to look for any distribution changes.
        
       | 6gvONxR4sf7o wrote:
       | These kinds of papers are my favorite ML papers (see also:
       | 'machine learning is the high interest credit card of technical
       | debt'). The org design and project aspects of ML projects are
       | some of the most pernicious issues I face, while the modeling and
       | other _fun_ stuff often ends up not being that hard _once the
       | right pieces are in place._
        
       ___________________________________________________________________
       (page generated 2021-06-04 23:02 UTC)