[HN Gopher] Dirty data science: Machine learning on non-curated ...
       ___________________________________________________________________
        
       Dirty data science: Machine learning on non-curated data
        
       Author : sebg
       Score  : 57 points
       Date   : 2021-10-27 13:31 UTC (9 hours ago)
        
 (HTM) web link (www.slideshare.net)
 (TXT) w3m dump (www.slideshare.net)
        
       | [deleted]
        
       | ninja3925 wrote:
       | The author (Gael Varoquaux) is deeply involved in numpy/Sklearn
       | and a talented researcher. Quite an impressive guy worth
       | following.
        
         | dr_kiszonka wrote:
         | The slides were pretty good. I just wish they were shared in a
         | different format and/or via a different medium.
        
       | aimor wrote:
       | Is there a soundtrack/transcript to go with this?
        
         | nighthawk454 wrote:
         | I googled the talk name and found these:
         | 
         | https://www.youtube.com/watch?v=dw5u4nth6_M
         | 
         | https://www.youtube.com/watch?v=BsDeG3jQ61s
         | 
         | Neither seems to be _exactly_ the same deck, but lot's of
         | overlap - should be pretty close.
        
         | albert_e wrote:
         | +1 very interested to hear this talk
        
       | LegitShady wrote:
       | Trying to make a dataset on cost overruns/project management at
       | work currently which has required quite a bit of manual work
       | looking up project information in documents not exposed to the
       | database.
       | 
       | I've looked through the slide deck and hopefully this will help
       | me figure out some improvements to the way my data is structured.
        
       ___________________________________________________________________
       (page generated 2021-10-27 23:01 UTC)