[HN Gopher] Dirty data science: Machine learning on non-curated ...
___________________________________________________________________
Dirty data science: Machine learning on non-curated data
Author : sebg
Score : 57 points
Date : 2021-10-27 13:31 UTC (9 hours ago)
(HTM) web link (www.slideshare.net)
(TXT) w3m dump (www.slideshare.net)
| [deleted]
| ninja3925 wrote:
| The author (Gael Varoquaux) is deeply involved in numpy/Sklearn
| and a talented researcher. Quite an impressive guy worth
| following.
| dr_kiszonka wrote:
| The slides were pretty good. I just wish they were shared in a
| different format and/or via a different medium.
| aimor wrote:
| Is there a soundtrack/transcript to go with this?
| nighthawk454 wrote:
| I googled the talk name and found these:
|
| https://www.youtube.com/watch?v=dw5u4nth6_M
|
| https://www.youtube.com/watch?v=BsDeG3jQ61s
|
| Neither seems to be _exactly_ the same deck, but lot's of
| overlap - should be pretty close.
| albert_e wrote:
| +1 very interested to hear this talk
| LegitShady wrote:
| Trying to make a dataset on cost overruns/project management at
| work currently which has required quite a bit of manual work
| looking up project information in documents not exposed to the
| database.
|
| I've looked through the slide deck and hopefully this will help
| me figure out some improvements to the way my data is structured.
___________________________________________________________________
(page generated 2021-10-27 23:01 UTC)