[HN Gopher] Building an open data pipeline in 2024
___________________________________________________________________
Building an open data pipeline in 2024
Author : dangoldin
Score : 22 points
Date : 2024-04-26 20:47 UTC (2 hours ago)
(HTM) web link (blog.twingdata.com)
(TXT) w3m dump (blog.twingdata.com)
| RadiozRadioz wrote:
| > And if you're dealing with truly massive datasets you can take
| advantage of GPUs for your data jobs.
|
| I don't think scale is the key deciding factor for whether GPUs
| are applicable for a given dataset.
|
| I don't think this is a particularly insightful article. Read the
| first paragraph of the "Cost" section.
| gchamonlive wrote:
| Yes I also believe both the dataset and the transformation
| algorithms have to lend themselves well to parallelization for
| GPUs to be useful. GPUs don't do magic they are just really
| good at parallel computing.
___________________________________________________________________
(page generated 2024-04-26 23:00 UTC)