[HN Gopher] Apache Hop 2.0
___________________________________________________________________
Apache Hop 2.0
Author : CharlesW
Score : 52 points
Date : 2022-06-08 18:58 UTC (4 hours ago)
(HTM) web link (hop.apache.org)
(TXT) w3m dump (hop.apache.org)
| waynesonfire wrote:
| I'd be curious how this contrasts with apache nifi.
| mring33621 wrote:
| potato, potahto
| frellus wrote:
| Aside from this platform, which I've never heard about until now,
| I'm wondering what others are using in the workflow orchestration
| space?
|
| I'd assume Airflow is the most prevalent, but there's also Argo
| getting quite a bit of momentum lately.
| yamrzou wrote:
| In my previous job, I used Dagster. It has served us well.
|
| See my comment about it here:
| https://news.ycombinator.com/item?id=28803117
| cmcconomy wrote:
| "What is HOP?"
|
| https://hop.apache.org/manual/latest/getting-started/hop-wha...
| chrisweekly wrote:
| > "VISUAL DESIGN AND METADATA
|
| > Apache Hop, short for Hop Orchestration Platform, is a data
| orchestration and data engineering platform that aims to
| facillitate all aspects of data and metadata orchestration. Hop
| lets you focus on the problem you're trying to solve without
| technology getting in the way. Simple tasks should be easy,
| complex tasks need to be possible.
|
| > Hop allows data professionals to work visually, using
| metadata to describe how data should be processed. Visual
| design enables data developers to focus on what they want to do
| instead of how that task needs to be done. This focus on the
| task at hand lets Hop developers be more productive than they
| would be when writing code."
| Simon_O_Rourke wrote:
| Thanks for expanding that, it reads like it's some Airflow
| competitor. Would be curious how it handles all the
| authentication management for the various pipeline elements.
| jjtheblunt wrote:
| I'm misunderstanding how so many Apache hosted projects P let
| someone focus on X without Y getting in the way, totally
| ignoring the complexity of introducing P and altering
| everything to align with P, thereby forbidding focus on X.
|
| Are these really often useful?
| waynesonfire wrote:
| "Hop initially (late 2019) started as a fork of the Kettle
| (Pentaho Data Integration)."
| mason55 wrote:
| Wow. PDI is one of the worst pieces of software I've ever
| used. Possibly only second to Pentaho Report Designer.
|
| From looking at the Apache HOP docs, it doesn't look like
| they have changed the UI much (if at all). I wonder if they
| at least made it less buggy.
| arthurcolle wrote:
| Wow I was expecting this to be the 463rd distributed computing
| streaming framework since its Apache. Shocked that its not
| mi_lk wrote:
| tangent - what is it about Apache or big data that the associated
| softwares are mostly written in Java?
| oaiey wrote:
| Java, like .NET, are just solid application platforms which are
| statically typed and their performance is good.
|
| Java has a history in big systems for soon 30 years.
|
| Rust, Python and Go are just not there yet. Rust is too low
| level, Python is not statically typed and will always suffer
| performance wise and Go ... I is a youngster :). And .NET is
| always not everyone's free choice.
|
| And Apache, well they just liked Java for their applications.
| They started with some C/C++ code but then quickly aggregated a
| lot of Java tech.
| GordonS wrote:
| > And .NET is always not everyone's free choice.
|
| Hey, sometimes it really is!!
| ojhughes wrote:
| MapReduce and HDFS were written in Java and they paved the way
| for a lot of the other big data tools
| manish_gill wrote:
| Something I've been asking for a long time as well. Java/JVM
| are great, but it would be great to see _some_ diversity in the
| Big Data ecosystem when it comes to implementations. :)
| mrtweetyhack wrote:
| qbasic_forever wrote:
| From like 2000 to 2010 or even 2015 either java or .NET was the
| default choice for big enterprise companies. Nobody ever got
| fired for picking Microsoft or Java (I would add), as they say.
| A lot of these Apache projects have been donated from work at
| big enterprises so it comes out of that background from
| enterprise I imagine.
| zmmmmm wrote:
| Performance, portability, stability, scalability, concurrency,
| ecosystem (libraries, etc) .... despite all the new languages
| around, there actually still aren't many alternatives that give
| you the same combination of all these to the same level as Java
| does.
| zekrioca wrote:
| Good question, but maybe due to Java's stability and
| portability.
| rektide wrote:
| Ooh Drools plugins, for rule based event-processing. Neat. Hope I
| can find some examples!
|
| I havent used Airflow, but my impression is this fits a similar
| role. That itcs built atop good tech like Apache Beam & can use
| things like Flink is, in my book, a nice win.
___________________________________________________________________
(page generated 2022-06-08 23:00 UTC)