https://www.r-bloggers.com/2019/07/complexity-is-a-source-of-income-in-open-source-ecosystems/ R-bloggers R-bloggers R news and tutorials contributed by hundreds of R bloggers * Home * About * RSS * add your blog! * Learn R * R jobs + Submit a new job (it's free) + Browse latest jobs (also free) * Contact us Complexity is a source of income in open source ecosystems Posted on July 8, 2019 by Derek Jones in R bloggers | 0 Comments [This article was first published on The Shape of Code >> R, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here) --------------------------------------------------------------------- Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. ShareTweet I am someone who regularly uses R, and my interest in programming languages means that on a semi-regular basis spend time reading blog posts about the language. Over the last year, or so, I had noticed several patterns of behavior, and after reading a recent blog post things started to make sense (the blog post gets a lot of things wrong, but more of that later). What are the patterns that have caught my attention? Some background: Hadley Wickham is the guy behind some very useful R packages. Hadley was an academic, and is now the chief scientist at RStudio, the company behind the R language specific IDE of the same name. As Hadley's thinking about how to manipulate data has evolved, he has created new packages, and has been very prolific. The term Hadley-verse was coined to describe an approach to data manipulation and program structuring, based around use of packages written by the man. For the last nine-months I have noticed that the term Tidyverse is being used more regularly to describe what had been the Hadley-verse. And??? Another thing that has become very noticeable, over the last six-months, is the extent to which a wide range of packages now have dependencies on packages in the [DEL:Hadley:DEL]Tidyverse. And??? A recent post by Norman Matloff complains about the Tidyverse's complexity (and about the consistency between its packages; which I had always thought was a good design principle), and how RStudio's promotion of the Tidyverse could result in it becoming the dominant R world view. Matloff has an academic world view and misses what is going on. RStudio, the company, need to sell their services (their IDE is clunky and will be wiped out if a top of the range product, such as Jetbrains, adds support for R). If R were simple to use, companies would have less need to hire external experts. A widely used complicated library of packages is a god-send for a company looking to sell R services. I don't think Hadley Wickam intentionally made things complicated, any more than the creators of the Microsoft server protocols added interdependencies to make life difficult for competitors. A complex package ecosystem was probably not part of RStudio's product vision, at least for many years. But sooner or later, RStudio management will have realised that simplicity and ease of use is not in their interest. Once a collection of complicated packages exist, it is in RStudio's interest to get as many other packages using them, as quickly as possible. Infect the host quickly, before anybody notices; all the while telling people how much the company is investing in the community that it cares about (making lots of money from). Having this package ecosystem known as the Hadley-verse gives too much influence to one person, and makes it difficult to fire him later. Rebranding as the Tidyverse solves these problems. Matloff accuses RStudio of monopoly behavior, I would have said they are fighting for survival (i.e., creating an environment capable of generating the kind of income a VC funded company is expected to make). Having worked in language environments where multiple, and incompatible, package ecosystems existed, I can see advantages in there being a monopoly. Matloff is also upset about a commercial company swooping in to steal their precious, a common academic complaint (academics swooping in to steal ideas from commercially developed software is, of course, perfectly respectable). Matloff also makes claims about teachability of programming that are not derived from any experimental evidence, but then everybody makes claims about programming languages without there being any experimental evidence. RStudio management rode in on the data science wave, raising money from VCs. The wave is subsiding and they now need to appear to have a viable business (so they can be sold to a bigger fish), which means there has to be a visible market they can sell into. One way to sell in an open source environment is for things to be so complicated, that large companies will pay somebody to handle the complexity. Related ShareTweet To leave a comment for the author, please follow the link and comment on their blog: The Shape of Code >> R. --------------------------------------------------------------------- R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job. --------------------------------------------------------------------- Want to share your content on R-bloggers? click here if you have a blog, or here if you don't. - Previous post Next post - [ ] [Go] [Your e-mail here ] [Subscribe] [RBloggers_][RBloggers_] R bloggers Facebook page Most viewed posts (weekly) * How to build your own image recognition app with R! [Part 2] * Dynamic Regression (ARIMA) vs. XGBoost * 10 Tips and Tricks for Data Scientists Vol.3 * R compiler Application-Installation Guide * How to write the first for loop in R * 5 Ways to Subset a Data Frame in R * tidyverse in r - Complete Tutorial Sponsors Recent Posts * How to run R code in PyCharm? * 10 Tips and Tricks for Data Scientists Vol.4 * Ten Years vs The Spread II: Calculating publication lag times in R * How to clean the datasets in R? * Long time, no see: Virtual Lunch Roulette * rOpenSci's R-universe Project * R compiler Application-Installation Guide * The Easter Bunny is Cashing In * 10 Tips and Tricks for Data Scientists Vol.3 * Methow Valley Air Quality * The top 10 R errors, the 7th one will surprise you * tidyverse in r - Complete Tutorial * Visual Representation of Text Data Sets using the R tm and wordcloud packages: part one, Beginner's Guide * Not so soft softmax * Microeconomic Theory and Linear Regression (Part 1) RSS Jobs for R-users * Junior Data Scientist / Quantitative economist * Senior Quantitative Analyst * R programmer * Data Scientist - CGIAR Excellence in Agronomy (Ref No: DDG-R4D/DS /1/CG/EA/06/20) * Data Analytics Auditor, Future of Audit Lead @ London or Newcastle RSS python-bloggers.com (python/data-science news) * How to Predict the Position of Runners in a Race * Create a Keylogger using Python * Why most "coding for spreadsheet users" training fails * Not so soft softmax * Redact Name Entities with SpaCy * How to Redact PII Data using AWS Comprehend * Compatibility of nnetsauce and mlsauce with scikit-learn Full list of contributing R-bloggers Archives Archives [Select Month ] Other sites * Jobs for R-users * SAS blogs Copyright (c) 2021 | MH Corporate basic by MH Themes Never miss an update! Subscribe to R-bloggers to receive e-mails with the latest R posts. (You will not see this message again.) [ ] Submit Click here to close (This popup will not appear again)