[HN Gopher] Show HN: Turn Your Pandas Dataframe into a Tableau-S...
       ___________________________________________________________________
        
       Show HN: Turn Your Pandas Dataframe into a Tableau-Style UI for
       Visual Analysis
        
       Hey, guys. I've just made a plugin which turns your pandas
       dataframe into a tableau-style component. It allows you to explore
       the dataframe with easy drag-and-drop UI.  You can use PyGWalker in
       Jupyter, Google Colab, or even Kaggle Notebook to easily explore
       your data and generate interactive visualizations.  PyGWalker
       (pronounced like "Pig Walker", just for fun) is named as an
       abbreviation of "Python binding of Graphic Walker".  Here are some
       links to check it out:  The Github Repo:
       https://github.com/Kanaries/pygwalker  Use PyGWalker in Kaggle:
       https://www.kaggle.com/asmdef/pygwalker-test  Feedback and
       suggestions are appreciated! Please feel free to try it out and let
       me know what you think. Thanks for your support!
        
       Author : AwsmDef
       Score  : 332 points
       Date   : 2023-02-20 15:50 UTC (7 hours ago)
        
 (HTM) web link (github.com)
 (TXT) w3m dump (github.com)
        
       | short_sells_poo wrote:
       | This looks incredible. Well done! Can we give money to you for
       | this?
        
         | tyfon wrote:
         | Yes, I can easily convince my boss this is worth done some
         | money too...
        
       | 323 wrote:
       | I thing DfWalker would be a better name.
        
       | anigbrowl wrote:
       | Looks good! was impressed with your web app, having hte same
       | functionality in a Jupyter notebook is a huge plus.
        
       | TrackerFF wrote:
       | Is there any support or implantation for writeback functions? We
       | use tableau at work for lots of reporting, and it just pissed me
       | off how expensive many functionalities are if you need to
       | purchase from third party vendors.
        
         | razor_router wrote:
         | It sounds like you are frustrated with the cost that comes with
         | using third party vendors for additional functionalities in
         | Tableau. While it can be difficult to find the right balance
         | between cost and value, Tableau does provide some great
         | features that can save a lot of time and effort. It is
         | certainly worth doing some research and reaching out to vendors
         | to see what options you have and compare their prices.
        
         | dgudkov wrote:
         | What do you mean by writeback functions?
        
       | selfawareMammal wrote:
       | Amazing! Does it work with Pyspark?
        
       | cjbprime wrote:
       | Nice! Do you think it's useful for working with non-numeric
       | tabular data too? I'm using a dataframe in vscode as a kind of
       | mini SQL database. Vscode visualizations of dataframe are pretty
       | bad, would be great to have another option for rendering.
        
         | loa_observer wrote:
         | Yes, it's a general visualization tool. But it depends on what
         | you need for your data. Dimensions in non-numeric tabular data
         | can be used with a 'count' to make charts of the distribution
         | of values in the dimensions.
        
         | kennethologist wrote:
         | According to the Github VSCode support isn't available as yet.
         | Major bummer tbh.
        
       | scrollbar wrote:
       | Mega impressed, I can see myself using this regularly. Charting
       | and viz with matplotlib and pandas transforms is great, but a lot
       | of time there's a benefit of dumping to point-and-click mode with
       | tableau to quickly spin data around without much coding. This
       | will make that workflow much smoother
        
         | boredemployee wrote:
         | Yep. And also there are so many alternatives of dataviz that I
         | struggle to decide which one to use. You learn a few syntax,
         | then come a newer and more beautiful visualization lib and you
         | have to learn all basic syntax again.
        
       | selfawareMammal wrote:
       | Hmm Im trying to open a few csv's but none of them seem to work.
       | The ui opens but I have no filters or ways to display anything, I
       | just can see the name of the columns/fields from my data. Am I
       | doing something wrong?
        
       | RandomWorker wrote:
       | This is the best thing since sliced bread! As someone that uses
       | both Jupyter notebooks in lab(and loves to make vis) and uses
       | tableau to prototype this is epic. It works trying it out of the
       | box.
       | 
       | However, the most important feature I love about tableau and why
       | I'm not dropping it is: the data import and sharing dashboard
       | section. I'm sure this is something that could be interesting to
       | investigate.
       | 
       | The data section where you can link data columns (and filter
       | across all datasets after linking) and do pivot work in an
       | intuitive way(see melt in pandas).
       | 
       | The data dashboards are great to share with my clients the output
       | of my analysis work. I'd love it move to a bokeh style, but
       | customizing those dashboards is not for the faint of heart,
       | although you get an opensource very robust product at the end,
       | and you don't need to pay license fees going forward. Clients
       | that want to keep their analysis for the long term can go for
       | this option.
        
       | maneesh wrote:
       | Wow! this is super cool!
        
       | aarondia wrote:
       | This is very cool. I'm the creator of Mito [1] -- we're also
       | building a data visualization tool in JupyterLab. The Tabluea
       | approach that you took is really interesting! Going to send you a
       | message -- would love to learn more!
       | 
       | [1] https://www.trymito.io
        
       | BigElephant wrote:
       | Is it possible to generate the matplotlib (or whatever charting
       | library) is used to make the visuals?
        
       | buremba wrote:
       | This is pretty cool! It uses Vega transforms under the hood,
       | right? The default backend uses JS for transformation which is
       | slow for large datasets. Did you consider using Duckdb under the
       | hood via WASM? I run into this project but not sure how active it
       | is: https://github.com/vega/vega-plus
        
         | loa_observer wrote:
         | There is a WIP PR for combining DuckDB with Graphic Walker:
         | https://github.com/Kanaries/graphic-walker/pull/18
         | 
         | WASM of Graphic Walker is coming soon!
        
           | buremba wrote:
           | Cool! How does it work in practice? Looking at the PR, it
           | seems to pull all the data from Duckdb but is there any plan
           | to apply the transformations (filter, aggregate that the user
           | selected in UI) in Duckdb while rendering the chart?
        
             | loa_observer wrote:
             | This PR had just tested running DuckDB-WASM. For more
             | complex computation, I am planning to build a computation
             | engine to generate SQL and push it to DuckDB or other
             | databases. (I built a POC like this in another project,
             | RATH, but the SQL is push to clickhouse instead.)
             | 
             | If you are interested in this, welcome to discuss with us
             | in the PRs/issues on Github or in our Discord:
             | https://discord.gg/Z4ngFWXz2U
        
       | faizshah wrote:
       | I love this, it seems like the heavy lifting is done by the web
       | app here: https://github.com/Kanaries/graphic-walker
       | 
       | I'm amazed that this is open source, it's incredibly useful.
       | 
       | I wish there was a profiler implementation, the best profiler is
       | in GCP's DataPrep.
        
         | faizshah wrote:
         | Ah, there's a really nice profiler implemented in one of their
         | other projects here (AGPLv3):
         | https://github.com/Kanaries/Rath/tree/master/packages/rath-c...
         | 
         | There's a lot of really nice features in this other tool, the
         | author's thought of everything:
         | https://github.com/Kanaries/Rath
        
           | isoprophlex wrote:
           | Holy shit that's incredible! Thanks for sharing..!
        
       | thirdmunky wrote:
       | This is incredible! I've wanted something like this in a notebook
       | for a long time.
        
       | chrsig wrote:
       | oh my, thank you! This could definitely fill a very noticeable
       | gap in my tools.
       | 
       | I'm looking forward to playing with it this week
        
       | damey wrote:
       | This was quite slow and never worked for 100K rows 20 columns.
       | Any work on improving performance ?
        
       | frgtpsswrdlame wrote:
       | Wow this looks absolutely awesome. I'll need to play around with
       | it a bit but this would be huge for me.
        
       | tyfon wrote:
       | This goes in my toolbox, thanks!
        
       ___________________________________________________________________
       (page generated 2023-02-20 23:00 UTC)