hngopher.com

       [HN Gopher] Solving Sudoku in Python Packaging
       ___________________________________________________________________
        
       Solving Sudoku in Python Packaging
        
       Author : Yenrabbit
       Score  : 281 points
       Date   : 2024-10-21 02:30 UTC (2 days ago)
        
 (HTM) web link (github.com)
 (TXT) w3m dump (github.com)
        
       | niyonx wrote:
       | How did you even think of that? Nice!
        
       | yochem wrote:
       | No way pip actually is a really inefficient SAT solver!
        
         | stabbles wrote:
         | For a long time it was not because there was no backtracking.
         | 
         | Now it is just an exhaustive, recursive search: for the current
         | package try using versions from newest to oldest, enqueue its
         | dependencies, if satisfied return, if conflict continue.
        
           | taeric wrote:
           | If there was no backtracking, that implies it couldn't solve
           | every sudoku? That is rather amusing with the implication
           | that it couldn't solve every dependency, as well?
        
         | fernandotakai wrote:
         | uv actually talks about this in their resolver docs
         | https://docs.astral.sh/uv/reference/resolver-internals/
        
       | ziofill wrote:
       | but how does it know the constraints?
        
         | jsnell wrote:
         | The constraints are going to be static and independent of the
         | puzzle. So I expect they're encoded in the package
         | dependencies. So for example version 1 of the package
         | sudoku_0_0 will conflict with all of: version 1 of
         | sudoku_[0-8]_0; version 1 of sudoku_0_[0-8]; version 1 of
         | [012]_ [012].
        
           | roywiggins wrote:
           | generate_packages makes it moderately clear:
           | 
           | https://github.com/konstin/sudoku-in-python-
           | packaging/blob/m...
        
         | thangngoc89 wrote:
         | This is the content of sudoku_0_0-1-py3-none-any.whl. So when
         | the (0,0) cell is 1, none of the cells in the same row, column
         | and subgrid should be 1.                   Requires-Dist:
         | sudoku_0_1 != 1         Requires-Dist: sudoku_0_2 != 1
         | Requires-Dist: sudoku_0_3 != 1         Requires-Dist:
         | sudoku_0_4 != 1         Requires-Dist: sudoku_0_5 != 1
         | Requires-Dist: sudoku_0_6 != 1         Requires-Dist:
         | sudoku_0_7 != 1         Requires-Dist: sudoku_0_8 != 1
         | Requires-Dist: sudoku_1_0 != 1         Requires-Dist:
         | sudoku_2_0 != 1         Requires-Dist: sudoku_3_0 != 1
         | Requires-Dist: sudoku_4_0 != 1         Requires-Dist:
         | sudoku_5_0 != 1         Requires-Dist: sudoku_6_0 != 1
         | Requires-Dist: sudoku_7_0 != 1         Requires-Dist:
         | sudoku_8_0 != 1         Requires-Dist: sudoku_0_1 != 1
         | Requires-Dist: sudoku_0_2 != 1         Requires-Dist:
         | sudoku_1_0 != 1         Requires-Dist: sudoku_1_1 != 1
         | Requires-Dist: sudoku_1_2 != 1         Requires-Dist:
         | sudoku_2_0 != 1         Requires-Dist: sudoku_2_1 != 1
         | Requires-Dist: sudoku_2_2 != 1
        
         | IshKebab wrote:
         | Yeah they missed out the actual interesting bit from the
         | readme...
        
       | simonw wrote:
       | I love this so much. I dug around a bit and figured out how it
       | works - I have an explanation (with an illustrative diagram)
       | here: https://simonwillison.net/2024/Oct/21/sudoku-in-python-
       | packa...
       | 
       | Figuring out how it works is a great way to learn a bit more
       | about how Python packaging works under the hood. I learned that
       | .whl files contain a METADATA file listing dependency constraints
       | as "Requires-Dist" rules.
       | 
       | I ran a speed comparison too. Using the uv pip resolver it took
       | 0.24s - with the older pip-compile tool it took 17s.
        
         | seanw444 wrote:
         | Wow, uv really is fast.
        
           | jebebeebehhe wrote:
           | As is simonw writing that post in under 60m assuming he first
           | saw the concept here on HN.
        
             | Medox wrote:
             | Using the simonw resolver is took under 3600s *
        
             | simonw wrote:
             | Nah I wrote this one a couple of days ago when I first saw
             | the Sudoku solving project on Mastodon.
        
               | jeanlucas wrote:
               | Thank you, I was going to point out the post on your blog
               | has date of publication lol
        
         | zahlman wrote:
         | People keep trying to sell the speed of such solutions as a
         | killer feature for uv, but I think I must not be anywhere near
         | the target audience. The constraint-solving required for the
         | sorts of projects I would typically work on is not even
         | remotely as complex, while I'm bottlenecked by a slow,
         | unreliable Internet connection (and the lack of a good way to
         | tell Pip _not_ to check PyPI for new versions and _only_
         | consider what 's currently in the wheel cache).
        
           | itsbjoern wrote:
           | Personally I'm just a fan of people improving dev tooling,
           | regardless of it ultimately making a huge difference to my
           | workflow. I haven't used uv yet, but I'm still tangentially
           | following it because despite pip and poetry being great tools
           | I have had my fair share of grievances with them.
        
             | yunohn wrote:
             | Using uv (IME) should be a drop-in replacement for almost
             | all of your python packaging needs. I highly recommend
             | giving it a shot - it's saved me measurable time in just a
             | few months.
        
           | kvdveer wrote:
           | Our CI took 2 minutes to install the requirements. Adding UV
           | dropped that to seconds. Now most time is spent on running
           | tests, instead of installing requirements.
           | 
           | Of course we could've cached the venv, but cache invalidation
           | is hard, and this is a very cheap way to avoid it.
        
             | zahlman wrote:
             | Or you could cache the _solve_ , surely? Simply explicitly
             | including your transitive dependencies and pinning
             | everything should do the trick - but there _is_ work being
             | done actively on a lockfile standard, it 's just the sort
             | of topic that attracts ungodly amounts of bikeshedding.
             | 
             | (One of these days I'll have to figure out this "CI" thing.
             | But my main focus is really just solving interesting
             | programming problems, and making simple elegant tools for
             | mostly-small tasks.)
        
           | the_mitsuhiko wrote:
           | > while I'm bottlenecked by a slow, unreliable Internet
           | connection (and the lack of a good way to tell Pip not to
           | check PyPI for new versions and only consider what's
           | currently in the wheel cache).
           | 
           | Which is one of the reasons why uv is so fast. It reduces the
           | total times it needs to go to PyPI! Not only does it cache
           | really well, it also hits PyPI more efficiently and highly
           | parallel. Once you resolved once, future resolutions will
           | likely bypass PyPI for the most part entirely.
        
             | zahlman wrote:
             | Oh, that's good to hear. The performance discourse around
             | uv seems to revolve around "written in Rust! Statically
             | compiled!" all the time, but an algorithmic change like
             | that is something that could conceivably make its way back
             | into Pip. (Or perhaps into future competing tools still
             | written in Python. I happen to have a design of my own.)
        
           | simonw wrote:
           | More significant than the speed improvement in my opinion is
           | the space saving.
           | 
           | The reason uv is fast is that it creates hard links from each
           | of your virtual environments to a single shared cached copy
           | of the dependencies (using copy-on-write in case you want to
           | edit them).
           | 
           | This means that if you have 100 projects on your machine that
           | all use PyTorch you still only have one copy of PyTorch!
        
             | zahlman wrote:
             | This is definitely a feature I wanted to have in my own
             | attempt (hopefully I can start work on it before the end of
             | the year; I think I'll bump up my mental priority for
             | blogging about the design). I also have considered symlink
             | (not sure if this really works) and .pth file-based
             | approaches.
             | 
             | (TIL `os.link` has been supported on Windows since 3.2.)
        
         | TeMPOraL wrote:
         | Tangent, but I wondered what libuv had to do with speeding up
         | Python packaging, and it turns out nothing. I wonder why
         | someone choose to name a pip replacement in a way that
         | effectively collides with several tools and libraries across
         | many languages...
        
           | giancarlostoro wrote:
           | I agree.. While I think it looks amazing, it's a poor naming
           | choice.
        
         | ilyagr wrote:
         | How does it encode the idea of having all the numbers on each
         | line/square?
        
       | visarga wrote:
       | That's why it feels like installing a ML repo is like sudoku. You
       | install everything and at the last step you realize your neural
       | net uses FlashAttention2 which only works on NVIDIA compute
       | version that is not deployed in your cloud VM and you need to
       | start over from scratch.
        
         | hskalin wrote:
         | Sometimes I just change the version of the package in
         | requirements to fit with others and pray that it works out (a
         | few times it does)
        
         | nicman23 wrote:
         | honestly if the ml does not have a docker image - not compose
         | no build an image- i do not even bother any more
        
         | austinjp wrote:
         | This describes the day I wasted on Monday before I gave up and
         | wrote some damn deterministic code instead of using some damn
         | AI.
        
         | pjc50 wrote:
         | See the discussion on why sqlite insists on vendoring its build
         | dependencies as far as possible and not using, say, CMake.
        
         | anthk wrote:
         | Guix fixes that in the spot.
        
       | chatmasta wrote:
       | Here's the same thing in Poetry (2022):
       | https://www.splitgraph.com/blog/poetry-dependency-resolver-s...
        
         | teschmitt wrote:
         | Was just about to say: I've seen this before but building it
         | with a universally usable requirements.txt is even cooler.
        
       | echoangle wrote:
       | > Solving the versions of python package from your requirements
       | is NP-complete, in the worst case it runs exponentially slow.
       | Sudokus are also NP-complete, which means we can solve sudokus
       | with python packaging.
       | 
       | Is that actually sufficient? Can every system that's solving
       | something that's NP-complete solve every other NP-complete
       | problem?
        
         | rolisz wrote:
         | Yes, NP complete means that every other NP problem is reducible
         | to it.
        
         | zahlman wrote:
         | >Can every system that's solving something that's NP-complete
         | solve every other NP-complete problem?
         | 
         | Yes, by definition (https://en.wikipedia.org/wiki/NP-
         | completeness , point 4).
        
         | arjvik wrote:
         | I think for Sudoku to be NP-Complete, it needs to be
         | generalized to arbitrary board sizes (at the very least)
        
           | tetha wrote:
           | Correct, if the complexity guys talk about NP-complete
           | sudoku, it's always about solving Sudokus of an arbitrary,
           | but fixed and finite size.
           | 
           | The problem class of "Solve an arbitrary Sudoku of Size 9"
           | might even be constant runtime, since it's a finite set to
           | search through.
        
             | SJC_Hacker wrote:
             | Does the Sudoku size have to be perfect square, or can
             | other sizes exist?
             | 
             | I suppose you could leave some blank and make the squares
             | the next largest perfect square
        
               | tetha wrote:
               | Hm. The biggest problem there are the definition of
               | "diagonal" and "box".
               | 
               | I mean non-square sudokus are still in NP. If a solution
               | can be validated in polynomial time, a problem is in NP.
               | 
               | And to validate a (x, y) sized sudoku, you need to check
               | (x * y) [ number of total squares] * (x [row] + y
               | [column] + max(x, y)ish [diagonal-ish] + ((x/3) + (y/3))
               | [box]). The boxes might be weird, but boxes are smaller
               | than the overall sudoku, so we are looking at some max(x,
               | y)^4 or so. Same for the diagonals. The input is x*y
               | numbers, so max(x, y)^2. Very much polynomial in the size
               | of the input[1]
               | 
               | And it should also be easy to show that if an (n, n)
               | sized sudoku has a solution, an (n+k, n+k) sized sudoku
               | has a solution. You kinda shove in the new numbers in
               | knights-kinda moves and that's it.
               | 
               | 1: this can be a bit weird, because you need to be
               | careful "what you're polynomial in". If your input is a
               | number or two, you might be polynomial in the magnitude
               | of the number, which however is exponential with the
               | input length.
               | 
               | In this case however, we wouldn't have encoding
               | shenanigans, since we're just placing abstract symbols
               | from the turing machine's alphabet onto an imagined grid.
        
         | empath75 wrote:
         | https://www.youtube.com/watch?v=6OPsH8PK7xM
         | 
         | This video I think makes it obvious why that's true in a pretty
         | intuitive way. I posted it a few days ago as a link and it
         | never got traction.
         | 
         | SAT is the equivalent of being able to find the inverse of
         | _any_ function, because you can describe any function with
         | logic gates (for obvious reasons), and any collection of logic
         | gates that describes a function is equivalent to a SAT problem.
         | All you need to do is codify the function in logic gates,
         | including the output you want, and the ask a SAT solver to find
         | the inputs that produce that output.
        
         | tzs wrote:
         | > Can every system that's solving something that's NP-complete
         | solve every other NP-complete problem?
         | 
         | Others have given the answer (yes) and provided some links. But
         | it is nice to have an explanation in thread so I'll have a go
         | at it.
         | 
         | The key idea is the idea of transforming one problem to
         | another. Suppose you have some problem X that you do not know
         | how to solve, and you've got some other problem Y that you do
         | know how to solve.
         | 
         | If you can find some transform that you can apply to instances
         | of X that turns them into instances of Y and that can transform
         | solutions of those instances of Y back to solutions of X, then
         | you've got an X solver. It will be slower than your Y solver
         | because of the work to transform the problem and the solution.
         | 
         | Now let's limit ourselves to problems in NP. This includes
         | problems in P which is a subset of NP. (Whether or not it is a
         | proper subset is the famous P=NP open problem).
         | 
         | If X and Y are in NP and you can find a polynomial time
         | transformation that turns X into Y then in a sense we can say
         | that X cannot be harder than Y, because if you know how to
         | solve Y then with that transformation you also know how to
         | solve X albeit slower because of the polynomial time
         | transformations.
         | 
         | In 1971 Stephen Cook proved that a particular NP problem,
         | boolean satisfiability, could serve as problem Y for _every_
         | other problem X in NP. In a sense then no other NP problem can
         | be harder than boolean satisfiability.
         | 
         | Later other problems were also found that were universal Y
         | problems, and the set of them was called NP-complete.
         | 
         | So if Python packaging is NP-complete then every other NP
         | problem can be turned into an equivalent Python packaging
         | problem. Note that the other problem does not have to also be
         | NP-complete. It just has to be in NP.
         | 
         | Sudoku and Python Packaging both being NP-complete means it
         | goes both ways. You can use a Python package solver to solve
         | your sudoku problems _and_ you can use a sudoku solver to solve
         | your Python packaging problems.
        
       | mi_lk wrote:
       | See also this 2008 post using Debian package system to solve
       | Sudoku:
       | 
       | https://web.archive.org/web/20160326062818/http://algebraict...
        
       | revskill wrote:
       | This is a hack.
        
         | jessekv wrote:
         | And why I come here for... er, news.
        
       | worewood wrote:
       | This is type of cool hacking I like to see. Kudos! (Or better,
       | Sukodus :) )
        
       | alentred wrote:
       | This is BRILLIANT ! I knew of a trend to implement lots of
       | different things at compile-time (in Scala and Haskell
       | communities at least) - definitely fun and quirky, but it never
       | seemed that "special". This one, it has an air of old-school
       | computer magic around it, probably because it is so elegant and
       | simple.
        
       | anthk wrote:
       | Now, in MicroLisp, Common Lisp and maybe Emacs' Elisp too:
       | 
       | http://www.ulisp.com/show?33J9
        
       ___________________________________________________________________
       (page generated 2024-10-23 23:01 UTC)