Paragon Haskell Large Codebases?

Liamzy · September 29, 2023, 2:25pm

Just curious, but what are the best publicly-available large code bases in Haskell?

I.e, something that’s useful for people to learn from as examples of well-written, well-commented, and well-designed codebases of substantial size?

wiz · September 29, 2023, 3:05pm

I’m curious how would you use them.

f-a · September 29, 2023, 3:12pm

Large codebases tend to be messy, at least a bit! Large also means “developed through many years”, so you can see idioms shifting, different styles, different testing guidelines etc.

I now and then contribute to cabal, it is not super-big but enough to learn things by exploring it.

Ambrose · September 29, 2023, 3:19pm

heh I would like to test graphex on a giant public codebase instead of just my work one

But for graphex - the messier, the better!

Liamzy · September 29, 2023, 3:36pm

I’m having difficulty going through a large Haskell codebase right now–I’m aiming to make sure everything is commented–and I’m a bit stalled because it has to support legacy features, and has certain restrictions on available libraries, and I’d think it’s baroque for that reason.

I want to find an easier, albeit still large, codebase, to get some practice parsing large Haskell codebases available on Github.

silky · September 29, 2023, 3:43pm

Gosh, there are so many!

I’ll just list some of my favourites and regular go-to’s:

flora-server: I’m not sure there is a better example of a servant-based website out there. I think the code is quite well-structured, and there a lots of interesting things to discover as you dive deeper into the implementation in various places.
stack: whether or not you use it as a package manager, it was my first love as an extremely friendly ecosystem with very well designed code, for my money.
hasura graphql-server (for a limited time only!): Hasura are moving away from Haskell, but still I think this codebase is interesting as a reference because it’s probably the largest Haskell codebase if you count by end-users? From that point of view I think, even though it’s being phased out, it’s very interesting to learn from; and I’m sure I’ve used it in the past as a reference for different things.
hasktorch: I think this is interesting for just how sophisticated it is; I think they do a lot of complicated/interesting things with types, and (if I’m remembering correctly) have a very interesting way of integrating with the torch ecosystem in a way that some of the other Haskell ML projects did not.
pandoc: A classic, and probably doen’t require much introduction. I find pandoc interesting for how in theory complicated it could be, for how easily you can work with it to do real transformations. I think it has a very nice data model.

This of course excludes pure libraries; but really, I think one of Haskell’s big strengths in how much can be learned by reading the source code of the libraries, and, in some sense, how small a lot of them manage to be; i.e. how well they compose together to allow us to build useful things with them That is to say, while I’ve learned a lot from glancing at these projects above, I’ve learned at least as much from just looking at the codebases of the larger libraries and dev-tools themselves!

hasufell · September 29, 2023, 3:51pm

I second that. And I’d take its codebase over cabal’s any day (purely talking about code hygiene), although there are some rough spots too.

You can also skim through IOGs large blockchain codebases, which are all public. E.g. the node or the wallet. Kadena is public too. Whatever you may think of blockchain, those are usually well designed codebases.

Agnishom · September 29, 2023, 4:00pm

Does the Diagrams library count? I think it is a very neat library with which you can do a lot of stuff

brandonchinn178 · September 30, 2023, 12:19am

I recently contributed to hackage-server, and found it to be a really nice experience! It’s very well organized + well architected

andreabedini · October 2, 2023, 12:08am

Thank you for the link! I think I am going to try this today right on cabal’s codebase!

Liamzy · October 4, 2023, 11:21am

Speaking of Hasura, are there any Haskellers proficient at Rust seeing obvious issues with moving graphql to a Rust idiom?

@Ambrose brought up in the Hasura thread that there is no guarantee of Graphql-Engine being successfully ported to Rust.

Via sloccount, the codebase is 200k lines Haskell and the estimated cost at 60k developer salaries is 8 million. Considering that a lot of Haskell development work is done with senior engineers, and even cutting costs based on Indian Haskellers, you’re still looking at about 16 million USD worth of developer hours.

Consider that Rust is between 150% to 300% the code length of Haskell, and that’s easily 48 million USD worth for the rewrite, and possibly 187.5 developer years.

Of course, this doesn’t say that rewriting GraphQL is impossible, and perhaps people more familiar with both Rust and Haskell might be able to clarify Hasura’s apparent project.

If this is in fact an expensively intractable problem, I’m guessing Hasura execs might end up kicking themselves for not trying, instead, a port to Linear Haskell, or paying Well-Typed or Tweag to help mature Haskell-Rust FFI and clone Sigma / SC’s Mu, except using Rust instead of C++.

If using the most current languages is a key Hasura value, wrapping Rust with Haskell would likely have given them the most avant-garde backend possible.

Topic		Replies	Views
Hasura migrating to Rust? Links	106	11695	July 27, 2023
DevOps Weekly Log, 2024-01-17 Haskell Foundation	11	1403	January 22, 2024
Blog post: Why I Support the Haskell Foundation Links	4	798	June 22, 2021
Isomorphic web apps in Haskell Links	7	1358	April 9, 2019
Maintaining haskell programs	104	7927	December 6, 2023

Paragon Haskell Large Codebases?

Related topics