How to do user facing records in 2024

tomjaguarpaw · March 28, 2024, 8:30am

Why not? The user can still make optics based on the public API.

This is still what I use because all the other approaches seem to have too many caveats, but it’s possible I just haven’t found the sweet spot yet.

michaelpj · March 28, 2024, 10:54am

I have been pondering this situation for some time for lsp-types, which exports a huge number of auto-generated record types, so needs a consistent policy (since a machine is going to apply it). The current approach is _-prefixed field names; DuplicateRecordFields (rather unavoidable, since the source has many duplicate field names, and I want to provide combined modules that export lots of things); and makeClassy from lens.

The alternative I am most attracted to is the one described by @velveteer : remove the _ prefixes, don’t provide any lens definitions, and let downstream use generic-lens. This would let us drop the lens dependency, which would be nice, and would also let us drop the _ prefixes, which have always felt artificial to me. In practice, people often still use the field selectors, which is unnecessarily ugly because of the underscores. There is an issue with type families also, see Consider switching to microlens or overloaded dot syntax? · Issue #465 · haskell/lsp · GitHub for discussion. Maybe I should just pull the trigger, though.

Now the user can’t make optics for my types if I don’t do it, and I can’t split off the optics definitions to a separate package because they need access to the record constructors.

I agree with @tomjaguarpaw : if you export a getter and setter, then the user can make a lens from them. If you don’t… then presumably they shouldn’t be able to write a lens, or they’ll break your attempt to hide the fields!

mixphix · March 28, 2024, 1:04pm

I tend to prefer -XOverloadedRecordDot with -XDuplicateRecordFields. I would also like to use -XNoFieldSelectors but it’s a lot of tedious of work to convert a codebase that started pre-9.0 to one that uses dot syntax instead of selectors.

Here’s the thing about optics: people usually define them with TemplateHaskell. You could write a different templating function than makeLens to use the RecordDot syntax, i.e. to produce code that looked like this:

fooY = lens (.y) (\foo y -> foo{y})

Then you avoid the need for selectors even here. Even the choice of templating function is opinionated!

Another common pattern in Haskell libraries is to provide the core datatypes and functionality from one package, and then various wrapper libraries that export the lens/microlens/optics flavour of choice. It’s more maintainer effort, but helps keep downstream dependency trees tailored!

One thing I find unfortunate about record dot syntax is that it’s not available for nullary constructors. You can’t say

instance HasField "message" (Maybe String) String where
  getField = \case
    Just msg -> msg
    Nothing -> "no message"

bad = Nothing.message

because Nothing gets parsed as a module name!

jaror · March 28, 2024, 1:21pm

I don’t think makeLenses uses selectors. The documentation says it just uses the constructor positionally:

e.g.

data FooBar
  = Foo { _x, _y :: Int }
  | Bar { _x :: Int }
makeLenses ''FooBar

will create

x :: Lens' FooBar Int
x f (Foo a b) = (\a' -> Foo a' b) <$> f a
x f (Bar a)   = Bar <$> f a
y :: Traversal' FooBar Int
y f (Foo a b) = (\b' -> Foo a  b') <$> f b
y _ c@(Bar _) = pure c

chreekat · March 28, 2024, 1:29pm

How does GitHub - ndmitchell/record-dot-preprocessor: A preprocessor for a Haskell record syntax using dot fit into the landscape here?

(I am also interested in knowing how to start a new project in 2024 – assuming I can rely on GHC 9.4 or even 9.6)

michaelpj · March 28, 2024, 2:14pm

Note that today you can do this IMO more pleasantly using additional public sub-libraries. I think this is a very natural way to do these “shim-for-using-my-package-with-package-X” little packages that otherwise proliferate.

eldritch-cookie · March 28, 2024, 2:57pm

is there even a non opinionated option? every option will make a part of the interface awkward you can’t support OverloadedRecordDot, record selectors and every optics library simultaneously, that said there is an option where there is maximum freedom, use NoRecordSelectors and provide a Lens for every library you want to support in a separate sublibrary, any one that wants a record selector can easily get one by using OverloadedRecordDot

DrewFenwick · March 28, 2024, 5:48pm

Hmm, it seems like the take-away here is that there is no agreed upon best practice yet.

DuplicateRecordFields seems to be liked though!

Ah, as in actually using the multi-library package feature for once?

It hadn’t even occurred to me that you could use such a feature to let users opt in to more dependencies for more features without a second package.

Well, you can… just not necessarily first-class support. Either your fields or your lenses need to have a prefix, so you have to pick a favorite there. OverloadedRecordDot is I think pretty unintrusive and doesn’t get in the way of the other options it would seem, but it’s a bit of work to show examples of how to use all three in your documentation, so it will be very tempting to pick a favorite to recommend.

Good point… in my zeal to bring up as many problems I can think of with interacting record extensions I may have imagined one

DrewFenwick · March 28, 2024, 5:56pm

Ah, there’s another thing. When you use DuplicateRecordField do you put all your records into separate modules so selector functions can always be disambiguated by qualified imports, or do you just throw records with duplicate fields in the same module, let the compiler generate selectors and offer them “as is” and leave it up to the user to find an alternative solution when field selectors are ambiguous?

I suppose if duplicate field selectors are defined in the same module then maybe users can avoid ambiguity with:

import M (Foo, Bar)
import M qualified as Foo (Foo(..))
import M qualified as Bar (Bar(..))

I’ve only just thought of doing that…

velveteer · March 28, 2024, 6:27pm

Yeah I usually have duplicate record fields enabled for the same reason that @michaelpj mentioned, where I have a top-level module as a namespace that imports multiple modules under it, and it’s likely there are duplicate selectors within that namespace.

How users get around ambiguity is up to them in this case. I still expose the child modules if they want to avoid importing the entire namespace. Your example is also an option, and I think it’s generally a good practice.

michaelpj · March 28, 2024, 8:56pm

I think the “just record selectors and Generic instances” is the non-opinionated option. You export something that is pretty much “normal-Haskell”:

There are record selectors
They have normal names (you don’t prefix them)
They can be used as normal (including with OverloadedRecordDot if your users want)
You don’t depend on any optics library since you’re not providing any optics
Users who want optics can get a pretty good experience still using generic-lens or generic-optics

Yes, this is IMO one of the key usecases for multiple libraries. e.g. lsp-types has a sublibrary for the quickcheck instances, so we can publish them together but people who don’t want them don’t need to incur the quickcheck dependency. It’s great.

DrewFenwick · March 29, 2024, 9:31am

There’s a snag when trying to use sublibraries to make optics dependencies optional.

Ideally I’d want to make use of optics-th's templates for creating lenses for my records, and I’d also like to be able to use optics’ LabelOptic tech to refer to these lenses with overloaded labels.

Alas, this requires defining instances of the LabelOptic class, which the templates do for you, but to put these instances in a sub-library they have to be orphan instances.

Is it just not that big of a deal in practice that I should define them anyway?

noinia · March 29, 2024, 9:42am

I wouldn’t worry about orphan instances in that case.

Maybe a more worrysome snag is that due to this cabal-bug Per-component dependency solving · Issue #4087 · haskell/cabal · GitHub essentially I don’t think the dependency on optics in such a sublibrary is actually optional.

As an additional data point: I still use the regular. _foo names + lens (sometimes makeLenses or makeClassy, or hand written lens functions). I find the optics approach to use % instead of . too noisy.

tomjaguarpaw · March 29, 2024, 9:59am

I don’t understand why sublibraries per se are beneficial here, rather than just having separate libraries.

michaelpj · March 29, 2024, 6:26pm

This is a bigger topic, but I think basically they’re just easier.

One cabal file, so you don’t have to repeat metadata and you can use common stanzas across them
One version, so you don’t have to think about how to version them independently
One package, so you don’t have to release and upload them separately

Flipping it around: why would you use a separate package when you have a sublibrary? The main thing that a separate library gets you is a separate version… but often you don’t need or want that.

(There are of course still tooling issues, which are legitimate reasons to avoid them, but conceptually I think they’re pretty great.)

jhenahan · March 29, 2024, 7:32pm

As a follow-on idea to this, you could expose your getter-setter pair in a private optics-compat sublibrary or similar and then have mything-lens and mything-optics just construct the native optics with the constructors (e.g., lens, in both libraries). In this way, you can hide the particularities from a user while also providing the tool support. If I remember how OverloadedRecordDot works, you could even use these to implement HasField instances and provide opt-in support for that syntax as a library at the cost of some semi-orphans.

jackdk · March 31, 2024, 9:14pm

Amazonka has similar concerns (generating lots of records from service definitions), exports records with no leading underscore and no other prefix, and I think it works well there. This needs -XDuplicateRecordFields in the modules where the record is defined (for GHC <= 9.6) and in any module which collects and re-exports them (for GHC >= 9.8, if you do that).

Library clients are expected to use whatever record technology they prefer. The Generic instance allows generic-lens/generic-optics, but because all the constructors are exported normal selectors/updates and dot syntax are also usable.

chreekat · April 2, 2024, 8:36am

Based on a number of comments in this thread, I feel like the answer to “how to do user facing records in 2024” is “Do what amazonka-2.0 does”

jackdk · April 2, 2024, 10:35am

I wish I could take credit, but that predates my maintainership. It does seem to have worked well.

blamario · April 10, 2024, 4:39am

My answer is no. If you expect the qualified imports, you still don’t need to scatter your records across separate modules. The disambiguation requires different module prefixes, not different source modules. So instead of

import qualified Library.Record1 as Record1
import qualified Library.Record2 as Record2

the user can say

import qualified Library(Record1(..)) as Record1
import qualified Library(Record2(..)) as Record2

with the same effect.