Using Immutable Vector extractors on Mutable Vectors

kalhauge · August 13, 2022, 9:40am

Hi everyone,

I have been trying to use the bitvec: Space-efficient bit vectors library, but most of the interesting operations are using Immutable Vectors. I however is planning to use mutable bit-vectors in my code.

Is it possible to use a mutable vector temporarily as a vector in a function call, without copying it?

Essentially, I want something like this:

extract :: 
  PrimMonad m => 
     MVector (PrimState s) a 
  -> (Vector a -> b) 
  -> m b

jaror · August 13, 2022, 10:17am

You cannot do it in a safe way: what if you modify the original MVector while the Vector a -> b function is running? However, you can use unsafeFreeze.

kalhauge · August 13, 2022, 1:24pm

Ah, the three problems with extract is that,

you might access the MVector in parallel,
b might contain (Vector a) and therefore keep the reference alive outside the scope, and
since (Vector a -> b) is a pure function and might first be evaluated when b is forced.

Keeping that i mind, I’m unsure how unsafeFreeze solves the problem since it states that: “The mutable vector may not be used after this operation.”

Can I implement the unsafeExtract function like this:

unsafeExtract :: 
  PrimMonad m => 
     MVector (PrimState s) a 
  -> (Vector a -> b) 
  -> m b
unsafeExtact mv fn = do 
  v <- unsafeFreeze mv
  let !b = fn v
  pure b

And use it in a context like a bit-vector set:

import Data.Bit as BV
import Data.Vector.Unboxed.Mutable as UM

add :: 
  PrimMonad m => 
     UM.MVector (PrimState m) BV.Bit 
  -> Int 
  -> m ()
add mv ix = do
  UM.insert mv ix 1

countUnique :: 
  PrimMonad m => 
     UM.MVector (PrimState m) BV.Bit 
  -> m Int
countUnique mv = extractUnsafe mv countBits

jaror · August 13, 2022, 2:21pm

Yeah, you can do that and it might be somewhat safer than just using unsafeFreeze directly. But either way there is some burden of proof on the user of these functions. The compiler does not check that you are always using them correctly.

kalhauge · August 13, 2022, 3:32pm

Thank you!

This is kind of interesting, so we are missing a way in the type-system to represent “Burn after reading” items, with two abilities 1) cannot be saved in hunks, and 2) cannot be saved in constructors or returned without being copied. This smells a little like the borrow checker in Rust.

jaror · August 13, 2022, 4:34pm

You’re completely right and there is already some work in that direction for Haskell. We call it Linear Types. I think this video is a pretty good introduction.

But for practical info you’ll have to look at the documentation and linear-base.

kalhauge · August 13, 2022, 6:09pm

Linear types are not quite right, since it’s all about arguments being consumed exactly once, in this case we want our argument to be consumed many times, but never returned. It’s about ‘borrowing’ the argument to the function, expecting it to not leak out on execution.

“Borrowing” the syntax of linear types, we might want something like a %0 -> b, ei, something that is returned exactly zero times. Destructive functions have these types:

index :: Int -> Vector a %0-> a
maybe :: (a -> b) -> b -> Maybe a %0-> a

Maybe something like this could also be beneficial to reduce the use of the garbage collector. If you call a destructive function with an argument, you can reuse the piece of memory right after the call, because it is not refereed to in the results.

Bodigrim · August 13, 2022, 8:13pm

Which operations are you interested in?

kalhauge · August 13, 2022, 9:48pm

I’m looking at the ‘countBits’ and ‘nthBitIndex’, there does not seem to be mutable versions of the same operations. I’m specifically trying to implement a succinct dictionary to improve the storage of a data structure I’m working on in the http://discourse.haskell.org/t/counting-words-but-can-we-go-faster/ project.

Ps. Always happy to talk to the author of the library

Bodigrim · August 13, 2022, 10:20pm

Are you looking for popkey: Static key-value storage backed by poppy and hw-rankselect: Rank-select?

kalhauge · August 14, 2022, 2:03pm

I’ll look into them, they look interesting! Do you recommend them instead of the bitvec library and should bitvec be considered obsolete? Alternatively, is calling the immutable operators on a mutable bit vector through the ‘unsafeFreeze’ operator the correct way to use the library?

Bodigrim · August 14, 2022, 5:26pm

bitvec is a general purpose library for bit vectors, while popkey is an implementation of a succinct data structure. They have different purposes and are not interchangeable.

I have not used popkey myself, so cannot tell if it’s a right choice for your purposes.

Yes, your unsafeExtract is fine, as much as “unsafe” functions can be “fine”. But I’m also open for someone to extend bitvec with countBitsM and nthBitIndexM for mutable vectors, it should be straightforward to implement.

kalhauge · August 14, 2022, 7:11pm

It’s mostly a pet project so ‘popkey’ is not the best choice! Thank you for your help though. If I can figure it out, I’ll love to send a pull request. I’ll be busy the next couple of though.

Topic		Replies	Views
Linearly-typed mutable vector of unboxed values (hack) for NFA benchmark Learn	0	116	April 7, 2025
How to use mutable vector	1	472	February 3, 2021
Type of `get` in `Data.Vector.Mutable.Linear` Learn	6	147	May 27, 2025
Is unsafeIOToSTM ever safe?	6	590	July 28, 2022
[ANN] vector-hashtables-0.1.0.1 Announcements	4	535	September 10, 2021

Using Immutable Vector extractors on Mutable Vectors

Related topics