Implementing incremental hashing of a data type

Nosrep · June 18, 2023, 11:54pm

I’m writing a chess engine in Haskell (not my greatest idea) and have implemented Zobrist hashing, which is a form of incremental hashing. Before implementing hashing, this was my Game type:

data Pieces = Pieces
    { _pawns   :: !Bitboard
    , _knights :: !Bitboard
    , _bishops :: !Bitboard
    , _rooks   :: !Bitboard
    , _queens  :: !Bitboard
    , _kings   :: !Bitboard
    } deriving (Eq, Show)
mkLenses ''Pieces

data Game = Game
    { _gamePlaying   :: Pieces
    , _gameWaiting   :: Pieces
    , _gameCastling  :: !Int
    , _gameEnPassant :: !(Maybe Int)
    , _gameTurn      :: !Color
    } deriving (Eq, Show)
makeLenses ''Game

For incremental hashing I came up with two approaches: adding the hash to the Game type

data Game = Game
    { _gamePlaying   :: Pieces
    , _gameWaiting   :: Pieces
    , _gameCastling  :: !Int
    , _gameEnPassant :: !(Maybe Int)
    , _gameTurn      :: !Color
    , _gameHash      :: !Int
    } deriving (Eq, Show)
makeLenses ''Game

and creating a separate type that wraps Game

data HGame = HGame
    { _hgGame :: {-# UNPACK #-} !Game
    , _hgHash :: !Int
    } deriving (Eq, Show)
makeLenses ''HGame

None of these approaches are really satisfactory to me. The first approach isn’t great because ideally I’d like the board specific information (info required to generate moves) to stay separate from auxiliary info like the hash. The second approach is unwieldy because most of the time the code is accessing hgGame, not hgHash, which leaves lots of game ^. hgGame . whatever in the code. It also causes some extra unboxing/reboxing to happen when the Game is passed to a function from within a HGame; I tried removing the UNPACK, performance got worse. I’ve also tried benchmarking both of these; performance is better during perft for the first approach, performance during searching is better for the second.

Is there any approach I’m overlooking? Can any of these be improved?

bradrn · June 19, 2023, 1:08am

Not necessarily! If you use makeClassy to make your lenses, then you can just do game ^. whatever even with the nested structure. I like microlens’s documentation here, but lens and optics both have this too.

jhenahan · June 19, 2023, 1:14am

Is the hash a fact about a game, or a fact about a board state? Would it make sense to carry around a state parameter and transform it at each game step? Has a certain CPS feel to it, too.

Nosrep · June 19, 2023, 1:35am

makeClassy looks like a pretty good fit for HGame, thanks!

Nosrep · June 19, 2023, 1:36am

The hash is created from all the fields of Game, which is closer to a board state than a game because it doesn’t store things like time control or history. What do you mean by a state parameter?

jhenahan · June 19, 2023, 1:54am

E.g., something State BitVector Game-ish, so that every step of the game you don’t have to tie together the hash and the board logic, at all. So each step of your game then looks like

Update the Game
modify (flip xor $ hashGame newGameState) (or something close)
Then each game is just initialized with flip runState hashOfStartingPosition

I don’t know how you’re currently handling updating the hash between game steps, so this might not actually be helpful.

Nosrep · June 19, 2023, 2:55am

I don’t think that would work unfortunately, currently the hash updating needs to be tied with the game state updating

Topic		Replies	Views
Please comment on my Hangman game Learn	7	461	October 29, 2023
How to optimize the following algorithms Learn	14	702	June 18, 2021
Defect Process: 2D commercial hack 'n slash game in Haskell Links	1	513	October 7, 2021
My knight's tour is slow Learn	13	1349	November 19, 2022
Hashable instance for vinyl records Learn	4	765	March 22, 2021

Implementing incremental hashing of a data type

Related topics