Tasty-bench-fit: benchmark a function and find out its asymptotic complexity

Bodigrim · April 10, 2023, 11:53am

I wrote a naive and stupid package, which tries to guess asymptotic complexity from benchmarks:

Here is an example of output:

> fit $ mkFitConfig (\x -> sum [1..x]) (10, 10000)
1.2153e-8 * x
> fit $ mkFitConfig (\x -> Data.List.nub [1..x]) (10, 10000)
2.8369e-9 * x ^ 2
> fit $ mkFitConfig (\x -> Data.List.sort $ take x $ iterate (\n -> n * 6364136223846793005 + 1) (1 :: Int)) (10, 10000)
5.2990e-8 * x * log x

I tested some toy examples, but curious how tasty-bench-fit behaves on data it was not “trained” on. Could someone please give it a try and share results (ideally with debug flag on)?

kindaro · April 10, 2023, 1:21pm

Wow, you actually made it!

ChShersh · April 11, 2023, 11:40am

Looks like an amazing project!

I give it a run on different implementations of nub:

nub from base:Data.List: should be O(n^2)
nubOrd from containers:Data.Containers.ListUtils: should be O(n log n)
nubHash based on HashSet from unordered-containers: should be O(n) (not really but still)

Here are the summary of tests:

Ordinary nub (unique)
1.8289e-10 * x ^ 2 * log x
Ordinary nub (dups)
3.2545e-9 * x ^ 2
Efficient nubOrd (unique)
2.2492e-9 * x * log ^ 2 x
Efficient nubOrd (dups)
4.0699e-8 * x ^ 1.1977
Efficient nubHash (unique)
1.2586e-10 * x * log ^ 3 x
Efficient nubHash (dups)
1.8058e-11 * x * log ^ 4 x

Running the same test suite result with debug information:

Link to debug info

For anyone interested in trying, here’s how I tested this:

Create a directory with a project, e.g. fit-example:
```
mkdir fit-example
cd fit-example
```

Create the following cabal.project:

packages: .

constraints: tasty-bench-fit +debug

source-repository-package
    type: git
    location: https://github.com/Bodigrim/tasty-bench-fit.git
    tag: be972944ee26184892213fd6e700565b643cffd5

Create the following fit-example.cabal file:

cabal-version:      3.0
name:               fit-example
version:            0.1.0.0

executable fit-example
    main-is:          Main.hs

    build-depends:
      , base
      , containers
      , hashable
      , tasty-bench-fit
      , unordered-containers

    default-language: GHC2021

And here’s my Main.hs test code:

module Main where

import Data.Hashable (Hashable)
import Data.HashSet (HashSet)
import qualified Data.HashSet as HashSet
import Data.List (nub)
import Data.Containers.ListUtils (nubOrd)
import Test.Tasty.Bench.Fit (fit, mkFitConfig)

nubHash :: (Eq a, Hashable a) => [a] -> [a]
nubHash = go mempty
  where
    go acc [] = []
    go acc (x:xs)
      | HashSet.member x acc = go acc xs
      | otherwise = x : go (HashSet.insert x acc) xs

main :: IO ()
main = do
    -- Nub (Eq)

    putStrLn "Ordinary nub (unique)"
    complexity <- fit $ mkFitConfig (\x -> nub [1..x]) (10, 10000)
    print complexity
    putStrLn "Ordinary nub (dups)"
    complexity <- fit $ mkFitConfig (\x -> nub ([1..x] ++ [1..x])) (10, 10000)
    print complexity

    -- Nub (Ord)

    putStrLn "Efficient nubOrd (unique)"
    complexity <- fit $ mkFitConfig (\x -> nubOrd [1..x]) (10, 10000)
    print complexity
    putStrLn "Efficient nubOrd (dups)"
    complexity <- fit $ mkFitConfig (\x -> nubOrd ([1..x] ++ [1..x])) (10, 10000)
    print complexity

    -- Nub (Eq + Hash)

    putStrLn "Efficient nubHash (unique)"
    complexity <- fit $ mkFitConfig (\x -> nubHash [1..x]) (10, 10000)
    print complexity
    putStrLn "Efficient nubHash (dups)"
    complexity <- fit $ mkFitConfig (\x -> nubHash ([1..x] ++ [1..x])) (10, 10000)
    print complexity

Running the code with:

cabal run

jaror · April 11, 2023, 12:03pm

With the same setup I got these results:

Ordinary nub (unique)
2.2311e-9 * x ^ 2
Ordinary nub (dups)
4.5887e-9 * x ^ 2
Efficient nubOrd (unique)
2.9811e-10 * x * log ^ 3 x
Efficient nubOrd (dups)
3.3858e-9 * x * log ^ 2 x
Efficient nubHash (unique)
1.6405e-10 * x * log ^ 3 x
Efficient nubHash (dups)
2.5267e-11 * x * log ^ 4 x

david-christiansen · April 12, 2023, 9:17am

This is a really cool project

Abab9579 · April 12, 2023, 11:09am

I would like a log-log fit, which should provide close watch on the order of increase.

noinia · April 12, 2023, 5:06pm

Can you give an example of an algorithm --as it would be implemented in Haskell-- that would have an asymptotic complexity of O(X loglog n) time for some appropriate X? (and clearly I don’t mean picking e.g. X = n/loglog n). Since almost everything is pointer/comparison based, I don’t think there are all that many algorithms for which that would be the right answer.

Aside from that. Indeed a cool idea. I would very much like to know what it is actually computing though. I.e. if the benchmark suite claims the running time is 5f(n), then what does that actually mean? I presume 5f(n) was the best fit over functions g selected from some particular set F? It would be nice to document that; otherwise it is hard to interpret the results. [1]

[1] I guess this should mostly be regarded as a general community TODO somewhere; I may just read the source at some point and write s.t. if it doesn’t exist by then :).

Bodigrim · April 12, 2023, 5:49pm

As experiments by @ChShersh and @jaror demonstrate (thanks, guys, I’ll get back to your results soon!), it is challenging to catch even log n factor right. Determining log (log n) is likely impossible for statistical methods, and I know a single algorithm with expected log (log n) term: Schönhage–Strassen algorithm - Wikipedia.

I presume 5*f(n) was the best fit over functions g selected from some particular set F?

See the definition of Complexity, it is a x^b log^c x.

Bodigrim · April 12, 2023, 10:14pm

@ChShersh thanks for a neat reproducible example! I’ve updated tasty-bench-fit to cope with certain situations better, so if anyone wants to provide another data point, please bump tag to the latest commit.

I’m getting more or less reliable results for the following derived program:

{-# OPTIONS_GHC -O0 #-}

module Main where

import Data.Hashable (Hashable)
import Data.HashSet (HashSet)
import qualified Data.HashSet as HashSet
import Data.List (nub)
import Data.Containers.ListUtils (nubOrd)
import Test.Tasty.Bench.Fit (fit, mkFitConfig)

nubHash :: (Eq a, Hashable a) => [a] -> [a]
nubHash = go mempty
  where
    go acc [] = []
    go acc (x:xs)
      | HashSet.member x acc = go acc xs
      | otherwise = x : go (HashSet.insert x acc) xs

main :: IO ()
main = do
    -- Nub (Eq)

    putStrLn "Ordinary nub (unique)"
    complexity <- fit $ mkFitConfig (\x -> nub [1..x]) (100, 10000)
    print complexity
    putStrLn "Ordinary nub (dups)"
    complexity <- fit $ mkFitConfig (\x -> nub ([1..x] ++ [1..x])) (100, 10000)
    print complexity

    -- Nub (Ord)

    putStrLn "Efficient nubOrd (unique)"
    complexity <- fit $ mkFitConfig (\x -> nubOrd [1..x]) (10000, 100000)
    print complexity
    putStrLn "Efficient nubOrd (dups)"
    complexity <- fit $ mkFitConfig (\x -> nubOrd ([1..x] ++ [1..x])) (10000, 100000)
    print complexity

The most surprising part is why -O0 is needed: my hypothesis is described at GitHub - Bodigrim/tasty-bench-fit: Benchmark a given function for variable input sizes and find out its time complexity. I also increased both low and high input sizes. The results are in line with expectations:

Ordinary nub (unique)
2.4935e-9 * x ^ 2
Ordinary nub (dups)
6.6628e-9 * x ^ 2
Efficient nubOrd (unique)
3.8730e-8 * x * log x
Efficient nubOrd (dups)
5.0239e-8 * x * log x

As for nubHash, fit indeed produces very surprising results like x log ^ 3 x. The thing is that feeding continuous Ints into a HashSet hits a very peculiar spot: instance Hashable Int where hash = id, so there is no hashing happening at all, we just pack small arrays element by element. There are sharp changes in performance when another level of small arrays needed, altogether making asymptotics difficult to predict from observations.

ChShersh · April 13, 2023, 9:35am

This is fun to play with! I know it’s not a real benchmark but still, it’s interesting to implement various algorithms and then check whether you guessed the complexity right.

This time, I wanted to test various sorting implementations:

Data.List.sort: should be O(n log n) with small constant because it uses several optimizations
Naive QuickSort: should be O(n log n) on a random list
Top-down mergeSort with the split in the middle: should be O(n log n) with big constant
Top-down mergeSort with split by even-odd: should be O(n log n) with smaller constant
Bottom-up mergeSort: should be O(n log n)
Sort based on IntMap: should be close to O(n)

Suprisingly, the results are not deterministic, and different runs produce different asymptotics. So I run every algorithm twice from GHCi:

-- Data.List.sort
ghci> fit $ mkFitConfig (Data.List.sort . mkList) (10, 10000)
7.0969e-9 * x * log ^ 2 x
ghci> fit $ mkFitConfig (Data.List.sort . mkList) (10, 10000)
1.0800e-7 * x ^ 1.1837

-- Quick Sort
ghci> fit $ mkFitConfig (quickSort . mkList) (10, 10000)
2.8411e-7 * x * log x
ghci> fit $ mkFitConfig (quickSort . mkList) (10, 10000)
2.8529e-7 * x * log x

-- Merge Sort (with split in the middle)
ghci> fit $ mkFitConfig (mergeSortWithLength . mkList) (10, 10000)
4.7956e-7 * x * log x
ghci> fit $ mkFitConfig (mergeSortWithLength . mkList) (10, 10000)
5.6328e-8 * x * log ^ 2 x

-- Merge Sort (with the split by even-odd)
ghci> fit $ mkFitConfig (mergeSortEvenOdd . mkList) (10, 10000)
5.2711e-7 * x * log x
ghci> fit $ mkFitConfig (mergeSortEvenOdd . mkList) (10, 10000)
6.1624e-8 * x * log ^ 2 x

-- Merge Sort (bottom up)
ghci> fit $ mkFitConfig (mergeSortBottomUp . mkList) (10, 10000)
4.1648e-7 * x * log x
ghci> fit $ mkFitConfig (mergeSortBottomUp . mkList) (10, 10000)
8.2759e-7 * x ^ 1.1684

-- Int Sort
ghci> fit $ mkFitConfig (intSort . mkList) (10, 10000)
6.3846e-8 * x * log x
ghci> fit $ mkFitConfig (intSort . mkList) (10, 10000)
2.3924e-7 * x ^ 1.0959

And here’s full code:

{-# OPTIONS_GHC -O0 #-}

module Main where

import Data.IntMap.Strict (IntMap)
import qualified Data.IntMap.Strict as IntMap
import Data.List (sort, partition, foldl')
import Test.Tasty.Bench.Fit (fit, mkFitConfig)

quickSort :: Ord a => [a] -> [a]
quickSort [] = []
quickSort (x : xs) =
  let (less, greater) = partition (< x) xs
  in quickSort less ++ (x : quickSort greater)

merge :: Ord a => [a] -> [a] -> [a]
merge xs [] = xs
merge [] ys = ys
merge (x : xs) (y : ys) = case compare x y of
  EQ -> x : y : merge xs ys
  LT -> x : merge xs (y : ys)
  GT -> y : merge (x : xs) ys

mergeSortWithLength :: Ord a => [a] -> [a]
mergeSortWithLength [] = []
mergeSortWithLength [x] = [x]
mergeSortWithLength xs =
    let (l, r) = splitAt (length xs `div` 2) xs
    in merge (mergeSortWithLength l) (mergeSortWithLength r)

mergeSortEvenOdd :: Ord a => [a] -> [a]
mergeSortEvenOdd [] = []
mergeSortEvenOdd [x] = [x]
mergeSortEvenOdd xs =
    let (l, r) = splitEvenOdd id id xs
    in merge (mergeSortWithLength l) (mergeSortWithLength r)
  where
    splitEvenOdd :: ([a] -> [a]) -> ([a] -> [a]) -> [a] -> ([a], [a])
    splitEvenOdd mkL mkR [] = (mkL [], mkR [])
    splitEvenOdd mkL mkR [x] = (mkL [x], mkR [])
    splitEvenOdd mkL mkR (x : y : xs) = splitEvenOdd (mkL . (x :)) (mkR . (y :)) xs

mergeSortBottomUp :: Ord a => [a] -> [a]
mergeSortBottomUp = mergeLists . map (:[])
  where
    mergeLists :: Ord a => [[a]] -> [a]
    mergeLists [] = []
    mergeLists [x] = x
    mergeLists xs = mergeLists $ mergePairs xs

    mergePairs :: Ord a => [[a]] -> [[a]]
    mergePairs [] = []
    mergePairs [x] = [x]
    mergePairs (x : y : ys) = merge x y : mergePairs ys

intSort :: [Int] -> [Int]
intSort = unfold . compress
  where
    compress :: [Int] -> IntMap Int
    compress = foldl' (\acc x -> IntMap.insertWith (+) x 1 acc) mempty

    unfold :: IntMap Int -> [Int]
    unfold = concatMap (\(x, frequency) -> replicate frequency x) . IntMap.toAscList

mkList :: Int -> [Int]
mkList n = take n $ iterate (\n -> n * 6364136223846793005 + 1) n

Bodigrim · April 17, 2023, 8:03pm

Thanks again for testing and providing interesting examples, @ChShersh!

I’ve updated the package one more time. Things seem to get closer to expected now:

ghci> fit $ mkFitConfig (Data.List.sort . mkList) (10, 10000)
5.714e-8 * x * log x
ghci> fit $ mkFitConfig (Data.List.sort . mkList) (10, 10000)
5.728e-8 * x * log x
ghci> fit $ mkFitConfig (quickSort . mkList) (10, 10000)
2.286e-7 * x * log x
ghci> fit $ mkFitConfig (quickSort . mkList) (10, 10000)
2.276e-7 * x * log x
ghci> fit $ mkFitConfig (mergeSortWithLength . mkList) (10, 10000)
4.176e-7 * x * log x
ghci> fit $ mkFitConfig (mergeSortWithLength . mkList) (10, 10000)
4.144e-7 * x * log x
ghci> fit $ mkFitConfig (mergeSortEvenOdd . mkList) (10, 10000)
4.460e-7 * x * log x
ghci> fit $ mkFitConfig (mergeSortEvenOdd . mkList) (10, 10000)
4.434e-7 * x * log x
ghci> fit $ mkFitConfig (mergeSortBottomUp . mkList) (10, 10000)
3.677e-7 * x * log x
ghci> fit $ mkFitConfig (mergeSortBottomUp . mkList) (10, 10000)
3.709e-7 * x * log x
ghci> fit $ mkFitConfig (intSort . mkList) (10, 10000)
3.797e-7 * x
ghci> fit $ mkFitConfig (intSort . mkList) (10, 100000)
5.978e-8 * x * log x
ghci> fit $ mkFitConfig (intSort . mkList) (10, 100000)
5.759e-8 * x * log x

Bodigrim · May 1, 2023, 9:27pm

And the package is now live: tasty-bench-fit: Determine time complexity of a given function

jaror · August 14, 2023, 9:03pm

I just used this to determine my extremely naive CFG parser is probably around O(n^4) (for a specific very simple grammar)

gist.github.com

https://gist.github.com/noughtmare/1def5cbea85e5d43fd0ca73671b5afd6

CFG.hs

import Data.List qualified as List
import Test.Tasty.Bench.Fit (fit, mkFitConfig)

data CFG = CFG String [(String, [Symbol])]

data Symbol = T Char | NT String

splits :: String -> [(String, String)]
splits [] = [([], [])]
splits (x : xs) = ([], x : xs) : map (\(y, z) -> (x : y, z)) (splits xs)

This file has been truncated. show original

The only catch is that it has a huge constant factor compared to less naive parsers. On my machine an input of 201 characters for a very simple grammar already takes almost 10 seconds.

Thanks a lot for this neat tool @Bodigrim!

tomjaguarpaw · August 15, 2023, 6:55am

Do you mean other CFG parsers are at least quartic too? That surprises me!

jaror · August 15, 2023, 7:08am

No, there is a proof that CFG parsing is reducible to binary matrix multiplication, for which the best practical algorithms are around O(n^2.7). But most CFG parsers are O(n^3) and I think O(n^4) is pretty close to that for a very naive approach, especially considering pretty much all parser combinator libraries are exponential in the worst case (and they don’t terminate at all on left-recursive grammars).

But the biggest caveat isthat I’ve only tested my implementation for a specific and quite simple grammar which should be parseable in O(n), so then O(n^4) is not so impressive.

Topic		Replies	Views
Quick Sort variants performance Learn	2	1603	July 7, 2020
Counting Words, but can we go faster? Show and Tell	30	3602	August 10, 2022
Memory performance when reading large files Learn	11	2780	December 18, 2021
Subquadratic behaviour of Data.List.nub Learn	9	622	April 10, 2023
All my sieves are the same! Learn	15	840	June 29, 2021

Tasty-bench-fit: benchmark a function and find out its asymptotic complexity

Related topics