Solutions and Write-Ups for my Advent Of Code adventures (mainly in Haskell)
This year’s longest path problem is finally here!
In order to represent the input, I have a data structure with three fields:
data Input = Input { start :: (Int, Int), end :: (Int, Int), grid :: Matrix Char } deriving (Show)
parseInput :: String -> Input
parseInput input = Input { start=start, end=end, grid=grid }
where grid = (fromLists . lines) input
start = head [(1 , c) | c <- [1 .. ncols grid], grid ! (1 , c) == '.']
end = head [(nrows grid, c) | c <- [1 .. ncols grid], grid ! (nrows grid, c) == '.']
By now, finding neighbours functions are pretty common, so I won’t spend much time describing it! The basic idea is:
getNeighbours :: Bool -> (Int, Int) -> Matrix Char -> [(Int, Int)]
getNeighbours isSlippy pos@(r, c) grid = filter isNotWall . filter isInGrid $ neighbours
where char = grid ! pos
isInGrid (r, c) = 0 < r && r <= nrows grid && 0 < c && c <= ncols grid
isNotWall (r, c) = grid ! (r, c) /= '#'
neighbours | isSlippy && char == '>' = [(r, c + 1)]
| isSlippy && char == '<' = [(r, c - 1)]
| isSlippy && char == 'v' = [(r + 1, c)]
| isSlippy && char == '^' = [(r - 1, c)]
| otherwise = [(r - 1, c), (r + 1, c), (r, c - 1), (r, c + 1)]
The longest path problem is NP-HARD. If you don’t know what that means (don’t worry, I don’t exactly know either), basically remember that there is HARD in the name :).
So, the best approach that we can have here is to brute-force our way through:
This works quite well on the example inputs, and it runs decently fast on the actual input for part 1.
However, if you try to naively bruteforce your way through part 2, you’re going to have to wait at least an hour!
And that’s only if you have a good computer! (Which is not my case!)
So, we need to find some optimisation here. My first thought was to try caching some results. However, caching with the visited tiles in my key proved too slow, and caching with the path length in my key didn’t give correct results.
After some time, I decided to analyse the input, and I noticed something quite interesting:
So, silly as I was, I thought about caching using the visited junctions. This was too slow, so I went to have breakfast.
After my breakfast, I realised something: I can simplify my grid in a weighted graph of connected junctions!
Basically:
#.#########
#.........#
#.#######.#
#.######..#
#........##
########.##
Here, there are four junctions:
So I can represent my grid with a weight graph:
This greatly simplifies my problem, as I now only have to find the longest path in a graph with 4 nodes!
On my actual input, this changes my number of nodes from about 9500 to just 36!
This also automatically prunes dead ends! For example:
#.#############
#......########
#.####..#######
#.#####.#######
#.####..#######
#.####.########
#.####........#
#.#############
Here, it is quite obvious that following the right (as in going Eastbound first, not as in “the correct”) path will lead to a dead end! However, when creating our junction graph, we get the following junctions:
#S#############
#A.....########
#.####..#######
#.#####.#######
#.####..#######
#.####.########
#.####........#
#E#############
In graph form:
This means that, instead of following two paths, we only have one path to follow! (Which is quite fast, as the longest path is now the only one)
Now, my initial way of doing creating this graph was to simply launch a bfs from the starting junction. The encountered junctions would not propagate the bfs further, and I keep the first found distance as the weight between two junctions.
type WGraph = M.Map (Int, Int) [(Int, (Int, Int))] -- Weighted directed graph. Edges are represented by a list of (cost, successor)
makeJunctionGraph :: Bool -> Input -> WGraph
makeJunctionGraph isSlippy (Input start end grid) = treatJunctions (S.singleton start) M.empty [start]
where isJunction pos | pos == start || pos == end = True
| otherwise = length (getNeighbours isSlippy pos grid) > 2
treatJunctions _ graph [] = graph
treatJunctions seen graph (x:queue) = treatJunctions seen' graph' queue'
where nextJunctions = findNextJunctions (S.singleton x) [(0, x)]
graph' = M.insert x nextJunctions graph
junctions = filter (`S.notMember` seen) $ map snd nextJunctions
seen' = foldr S.insert seen junctions
queue' = queue ++ junctions
findNextJunctions _ [] = []
findNextJunctions seen ((d, x):queue) | d > 0 && isJunction x = (d, x) : findNextJunctions seen queue
| otherwise = findNextJunctions seen' queue'
where neighbours = filter (`S.notMember` seen) $ getNeighbours isSlippy x grid
seen' = foldr S.insert seen neighbours
queue' = queue ++ zip (repeat (d + 1)) neighbours
My makeJunctionGraph function here creates the graph of junctions from an input. It works with a queue of junctions to treat:
What the bfs does is:
This works on my input (and probably most inputs) because it has one interesting property:
This, however, is not the case for the first example showed above! It would tell me that the distance between A and B is 11, not 12! (it would take the lower path, at it is the shortest path)
I fixed this as soon as I noticed it:
type WGraph = M.Map (Int, Int) [((Int, Int), Int)] -- Weighted directed graph. Edges are represented by a list of (successor, cost)
makeJunctionGraph :: Bool -> Input -> WGraph
makeJunctionGraph isSlippy (Input start end grid) = treatJunctions (S.singleton start) M.empty [start]
where isJunction pos | pos == start || pos == end = True
| otherwise = length (getNeighbours isSlippy pos grid) > 2
treatJunctions _ graph [] = graph
treatJunctions seen graph (x:queue) = treatJunctions seen' graph' queue'
where nextJunctions = M.assocs $ findNextJunctions (S.singleton x) [(x, 0)] M.empty
graph' = M.insert x nextJunctions graph
junctions = filter (`S.notMember` seen) $ map fst nextJunctions
seen' = foldr S.insert seen junctions
queue' = queue ++ junctions
findNextJunctions _ [] found = found
findNextJunctions seen ((x, d):queue) found | d > 0 && isJunction x = findNextJunctions seen queue found'
| otherwise = findNextJunctions seen' queue' found
where neighbours = filter (`S.notMember` seen) $ getNeighbours isSlippy x grid
seen' = foldr S.insert seen $ filter (not . isJunction) neighbours
queue' = queue ++ zip neighbours (repeat (d + 1))
found' | x `M.member` found = M.adjust (max d) x found -- There is another way to get to this junction: keep the longest
| otherwise = M.insert x d found
The code is almost the same, to one small difference:
My code for that could be cleaner, but I quickly hacked that fix in because I didn’t have much time :C (at least it works and doesn’t deter performances!)
Now that I have a way to create my junction graph, all I need to do is to brute-force every possible path and to keep the best result:
findLongestPath :: Bool -> Input -> Output
findLongestPath isSlippy input = go S.empty 0 (start input)
where graph = makeJunctionGraph isSlippy input
go seen pathLen cur | cur == end input = pathLen
| otherwise = best
where seen' = S.insert cur seen
neighbours = [(pos, dist) | (pos, dist) <- graph M.! cur, pos `S.notMember` seen]
best = maximum (0 : parMap rseq (\(p, d) -> go seen' (pathLen + d) p) neighbours)
I do this using a dfs:
Note that I do my recursion in parallel to go a little bit faster!
And now, solving part one and part two is simply calling the findLongestPath function, with and without slipperiness!
partOne :: Input -> Output
partOne = findLongestPath True
partTwo :: Input -> Output
partTwo = findLongestPath False
Done!