Serializing and Deserializing Binary Tree - Depth First Search / DFS on Tree

algo.monster · December 13, 2021, 8:43am

https://algo.monster/problems/serializing_tree

Coder1 · December 13, 2021, 8:43am

for JS noobs like me who don’t like to mess with Symbol.iterator, use const val = nodes.shift(); when deserializing. eg:

function dfsDeserealize(nodes) {
    const val = nodes.shift();
    if (val === 'x') return;
    const cur = new Node(parseInt(val, 10));
    cur.left = dfsDeserealize(nodes);
    cur.right = dfsDeserealize(nodes);
    return cur;
}

nor · March 11, 2022, 10:42am

This seems more practical to me

mike · March 15, 2022, 7:27am

Less insane JS solution referencing @Coders idea so we don’t have to write some weird syntax coalescing an iterator out of something.

This uses a js array as a queue, I believe.

function serialize_dfs(root, res){
if(!root) {
res.push(“x”);
return;
}
res.push(root.val);
serialize_dfs(root.left, res);
serialize_dfs(root.right, res);
}

function serialize(root) {
let res = [];
serialize_dfs(root, res);
return res.join(" ");
}

function deserialize_dfs(nodes){
let node = nodes.shift();
if(node === “x”) return null;
let cur = new Node(parseInt(node,10));
cur.left = deserialize_dfs(nodes);
cur.right = deserialize_dfs(nodes);
return cur;
}

function deserialize(s) {
return deserialize_dfs(s.split(" "));
}

AnotherCoder · March 20, 2022, 7:43am

we can may be avoid using shift by returning below during Serialize:
return nodes.reverse().join(" ")

Now you can use pop in deserialize

kopi22 · May 5, 2022, 6:38pm

Problem with using shift is that it takes O(n) time to remove the first element making the overall time complexity at least O(n^2)

Ryan · May 18, 2022, 1:34am

Rather than using an iterator, we can also use a global or nonlocal pointer variable in the deserialize function to track our location in the string array.

jay · May 23, 2022, 8:04am

here’s another solution if you don’t want to use iter() and next() in python

def deserialize(curr_s):
    def dfs(arr):
        val = arr.pop(0)
        if val == 'x':
            return None
        node = Node(int(val))
        node.left = dfs(arr)
        node.right = dfs(arr)

        return node
    
    arr = curr_s.split(' ')
    return dfs(arr)

floridaman · May 24, 2022, 12:04am

Why do we need to return cur?

Im_just_guessing_her · June 12, 2022, 8:43am

TLDR: We return curr because it is what allows us to “bubble up” the information we need to build the tree or returns the root

DFS for this problem works like:
Check the current value of our iterator and see if it is Null
If it is return None (think of it as returning curr as null)
If it isn’t then set our curr node’s value to the value of our next(iterator)

We can’t return yet because we do not know curr children (curr.left and curr.right)

How do we find the children of curr?

We use the DFS / Recursion to find out!

Search the left side - dfs(curr.left)
Search the right side - dfs(curr.right)

Once this completes we can return curr because curr has its left and right children as well as having it’s value.
We don’t know if this was the first function call or if it’s a recursive call being used to build the tree, regardless we are returning curr because it is either: the answer or part of the answer (being used to build out the tree).

j112 · June 15, 2022, 12:45pm

val = next(nodes) can you explain next? I think its from iter in main

Mod2 · June 21, 2022, 5:14am

Yes, next retrieves the next element from an iterator. The iterator is created on line 45 in the main function.

Alex · June 30, 2022, 5:44pm

It’s vital to consider that this is a preorder tree traversal. Otherwise, the whole problem changes. For example, if you serialize the tree into inorder or post order, the solution wouldn’t be valid.

archurro · July 7, 2022, 1:05pm

Probably overcomplicating it but I took an iterative approach with deserialize, let me know how I could make it cleaner:

def deserialize(s):
if s == ‘x’:
return None

s = s.split()
stack = []
nodes = []
i = 0
xCount = 0

for i in range(len(s)):      
    stack.append(s[i])
    
    if s[i] == 'x':
        xCount += 1
    
    if xCount == 2:
        xCount = 0
        
        # pop the 2 x's
        stack.pop()
        stack.pop()
        
        # next pop is the node value
        n = Node(int(stack.pop()), None, None)
        nodes.append(n)
    
    if len(nodes) == 2:
        right = nodes.pop()
        left = nodes.pop()
        val = int(stack.pop())
        n = Node(val, left, right)
        nodes.append(n)

return nodes.pop()

NewCoder · July 13, 2022, 10:29am

Thanks @Coder & @AnotherCoder, this helped a lot! Hope you both found your dream jobs

lava1 · August 22, 2022, 8:07am

def serialize(root):
# WRITE YOUR BRILLIANT CODE HERE
ans = []
f1(root, ans)
return ans

def f1(root, ans):
if root is None:
ans.append(“X”)
return

ans.append(root.val)
f1(root.left, ans)
f1(root.right, ans)

def deserialize(s):
# AND HERE
return f2(s, 0)[0]

def f2(s, idx):
if idx >= len(s):
return

if s[idx] == "X":
    return None, idx

root = Node(s[idx])
idx += 1
root.left, idx = f2(s, idx)
idx += 1
root.right, idx = f2(s, idx)
return root, idx

rd1 · September 22, 2022, 11:09pm

arr.pop(0) is O(N) in Python so this will work but total time complexity won’t be O(N) anymore since for each DFS call you are doing O(N) work.

Chris · October 12, 2022, 2:24am

In previous sections, we draw a parallel between DFS and pre-order. Can this serialize/deserialize be done with in-order or post-order representations of strings instead, by rearranging the order of the calls to left/right and calls to self?

mod1 · December 22, 2022, 7:59am

you can but you’d need dummy values when you serialize to tell when a new node starts

jz1 · January 4, 2023, 1:58am

For those wondering, List.pop(k) in Python pops the index k and shifts all elements up one. This causes List.pop() to be O(N). An alternative is to use a deque which has O(1) pop / append operations.