Skip to content

Latest commit

 

History

History
994 lines (702 loc) · 50.3 KB

README.md

File metadata and controls

994 lines (702 loc) · 50.3 KB

Interview Prep Notes

This document contains notes to improve my problem solving skills, and possibly other skills such as system design in the distant future. The solutions outlined below are not copies of the actual solutions, but rather contain the intuition or thought process that has one arrive at a solution. Above all, LeetCode is a matter of being able to recognize and apply patterns, which this document intends to help with. Additionally, I do note some implementation tips for myself, such as dummy heads in lists.

LeetCode Solutions

These are my notes for the LeetCode questions I've solved, with respective code under solutions/*.cpp. All solutions are in C++, as it has a phenomenal standard library.

Q1 - Two Sum

  • As with many situations, it's possible to sacrifice memory in exchange for superior time complexity.
  • Storing each element of a vector into a map where the values are indices reduces the algorithm to a one-pass. Lookup is made constant in this way.

Q2 - Add Two Numbers

  • When working with linked lists, it's sometimes useful to have a dummy node at the beginning. Once you need to return the result, just return dummy->next. This technique may simplify loops.
  • If you are doing something that produces extra terms to be included into the algorithm, be careful about whether your conditional clauses need to include said term as well. For example, when adding two digits, the resulting carry may be stored as an extra term, in which case, you would need to take into account whether the conditionals should include the carry beside the two digits.

Q3 - Longest Substring Without Repeating Characters

  • The sliding window technique is great for one-pass solutions. You should consider the rules of what your left and right pointers mean. For example, the window may be defined as [left, right], [left, right), or (left, right]. The exclusion of the left pointer may simplify things at times.
  • Consider keeping a store of indices according to which you can advance the left pointer.
  • Vectors can be used as frequency tables when you're working with a type that maps easily to integer-based indexing, such as char. When the chars are ASCII, use size 26 for az or AZ, 128 for full ASCII, 256 for extended ASCII.

Q4 - Median of Two Sorted Arrays

  • When working with multiple sorted arrays, you might be able to save both memory and time by just simulating a merge operation with pointers and setting a breaking condition such that what you are left with is the answer.

Q5 - Longest Palindromic Substring

  • Refer to Q647 notes for palindromic substrings.

Q7 - Reverse Integer

  • When working with the individual digits of a number, modulo and integer division are your best friends.
  • If possible, instead of keeping track of what you do (multiplier, etc.), just cut down the number itself to keep track (integer division helps with this).

Q9 - Palindrome Number

  • When trying to validate a palindrome, rather than checking through iteration, you can reverse the input into another variable and check if input == reverse.
  • Sometimes, constraints may prevent reversing the entire input. In that case, you can reverse only one half of the input and check if the remaining input is equal to the reversed half.

Q13 - Roman to Integer

  • Basic understanding of roman numerals: Each symbol is converted to its integer equivalent and summed. Symbols like I/X/C become negative when positioned right before V/X or L/C or D/M respectively.

Q14 - Longest Common Prefix

  • Nothing much to add. Just remember to consider edge cases.

Q15 - 3Sum

  • If you deduce the BCR to be quadratic, it may be a good idea to sort the input. Doing so makes this problem an extension to the two sum two pointer approach.
  • You hold the first item constant, and move the other two to find matches.
  • Since the input is sorted, it's easy to skip duplicates by skipping equivalent adjacent elements.

Q19 - Remove Nth Node From End of List

  • Again, dummy nodes help but be careful picking dummy->next instead of dummy.next.
  • Queues with a fixed size help you keep track of the nth most recent element of something.

Q20 - Valid Parentheses

  • Don't access the top of a stack when it's empty, because "calling back on an empty container causes UB".
  • When processing something "outward to inward", stacks can help.

Q21 - Merge Two Sorted Lists

  • When merging two linked lists, if you consume one list entirely, you can just have the node at that moment point to the list that still has elements left. In other words, there's no need to construct the remaining result list by hand.

Q22 - Generate Parentheses

  • One way to approach a recursive backtrack problem is to make sure each recursive call constructs a valid subresult, and let it run until it hits a base case of what we are looking for. If there are two independent recursive calls within a recursive, the first call is going to exhaust all possibilities, and only then will the second call be run for the first time.
  • This kind of recursion is VERY similar to DFS. We exhaust the first option we have, then exhaust the second option. Once the second option is exhausted, we backtrack, use a second option instead of a first one, and exhaust that. Then just repeat until all possibilities are exhausted.

Q23 - Merge k Sorted Lists

  • When reducing a list of something to a single item, you can iteratively combine the first and last items while reducing the size by size -= size / 2 each time, so you are eventually left with a single item.

Q26 - Remove Duplicates from Sorted Array

  • The two pointer technique is commonly used such that one is the "slow-runner" and the other is the "fast-runner".

Q27 - Remove Element

Too easy, nothing to add.

Q28 - Implement strStr()

  • String matching is optimally done with KMP or Rabin Karp, but that's a bit of an advanced topic.
  • Sometimes it's just better to deal with some edge cases before the main logic.
  • For questions where you only need to find the very first instance of something, you can return as soon as you find it, and in fact have the default return statement return some constant like -1.
  • You can move the variable declaration statement of a for block to the outside of the loop so you get to keep it after the for block.
  • In cases like substring matching where there needs to be a minimum amount of indices left to fit in, you can set loop conditionals such that the impossible indices are not iterated over. See: lPtr != haystack.size() - needle.size() + 1 (+1 depending on how the range is to be used).

Q32 - Longest Valid Parentheses

  • Dynamic Programming:
    • In array questions, it's possible to do DP by using another array of the same size. You might be able to store information about a certain element by examining patterns. This may include using the DP results of the previous elements.
    • Having a base case helps to get the DP array rolling.
  • Stacks:
    • Indices are pushed onto an array, and popped once it's verified that that index marks a valid substring. Thus, when an element is popped, the top element is one before the largest verified substring index at that time.
    • By pushing invalid indices when the stack becomes empty, we make sure there's always a "one before valid index" element.
  • It's sometimes possible to accomplish wizardry by running an algorithm forwards then backwards.

Q33 - Search in Rotated Sorted Array

  • Again, binary search is the way to go. To determine which side to look next, examine the patterns. In a rotated sorted array, one side is going to be properly sorted still. Do a classic binary search on that side and decide from there.

Q35 - Search Insert Position

  • You always need to think of binary search when a sorted array is mentioned.

Q39 - Combination Sum

  • Recursive backtrack solutions have a general pattern of;
    • Lay out a base case to return and backtrack on invalid elements, and also to record sought results if obtained.
    • Have either a loop or some kind of conditional to go over your possible options for the next attempt. The new option is added here, then a recursive call is made with the new state, and then the new option is popped away.
  • It again helps to think of the process in terms of call stack and DFS.
  • Within the options loop of the recursive function, consider where to start your iteration from each time. For example, when going over number candidates, you don't need to iterate over previous candidates on the list, but you should iterate the current item once more at least.

Q46 - Permutations

  • Generating permutations by backtracking works as a DFS where elements are swapped. On each level of the recursion, the current item is swapped with an element on the right. The base case occurs when there are no elements left to swap.

Q48 - Rotate Image

  • Straightforward problem, it's just kinda tricky to get all the indices right.

Q53 - Maximum Subarray

  • Following Kadane's algorithm, you need to discard any current sum that is negative, as it will never contribute positively.
  • To be able to process negative results as well, set the max so INT_MIN so that any negative item may still be worth keeping.

Q56 - Merge Intervals

  • In brute-force-like algorithms, it may be helpful to sort the vector and process it afterwards.
  • Determine when you are going to add the interval as is, and when you are going to merge. Setting up a conditional for this is easy. Then, determine what you would need to do to merge two intervals. Step by step.

Q57 - Insert Interval

  • Instead of dealing with edge case hell by modifying the current array, start a new one and weave the items into it as you go along. Dealing with intervals that need to be merged is easier this way. If an interval is to be merged, only its endpoint changes to the max option. If there is no overlap, it's just added as is.

Q58 - Length of Last Word

  • Trivial. Just start from the end and think about edge cases.

Q62 - Unique Paths

  • Dynamic programming is good for these kinds of counting problems. Each position can only be reached from two adjacent directions, so just sum those and put it in a DP variable.
  • Before adding a lot of code to cover edge cases, consider whether you can adjust your base case to cover those as well.

Q63 - Unique Paths II

  • Implementing obstacles is as simple as forcing their DP sum to be 0.
  • For the first row and column, it's better to do a loop beforehand and determine their values so that you don't have to deal with it as an edge case in the main loop.

Q66 - Plus One

  • Straightforward grade-school addition. Notice that you don't need to keep going if the carry is 0, as it will never become 1 again.

Q70 - Climbing Stairs

  • Great fit for DP since the current step depends on the previous two steps. Again, make sure you do a good mapping of the DP indices and values, the rest will come naturally.
  • Can also be solved with recursion, which can be optimized with memoization. As always, you just give it a base case and call the recursion with the necessary increments.

Q79 - Word Search

  • This follows the usual recursive backtracking pattern where you change a thing, run recursion, change it back. Of course, there needs to be a base case too. In this problem, the value is changed by invalidating the current cell, effectively marking it as visited. The base case is whether the current cell can be part of the word or not.
  • The recursive function is actually called for every cell in the grid through simple iteration.
  • You can either invalidate a cell by changing its content, or add its address to a set of visited coordinates. The latter is slower, but doesn't modify the original grid.

Q88 - Merge Sorted Array

  • Instead of using extra space to store the merged array, there is a trick to do this in constant space (merge in-place). Use three pointers (2 for the arrays, 1 for where the item will go), and work from the end to the front. This way, no values will be overwritten. In fact, this trick is helpful in any array question where in-place actions are needed.

Q92 - Reverse Linked List II

  • Nothing tricky other than keeping track of what node is pointing where and a few edge cases.

Q94 - Binary Tree Inorder Traversal

  • In-order traversal is very easy with recursion, but doing it iteratively is a bit trickier. For every node, you traverse left and push to stack until you can't anymore. Then pop from the stack, visit the node, move to the right node, repeat.

Q98 - Validate Binary Search Tree

  • You need to keep a lower and higher limit to compare against. Adjacent nodes aren't enough to validate.
  • Another solution would be to do an in-order traversal since BST's will be ordered in that case.

Q100 - Same Tree

  • Recursion. Just gotta take care of the base cases.

Q101 - Symmetric Tree

  • Simple stuff recursively. Iteratively, it works with a BFS-like queue where the elements are pairs of supposedly symmetric nodes.

Q102 - Binary Tree Level Order Traversal

  • Recursion is once again easier. Iterative solution needs BFS where the level is kept track of, perhaps in a pair alongside the node.

Q104 - Maximum Depth of Binary Tree

  • Recursion easy, iteration uses pair for depth.

Q105 - Construct Binary Tree from Preorder and Inorder Traversal

  • A pre-order array is going to have the root node at the beginning. An in-order array is going to have its left and right subtrees to the left and right of its index in the array. So, you get the root from the pre-order array and find it in the in-order array, and construct subtrees recursively.
  • As usual, when you need to keep referring back to an array, to make lookup constant, use a hashtable.

Q106 - Construct Binary Tree from Postorder and Inorder Traversal

  • Exact same logic as Q105 above. However, note that the subtrees need to be constructed according to the order of the array besides the in-order one. In other words, Q105 needs to go left->right but Q106 needs to right->left, since that's how the trees will look in their respective pre/post-order arrays.

Q107 - Binary Tree Level Order Traversal II

  • Pretty much the same solution as Q102, except you reverse the result. Keep that in mind I guess.

Q111 - Minimum Depth of Binary Tree

  • Doable with DFS/BFS/recursion. Note that with BFS, you can keep track of depth/level by using a for loop within the while loop. If you run the loop as many times as the queue has elements at the beginning, you will have gone through all queue items on the current level. Something like int n = q.size(); for (int i = 0; i < n; i++) {}.

Q112 - Path Sum

  • Pretty recursion, or paired DFS.

Q113 - Path Sum II

  • Once again, the elegant solution is to use recursive backtracking to add in then pop a node on the path.
  • If insisting on an iterative DFS, you can use tuple to keep track of node/sum/path all at once.

Q114 - Flatten Binary Tree to Linked List

  • Dummy nodes help with trees as well. You can do flattening in O(1) space by appending DFS nodes to a dummy node.
  • When doing DFS, be very careful with the order in which you push the branches to the stack. Pushing the left branch before the right branch means you explore the right branch first. In other words, if you want to go left to right, push right then left.

Q121 - Best Time to Buy and Sell Stock

  • One-pass algorithm by keeping track of minimum value at each iteration.
  • Doing comparisons to determine the optimized value itself rather than variables that should result in the most optimal value is useful at times. For example, when looking for max profit, do comparisons on the potential max profit itself rather than trying to find a max value from which to subtract a min value. Doing final comparisons according to what is already known at that moment helps to simplify the flow of the code as well. In other words, work with maxProfit and minValue instead of maxValue and minValue.

Q124 - Binary Tree Maximum Path Sum

  • This problem has a very concise recursive solution, consisting of multiple little tricks.
  • The function calls itself for its two branches, but discards their value if negative thanks to max(<possible gain>, 0).
  • The return value is the current node plus the maximum of the branches explored: max(left, right), so the path property is kept.
  • The maximum sum is updated through a reference with a distinct expression node->val + left + right. This is only valid for the actual sum since the root can bind the two branches together as a single path.
  • As per usual, the max sum is initialized as negative infinity rather than 0, so that max sums that are negative can be found as well.

Q125 - Valid Palindrome

  • Instead of building a purified string, there is a way to reduce space usage to constant by skipping over non-alphanumeric characters with an inner while loop.

Q136 - Single Number

  • Hash table, or for constant space, bit manipulation: a XOR 0 = a and a XOR a = 0, so a XOR b XOR a = b. In other words, all the doubles will cancel themselves out to leave the single number behind.

Q141 - Linked List Cycle

  • There are three approaches to a cycle question:
    • Prune the visited nodes by destroying/invalidating them, and check to see if you ever reach a visited node.
    • Use a hashset to store memory addresses etc. and see if you ever revisit a node.
    • Floyd's cycle finding algorithm: Have one slow and one fast pointer, if the fast pointer reaches the end, no cycle. If the fast pointer "catches up" to the slow pointer, there must be a cycle.

Q142 - Linked List Cycle II

  • Floyd's cycle finding algorithm does not actually guarantee the intersection to be at the beginning of the cycle. However, if after finding the intersection, the slow pointer is moved back to the head and both pointers are advanced one by one, the intersection will be guaranteed to be the beginning node of the cycle.
  • Be very careful with what you set your initial variables to be in a while loop. Sometimes, both the beginning and ending conditions will be equal, in which case, the loop will actually never run! The simplest way out of this may be a do while loop instead.

Q143 - Reorder List

  • Instead of using extra memory, you can combine several easy operations to achieve the desired order.
    • Find the middle of the list (slow/fast pointer)
    • Reverse the second half of the list
    • Merge the two halves together to obtain result

Q146 - LRU Cache

  • You can combine two data structures (doubly linked list and hashmap) to achieve good runtime and also queue-like ability. The doubly linked list holds the key-value pairs, and the hashmap points from the key to the respective node.
  • You can eliminate some edge cases in doubly linked lists by always keeping a head and tail node. All operations will be done to whatever is adjacent to these two nodes, so no dealing with null nodes.

Q152 - Maximum Product Subarray

  • The intuition is similar to Kadane's algorithm, but you have to reconsider when you're going to reset the current product. If the product is 0, it will never change, so discard it. To deal with negatives, you do two passes: one from the left and one from the right. This way, you will have optimized the negative numbers as well.
  • To do it in a single pass, you keep two variables, maxCurrent and minCurrent. Since these two can change places because of a negative number, you will need to consider both while finding the real maximum.

Q153 - Find Minimum in Rotated Sorted Array

  • Even with a rotation, binary search is still viable. You need to examine the patterns and play around with a few samples. The boundaries of the current selection can direct which side to go next.
  • In binary search, the middle element isn't guaranteed to be the same as when you calculate it from the container size. If you need it to be consistent, it may be a good idea to increment the index depending on parity.

Q167 - Two Sum - Input Array Sorted

  • Think it through step by step, and you'll get it. Many of the steps are made redundant by the input property anyway.
  • The usual way to overcome overflow is to subtract rather than add. For example, left + right == target may instead be left == target - right. Though this may not apply when negative values are involved.

Q190 - Reverse Bits

  • Use a bitmask. Add a 1 each time both number and mask are a 1. Shift mask and result left to iterate over bits.

Q191 - Number of 1 Bits

  • Same as Q190 but even simpler. Increment count whenever bitmask and number are 1. Keep shifting.

Q200 - Number of Islands

  • In certain questions, you can invalidate (destroy) adjacent data to eliminate recounting the same data.
  • Rather than destroying the data, it's a better practice to either make a copy or keep track of the data that would have been destroyed by putting it in a hashmap etc. Additional conditional checks against the "visited" data would be added in this case.
  • Both BFS and DFS can be used in a 2D array.
  • When you need to access an array at a certain index, check that the index is within the array. It's easier to write the conditional once you've written the array expression.
  • The index checks mentioned above are sometimes greatly simplified if you can reduce the check down to a single point. Instead of checking multiple indices by hand before calling a function, instead have the function begin by checking if the input index is valid. Doing so reduces the check down to a single location.
  • Read the solution with Union Find.

Q202 - Happy Number

  • The simple solution is to keep a hashset, but you can actually solve this with Floyd's cycle detection. The slow pointer will move one number ahead, and the fast pointer will move two numbers ahead.

Q203 - Remove Linked List Elements

  • Edge case where the head node is to be removed. Simply advance to the first node that is to be kept. You could also use a dummy head.
  • Removal is as simple as pointing to the next next.

Q206 - Reverse Linked List

  • One of the recursive approaches is to keep stacking the function until you've reached the end, then change pointers as you pop. Note that the iterative solution is actually better in terms of space used.

Q215 - Kth Largest Element in an Array

  • Very simple use of a min-heap. Just pop the top whenever size goes over capacity.

Q217 - Contains Duplicate

  • Sort to group duplicates together, or use a hashset.

Q225 - Implement Stack using Queues

  • You can keep rotating a single queue to get a stack. You would arrive at this solution by writing down the real state of the queue and also the ideal stack representation, and compare to see how you might accomplish it.

Q226 - Invert Binary Tree

  • Recursion. It actually doesn't matter whether you do the swap bottom-up or top-down.

Q230 - Kth Smallest Element in a BST

  • Simple enough to use a max-heap and pop as size goes over capacity, as you iterate over nodes.
  • But you should actually take advantage of the BST property and do in-order traversal, and return when you get to the Kth element. Refer to Q94 for in-order traversal.

Q232 - Implement Queue using Stacks

  • Whenever you need to push in an item, move all of the stack to another stack, put new item in main stack, push the temporary stack back onto main. Honestly it helps to think of what you would do by hand if asked to do this.

Q234 - Palindrome Linked List

  • To solve a palindrome linked list question in space O(1), reverse the second half of the list and verify.
  • To find the middle of the list, use slow & fast running pointers; if the 2nd pointer moves 2 nodes, and the 1st pointer moves 1 node; by the time the fast one reaches the end, the slow one will be at the middle.

Q237 - Delete Node in a Linked List

  • This is a misleading question but basically rather than removing a node, you shift every element after it up by one. So it is more like effectively deleting a node.

Q238 - Product of Array Except Self

  • The intuition here is to build a running array. In fact, build it from both left and right. Notice that for each index, you can use the running products to get the result. To build the arrays, you start traversing from the left, and then from the right, multiplying the previous product with the previous missing item. To generalize, a running array may be described as an array that contains the result up to that point, and the next index can be obtained by modifying the previous index.
  • You can use the output array as a space to build a running array, and use a single variable to hold the other running product, to achieve constant space.

Q242 - Valid Anagram

  • Classic situation where you use a frequency table to check if equal.
  • You could also sort and compare since sorted anagrams become identical.
  • A hash table is a more generic solution as it can take unicode characters as well.
  • It might be a good idea to look for early returns on some validation questions, since once you encounter, say, a negative frequency in this case, you can't recover from it.

Q257 - Binary Tree Paths

  • Not much to say, straightforward stuff.

Q258 - Add Digits

  • Straightforward naive solution, but this is devisable in constant time mathematically.

Q268 - Missing Number

  • Sorting is an obvious way to make the result, well, obvious.
  • The formula n(n+1)/2 is still useful, keep it in mind.
  • Bit manipulation using XOR.

Q278 - First Bad Version

  • Once again, to minimize overhead when something is sorted, even if that thing is sorted chronologically in this case, binary search helps.

Q322 - Coin Change

  • Classic DP problem. You build up to the target amount, and at each step, you pick the minimum amount of ways among the coins. The indices are sums, the values are the minimum coins needed to get to that sum.
  • You can also solve it with a recursive backtrack using memoization. You go top-down on this one though.

Q338 - Counting Bits

  • Count up using & bitmask checking bit by bit.

Q344 - Reverse String

  • Straightforward with two pointers, but consider that it may be more readable to use a while loop.

Q345 - Reverse Vowels of a String

  • Pretty usual two pointer approach where on each iteration you move both pointers until they are at a valid index.

Q347 - Top K Frequent Elements

  • Go step by step, solving subproblems. Other than that, it's the usual heap question with a custom functor comparison.

Q359 - Logger Rate Limiter

  • Using a hashmap to keep track of messages is simple, but over time there is no garbage collection on the memory. Instead you can use a queue+set combination to wipe all messages that are at the front of the queue with a timestamp difference over the constraints. The set serves as the hashmap check in this case.

Q366 - Find Leaves of Binary Tree

  • This can be solved iteratively both destructively and by keeping a hashset.
  • However, recursion is much simpler, and the intuition is basically that the current level is determined by the maximum height of either the left or right branch. Once the current height is found, the node can be added to its respective vector index.

Q369 - Plus One Linked List

  • Could have solved this one with constant space...

Q371 - Sum of Two Integers

  • To do addition without the + operator, you use XOR for addition and AND for carry.
  • In certain languages, a bitmask is necessary. It has something to do with overflows, but I'm not sure what yet.

Q382 - Linked List Random Node

  • You simply load the list into an array and pick from there with ((float) rand() / RAND_MAX) * list.size(). But if the height is unknown, etc., you need reservoir sampling.

Q387 - First Unique Character in a String

  • Simple use of a basic frequency table. Do note that for such simple uses, vectors are much faster than maps/sets.

Q389 - Find the Difference

  • Either frequency table or bit manipulation using XOR.

Q394 - Decode String

  • You can convert chars to int by doing charValue - '0'.
  • Recursive: Mostly straightforward, just beware of your loop conditionals. This approach takes away the complexity of managing your own stacks. (Also, there's actually a much more concise recursive solution)
  • Iterative: Solving a recursive problem with iteration usually involves stacks, but beware that you may even need to manage multiple stacks for different things.

Q412 - Fizz Buzz

  • Coding this so that the strings are concatenated makes it more extensible. You could even go overboard have a map of int to string pairs and just iterate over keys to check divisibility.

Q414 - Third Maximum Number

  • When you need to get an iterator to the last item, use rbegin. For a reference, use back.
  • You can keep a fixed-size ordered set as a heap with no duplicates.

Q415 - Add Strings

  • Standard grade-school addition but be careful with data structure sizes. size_t is unsigned, so subtracting directly from it can be a very bad idea.

Q423 - Reconstruct Original Digits from English

  • More like a brain twister, you need to notice that certain numbers have unique characters and such. Getting them out of the way sooner means you get the correct result at the end.

Q509 - Fibonacci Number

  • You can solve this in many ways (DP/Recursion+Memoization), but honestly just keep the previous two values in a variable for constant space.

Q528 - Random Pick with Weight

  • You can set up a probability table using prefix sums. Add the last weight plus the current weight and append to buckets. When picking randomly among the buckets, you can actually use binary search to optimize further.

Q540 - Single Element in a Sorted Array

  • You hear the word "sorted", you need to think binary search. You can find even the non-duplicate item with binary search.
  • Some rules of thumb regarding the binary search conditional:
    • If returning inside loop, low <= high may be better.
    • If reducing but returning outside, low < high may be better.
    • If discarding mid for next iteration, low <= high may be better.
    • If keeping mid for next iteration, low < high may be better.
    • If you don't discard mid, you may end up in an infinite loop, be careful.

Q541 - Reverse String II

  • Straightforward but make sure you think it through before coding.

Q557 - Reverse Words in a String III

  • Use a stack to reverse char order until you hit a delimiter.

Q559 - Maximum Depth of N-ary Tree

  • Iterative solution uses BFS with pair to keep track of depth. Recursive solution is pretty standard as well.

Q637 - Average of Levels in Binary Tree

  • Instead of keeping track of levels in BFS pairs, it's better to use the level traversal trick in Q111.

Q647 - Palindromic Substrings

  • There are two ways to approach this kind of palindrome question:
    • Dynamic Programming: Keep a 2d matrix as a boolean store for indices where there is a palindrome. For two index pointers i, j, the vector dp[i][j] would be mapped to that substring. Since there are two base cases, you get those out of the way and work out all the other lengths up to the length of the string.
    • Expand Around Center: For any index, you can check if the characters to the left and right are equal. Once you find an unequality, you can break out and return the count so far. Note that this applies differently for an odd or even palindrome. A single index is enough for an odd palindrome, but when checking even palindromes you would need two indices i and i+1.

Q680 - Valid Palindrome II

  • If you're validating something, and you're allowed to remove an element at most, you can simply feed the subsequence after the deletion into the function, to solve it via recursion.

Q696 - Count Binary Substrings

  • Very tricky intuition. Keep two pointers, prev and cur. Whenever a transition to the other group is made, add to count min(prev, cur), and move cur to prev then reset cur to 1.

Q703 - Kth Largest Element in a Stream

  • Standard min-heap question.

Q706 - Design HashMap

  • When you've got ints as keys, a hash function is as simple as using a modulo. Preferably do it with a table size that is a prime number to reduce collisions.

Q746 - Min Cost Climbing Stairs

  • Pretty much the same as the original climbing stairs, only now the values are mapped to the minimum cost.

Q811 - Subdomain Visit Count

Need to add notes here.

Q833 - Find And Replace in String

Need to add notes here.

Q876 - Middle of the Linked List

  • Classic two pointer approach where one is slow and the other is fast. Once the fast one reaches the end, the slow one is at the middle (the same one that is size / 2).

Q938 - Range Sum of BST

  • BST (Binary Search Tree) lets you discard paths that do not meet constraints. It's basically a tree property that lets you naturally perform binary search.
  • DFS uses either a stack or recursion, and BFS uses queues.

Q989 - Add to Array-Form of Integer

  • Another grade-school addition problem.

Q1005 - Maximize Sum Of Array After K Negations

  • The intuition is to keep all values in a min-heap. For k times, you pop the minimum item, and push back its negative. If the new minimum is "different", that's fine. If the new minimum is the "same" element, it'll be negated once again to return to its original sign. Then after having done the optimal negations, you just sum up everything in the heap.

Q1051 - Height Checker

  • Simply construct the sorted version and compare with original.

Q1119 - Remove Vowels from a String

  • Super easy. Skip concatenation whenever a vowel is encountered.

Q1408 - String Matching in an Array

  • For string matching, you either brute-force your way through, or you use something like KMP.

Q1985 - Find the Kth Largest Integer in the Array

  • Just a standard min-heap problem.
  • When comparing strings that are actually integers, you don't have to parse the integer. If their lengths aren't equal, the longer one is the bigger number. If the length are equal, a simple a > b to check lexicographically will suffice.

Q2095 - Delete the Middle Node of a Linked List

  • As simple as finding the middle really.

Q2099 - Find Subsequence of Length K With the Largest Sum

  • Note that subsequences do not consist of adjacent elements, but apparently need to preserve original order. This is best achieved with a combination of heap and hashmap. The max-heap keeps the top elements, which are counted into a hashmap. Then, the original numbers are iterated, in order, and are added into result based on hashmap findings. Thus, we leverage the original array's order and decide on insertion with another data structure.

Cracking the Coding Interview - Notes

Interview Process

This section includes thoughts from both the book and experiences I've read around the web.

  • Always talk out loud throughout the problem and expose your thought process.
  • It's okay to receive some assistance from the interviewer, the process is interactive after all. If anything, it's better to pay attention to their feedback.
  • Communication skills are nearly as important as other skills.
  • Going through an example is extremely important both to verify and come up with solutions.
  • When looking to find the source of a bug, it's better to go for smaller examples.
  • Likewise, to find bugs, it's a good idea to introduce edge cases that are small.
  • Ask clarifying questions to the interviewer to really grasp the problem. Questions are left vague for a reason. It's a good idea to get together a Constraints section like the ones LeetCode questions have.
  • If you make ANY non-trivial assumptions in your code/algorithm, make sure to bring them up with the interviewer.
  • Don't start coding until you have a solution cleared with the interviewer.
  • Do a dry-run of the solution to catch bugs and verify.
  • If you don't see an optimal solution, start off with the brute-force one and optimize from there. In fact, even if you are able to find an optimal solution, it's still a good idea to start with the brute one. It's good to let the interview see the alternatives you're able to come up with.
  • Often times, it helps to think of what you would do if you were to solve the problem manually with your hands. For example, people do binary search on sorted lists almost by reflex. Something similar may occur with other problems.
  • You might want to come up with a base case and build up from there. For example, for a permutation question, find the base cases {a} and {ab, ba} then realize that for permutations of abc you just need to insert a c into it.
  • It's a good idea to have some genuine questions prepared for the interviewer.
  • Don't forget to go over the items on your resume as it's possible to be asked about them in depth.
  • Recommended questions from Google to think about:
    • How do you work best, both as an individual and as part of a team?
    • What challenges have you faced at school or at work and how did you overcome them?
    • Which of your skills or experiences would be assets in the role and why?
    • What is something you learned that made everything that came after easier?
    • Have more of your achievements come as a result of solitary effort or teamwork?
    • What do you enjoy more, solving problems or pushing the discussion forward?
    • What is the most rewarding job you've ever had? Why?
    • Describe the best team you ever worked with. What made that experience stand out?
  • Things that get me excited about Google:
    • The flexibility to work the way you want
    • Financial peace of mind
    • The chance to be surrounded by Googlers
    • The room for growth
    • All kinds of perks to enjoy
    • To be able to work on significant projects
    • To experience new things & diversity
    • There's clearly so much more on the inside that I don't know yet

Big O Notation

Big O is used to describe the efficiency of algorithms. It's concerned more with the growth of complexity rather than the actual runtime itself. Time complexity and space complexity can be described with Big O. Commonly seen are, from best to worst; O(1), O(logN), O(N), O(NlogN), O(N^2), O(2^N), O(N!). The runtimes may not occur exactly, as in, there may exist best and worst cases. For example, in code with the possibility of early returns, the best case may be significantly better.

Recursive functions can have their time/space complexities determined by the amount of times they are called, since each call is added onto the call stack. Other times, it may help to draw a graph with branches of function calls, to clearly see the complexity by comparing levels.

Constants are dropped from Big O notation. As mentioned, we are only concerned with the rate of increase. Also drop non-dominant terms, for example, O(N^2 + N) is reduced to O(N^2). However, be careful, as some operations have their complexities multiplied rather than added. These are usually dependent on one another, such as by being nested.

Sometimes, an aspect of the complexity may not be explicit. For example, a vector will need to resize and copy all elements to a new inner array after some insertions. Similarly, a string will need to copy its contents over to a new memory location, especially when a lot of concatenations are taking place.

A logarithmic time complexity often indicates that the elements in question get halved (or more) upon each iteration.

Be very careful with what variables you use in a Big O notation, using N for several unrelated quantities is a common mistake. The amount of elements in an array and the length of each string in that array cannot both be represented by N.

When code runs in a fashion like 1 + 2 + ... + n, the runtime is still O(N^2) because (N(N+1))/2 -> N^2.

The Essential Data Structures & Algorithms

Data Structures

Arrays - vector
  • The techniques with arrays almost always apply to strings as well.
  • Many languages do not have resizable arrays natively, and instead use library classes like vector. These provide dynamic resizing, and may multiply their size when they are internally full.
  • Be careful with string concatenation as it can result in many internal copies. However, since C++ has a mutable string class, the resize behavior is similar to that of a vector.
  • Rabin-Karp substring search is the technique to check if two substrings have the same hash, ideally using a rolling hash function.
  • std::vector;
    • Constructors: vector(), vector(count, value), vector(size), vector(itr_begin, itr_end), vector(vector), vector({init_list}).
    • operator[index] returns reference in O(1).
    • front and back return references to first and last elements respectively.
    • begin/end and rbegin/rend return iterators in respective directions.
    • empty and size for element count.
    • clear (O(N)), erase(itr) or erase(itr_begin, itr_end) (O(N)), push_back (amortized O(1)), pop_back (O(1)).
    • insert(itr, value) for insertion before itr, insert(itr, itr_begin, itr_end) inserts range before itr, insert(itr, {init_list}) inserts list before itr. Runtime is linear in the distance to the end of array.
  • std::vector<bool> is a special case, as it is implemented for use as a dynamic bitset. It has a hash function, unlike other vector types, and has a member function flip to flip all bits.
  • std::string;
    • Constructors: string() and string(count, char) (initialized with count amount of char).
    • operator[index] returns reference to char.
    • Same as vector; front, back, begin/etc, empty, size, clear, pop_back.
    • operator+= to append char or string.
    • insert has lots of overloads.
    • erase(i_begin, count) or erase(itr) or erase(itr_begin, itr_end).
    • substr(i_begin, count = end) returns substring [i_begin, i_begin+count)
    • find(string/char, i_begin) returns index to first match, searches from i_begin.
    • to_string non-member function for numeric values.
Linked Lists - list & ListNode
  • Lists don't provide random access, but they do allow inserting/removing items at the head in constant time.
  • Lists may be singly or doubly linked.
Hash Tables/Sets - unordered_map/unordered_set/etc
  • Hash tables map keys to values for efficient lookup. This is usually done via hashing functions. The hash of the key is computed, and the hash is mapped to an index on the internal array. This internal array consists of linked lists, which hold the items whose hash is the same. If the hash collisions are high, the lookup runtime may increase to linear.
  • Hash tables may also be implemented using balanced binary search trees (also called red-black tree). This makes most runtimes a O(logN), but the space used potentially decreases.
  • std::unordered_map<Key, Value>;
    • Constructors: unordered_map(), unordered_map(itr_begin, itr_end), unordered_map({init_list}). The initializers need to be of form {{key, value}, ...}.
    • begin/end return iterators.
    • empty/size/clear are supported.
    • erase(key) or erase(itr).
    • operator[key] returns reference to value. If no such key exists, it's inserted.
    • count(key) returns 1 or 0 as this container does not support duplicate keys.
    • find(key) returns iterator if found, otherwise end.
    • iterator points to pair<Key, Value>.
  • std::unordered_set<Key>;
    • Mostly similar to unordered_map.
    • Insertion is done via insert(key).
  • There are also set and map, which are implemented with trees, and are sorted.
  • There are also multi versions of these, which allow duplicate keys.
Stacks/Queues - stack/queue
  • Stacks are often useful in implementing a recursive function iteratively.
  • Queues are good for implementing BFS and caches.
  • std::stack<Value>;
    • top returns reference to top item.
    • size/empty are supported.
    • push adds new items, has runtime of internal container.
    • pop removes top item, has runtime of internal container.
  • std::queue<Value>;
    • front/back return references to first/last item pushed respectively.
    • size/empty are supported.
    • push adds new items, has runtime of internal container.
    • pop removes front item, has runtime of internal container.
Heaps - priority_queue
  • Priority queues are implemented with heaps and are great for finding the max/min of a thing.
  • A heap is actually a binary tree that has a specific order among nodes. For example, in a min-heap, each node is smaller than its children.
  • std::priority_queue<Value, Container=vector<Value>, Compare=less<Value>>;
    • Is a max-heap by default, with the largest element at the top. Can be made min-heap with Compare -> greater<Value>. Can also have custom comparison by creating a new class with an overloaded operator().
    • top returns top item of the heap tree, value depends on priority criteria.
    • size/empty are supported.
    • push adds new items, in correct order, so has logarithmic runtime and of internal container.
    • pop removes top item, then corrects order, so has logarithmic runtime and of internal container.
Trees - TreeNode
  • Not all trees are binary trees.
  • A node is called a "leaf" if it has no children.
  • A binary search tree is a tree where a specific order is kept: all left descendents <= n < all right descendents.
  • Not all trees are balanced.
  • Binary Tree Traversal:
    • In-Order Traversal: The visits are done in order of left branch -> current node -> right branch. When used on a BST, the nodes are in ascending order.
    • Pre-Order Traversal: The visits are done in order of current node -> left branch -> right branch.
    • Post-Order Traversal: The visits are done in order of left branch -> right branch -> current node.
  • Tries (Prefix Trees) are n-ary trees in whose nodes are the characters to a word. The root connects to the first character of a string, which is laid out node to node until its end is marked in some way, such as with a "null" character or special termination node.

Concepts (Algorithms, etc.)

Breadth-First Search (BFS)
  • Implemented with queues.
  • Starts at the root, each neighbour is explored before moving onto their children.
Depth-First Search (DFS)
  • Implemented either via recursion or stacks.
  • Starts at the root, explores each branch completely before moving onto the next branch.
Binary Search
  • Look at the middle, determine whether to move left or right next, keep going until the middle is the element we need.
  • Has runtime O(logN), since it keeps halving.
Merge Sort / Quick Sort
  • Merge Sort (O(NlogN)) keeps dividing the array in half until there are only two elements to be merged, and merges them in the correct order.
  • Quick Sort (O(NlogN) average, O(N^2) worst case) picks a random element and partitions the array such that it is eventually sorted.
  • Radix Sort (O(kN) where k is the number of passes) works nicely in cases where finite integers are involved.
Bit Manipulation
  • XOR is useful for addition, AND for carry, shifts for bitmasking and such. And bitmasking is great. It's probably preferable to use uint32_t if possible.
Dynamic Programming
  • Usually done by keeping track of previous subproblems within an array, and it's useful to map the indices to subproblems that could aid with the current subproblem, and the values behind those indices to be the result of that previous subproblem. For example, for combination sum, the indices could be a certain sum and the values could be the number of ways to reach that sum, and so on.
Backtracking
  • Usually involves recursion and invalidating a path once you're done with it.
Greedy Algorithms
  • Simply go for the naive, greedy solution that may work.
NP-complete Problems
  • P problems have solutions in polynomial time.
  • NP problems have solutions that can be verified in polynomial time.
  • NP-complete problems are those that can have other NP problems reduced to them.
  • It is unknown whether NP-complete problems can be solved in polynomial time.

Mock Interview with a Googler! - Feedback

  • It's always a good idea to display your interest in CS/Engineering, even as you introduce yourself.
  • Start by clarifying things, such as input types, overflows and limits, and special cases such as negative numbers or unequal lengths, etc.
  • Immediately come up with your own examples, don't just rely on the ones provided.
  • When you say something, you're going to have to justify it. For example, to say something is "complicated" will urge the interviewer to question you as to why.
  • Preferably don't throw all your code into a single function, make things modular and ideally functions should only be achieving one purpose.
  • Taking initiative is a good sign. Proposing to do a dry-run without being prompted is good. Make sure to do a dry-run for both an input that passes and one that fails. In other words, show the different paths that the code could take.
  • Try to come up with concise ways to represent structures on the document, such as by representing a frequency table vector as a map: {1: 2, 7: 3, 9: 1, 8: -1}.
  • If you don't precisely understand what the interviewer is asking, it's better to ask for clarification rather than roll with a possibly incorrect solution.
  • Don't overcomplicate the complexity analysis. State what N is then just describe it in terms of N.
  • It's a good idea to talk about scalability as you're coming up with an algorithm.
  • Try not to focus on a specific time complexity, at least not vocally. Simply go for problem solving instead.
  • When you're not able to hash a structure, you might be able to serialize it as a string instead and use that as a hashable key.
  • Keep in mind that you can prune unnecessary data to optimize space. For example, you don't need to store tons of indices if they're invalidated once there are multiple of them.
  • Slow down and spend a bit more time to consider potential drawbacks of the individual design choices you make.
  • Always state/ask about assumptions before making them.
  • Try to have multiple ways in mind to do a thing if possible.
  • Be careful with typos, it's better not to make them.
  • Asking a decent balance of technical and social questions at the end is a good idea.
  • It might be a good idea to write down the general steps of your algorithm, so that they're there when you need to code.