-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathDSA
320 lines (229 loc) · 20.7 KB
/
DSA
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
1. Learn at least one Programming language
This should be your first step while starting to learn data structure and algorithms. We as human beings, before learning to write a sentence or an essay on a topic, first try to learn that language: the alphabet, letters, and punctuations in it, how and when to use them. The same goes for programming also.
Firstly, select a language of your choice, be it Java, C, C++, Python, or any other language of your choice. Before learning how to code in that language you should learn about the building pieces of the language: the basic syntax, the data types, variables, operators, conditional statements, loops, functions, etc. You may also learn the concept of OOP (Object Oriented Programming).
To help you get started with the language of your choice, we have created a complete course to start as a beginner, such as:
C Programming (Basic to Advanced) – Self Paced
Fork CPP Programming – Self Paced
Fork Java Programming – Self Paced
Fork Python Programming – Self Paced
Fork Javascript -Self Paced
2. Learn about Complexities
Here comes one of the interesting and important topics. The primary motive to use DSA is to solve a problem effectively and efficiently. How can you decide if a program written by you is efficient or not? This is measured by complexities. Complexity is of two types:
Time Complexity: Time complexity is used to measure the amount of time required to execute the code.
Space Complexity: Space complexity means the amount of space required to execute successfully the functionalities of the code.
You will also come across the term Auxiliary Space very commonly in DSA, which refers to the extra space used in the program other than the input data structure.
Both of the above complexities are measured with respect to the input parameters. But here arises a problem. The time required for executing a code depends on several factors, such as:
The number of operations performed in the program,
The speed of the device, and also
The speed of data transfer if being executed on an online platform.
So how can we determine which one is efficient? The answer is the use of asymptotic notation.
Asymptotic notation is a mathematical tool that calculates the required time in terms of input size and does not require the execution of the code.
It neglects the system-dependent constants and is related to only the number of modular operations being performed in the whole program. The following 3 asymptotic notations are mostly used to represent the time complexity of algorithms:
Big-O Notation (Ο) – Big-O notation specifically describes the worst-case scenario.
Omega Notation (Ω) – Omega(Ω) notation specifically describes the best-case scenario.
Theta Notation (θ) – This notation represents the average complexity of an algorithm.
Rate of Growth of Algorithms
The most used notation in the analysis of a code is the Big O Notation which gives an upper bound of the running time of the code (or the amount of memory used in terms of input size).
To learn about complexity analysis in detail, you can refer to our complete set of articles on the Analysis of Algorithms.
3. Learn Data Structures and Algorithms
Here comes the most crucial and the most awaited stage of the roadmap for learning data structure and algorithm – the stage where you start learning about DSA. The topic of DSA consists of two parts:
Data Structures
Algorithms
Though they are two different things, they are highly interrelated, and it is very important to follow the right track to learn them most efficiently. If you are confused about which one to learn first, we recommend you to go through our detailed analysis on the topic: What should I learn first- Data Structures or Algorithms?
Here we have followed the flow of learning a data structure and then the most related and important algorithms used by that data structure.
Roadmap to learn DSA
3.1. Array
The most basic yet important data structure is the array. It is a linear data structure. An array is a collection of homogeneous data types where the elements are allocated contiguous memory. Because of the contiguous allocation of memory, any element of an array can be accessed in constant time. Each array element has a corresponding index number.
Array Data Structure
To learn more about arrays, refer to the article “Introduction to Arrays“.
Here are some topics about array which you must learn:
Rotation of Array – Rotation of array means shifting the elements of an array in a circular manner i.e., in the case of right circular shift the last element becomes the first element, and all other element moves one point to the right.
Rearranging an array – Rearrangement of array elements suggests the changing of an initial order of elements following some conditions or operations.
Range queries in the array – Often you need to perform operations on a range of elements. These functions are known as range queries.
Multidimensional array – These are arrays having more than one dimension. The most used one is the 2-dimensional array, commonly known as a matrix.
Kadane’s algorithm
Dutch national flag algorithm
3.2. String
A string is also a type of array. It can be interpreted as an array of characters. But it has some special characteristics like the last character of a string is a null character to denote the end of the string. Also, there are some unique operations, like concatenation which concatenates two strings into one.
String Data Structure
some must-know concepts of string:
Subsequence and substring – A subsequence is a sequence that can be derived from a string deleting one or more elements. A substring is a contiguous segment of the string.
Reverse and rotation in a string – Reverse operation is interchanging the position of characters of a string such that the first becomes the last, the second becomes the second last, and so on.
Binary String – A binary string is a string made up of only two types of characters.
Palindrome – A palindrome string is a string in which the elements at the same distance from the center of the string are the same.
Lexicographic pattern – Lexicographical pattern is the pattern based on the ASCII value or can be said in dictionary order.
Pattern searching – Pattern searching is searching a given pattern in the string. It is an advanced topic of string.
3.3. Linked List
As the above data structures, the linked list is also a linear data structure. But Linked List is different from Array in its configuration. It is not allocated to contiguous memory locations. Instead, each node of the linked list is allocated to some random memory space and the previous node maintains a pointer that points to this node. So no direct memory access of any node is possible and it is also dynamic i.e., the size of the linked list can be adjusted at any time. To learn more about linked lists refer to the article “Introduction to Linked List“.
Linked List Data Structure
The topics which you must want to cover are:
Singly Linked List – In this, each node of the linked list points only to its next node.
Circular Linked List – This is the type of linked list where the last node points back to the head of the linked list.
Doubly Linked List – In this case, each node of the linked list holds two pointers, one point to the next node and the other points to the previous node.
3.4. Searching Algorithm
Now we have learned about some linear data structures and is time to learn about some basic and most used algorithms which are hugely used in these types of data structures. One such algorithm is the searching algorithm.
Searching algorithms are used to find a specific element in an array, string, linked list, or some other data structure.
The most common searching algorithms are:
Linear Search – In this searching algorithm, we check for the element iteratively from one end to the other.
Binary Search – In this type of searching algorithm, we break the data structure into two equal parts and try to decide in which half we need to find for the element.
Ternary Search – In this case, the array is divided into three parts, and based on the values at partitioning positions we decide the segment where we need to find the required element.
Besides these, there are other searching algorithms also like
Jump Search
Interpolation Search
Exponential Search
3.5. Sorting Algorithm
Here is one other most used algorithm. Often we need to arrange or sort data as per a specific condition. The sorting algorithm is the one that is used in these cases. Based on conditions we can sort a set of homogeneous data in order like sorting an array in increasing or decreasing order.
Sorting Algorithm is used to rearrange a given array or list elements according to a comparison operator on the elements. The comparison operator is used to decide the new order of element in the respective data structure.
An example to show Sorting
There are a lot of different types of sorting algorithms. Some widely used algorithms are:
Bubble Sort
Selection Sort
Insertion Sort
Quick Sort
Merge Sort
There are several other sorting algorithms also and they are beneficial in different cases. You can learn about them and more in our dedicated article on Sorting algorithms.
3.6. Divide and Conquer Algorithm
This is one interesting and important algorithm to be learned in your path of programming. As the name suggests, it breaks the problem into parts, then solves each part and after that again merges the solved subtasks to get the actual problem solved.
Divide and Conquer is an algorithmic paradigm. A typical Divide and Conquer algorithm solves a problem using following three steps.
Divide: Break the given problem into subproblems of same type.
Conquer: Recursively solve these subproblems
Combine: Appropriately combine the answers
This is the primary technique mentioned in the two sorting algorithms Merge Sort and Quick Sort which are mentioned earlier. To learn more about the technique, the cases where it is used, and its implementation and solve some interesting problems, please refer to the dedicated article Divide and Conquer Algorithm.
3.7. Stack
Now you should move to some more complex data structures, such as Stack and Queue.
Stack is a linear data structure which follows a particular order in which the operations are performed. The order may be LIFO(Last In First Out) or FILO(First In Last Out).
Stack Data Structure
The reason why Stack is considered a complex data structure is that it uses other data structures for implementation, such as Arrays, Linked lists, etc. based on the characteristics and features of Stack data structure.
3.8. Queue
Another data structure that is similar to Stack, yet different in its characteristics, is Queue.
A Queue is a linear structure which follows First In First Out (FIFO) approach in its individual operations.
Queue Data Structure
A queue can be of different types like
Circular queue – In a circular queue the last element is connected to the first element of the queue
Double-ended queue (or known as deque) – A double-ended queue is a special type of queue where one can perform the operations from both ends of the queue.
Priority queue – It is a special type of queue where the elements are arranged as per their priority. A low priority element is dequeued after a high priority element.
3.9. Tree Data Structure
After having the basics covered about the linear data structure, now it is time to take a step forward to learn about the non-linear data structures. The first non-linear data structure you should learn is the tree.
Tree data structure is similar to a tree we see in nature but it is upside down. It also has a root and leaves. The root is the first node of the tree and the leaves are the ones at the bottom-most level. The special characteristic of a tree is that there is only one path to go from any of its nodes to any other node.
Tree Data Structure
Based on the maximum number of children of a node of the tree it can be –
Binary tree – This is a special type of tree where each node can have a maximum of 2 children.
Ternary tree – This is a special type of tree where each node can have a maximum of 3 children.
N-ary tree – In this type of tree, a node can have at most N children.
Based on the configuration of nodes there are also several classifications. Some of them are:
Complete Binary Tree – In this type of binary tree all the levels are filled except maybe for the last level. But the last level elements are filled as left as possible.
Perfect Binary Tree – A perfect binary tree has all the levels filled
Binary Search Tree – A binary search tree is a special type of binary tree where the smaller node is put to the left of a node and a higher value node is put to the right of a node
Ternary Search Tree – It is similar to a binary search tree, except for the fact that here one element can have at most 3 children.
3.10. Graph Data Structure
Another important non-linear data structure is the graph. It is similar to the Tree data structure, with the difference that there is no particular root or leaf node, and it can be traversed in any order.
A Graph is a non-linear data structure consisting of a finite set of vertices(or nodes) and a set of edges that connect a pair of nodes.
Graph Data Structure
Each edge shows a connection between a pair of nodes. This data structure helps solve many real-life problems. Based on the orientation of the edges and the nodes there are various types of graphs.
Here are some must to know concepts of graphs:
Types of graphs – There are different types of graphs based on connectivity or weights of nodes.
Introduction to BFS and DFS – These are the algorithms for traversing through a graph
Cycles in a graph – Cycles are a series of connections following which we will be moving in a loop.
Topological sorting in the graph
Minimum Spanning tree in graph
3.11. Greedy methodology
As the name suggests, this algorithm builds up the solution one piece at a time and chooses the next piece which gives the most obvious and immediate benefit i.e., which is the most optimal choice at that moment. So the problems where choosing locally optimal also leads to the global solutions are best fit for Greedy.
For example, consider the Fractional Knapsack Problem. The local optimal strategy is to choose the item that has maximum value vs weight ratio. This strategy also leads to a globally optimal solution because we are allowed to take fractions of an item.
Fractional Knapsack Problem
Here is how you can get started with the Greedy algorithm with the help of relevant sub-topics:
Standard greedy algorithms
Greedy algorithms in graphs
Greedy Algorithms in Operating Systems
Greedy algorithms in array
Approximate greedy algorithms for NP-complete problems
3.12. Recursion
Recursion is one of the most important algorithms which uses the concept of code reusability and repeated usage of the same piece of code.
Recursion
The point which makes Recursion one of the most used algorithms is that it forms the base for many other algorithms such as:
Tree traversals
Graph traversals
Divide and Conquers Algorithms
Backtracking algorithms
In Recursion, you can follow the below articles/links to get the most out of it:
Recursion
Recursive Functions
Tail Recursion
Towers of Hanoi (TOH)
3.13. Backtracking Algorithm
As mentioned earlier, the Backtracking algorithm is derived from the Recursion algorithm, with the option to revert if a recursive solution fails, i.e. in case a solution fails, the program traces back to the moment where it failed and builds on another solution. So basically it tries out all the possible solutions and finds the correct one.
Backtracking is an algorithmic technique for solving problems recursively by trying to build a solution incrementally, one piece at a time, removing those solutions that fail to satisfy the constraints of the problem at any point of time
Some important and most common problems of backtracking algorithms, that you must solve before moving ahead, are:
Knight’s tour problem
Rat in a maze
N-Queen problem
Subset sum problem
m-coloring problem
Hamiltonian cycle
Sudoku
3.14. Dynamic Programming
Another crucial algorithm is dynamic programming. Dynamic Programming is mainly an optimization over plain recursion. Wherever we see a recursive solution that has repeated calls for the same inputs, we can optimize it using Dynamic Programming.
The main concept of the Dynamic Programming algorithm is to use the previously calculated result to avoid repeated calculations of the same subtask which helps in reducing the time complexity.
Dynamic Programming
To learn more about dynamic programming and practice some interesting problems related to it, refer to the following articles:
Tabulation vs Memoization
Optimal Substructure Property
Overlapping Subproblems Property
How to solve a Dynamic Programming Problem?
Bitmasking and Dynamic Programming | Set 1
Bitmasking and Dynamic Programming | Set-2 (TSP)
Digit DP | Introduction
4. Practice, Practice and Practice more
With this, we have completed the basics of major Data structure and Algorithms, and now it’s time to try our hands on each of them.
“Practice makes a man perfect.”
This is highly applicable for learning DSA. You have learned a lot of data structures and algorithms and now you need a lot of practice. This may be seen as a separate step or an integrated part of the process of learning DSA. Because of its importance, we are discussing it as a separate step.
For practicing problems on individual data structures and algorithms, you can use the following links:
Practice problems on Arrays
Practice problems on Strings
Practice problems on Linked Lists
Practice problems on Searching algorithm
Practice problems on Sorting algorithm
Practice problems on Divide And Conquer algorithm
Practice problems on Stack
Practice problems on Queue
Practice problems on Tree
Practice problems on Graph
Practice problems on Greedy algorithm
Practice problems on Recursion algorithm
Practice problems on Backtracking algorithm
Practice problems on Dynamic Programming algorithm
Apart from these, there are many other practice problems that you can refer based on their respective difficulties:
School-level
Basic level
Easy level
Medium level
Hard level
You can also try to solve the most asked interview questions based on the list curated by us at:
Must-Do Coding Questions for Companies
Top 50 Array Coding Problems for Interviews
Top 50 String Coding Problems for Interviews
Top 50 Tree Coding Problems for Interviews
Top 50 Dynamic Programming Coding Problems for Interviews
You can also try our curated lists of problems from below articles:
SDE SHEET – A Complete Guide for SDE Preparation
DSA Sheet by Love Babbar
5. Compete and Become A Pro
Now it is time to test out your skills and efficiency. The best possible way is to compete with others. This will help you find out your position among others and also give you a hint on the areas you are lacking.
There are several online competitive platforms available where you can participate regularly. Also, some online challenges are held from time to time in a year which also provides lots of prizes and opportunities, such as:
Monthly Job-a-thon: It is a contest for individual participants. Participants get the opportunity to get hired by a bunch of companies that shortlist for interviews as per their criteria.
Bi-Wizard Coding: A coding competition exclusively for students. The top 100 students get chances of winning exciting rewards and also access to free courses.
Interview Series: A weekly challenge that gives a great opportunity for aspirants to practice a lot of questions based on important data structure and algorithms concepts for the preparation of interviews.
Problem of the Day: A new problem every day to strengthen the base of data structure and algorithm.
To learn more about where to compete, you can refer to our detailed article Top 15 Websites for Coding Challenges and Competitions.
Tips to boost your learning
By far we have discussed in-depth the 5 crucial steps to learning DSA from scratch. During the complete journey on the roadmap to learn DSA, here are some tips which will surely help you:
Learn the Fundamentals of chosen Programming Language thoroughly
Implement each small concept that you are learning. Make sure to learn the following concepts:
Basic Syntax
Data Types
Operators, Variables, functions
Conditional Statement, loops
OOP(Object-Oriented Programming)
Get a good grasp of the Complexity Analysis
Understand how the complexity is calculated, and try to solve multiple questions to find the complexities of programs. You can also try out our quiz on Analysis of Algorithms for better practice.
Focus on Logic Building
The best way to do this is to solve as many problems as you can from scratch, without looking into solutions or editorials. The more you solve, the more strong your logic building will be