Revision as of 03:12, 5 August 2018

External

Internal

Programming

Overview

A data structure is an arrangement of data in computer's memory or external storage, designed to facilitate a specific way to access or modify it. Data structures include arrays, linked lists, stacks, binary trees, hash tables, etc.

Algorithms manipulate the data in these structures in various ways. An algorithm is any well-defined computational procedure consisting in a sequence of steps, which takes some value or set of values, called input and produces a value, or a set of values, called output. An algorithm can be viewed as a tool for solving a well-specified computational problem. In this context, a specific set of input values provided to the algorithm is called an instance of the problem.

One of the most important characteristics of an algorithm is its correctness. An algorithm is said to be correct if, for every input instance, it halts with the correct output. It is almost always desirable for an algorithm to be correct. However, in some cases, even incorrect algorithms are useful if we can control the error rate. An example of such algorithm is Miller-Rabin primality test.

Another characteristic of algorithms is efficiency. The efficiency of an algorithm can be analyzed through formal methods and expressed using a special notation, called asymptotic notation. The asymptotic notation uses functions that bound the algorithm's running time from above and from below. To say that the running time is asymptotically bounded from above by a specific function, say n², we use the "big-O" notation: O(n²). All these notions are expanded upon in the algorithm complexity section.

Algorithms can be coarsely categorized in iterative and recursive. A common recursive technique is divide-and-conquer. The running time of recursive algorithms can be estimated using recurrences.

A problem that arises frequent in practice is to sort a sequence of numbers into nondecreasing order. The class of algorithms that addresses this problem are the sorting algorithms.

Algorithms whose behavior is determined not only by input, but also by the values produced by a random-number generator that are called randomized algorithms. A randomized algorithm implies an inherent probability distribution for one or more variable, so the running time of such an algorithm may differ on different inputs on the same size. Probabilistic analysis is used to analyze running time of randomized algorithms.

Arrays.
Lists (single linked and double linked)
Stack
Queue. Difference between list and queue.
Sorting
- Comparison Sort. Comparison sort algorithm class. Know that comparison sort cannot do better than n ln n. Proof in a separate article.
- Non-comparison Sort
- Sorting algorithms characteristics (in-place, stability)
Algorithm complexity, Bounds. Understand and document O, Omega and Theta notations. Perform complexity analysis on all the algorithms I examine. Expected running time, worst-case running time. Average case running time.
Iteration and recursion – fundamentally different approaches.
Set
Map
Hash Map
Distributed Hash Map. Insist on this as is key to systems that scale.
Collision resistant hash function.
Trees
- Binary Trees. Binary tree height.
  - Sorting trees. Difference between a binary search tree and a red-black tree.
- BTrees
Tree walking algorithms. Difference between depth first and breadth first.
Graphs
Graph algorithms
Dynamic programming and memoization.
Greedy algorithms.
Matrix multiplication (NOKB and code).
lg and ln notation. Link from the Mathematics main page.
Random variable analysis. Indicator random variable.
Probability Distribution.
Consider starting upside down, with the bottom (math) sections, and NOKB concepts from them first.
Mathematics: Understand and document induction.
Need to understand aggregate analysis Section 17.1

Number-theoretic algorithms are important due in large part to the invention of cryptographic schemes based on large prime numbers. Algorithms in this category are used to generate large prime numbers. Some of these algorithms, for example Miller-Rabin primality test algorithm, are not entirely correct, the sense that there is a very small chance of error, but the chance of error is so small that is considered acceptable.

Almost all the algorithms mentioned so far have been polynomial-time algorithms, which is to say that on an input of size n, their worst running time is O(n^k) for some constant k. Generally, we think of a problem that is solvable by a polynomial-time algorithm as tractable or easy. A problem that requires super-polynomial time is designated intractable or hard. There are also problems whose status is unknown: no polynomial-time algorithm has been yet discovered for them, nor has anyone yet been able to prove that no polynomial-time algorithm can exist for any of them. This class of problems is called NP-complete problems. The set of NP-complete problems has the property that if an efficient algorithm exists for any one of them, then efficient algorithms exist for all of them. There are methods to show that a problem is NP-complete, and if that is the case, an approximation algorithm instead of an efficient algorithm, can be developed form it.

@@ Line 59: / Line 59: @@
 <span id='Number-Theoretic_Algorithms'></span>'''Number-theoretic algorithms''' are important due in large part to the invention of cryptographic schemes based on large prime numbers. Algorithms in this category are used to generate large prime numbers. Some of these algorithms, for example '''[[Miller-Rabin Primality Test Algorithm|Miller-Rabin primality test algorithm]]''', are not entirely correct, the sense that there is a very small chance of error, but the chance of error is so small that is considered acceptable.
-<span id='NP-complete_Problems'></span>Almost all the algorithms mentioned so far have been '''polynomial-time algorithms''', which is to say that on an input of size n, their worst running time is O(n<sup>k</sup>) for some constant k. Generally, we think of a problem that is solvable by a polynomial-time algorithm as '''tractable''' or '''easy'''. A problem that requires super-polynomial time is designated '''intractable''' or '''hard'''. There are also problems whose status is unknown: no polynomial-time algorithm has been yet discovered for them, nor has anyone yet been able to prove that no polynomial-time algorithm can exist for any of them. This class of problems is called '''[[NP-complete Problems#Overview|NP-complete problems]]'''. The set of NP-complete problems has the property that if an efficient algorithm exists for any one of them, then efficient algorithms exist for all of them.
+<span id='NP-complete_Problems'></span>Almost all the algorithms mentioned so far have been '''polynomial-time algorithms''', which is to say that on an input of size n, their worst running time is O(n<sup>k</sup>) for some constant k. Generally, we think of a problem that is solvable by a polynomial-time algorithm as '''tractable''' or '''easy'''. A problem that requires super-polynomial time is designated '''intractable''' or '''hard'''. There are also problems whose status is unknown: no polynomial-time algorithm has been yet discovered for them, nor has anyone yet been able to prove that no polynomial-time algorithm can exist for any of them. This class of problems is called '''[[NP-complete Problems#Overview|NP-complete problems]]'''. The set of NP-complete problems has the property that if an efficient algorithm exists for any one of them, then efficient algorithms exist for all of them. There are methods to show that a problem is NP-complete, and if that is the case, an '''approximation algorithm''' instead of an efficient algorithm, can be developed form it.
 =Organizatorium=

Algorithms: Difference between revisions

Revision as of 03:12, 5 August 2018

Contents

External

Internal

Overview

Organizatorium

Navigation menu

Algorithms: Difference between revisions

Revision as of 03:12, 5 August 2018

External

Internal

Overview

Organizatorium

Navigation menu

Search