Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
ADVANCED
LIST..........................................................
Memory efficient doubly linked list
Asked by Varun Bhatia.
Question:
Write a code for implementation of doubly linked list with use of single pointer in each node.
Solution:
This question is solved and very well explained at http://www.linuxjournal.com/article/6828.
SysAdmin
Save precious bytes with a new twist on a standard data type.
In the quest to make small devices cost effective, manufacturers often need
to think about reducing the memory size. One option is to find alternative
implementations of the abstract data types (ADTs) we are used to for our
day-to-day implementations. One such ADT is a doubly linked list
structure.
In this article, I present a conventional implementation and an alternative
implementation of the doubly linked list ADT, with insertion, traversal and
deletion operations. I also provide the time and memory measurements of
each to compare the pros and cons. The alternative implementation is
based on pointer distance, so I call it the pointer distance implementation
for this discussion. Each node would carry only one pointer field to traverse
the list back and forth. In a conventional implementation, we need to keep
a forward pointer to the next item on the list and a backward pointer to the
previous item. The overhead is 66% for a conventional node and 50% for
the pointer distance implementation. If we use multidimensional doubly
linked lists, such as a dynamic grid, the savings would be even greater.
A detailed discussion of the conventional implementation of doubly linked
lists is not offered here, because they are discussed in almost every data
structure and algorithm book. The conventional and the distance pointer
implementations are even used in the same fashion to have comparable
memory and time usage statistics.
Node Definition
We define a node of pointer distance implementation like this:
typedef int T;
typedef struct listNode{
T elm;
struct listNode * ptrdiff;
};
The ptrdiff pointer field holds the difference between the pointer to the next
node and the pointer to the previous node. Pointer difference is captured by
using exclusive OR. Any instance of such a list has a StartNode and an
EndNode. StartNode points to the head of the list, and EndNode points to
the tail of the list. By definition, the previous node of the StartNode is a
NULL node; the next node of the EndNode also is a NULL node. For a
singleton list, both the previous node and next node are NULL nodes, so
the ptrdiff field holds the NULL pointer. In a two-node list, the previous
node to the StartNode is NULL and the next node is the EndNode. The
ptrdiff of the StartNode is the exclusive OR of EndNode and NULL node:
EndNode. And, the ptrdiff of the EndNode is StartNode.
Traversal
The insertion and deletion of a specific node depends on traversal. We need
only one simple routine to traverse back and forth. If we provide the
StartNode as an argument and because the previous node is NULL, our
direction of traversal implicitly is defined to be left to right. On the other
hand, if we provide the EndNode as an argument, the implicitly defined
direction of traversal is right to left. The present implementation does not
support traversal from the middle of the list, but it should be an easy
enhancement. The NextNode is defined as follows:
typedef listNode * plistNode;
plistNode NextNode( plistNode pNode,
plistNode pPrevNode){
return ((plistNode)
((int) pNode->ptrdiff ^ ( int)pPrevNode) );
}
perform another exclusive OR with the previous node, we get the pointer to
the next node.
Insertion
Given a new node and the element of an existing node, we would like to
insert the new node after the first node in the direction of traversal that has
the given element (Listing 1). Inserting a node in an existing doubly linked
list requires pointer fixing of three nodes: the current node, the next node
of the current node and the new node. When we provide the element of the
last node as an argument, this insertion degenerates into insertion at the
end of the list. We build the list this way to obtain our timing statistics. If
the InsertAfter() routine does not find the given element, it would not
insert the new element.
Listing 1. Function to Insert a New Node
void insertAfter(plistNode pNew, T theElm)
{
plistNode pPrev, pCurrent, pNext;
pPrev = NULL;
pCurrent = pStart;
while (pCurrent) {
pNext = NextNode(pCurrent, pPrev);
if (pCurrent->elm == theElm) {
/* traversal is done */
if (pNext) {
/* fix the existing next node */
pNext->ptrdiff =
(plistNode) ((int) pNext->ptrdiff
^ (int) pCurrent
^ (int) pNew);
/* fix the current node */
pCurrent->ptrdiff =
(plistNode) ((int) pNew ^ (int) pNext
^ (int) pCurrent->ptrdiff);
/* fix the new node */
pNew->ptrdiff =
(plistNode) ((int) pCurrent
^ (int) pNext);
break;
}
pPrev = pCurrent;
pCurrent = pNext;
}
}
First, we traverse the list up to the node containing the given element by
using the NextNode() routine. If we find it, we then place the node after this
found node. Because the next node has pointer difference, we dissolve it by
exclusive ORing with the found node. Next, we do exclusive ORing with the
new node, as the new node would be its previous node. Fixing the current
node by following the same logic, we first dissolve the pointer difference by
exclusive ORing with the next current node. We then do another exclusive
ORing with the new node, which provides us with the correct pointer
difference. Finally, since the new node would sit between the found current
node and the next node, we get the pointer difference of it by exclusively
ORing them.
SysAdmin
Save precious bytes with a new twist on a standard data type.
Deletion
The current delete implementation erases the whole list. For this article,
our objective is to show the dynamic memory usage and execution times for
the implemented primitives. It should not be difficult to come up with a
canonical set of primitive operations for all the known operations of a
doubly linked list.
Consider the above Doubly Linked List. Following are the Ordinary and XOR (or Memory Effiecient) representations of the Doubly Linked List.
Ordinary Representation:
Node A:
prev = NULL, next = add(B) // previous is NULL and next is address of B
Node B:
prev = add(A), next = add(C) // previous is address of A and next is address of C
Node C:
prev = add(B), next = add(D) // previous is address of B and next is address of D
Node D:
prev = add(C), next = NULL // previous is address of C and next is NULL
XOR List Representation:
Let us call the address variable in XOR representation npx (XOR of next and previous)
Node A:
npx = 0 XOR add(B) // bitwise XOR of zero and address of B
Node B:
npx = add(A) XOR add(C) // bitwise XOR of address of A and address of C
Node C:
npx = add(B) XOR add(D) // bitwise XOR of address of B and address of D
Node D:
npx = add(C) XOR 0 // bitwise XOR of address of C and 0
Traversal of XOR Linked List:
We can traverse the XOR list in both forward and reverse direction. While traversing the list we need to remember the address of the previously accessed
node in order to calculate the next nodes address. For example when we are at node C, we must have address of B. XOR of add(B) and npx of C gives us
the add(D). The reason is simple: npx(C) is add(B) XOR add(D). If we do xor of npx(C) with add(B), we get the result as add(B) XOR add(D) XOR add(B)
which is add(D) XOR 0 which is add(D). So we have the address of next node. Similarly we can traverse the list in backward direction.
We have covered more on XOR Linked List in the following post.
XOR Linked List A Memory Efficient Doubly Linked List | Set 2
}
// Driver program to test above functions
int main ()
{
/* Create following Doubly Linked List
head-->40<-->30<-->20<-->10
*/
struct node *head = NULL;
insert(&head, 10);
insert(&head, 20);
insert(&head, 30);
insert(&head, 40);
// print the created list
printList (head);
return (0);
}
Run on IDE
Output:
Note that XOR of pointers is not defined by C/C++ standard. So the above implementation may not work on all platforms.
What is the time complexity with two layers? The worst case time complexity is number of nodes on express lane plus number of nodes in a segment (A
segment is number of normal lane nodes between two express lane nodes) of normal lane. So if we have n nodes on normal lane, n (square root of n)
nodes on express lane and we equally divide the normal lane, then there will be n nodes in every segment of normal lane . n is actually optimal
division with two layers. With this arrangement, the number of nodes traversed for a search will be O(n). Therefore, with O(n) extra space, we are able to
reduce the time complexity to O(n).
Can we do better?
The time complexity of skip lists can be reduced further by adding more layers. In fact, the time complexity of search, insert and delete can become O(Logn)
in average case. We will soon be publishing more posts on Skip Lists.
For online strategies and algorithms like above, we have a totally different way of analyzing them calledcompetitive analysis where performance of an online
algorithm is compared to the performance of an optimal offline algorithm (that can view the sequence of requests in advance). Competitive analysis is used in
many practical algorithms like caching, disk paging, high performance computers. The best thing about competitive analysis is, we dont need to assume
anything about probability distribution of input. The Move-to-front method is 4-competitive, means it never does more than a factor of 4 operations than offline
algorithm (See the MIT video lecture for proof).
We will soon be discussing implementation and proof of the analysis given in the video lecture.