Skip to content

tamarit/edd

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Erlang Declarative Debugger (edd)

A declarative (algorithmic) debugger for Erlang. This debugger asks the programmers questions about the expected results of some evaluations (function and fun calls, evaluation of case expressions, etc.) and points out the program fragment that is causing the bug.

The Erlang Declarative Debugger uses the module 'smerl.erl' for metaprogramming in Erlang. This module has been written by Yariv Sadan for Erlyweb (https://github.com/yariv/erlyweb). It also uses the module 'mochijson.erl' for handling JSON. This module has been written by Bob Ippolito for Mochiweb (https://github.com/mochi/mochiweb). We have included these files in the repository to make the installation of edd easier, since it does not require a complete Erlyweb and Mochiweb installed in the system to run edd. Both modules have MIT license.

Getting edd

The code of 'edd' is contained in a GIT repository on GitHub. The URL is https://github.com/tamarit/edd

To get a copy of the repository you only have to write in your Linux/MacOS console:

$ git clone https://github.com/tamarit/edd

This will create a folder called 'edd' in your current directory containing the files of the Erlang Declarative Debugger. If you do not have git installed in your system (or you are using Windows) you can also download a copy of the repository in a ZIP file using the button Download ZIP in the upper right part of the https://github.com/tamarit/edd page, or following the link https://github.com/tamarit/edd/archive/master.zip.

Compiling and launching edd

The Erlang Declarative Debugger is written in Erlang, so you will need an Erlang system installed in you computer. Concretely, it has been tested with Erlang/OTP 21. We provide a 'Makefile' to make compilation and launch easy:

  1. First, open a terminal and move to the directory of the repository (for example /home/john/edd)

     $ cd /home/john/edd
    
  2. Compilation: type make

     $ make
     erlc edd_comp.erl
     erl -run edd_comp compile -noshell -s erlang halt
    
  3. Launch: type make load

     $ make load
     erl -run edd_comp load -rsh ssh -sname edd_main -setcookie edd_cookie
     Erlang/OTP 21 [erts-10.2.3] [source] [64-bit] [smp:8:8] [ds:8:8:10] [async-threads:1]
    
     Eshell V10.2.3  (abort with ^G)
     (edd_main@YOUR_MACHINE)1>
    

After the compilation, the modules will be compiled into the 'ebin' directory, and the edoc documentation will be generated into the folder 'edoc'. The launch process will open an interactive Erlang interpreter with the module edd loaded.

Using edd for debugging sequential Erlang programs

To debug an expression that is returning an incorrect value, simply call the edd:dd/1 function with the string of the problematic expression. The 'edd' directory contains a 'examples' folder with some buggy programs to debug. For example, if you compile and load the 'merge.erl' program you can observe that the mergesort/2 function is buggy:

> cd("examples/mergesort"), c(merge).
{ok,merge}
> merge:mergesort([b,a], fun merge:comp/2).
[b,a]

To debug this function just call edd:dd/1 with the buggy expression. The following lines show a complete debugging session:

> edd:dd( "merge:mergesort([b,a], fun merge:comp/2)", tree ).
Please, insert a list of trusted functions [m1:f1/a1, m2:f2/a2 ...]: 
merge:merge([b], [a], fun merge:comp/2) = [b, a]? [y/n/t/v/d/i/s/u/a]: n
merge:comp(b, a) = false? [y/n/t/v/d/i/s/u/a]: t
merge:merge([a], [], fun merge:comp/2) = [a]? [y/n/t/v/d/i/s/u/a]: y
Call to a function that contains an error:
merge:merge([b], [a], fun merge:comp/2) = [b, a]
Please, revise the fourth clause:
merge([H1 | T1], [H2 | T2], Comp) ->
    case Comp(H1, H2) of
      false -> [H1 | merge([H2 | T1], T2, Comp)];
      true -> [H1 | merge(T1, [H2 | T2], Comp)]
    end.
Do you want to follow the debugging session inside this function? [y/n]: y

First, the debugger asks for a list of trusted functions, i.e., functions that the programmer is sure are correct, so that 'edd' will never ask about them. This functions must be introduced as a comma-separated lists of function signatures

module1:function1/arity1, module2:function2/arity2, ...

If you do not want to mark any function as trusted, simply press 'Enter'. After the trusted functions the debugger will ask some question about function and 'fun' applications of the shape

fun(val1, ..., valn) = val? [y/n/t/v/d/i/u/a]: 

to know whether the function 'fun' applied to values 'val1', ..., 'valn' must return the value 'val' or not. The possible answers to these questions are:

  • Yes (y): the evaluation is correct.
  • No (n): the evaluation is incorrect.
  • Trusted (t): the evaluation is correct and the function is trusted, so any subsequent application of the function will be considered as correct without asking.
  • No + Value (v): the evaluation is incorrect and the user choose to provide the expected value.
  • Don't know (d): the question is too hard to answer.
  • Inadmissible (i): the function call has no sense with those values as arguments.
  • Undo (u): the debugger forgets the last answer and asks again the previous question.
  • Abort (a): finishes the debugging session.

After asking some questions, the debugger will point out what is the function call that is causing the bug and it will also shows the concrete function rule responsible. In the example:

Call to a function that contains an error: merge:merge([b], [a], fun merge:comp/2) = [b, a]
Please, revise the fourth clause:
merge([H1 | T1], [H2 | T2], Comp) ->
    case Comp(H1, H2) of
      false -> [H1 | merge([H2 | T1], T2, Comp)];
      true -> [H1 | merge(T1, [H2 | T2], Comp)]
    end.

After the debugger finds the wrong function, it can follow the debugging session inside this function to find the the part of the function that is causing the erroneous behavior. In the 'merge.erl' example, the session continues as follows:

Do you want to follow the debugging session inside this function? [y/n]: y
For the case expression:
case Comp(H1, H2) of
  false -> [H1 | merge([H2 | T1], T2, Comp)];
  true -> [H1 | merge(T1, [H2 | T2], Comp)]
end
Is there anything incorrect?
1.- The context:
        Comp = fun merge:comp/2
        H1 = b
        H2 = a
        T1 = []
        T2 = []
2.- The argument value: false.
3.- Enter in the first clause.
4.- The final value: [b,a].
5.- Nothing.
[1/2/3/4/5/d/s/u/a]? 4


This is the reason for the error:
Value [b,a] for the final expression [H1 | merge([H2 | T1], T2, Comp)] (Line 27) is not correct.

In this second step of debugging, the debugger will not ask about function/fun applications but about the evaluation of different constructions like guards, case/if expressions, bindings, etc.

edd options for sequential programs

The Erlang Declarative Debugger has two different strategies to traverse the proof tree looking for the buggy node: "Divide & Query" and "Top Down". The "Divide & Query" strategy tries to make the minimum number of questions, and selects those nodes that split the tree into two of about the same size. On the other hand, "Top Down" traverses the tree from the root to the leaves until a buggy node is found. The default strategy for the first step of debugging (to find a wrong function) is "Divide & Query", and the default strategy for the second step is "Top Down". To use the "Top Down" strategy in both debugging steps, use the 'top_down' argument in the dd/2 call. For example:

> edd:dd("merge:mergesort([b,a], fun merge:comp/2)", top_down).

Note that you can change the strategy at any point of the debugging session by pressing the key 's'.

The Erlang Declarative Debugger can also generate the proof tree in a DOT file (http://en.wikipedia.org/wiki/DOT_language) and a PDF file using the 'dot' command. To activate this option, use the 'tree' argument in dd/2. For example the call:

> edd:dd("merge:mergesort([b,a], fun merge:comp/2)", tree).

will start a debugging session using the default strategy "Divide & Query" and generate the proof tree of the evaluation of the expression in the files "dbg.dot"/"dbg.pdf" for the first step of the debugging session, and "dbg_zoom.dot"/"dbg_zoom.pdf" for the second one. These files will be stored in the current directory. If your system does not have the 'dot' command to create PDF from DOT files, the PDF will not be create but no error will be raised.

To generate the proof tree file and also use the "Top Down" strategy, use the dd/3 function:

> edd:dd("merge:mergesort([b,a], fun merge:comp/2)", tree, top_down).

or

> edd:dd("merge:mergesort([b,a], fun merge:comp/2)", top_down, tree).

or dd/2 which also allows to use a list of options:

> edd:dd("merge:mergesort([b,a], fun merge:comp/2)", [top_down, tree]).

edd for concurrent programs

The Erlang Declarative Debugger allows the user to debug concurrent programs in a similar way that sequential ones. To debug a function with concurrent features, the user would use 'edd:cdd':

  • edd:cdd/2(Call, Timeout).

The first parameter is a call with a wrong behavior, and the second is the timeout (in milliseconds) for the tracing process needed in this kind of debugging

  • edd:cdd(Call,Timeout,Strategy,Priority).

The first two parameters are the same as in the previous case. Stategy sets the way of exploring the debugging tree, that can be top_down or divide_query. On the other hand, Priority sets how to select the next node to debug in case of ties. It can take values old to select the oldest node (the closest to the root), new to select the newer node (the furthest from the root), or indet to select the next node randomly.

The directory examples/Concurrency contains some concurrent examples with bugs to test the debugger. For example, the program mergesort can be debugged as follows:

> cd("examples/Concurrency/mergesort").
/home/john/edd/examples/Concurrency/mergesort
ok
> c(mergesort).
{ok,mergesort}
> edd:cdd("mergesort:mergesort([3,2,1])", 2000). 

Execution result: [2,2,3]

**********************************
PROCESS <0.795.0>
First call mergesort:mergesort([2], #Fun<mergesort.comp.2>, <0.793.0>)
Result {result,[2]}
Sent messages:
        {result,[2]} (from <0.795.0> to <0.793.0>)
No spawned processes
**********************************

**********************************
PROCESS <0.794.0>
First call mergesort:mergesort([2], #Fun<mergesort.comp.2>, <0.793.0>)
Result {result,[2]}
Sent messages:
        {result,[2]} (from <0.794.0> to <0.793.0>)
No spawned processes
**********************************

**********************************
PROCESS <0.793.0>
First call mergesort:mergesort([2,2], #Fun<mergesort.comp.2>, <0.790.0>)
Result {result,[2,2]}
Sent messages:
        {result,[2,2]} (from <0.793.0> to <0.790.0>)
Spawned processes:
        <0.795.0>
        <0.794.0>
**********************************

**********************************
PROCESS <0.792.0>
First call mergesort:mergesort([3], #Fun<mergesort.comp.2>, <0.790.0>)
Result {result,[3]}
Sent messages:
        {result,[3]} (from <0.792.0> to <0.790.0>)
No spawned processes
**********************************

**********************************
PROCESS <0.790.0>
First call mergesort:mergesort([3,2,1])
Result [2,2,3]
No sent messages
Spawned processes:
        <0.793.0>
        <0.792.0>
**********************************

**********************************
Pid selection
**********************************
1.- <0.795.0>
        First call:mergesort:mergesort([2], #Fun<mergesort.comp.2>, <0.793.0>)
        Result: {result,[2]}
2.- <0.794.0>
        First call:mergesort:mergesort([2], #Fun<mergesort.comp.2>, <0.793.0>)
        Result: {result,[2]}
3.- <0.793.0>
        First call:mergesort:mergesort([2,2], #Fun<mergesort.comp.2>, <0.790.0>)
        Result: {result,[2,2]}
4.- <0.792.0>
        First call:mergesort:mergesort([3], #Fun<mergesort.comp.2>, <0.790.0>)
        Result: {result,[3]}
5.- <0.790.0>
        First call:mergesort:mergesort([3,2,1])
        Result: [2,2,3]
6.- Choose an event
7.- None

The returned value [2,2,3] is wrong, and the debugger shows the different processes involved as well as their results, messages sent, and spawns. At this point, before starting the debugging session, edd creates three files in the current directory to ease the debugging process:

  • eval_tree.pdf: The debugging tree with the different fragments evaluated.

  • comm_seq_diag.pdf: Sequence diagram showing process creation, message passing, and message consumption. Each event in this diagram has an identifier that can be used to start the debugging session.

  • pids_tree.pdf: Tree showing the hierarchy of process creation.

To start the debugging session the user can select a process or an event from the sequence diagram. In this case we will select the process number 5 that incorrectly sorts the list [3,2,1].

Please, insert a PID where you have observed a wrong behavior (or 6 to select an event): [1..7]: 5

Selected initial PID: <0.790.0>
        First call:mergesort:mergesort([3,2,1])
        Result: [2,2,3]

**********************************

**********************************
Process <0.790.0> called mergesort:mergesort([3,2,1]).
What is wrong?
1. - Previous evaluated receive:
        receive {result, LOrd2_} -> LOrd2_ end
        in mergesort.erl:25
        Context:
                'Comp' = #Fun<mergesort.comp.2>
                'Half' = 1
                'L' = [3,2,1]
                'L1' = [3]
                'L2' = [2,2]
                'LOrd1' = [3]
                'LOrd2_' = [2,2]
                'Parent' = none
        Received messages:
                {result,[2,2]} (from <0.793.0> to <0.790.0>)
        Consumed messages:
                {result,[2,2]} (from <0.793.0> to <0.790.0>) 
2. - Evaluated to value: [2,2,3] 
3. - No sent messages 
4. - No created processes 
5. - Nothing 

[1/2/3/4/5/t/d/c/s/p/r/u/h/a]: 1

**********************************
Process <0.790.0> evaluated
receive {result, LOrd2_} -> LOrd2_ end
in mergesort.erl:25
What is wrong?
1. - Context:
        'Comp' = #Fun<mergesort.comp.2>
        'Half' = 1
        'L' = [3,2,1]
        'L1' = [3]
        'L2' = [2,2]
        'LOrd1' = [3]
        'LOrd2_' = [2,2]
        'Parent' = none 
2. - Received messages:
        {result,[2,2]} (from <0.793.0> to <0.790.0>) 
3. - Consumed messages:
        {result,[2,2]} (from <0.793.0> to <0.790.0>) 
4. - Evaluated to value: [2,2] 
5. - No sent messages 
6. - No created processes 
7. - Nothing 

[1/2/3/4/5/6/7/t/d/c/s/p/r/u/h/a]: 1

**********************************
Process <0.790.0> called mergesort:mergesort([3,2,1], #Fun<mergesort.comp.2>, none).
What is wrong?
1. - Reached receive:
        receive {result, LOrd1_} -> LOrd1_ end
        in mergesort.erl:20
        Context:
                'Comp' = #Fun<mergesort.comp.2>
                'Half' = 1
                'L' = [3,2,1]
                'L1' = [3]
                'L2' = [2,2]
                'Parent' = none 
2. - No sent messages 
3. - Created processes:
        <0.792.0>
        <0.793.0> 
4. - Nothing 

[1/2/3/4/t/d/c/s/p/r/u/h/a]: 1

**********************************
Process <0.790.0> called mergesort:take(1, [2,3]).
What is wrong?
1. - Evaluated to value: [2] 
2. - No sent messages 
3. - No created processes  
4. - Nothing 

[1/2/3/4/t/d/c/s/p/r/u/h/a]: 4

**********************************
Process <0.790.0> called mergesort:take(2, [1,2,3]).
What is wrong?
1. - Evaluated to value: [2,2] 
2. - No sent messages 
3. - No created processes 
4. - Nothing 

[1/2/3/4/t/d/c/s/p/r/u/h/a]: 1

**********************************
Process <0.790.0> called mergesort:take(1, [3,2,1]).
What is wrong?
1. - Evaluated to value: [3] 
2. - No sent messages 
3. - No created processes 
4. - Nothing 

[1/2/3/4/t/d/c/s/p/r/u/h/a]: 4

The error has been detected:
The problem is in pid <0.790.0>
while running call mergesort:take(2, [1,2,3])

After the debugging session, edd detects the function mergesort:take as buggy, concretely in the call mergesort:take(2, [1,2,3]).

edd and eunit/proper

The Erlang Declarative Debugger can interact with eunit/proper in two ways:

  • It reads eunit/proper tests from given files and use this information to automatically answer some questions.

  • It generates and stores new eunit tests from the user answers. These new tests are always stored in a function named 'edd_test/0'

The default behavior for edd is to read and store the eunit/proper tests inside debugged modules. There are three options related with these tests:

  • "not_load_tests" avoids loading tests from the debugged modules

  • "not_save_tests" avoids saving generated test to the debugged modules

  • "{test_files, Files :: list()}" edd will automatically load additional eunit tests from the given files