CS110 Lab 2

Goals

  • Perform specific bit manipulations through compositions of bit operations.
  • Introduced to the C debugger and gain practical experience using gdb to debug C programs.
  • Identify potential issues with dynamic memory management.

Lab 2

Computer Architecture I @ ShanghaiTech University

Exercises

Download the files for Lab 2 first.

Exercise 1: git init; git add; git commit

After extracting the tar file with tar command, initialize a git repository in the lab2 directory and make your first commit.

Exercise 2: vector

First read vector.h and vector.c
The four functions in vector.h are declared as follows:

  • Vector *vector_create(void);
  • void vector_push(Vector *vector, double element);
  • double vector_get(Vector *vector, int index);
  • void vector_free(Vector *vector);

Make sure you understand what every line of code is doing before moving on.

Tell the TA what malloc()realloc(), and free() do.

This exercise will give some subtasks, you should make a git commit after each subtask to track your changes.

  1. Consider the cases where malloc() and realloc() fails, do necessary changes to make code work under these circumstances.
  2. The code never check the Vector *vector passed to it is valid, modify the code to add a null check at the beginning of each function.
  3. In function vector_get(), it is possible that the user visits a position that is not initialized, you should handle this case.
  4. Some implementation suggests void another_vector_free(Vector **vector), add this to your code. Explain why this could be beneficial.

Show your TA the git commits after each subtask and make necessary explanations. You must compile your code with Makefile we've provided.

Exercise 3: Catch those bugs!

debugger, as the name suggests, is a program which is designed specifically to help you find bugs, or logical errors and mistakes in your code (side note: if you want to know why errors are called bugs, look here). Different debuggers have different features, but it is common for all debuggers to be able to do the following things:

  • Set a breakpoint in your program. A breakpoint is a specific line in your code where you would like to stop execution of the program so that you can take a look at what's going on nearby.
  • Step line-by-line through the program. Code only ever executes line by line, but it happens too quickly for us to figure out which lines cause mistakes. Being able to step line-by-line through your code allows you to hone in on exactly what is causing a bug in your program.

For this exercise, you will find the GDB reference card useful. GDB stands for "GNU De-Bugger." 😃 Use the code you've written in the previous exercise to try the debugger.

This causes gcc to store information in the executable program for gdb to make sense of it. Now start our debugger, (c)gdb:

1
$ cgdb ./lab2

ACTION ITEM: step through the whole program by doing the following:

  1. setting a breakpoint at main
  2. using gdb's run command
  3. using gdb's single-step command

Type help from within gdb to find out the commands to do these things, or use the reference card.

Look here if you see an error message like printf.c: No such file or directory. You probably stepped into a printf function! If you keep stepping, you'll feel like you're going nowhere! GDB is complaining because you don't have the actual file where printf is defined. This is pretty annoying. To free yourself from this black hole, use the command finish to run the program until the current frame returns (in this case, until printf is finished). And NEXT time, use next to skip over the line which used printf.

ACTION ITEM: Learn MORE gdb commands Learning these commands will prove useful for the rest of this lab, and your C programming career in general. Create a text file containing answers to the following questions (or write them down on a piece of paper, or just memorize them if you think you want to become a GDB pro).

  1. How do you pass command line arguments to a program when using gdb?
  2. How do you set a breakpoint which only occurs when a set of conditions is true (e.g. when certain variables are a certain value)?
  3. How do you execute the next line of C code in the program after stopping at a breakpoint?
  4. If the next line of code is a function call, you'll execute the whole function call at once if you use your answer to #3. (If not, consider a different command for #3!) How do you tell GDB that you want to debug the code inside the function instead? (If you changed your answer to #3, then that answer is most likely now applicable here.)
  5. How do you resume the program after stopping at a breakpoint?
  6. How can you see the value of a variable (or even an expression like 1+2) in gdb?
  7. How do you configure gdb so it prints the value of a variable after every step ?
  8. How do you print a list of all variables and their values in the current function?
  9. How do you exit out of gdb?

Show your TA that you are able to run through the above steps and provide answers to the questions.

Exercise 4: Memory Management

Valgrind is an open-source framework built for dynamic analysis. In this course, we will use a very popular tool, memcheck, to detect any possible memory-related issue.

Installation

In linux, we can take advantage of package managers: sudo apt install valgrind (in Ubuntu) for minimal installation.

Example usages

When there are no memleaks

1
2
3
4
5
6
7
8
// File name: main.c
#include <stdio.h>

int main() {
int a = 42;
printf("%d\n", a);
return 0;
}

We can easily point out that this program with output 42 is memory safe.

1
$ valgrind --tool=memcheck --leak-check=full --track-origins=yes ./main

Here's what Valgrind gives us, which indicate that no memleaks are detected.

1
2
3
4
5
6
7
8
==371318== HEAP SUMMARY:
==371318== in use at exit: 0 bytes in 0 blocks
==371318== total heap usage: 1 allocs, 1 frees, 1,024 bytes allocated
==371318==
==371318== All heap blocks were freed -- no leaks are possible
==371318==
==371318== For lists of detected and suppressed errors, rerun with: -s
==371318== ERROR SUMMARY: 0 errors from 0 contexts (suppressed: 0 from 0)

When there are memleaks:

1
2
3
4
5
6
7
8
9
10
11
12
13
// File name: memleak.c
#include <stdio.h>
#include <stdlib.h>

int main() {
int a = 42;
char *normal = malloc(135);
char *leak = malloc(246);
leak[0] = 'q'; // Avoid unused variable warning
printf("%d\n", a);
free(normal);
return 0;
}

In this example, we allocated memory from heap for variable leak , but the call to free() for the variable is missing, causing us to fail Gradescope test 😦
We compile with gcc -Wpedantic -Wall -Wextra -Wvla -Werror memleak.c -o memleak to make sure there are no warnings, which means that compiler (gcc) is unable to find any potential issues.
Now we harness the power of Valgrind to detect the issue:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
$ valgrind --tool=memcheck --leak-check=full --track-origins=yes ./memleak

==372058== HEAP SUMMARY:
==372058== in use at exit: 246 bytes in 1 blocks
==372058== total heap usage: 3 allocs, 2 frees, 1,405 bytes allocated
==372058==
==372058== 246 bytes in 1 blocks are definitely lost in loss record 1 of 1
==372058== at 0x4848899: malloc (in /usr/libexec/valgrind/vgpreload_memcheck-amd64-linux.so)
==372058== by 0x1091B3: main (in /home/caoster/Desktop/temp/memleak)
==372058==
==372058== LEAK SUMMARY:
==372058== definitely lost: 246 bytes in 1 blocks
==372058== indirectly lost: 0 bytes in 0 blocks
==372058== possibly lost: 0 bytes in 0 blocks
==372058== still reachable: 0 bytes in 0 blocks
==372058== suppressed: 0 bytes in 0 blocks
==372058==
==372058== For lists of detected and suppressed errors, rerun with: -s
==372058== ERROR SUMMARY: 1 errors from 1 contexts (suppressed: 0 from 0)

We can tell from above that:

  1. When we exit the program, there are still 246 bytes on the heap that belongs to us.
  2. The missing 246 bytes were allocated in main with malloc().
  3. ...

By adding more options/flags of Valgrind, there are absolutely more messages to reveal. What's more:

1
2
3
4
5
6
7
#include <stdio.h>

int main() {
int a[5] = {13, 24, 35, 46, 57};
printf("%d\n", a[5]);
return 0;
}

With Valgrind, we can find those uninitialized visits effortlessly (only a fraction of the message is shown here):

1
2
3
4
5
6
==373455== Conditional jump or move depends on uninitialised value(s)
==373455== at 0x48EFB56: __vfprintf_internal (vfprintf-internal.c:1516)
==373455== by 0x48D981E: printf (printf.c:33)
==373455== by 0x1091BF: main (in /home/caoster/Desktop/temp/main)
==373455== Uninitialised value was created by a stack allocation
==373455== at 0x109169: main (in /home/caoster/Desktop/temp/main)

Time is limited, so we won't check upon this part, but please make sure that you understand the usage of Valgrind and the messages it gives you.
Throughout the course, you will be asked to submit code with no memory issues.


The following TA(s) are responsible for this lab:

Suting Chen <chenst AT shanghaitech.edu.cn>