2021-01-13 20:59:15 +09:00
|
|
|
Up: [Readme.md](../Readme.md), Prev: [Section 5](sec5.md), Next: [Section 7](sec7.md)
|
2021-01-11 11:15:04 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
# String and memory management
|
2020-12-21 21:12:05 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
GtkTextView and GtkTextBuffer have functions that have string parameters or return a string.
|
|
|
|
The knowledge of strings and memory management is very important for us to understand the functions above.
|
2020-12-22 11:30:06 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
## String and memory
|
2020-12-22 11:30:06 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
String is an array of characters that is terminated with '\0'.
|
|
|
|
String is not a C type such as char, int, float or double.
|
|
|
|
But the pointer to a character array behaves like a string type of other languages.
|
|
|
|
So, the pointer is often called string.
|
2020-12-22 11:30:06 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
~~~C
|
|
|
|
char a[10], *b;
|
|
|
|
|
|
|
|
a[0] = 'H';
|
|
|
|
a[1] = 'e';
|
|
|
|
a[2] = 'l';
|
|
|
|
a[3] = 'l';
|
|
|
|
a[4] = 'o';
|
|
|
|
a[5] = '\0';
|
|
|
|
|
|
|
|
b = a;
|
|
|
|
/* *b is 'H' */
|
|
|
|
/* *(++b) is 'e' */
|
|
|
|
~~~
|
2020-12-22 11:30:06 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
The array `a` has `char` elements and the size is ten.
|
|
|
|
The first six elements are 'H', 'e', 'l', 'l', 'o' and '\0'.
|
|
|
|
This array represents a string "Hello".
|
|
|
|
The first five elements are character codes that correspond to the characters.
|
|
|
|
The sixth element is '\0' that is the same as zero.
|
|
|
|
And it indicates that the string ends there.
|
|
|
|
The size of the array is 10, so 4 bytes aren't used, but it's OK.
|
|
|
|
They are just ignored.
|
2020-12-22 11:30:06 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
The variable 'b' is a pointer to a character.
|
|
|
|
Because `a` is assigned to `b`, `a` and `b` point the same character ('H').
|
|
|
|
The variable `a` is defined as an array and it can't be changed.
|
|
|
|
It always point the top address of the array.
|
|
|
|
On the other hand, pointers are mutable, so `b` can change itself.
|
|
|
|
It is possible to write `++b` (`b` is increased by one).
|
2020-12-22 11:30:06 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
If a pointer is NULL, it points nothing.
|
|
|
|
So, the pointer is not a string.
|
|
|
|
Programs with string will include bugs if you aren't careful about NULL pointer.
|
2020-12-22 11:30:06 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
Another annoying problem is memory that a string is allocated.
|
|
|
|
There are four cases.
|
2020-12-22 11:30:06 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
- The string is read only.
|
|
|
|
- The string is in static memory area.
|
|
|
|
- The string is in stack.
|
|
|
|
- The string is in memory allocated from the heap area.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
## Read only string
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
A string literal in C program is surrounded by double quotes.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
~~~C
|
|
|
|
char *s;
|
|
|
|
s = "Hello"
|
2021-01-25 21:20:15 +09:00
|
|
|
~~~
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
"Hello" is a string literal.
|
|
|
|
A string literal is read only.
|
|
|
|
In the program above, `s` points the string literal.
|
|
|
|
So, the following program is illegal.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-01-25 21:20:15 +09:00
|
|
|
~~~C
|
2021-06-15 11:24:42 +09:00
|
|
|
*(s+1) = 'a';
|
2021-01-25 21:20:15 +09:00
|
|
|
~~~
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
The result is undefined.
|
|
|
|
Probably a bad thing will happen, for example, a segmentation fault.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
## Strings defined as arrays
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
If a string is defined as an array, it's in static memory area or stack.
|
|
|
|
It depends on the class of the array.
|
|
|
|
If the array's class is `static`, then it's placed in static memory area.
|
|
|
|
It keeps its value and remains for the life of the program.
|
|
|
|
This area is writable.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
If the array's class is `auto`, then it's placed in stack.
|
|
|
|
If the array is defined in a function, its default class is `auto`.
|
|
|
|
The stack area will disappear when the function returns to the caller.
|
|
|
|
stack is writable.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-01-25 21:20:15 +09:00
|
|
|
~~~C
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
static char a[] = {'H', 'e', 'l', 'l', 'o', '\0'};
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
void
|
|
|
|
print_strings (void) {
|
|
|
|
char b[] = "Hello";
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
a[1] = 'a'; /* Because the array is static, it's writable. */
|
|
|
|
b[1] = 'a'; /* Because the array is auto, it's writable. */
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
printf ("%s\n", a); /* Hallo */
|
|
|
|
printf ("%s\n", b); /* Hallo */
|
|
|
|
}
|
|
|
|
~~~
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
The array `a` is defined externally to functions.
|
|
|
|
Such variables are placed in static memory area even if the `static` class is left out.
|
|
|
|
First, the compiler calculates the number of the elements in the right hand side.
|
|
|
|
It is six.
|
|
|
|
The compiler allocates six bytes memory in the static memory area and copies the data to the memory.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
The array `b` is defined inside the function.
|
|
|
|
So, its class is `auto`.
|
|
|
|
The compiler calculates the number of the elements in the string literal.
|
|
|
|
It is six because the string is zero terminated.
|
|
|
|
The compiler allocates six bytes memory in the stack and copies the data to the memory.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
Both `a` and `b` are writable.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
The memory is managed by the executable program.
|
|
|
|
You don't need to program to allocate or free the memory for `a` and `b`.
|
|
|
|
The array `a` remains for the life of the program.
|
|
|
|
The array `b` disappears when the function returns to the caller.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
## Strings in the heap area
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
You can get memory from the heap area and put back the memory to the heap area.
|
|
|
|
The standard C library provides `malloc` to get memory and `free` to put back memory.
|
|
|
|
Similarly, GLib provides `g_new` and `g_free`.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
~~~C
|
|
|
|
g_new (struct_type, n_struct)
|
|
|
|
~~~
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
`g_new` is a macro to allocate memory for an array.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
- `struct_type` is the type of the element of the array.
|
|
|
|
- `n_struct` is the size of the array.
|
|
|
|
- The return value is a pointer to the array.
|
|
|
|
Its type is a pointer to `struct_type`.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
For example,
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
~~~C
|
|
|
|
char *s;
|
|
|
|
s = g_new (char, 10);
|
|
|
|
/* s points an array of char. The size of the array is 10. */
|
|
|
|
|
|
|
|
struct tuple (int x, int y) *t;
|
|
|
|
t = g_new (struct tuple, 5);
|
|
|
|
/* t points an array of struct tuple. */
|
|
|
|
/* The size of the array is 5. */
|
|
|
|
~~~
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
`g_free` frees memory.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
~~~C
|
|
|
|
void
|
|
|
|
g_free (gpointer mem);
|
|
|
|
~~~
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
If `mem` is NULL, `g_free` does nothing.
|
|
|
|
`gpointer` is a type of general pointer.
|
|
|
|
It is the same as `void *`.
|
|
|
|
This pointer can be casted to any pointer type.
|
|
|
|
Conversely, any pointer type can be casted to gpointer.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-01-25 21:20:15 +09:00
|
|
|
~~~C
|
2021-06-15 11:24:42 +09:00
|
|
|
g_free (s);
|
|
|
|
/* Frees the memory allocated to s. */
|
|
|
|
|
|
|
|
g_free (t);
|
|
|
|
/* Frees the memory allocated to t. */
|
2021-01-25 21:20:15 +09:00
|
|
|
~~~
|
2020-12-22 11:30:06 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
If the argument doesn't point allocated memory, it will cause an error, for example, segmentation fault.
|
|
|
|
|
|
|
|
Some Glib functions allocate memory.
|
|
|
|
For example, `g_strdup` allocates memory and copy a string given as an argument.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
~~~C
|
|
|
|
char *s;
|
|
|
|
s = g_strdup ("Hello");
|
|
|
|
g_free (s);
|
|
|
|
~~~
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
The string literal "Hello" has 6 bytes because the string has '\0' at the end it.
|
|
|
|
`g_strdup` gets 6 bytes from the heap area and copies the string to the memory.
|
|
|
|
`s` is assigned the top address of the memory.
|
|
|
|
`g_free` returns the memory to the heap area.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-08-12 14:04:22 +09:00
|
|
|
`g_strdup` is described in [GLib API Reference](https://docs.gtk.org/glib/func.strdup.html).
|
2021-06-15 11:24:42 +09:00
|
|
|
The following is extracted from the reference.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
> The returned string should be freed with `g_free()` when no longer needed.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
The reference usually describes if the returned value needs to be freed.
|
|
|
|
If you forget to free the allocated memory, it causes memory leak.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
Some GLib functions return a string which mustn't be freed by the caller.
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-01-25 18:35:49 +09:00
|
|
|
~~~C
|
2021-06-15 11:24:42 +09:00
|
|
|
const char *
|
|
|
|
g_quark_to_string (GQuark quark);
|
2021-01-25 18:35:49 +09:00
|
|
|
~~~
|
2021-01-13 13:29:35 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
This function returns `const char*` type.
|
|
|
|
The qualifier `const` means immutable.
|
|
|
|
Therefore, the characters pointed by the returned value aren't be allowed to change or free.
|
2021-06-14 17:10:23 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
If a variable is qualified with `const`, the variable can't be assigned except initialization.
|
2021-06-14 17:10:23 +09:00
|
|
|
|
2021-06-15 11:24:42 +09:00
|
|
|
~~~C
|
|
|
|
const int x = 10; /* initialization is OK. */
|
|
|
|
|
|
|
|
x = 20; /* This is illegal because x is qualified with const */
|
|
|
|
~~~
|
2021-01-07 20:29:13 +09:00
|
|
|
|
2021-01-11 11:15:04 +09:00
|
|
|
|
2021-01-13 20:59:15 +09:00
|
|
|
Up: [Readme.md](../Readme.md), Prev: [Section 5](sec5.md), Next: [Section 7](sec7.md)
|