-
Notifications
You must be signed in to change notification settings - Fork 0
/
issues-C++-layout.html
264 lines (249 loc) · 13.2 KB
/
issues-C++-layout.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
<!doctype html public "-//w3c//dtd html 4.0 transitional//en">
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<meta name="GENERATOR" content="Mozilla/4.51C-SGI [en] (X11; I; IRIX 6.5 IP22) [Netscape]">
<title>C++ class layout and member function invocation</title>
</head>
<body>
<center>
<h1>
C++ Class layout and member function invocation</h1></center>
<hr WIDTH="100%">
<h2>
Ordinary class, no inheritance or virtual functions</h2>
Same as in C. An object is a contiguous region of storage; the data
members appear in the order of declaration. (Not quite required by
the C++ standard, but everyone does it that way.) Every object has
nonzero size. Padding is same as in C.
<p>Member function invocation: calling convention for a member function
of class <tt>X</tt> is just the same as for a non-member, except that it
gets an extra argument. This argument comes before all of the ones
<br>that the user declares. For non-<tt>const</tt> member functions
this argument is of type <tt>X*</tt>, and for const member functions it
is of type const <tt>X*</tt>.
<p>Static data members and static member functions are not part of the
class, except for name resolution and access rules. They get mangled
names, but otherwise they are handled just the same way as any ordinary
global variable or global function.
<br>
<hr WIDTH="100%">
<h2>
Base class(es), no virtual functions</h2>
An object in C++ is a contiguous region of storage, and it's possible to
convert a pointer to a derived class to a pointer to any of its base classes,
so there isn't much choice here: a derived class must have a contiguous
region for each of its base class subobjects, and then a region for all
of its own data members. Issues: what order do the base class subobjects
come in (only matters for multiple inheritance), how do we handle alignment
restrictions, and do we do anything special for empty base classes?
<p>Cfront solution:
<ul>
<li>
Base class subobjects are in order of declaration. That is, if we've
got</li>
<pre> struct Y : public X1, public X2, public X3 { int n; };</pre>
then, in order of increasing address, we've got an <tt>X1</tt> subobject
starting at the same address as the <tt>Y</tt> object; then an <tt>X2</tt>
subobject; then an <tt>X3</tt> subobject; then an <tt>int</tt>.
<li>
Each of the subobjects has the appropriate padding. There is as much
space reserved for an <tt>X1</tt> subobject in a <tt>Y</tt> as for <tt>sizeof(X1)</tt>,
including <tt>X1</tt>'s padding. Implication: if each of <tt>X1</tt>,
<tt>X2</tt>,
<tt>X3</tt>
consists of a single <tt>char</tt>, we can expect that <tt>sizeof(Y)</tt>
is considerably larger than <tt>3*sizeof(char) + sizeof(int)</tt>.
(Rationale: simplifies the handling of the subobjects. We can freely
use bitwise copying, which copies the base class padding areas as well
as their data members. If we didn't include the padding in the derived
classes, we'd have to make sure to handle an <tt>X1</tt> subobject more
carefully than a standalone <tt>X1</tt> object.)</li>
<li>
No special handling for empty base classes.</li>
</ul>
Sun solution: same as above, except they do have special handling
for empty base classes. An empty base class takes up no space in
the complete class, except that no other instance of the same empty class
will be allocated at the same offset.
<br>
<hr WIDTH="100%">
<a name=vfunc>
<h2> Virtual functions and a single base class </h2>
A class with virtual functions differs in two ways. First, it has
a type id tag for the RTTI mechanism. Second, it has a virtual function
table (vtbl) and a virtual function pointer (vptr). This isn't required
by the C++ standard, but everyone does it this way.
<p>A virtual function table is a table of function addresses, If
a class <tt>X</tt> has three virtual functions <tt>f</tt>, <tt>g</tt>,
and <tt>h</tt>, then the addresses of those virtual functions appear at
fixed offsets, known at compile time, from the beginning of the vtbl.
A vtbl is one-per-class, not one-per-object. Each object of type
<tt>X</tt>
has a vptr, a pointer to the single <tt>X</tt> vtbl.
<p>(Implication: a class with virtual functions can never be empty.
Even if the user declares no data members, it always contains a vptr data
member.)
<p>If class <tt>Y</tt> is derived from class <tt>X</tt>, then <tt>Y</tt>'s
vtbl puts <tt>f</tt>, <tt>g</tt>, and <tt>h</tt> in the same slots as <tt>X</tt>'s
vtbl did. Any additional virtual functions introduced in <tt>Y</tt>
come later. That is, anyone who is expecting a pointer
<br>to the beginning of <tt>X</tt>'s vtbl and get a pointer to the beginning
of <tt>Y</tt>'s vtbl instead won't get any surprises.
<p>An object of class <tt>Y</tt> has two vptrs, one for the <tt>X</tt>
subobject (a subobject always has the same layout as a standalone object
of the same type) and one for the <tt>Y</tt> object itself. While
<tt>X</tt>'s
constructor is being run, the <tt>X</tt> subobject's vptr points to <tt>X</tt>'s
vptr. While <tt>Y</tt>'s constructor is being run, and for the remainder
of the object's lifetime, both of the <tt>Y</tt> object's vptrs point to
<tt>Y</tt>'s
vtbl.
<p>A virtual function call is just like an ordinary member call, except
that the call is through a function pointer in the vtbl. For example
if <tt>p</tt> is of type <tt>X*</tt> and if <tt>f</tt> is a member function
that takes a single argument of type <tt>int</tt>, then the expression
<tt>p->f(5)</tt>
is implemented as something like this:
<br><tt>(*(p->__vptr[__VTBL_OFFSET_OF_f]))(p, 5)</tt>. This is all
done on the caller side. The callee, <tt>f</tt>, need know nothing
about it.
<p>(Note: it is essential that this all be done on the caller side.
In some cases the function resolution can be performed at compile time.
If we are to allow the virtual function mechanism to be bypassed in such
cases, virtual functions and nonvirtual functions must use the same calling
convention.)
<p>Issues:
<ol>
<li>
What is the name of the vptr data member, here shown as <tt>__vptr</tt>?</li>
<li>
What is the location of the vptr relative to the class's other data members?
(Usual answer: either it's the first data member, or it's the last.)</li>
<li>
There is a single vtbl for each virtual-function-containing class, meaning
that the vtbl is emitted only in a single translation unit. Which
one?</li>
<li>
What is the exact layout of entries within a vtbl?</li>
</ol>
Sun: the vtbl is a static data member called __vtbl. The vptr
is at offset 0 within the class. In the vtbl, offset 0 is for RTTI;
it is the address of a function called _RT returning a <tt>const type_info*</tt>.
Offset 1 is unused except for base class subobjects within derived class
objects. It is the offset of the subobject from the beginning of
the complete object. (See below for why this is needed.) The
remaining indices are allocated to virtual functions in this order: first,
virtual functions from the leftmost immediate base class, with the same
index assignments as in the base; then virtual functions from other base
classes; then virtual functions newly declared in the derived class.
Index -1 is unused. Ordering of virtual base class pointers: first
the ones along the leftmost path, in depth-first order, then the other
virtual base classes, in depth-first, left-to-right order.
<br>
<hr WIDTH="100%">
<h2>
Virtual functions and multiple base classes</h2>
If <tt>Y</tt> inherits from <tt>X1</tt> and <tt>X2</tt>, then either the
<tt>X1</tt>
or the <tt>X2</tt> subobject begins at the same address as the full Y object.
Let's say, for argument's sake, that it's the
<tt>X1</tt> subobject.
That means that whenever we have an <tt>X2*</tt> that points to an <tt>X2</tt>
subobject, and we need to convert it to a
<tt>Y*</tt> pointing to the full
<tt>Y</tt> object, we need to subtract some fixed offset. That's
fine if the user is performing an explicit cast; the difficulty is what
happens when the conversion is part of the virtual function mechanism.
A virtual member function invoked through an <tt>X2*</tt> can yield a <tt>Y</tt>
member function, and the first argument of a <tt>Y</tt> member function
(provided automatically by the compiler) must be a <tt>Y*</tt>. This
means there must be a mechanism to perform the <tt>X2*</tt> to <tt>Y*</tt>
adjustment automatically whenever it's needed for virtual function calls.
<p>Cfront solution: A vtbl is no longer a table of addresses, but a table
of pairs: address and offset. The vtbl for the <tt>Y</tt> itself
and for its <tt>X1</tt> subobjects are no longer the same. They contain
the same function addresses, but different offsets. Note that we
now have two vtbls for <tt>Y</tt> rather than 1, and that each one is twice
as large.
<p>Sun solution: Thunks. A vtbl entry can be a stub that performs
pointer adjustment and then jumps to the appropriate virtual function.
The multiple vtbls are merged into one; a secondary vtbl is accessed as
a fixed offset from the start of the primary. One additional optimization: the
vptr is stored at offset 0 from the beginning of the class, and base class
subobjects are layed out in declaration order. If the class's leftmost
immediate base has a vptr, and if it is a non-virtual base class or a virtual
base class with zero size, then it is allocated at zero offset so that
the base and derived class can share a vptr.
<p>IBM and Microsoft also use thunks, with some differences. (Caller/callee
issues.) Microsoft's thunk mechanism is patented. Thunks, or something
like them, are needed anyway for another reason. An overriding virtual
function can sometimes have a different type than the virtual function
in the base class. ("Covariant return types.") If a derived
class virtual function is invoked through a base class pointer, there are
cases where fixup on the return value is necessary.
<p>Issue: special handling for a class that only uses single inheritance?
<p>(TO DO: describe pointer-to-member layout. A pointer-to-member
must be a struct, not just a single address or offset. This is critical
for ABI compatibility.)
<br>
<br>
<p>
<hr WIDTH="100%">
<br>
<h2>
Virtual base classes</h2>
Virtual base classes are very complicated, and implementation techniques
vary widely. The point of a virtual base class is if class <tt>Y</tt> inherits
from classes <tt>X1</tt> and <tt>X2</tt>, where <tt>X1</tt> and <tt>X2</tt>
both have a virtual base class <tt>A</tt>, then <tt>Y</tt> has only one
<tt>A</tt>
subobject, not two. This means that we can't use the same class layout
as we described above: the <tt>X1</tt> and <tt>X2</tt> subobjects can't
both have <tt>A</tt> subobjects the same way that standalone
<tt>X1</tt>
and <tt>X2</tt> objects would.
<p>The basic idea, which is common to all implementations, is that if a
class <tt>X1</tt> has a virtual base class <tt>A</tt>, the location of
the <tt>A</tt> subobject within an <tt>X1</tt> object can vary. (It
varies depending on whether the <tt>X1</tt> object is a standalone object
or a subobject.) Somewhere, then, at some fixed location, we need
to store a pointer or offset so that the <tt>A</tt> subobject can
be found at runtime.
<p>Cfront solution:
<ul>
<li>
Virtual base class subobjects go at the end of the derived class object,
in declaration order. In the example above, a <tt>X1</tt> object
would consist of <tt>X1</tt>'s own members, followed by an <tt>A </tt>subobject.
<tt>X2</tt> would have similar layout. <tt>Y</tt> would consist
of an <tt>X1</tt> subobject (without the <tt>A</tt> subobject), followed
by an <tt>X2</tt> subobject, followed by a shared <tt>A</tt> subobject.</li>
<li>
Each derived class object contains a pointer to the virtual base class
subobject. So, for example, <tt>X1</tt> would contain a <tt>__vbc_A</tt>
pointer of type <tt>A*</tt>. A <tt>Y</tt> object would contain three
such pointers, one for the Y itself and one for each of the <tt>X1</tt>
and <tt>X2</tt> subobjects. These pointers are set up in the constructor.</li>
</ul>
Sun solution: similar, except that what's stored is offsets instead of
pointers. That makes it possible for this information to be pulled
out of the objects themselves and put into the vtbl. Objects are
thus smaller than in the cfront model, and constructors don't have to initialize
any vbv pointers. In Sun's implementation, virtual member functions
have positive indices in the vtbl and virtual base class offsets have negative
indices. That may just be for backward compatibility with their old
ABI; the solution would be essentially the same if the entries were instead
interleaved.
<p>Issue: is there any reason to have special handling for an empty
virtual base class?
<p>(TO DO: describe pointer-to-member layout. Converting a pointer-to-data-member
in a virtual base class into a pointer-to-data-member of a derived class
requires runtime fixup.)
<br>
<br>
<hr>
<a href=issues-C++-layout-ex.txt>
<h2> Virtual Function Layout Examples </h2>
</body>
</html>