Lesson 7 - Type Conversions
C++ follows certain rules to convert one type to another (as long as the conversions are valid). Conversions are generally automatic (implicit), while in others the developer specifies the type of conversion needed (explicit).
Implicit conversion
Implicit conversions are automatically performed when a value is copied to a compatible type. For example:
short a=2000;
int b;
b=a;
Here, the value of a
is promoted from short
to int
without the need of any explicit operator. This is known as a standard conversion. Standard conversions affect fundamental data types, and allow the conversions between numerical types (short
to int
, int
to float
, double
to int
…), to or from bool
, and some pointer conversions.
Converting to int
from some smaller integer type, or to double
from float
is known as promotion, and is guaranteed to produce the exact same value in the destination type. Other conversions between arithmetic types may not always be able to represent the same value exactly:
- If a negative integer value is converted to an unsigned type, the resulting value corresponds to its 2's complement bitwise representation (i.e., -1 becomes the largest value representable by the type, -2 the second largest, ...).
- The conversions from/to
bool
consider false equivalent to zero (for numeric types) and to null pointer (for pointer types);true
is equivalent to all other values and is converted to the equivalent of1
. - If the conversion is from a floating-point type to an integer type, the value is truncated (the decimal part is removed). If the result lies outside the range of representable values by the type, the conversion causes undefined behaviour.
- Otherwise, if the conversion is between numeric types of the same kind (integer-to-integer or floating-to-floating), the conversion is valid, but the value is implementation-specific (and may not be portable).
Some of these conversions may imply a loss of precision, which the compiler can signal with a warning. This warning can be avoided with an explicit conversion.
For non-fundamental types, arrays and functions implicitly convert to pointers, and pointers in general allow the following conversions:
- Null pointers can be converted to pointers of any type
- Pointers to any type can be converted to
void
pointers. - Pointer upcast: pointers to a derived class can be converted to a pointer of an accessible and unambiguous base class, without modifying its
const
orvolatile
qualification.
Implicit conversions with classes
For classes, implicit conversions can be controlled by means of three member functions:
- Single-argument constructors - allow implicit conversion from a particular type to initialise an object.
- Assignment operator - allow implicit conversion from a particular type on assignments.
- Type-cast operator - allow implicit conversion to a particular type.
class A {};
class B
{
public:
// conversion from A (constructor):
B (const A& x) {}
// conversion from A (assignment):
B& operator= (const A& x) {return *this;}
// conversion to A (type-cast operator)
operator A() {return A();}
};
The type-cast operator uses a particular syntax: it uses the operator
keyword followed by the destination type and an empty set of parentheses. Notice that the return type is the destination type and is not specified before the operator
keyword.
Keyword explicit
On a function call, C++ allows one implicit conversion to happen for each argument. This may be somewhat problematic for classes, because it is not always what is intended. For example, if we add the following function to the last example:
void fn(B arg) {}
This function takes an argument of type B
, but it could as well be called with an object of type A
as argument:
fn(A{});
This may or may not be what was intended. But, in any case, it can be prevented by marking the affected constructor with the explicit
keyword. In general, prefer the use of explicit constructors to avoid hard to trace down bugs.
class B
{
public:
// conversion from A (constructor):
explicit B (const A& x) {}
// conversion from A (assignment):
B& operator= (const A& x) {return *this;}
// conversion to A (type-cast operator)
operator A() {return A();}
};
Type-cast member functions can also be specified as explicit
. This prevents implicit conversions in the same way as explicit-specified constructors do for the destination type.
Type casting
C++ is a strongly-typed language. Many conversions, specially those that imply a different interpretation of the value, require an explicit conversion, known in C++ as type-casting. There are two main syntaxes for generic type-casting: functional and c-like:
double x = 10.3;
int y;
y = int(x); // functional notation
y = (int) x; // c-like cast notation
The functionality of these generic forms of type-casting is enough for most needs with fundamental data types. However, these operators can be applied indiscriminately on classes and pointers to classes, which can lead to code that while being syntactically correct can cause runtime errors.
class Dummy
{
double i,j;
};
class Addition
{
int x,y;
public:
Addition(int a, int b) { x=a; y=b; }
int result() const { return x+y;}
};
int main()
{
Dummy d;
Addition* padd;
padd = (Addition*) &d;
padd->result();
return 0;
}
The program declares a pointer to Addition, but then it assigns to it a reference to an object of another unrelated type using explicit type-casting:
padd = (Addition*) &d;
Unrestricted explicit type-casting allows to convert any pointer into any other pointer type, independent of the types they point to. The subsequent call to member function result
will produce either a run-time error or some other unexpected results.
In order to control these types of conversions between classes, C++ has four specific casting operators: dynamic_cast
, reinterpret_cast
, static_cast
and const_cast
. Their format is to follow the new type enclosed between angle-brackets (<>
) and immediately after, the expression to be converted between parentheses.
dynamic_cast<new_type>(expression)
reinterpret_cast<new_type>(expression)
static_cast<new_type>(expression)
const_cast<new_type>(expression)
dynamic_cast
This cast operator can only be used with pointers and references to classes (or with void*
). Its purpose is to ensure that the result of the type conversion points to a valid complete object of the destination pointer type.
This naturally includes pointer upcast (converting from pointer-to-derived to pointer-to-base), in the same way as allowed as an implicit conversion.
But dynamic_cast
can also downcast (convert from pointer-to-base to pointer-to-derived) polymorphic classes (those with virtual members) if and only if the pointed object is a valid complete object of the target type.
#include <iostream>
#include <exception>
class Base { virtual void dummy() {} };
class Derived: public Base { int a; };
int main()
{
try
{
Base* pba = new Derived;
Base* pbb = new Base;
Derived* pd;
pd = dynamic_cast<Derived*>(pba);
if (pd==nullptr) std::cout << "Null pointer on first type-cast." << std::endl;
pd = dynamic_cast<Derived*>(pbb);
if (!pd) std::cout << "Null pointer on second type-cast." << std::endl;
}
catch ( const std::exception& e ) { std::cout << "Exception: " << e.what() << std::endl;}
return 0;
}
The code above tries to perform two dynamic casts from pointer objects of type Base*
(pba
and pbb
) to a pointer object of type Derived*
, but only the first one is successful. Notice their respective initialisations:
Base* pba = new Derived;
Base* pbb = new Base;
Even though both are pointers of type Base*
, pba
actually points to an object of type Derived
, while pbb
points to an object of type Base
. Therefore, when their respective type-casts are performed using dynamic_cast
, pba
is pointing to a full object of class Derived
, whereas pbb
is pointing to an object of class Base
, which is an incomplete object of class Derived
.
When dynamic_cast
cannot cast a pointer because it is not a complete object of the required class it returns a null pointer to indicate the failure. If dynamic_cast
is used to convert to a reference type and the conversion is not possible, an exception of type std::bad_cast
is thrown instead.
dynamic_cast
can also perform the other implicit casts allowed on pointers: casting null pointers between pointers types (even between unrelated classes), and casting any pointer of any type to a void*
pointer.
static_cast
This type cast operator can perform conversions between pointers to related classes, not only upcasts (from pointer-to-derived to pointer-to-base), but also downcasts (from pointer-to-base to pointer-to-derived). No checks are performed during runtime to guarantee that the object being converted is in fact a full object of the destination type. Therefore, it is up to the programmer to ensure that the conversion is safe. On the other side, it does not incur the overhead of the type-safety checks of dynamic_cast
.
class Base {};
class Derived: public Base {};
Base* a = new Base;
Derived* b = static_cast<Derived*>(a);
This would be valid code, although b
would point to an incomplete object of the class and could lead to runtime errors if dereferenced.
Therefore, static_cast
is able to perform with pointers to classes not only the conversions allowed implicitly, but also their opposite conversions.
static_cast
is also able to perform all conversions allowed implicitly (not only those with pointers to classes), and is also able to perform the opposite of these. It can:
- Convert from
void*
to any pointer type. In this case, it guarantees that if thevoid*
value was obtained by converting from that same pointer type, the resulting pointer value is the same. - Convert integers, floating-point values and enum types to enum types.
Additionally, static_cast
can also perform the following:
- Explicitly call a single-argument constructor or a conversion operator.
- Convert to
rvalue
references. - Convert
enum class
values into integers or floating-point values. - Convert any type to
void
, evaluating and discarding the value.
reinterpret_cast
This type-cast operator converts any pointer type to any other pointer type, even of unrelated classes. The operation result is a simple binary copy of the value from one pointer to the other. All pointer conversions are allowed: neither the content pointed nor the pointer type itself is checked.
It can also cast pointers to or from integer types. The format in which this integer value represents a pointer is platform-specific. The only guarantee is that a pointer cast to an integer type large enough to fully contain it (such as intptr_t
), is guaranteed to be able to be cast back to a valid pointer.
The conversions that can be performed by reinterpret_cast
but not by static_cast
are low-level operations based on reinterpreting the binary representations of the types, which in most cases results in code which is system-specific, and thus non-portable.
void swapEndian( const int32_t source, int32_t& dest )
{
const char* in = reinterpret_cast<const char*>( &source );
char* out = reinterpret_cast<char*>( &dest );
out[0] = in[3];
out[1] = in[2];
out[2] = in[1];
out[3] = in[0];
}
const_cast
This type-cast operator manipulates the const’ness
of the object pointed by a pointer, either to be set or to be removed. For example, in order to pass a const pointer
to a function that expects a non-const argument:
#include <iostream>
void print(char * str)
{
std::cout << str << std::endl;
}
int main()
{
const char* c = "sample text";
print( const_cast<char*>(c) );
return 0;
}
The example above is guaranteed to work because function print does not write to the pointed object. Note though, that removing the const’ness of a pointed object to actually write to it causes undefined behaviour.
typeid
typeid
allows to check the type of an expression:
typeid(expression)
This operator returns a reference to a constant object of type type_info
that is defined in the standard header <typeinfo>
. A value returned by typeid
can be compared with another value returned by typeid
using operators ==
and !=
or can serve to obtain a null-terminated character sequence representing the data type or class name by using its name()
member.
#include <iostream>
#include <typeinfo>
int main()
{
int* a = 0;
int b = 0;
if (typeid(a) != typeid(b))
{
std::cout << "a and b are of different types:" << std::endl;
std::cout << "a is: " << typeid(a).name() << std::endl;
std::cout << "b is: " << typeid(b).name() << std::endl;
}
return 0;
}
When typeid
is applied to classes, typeid
uses the RTTI to keep track of the type of dynamic objects. When typeid
is applied to an expression whose type is a polymorphic class, the result is the type of the most derived complete object.
#include <iostream>
#include <typeinfo>
#include <exception>
class Base { virtual void f(){} };
class Derived : public Base {};
int main()
{
try
{
Base* a = new Base;
Base* b = new Derived;
std::cout << "a is: " << typeid(a).name() << std::endl;
std::cout << "b is: " << typeid(b).name() << std::endl;
std::cout << "*a is: " << typeid(*a).name() << std::endl;
std::cout << "*b is: " << typeid(*b).name() << std::endl;
}
catch ( const std::exception& e ) { std::cout << "Exception: " << e.what() << std::endl; }
return 0;
}
Note:
The string returned by member name
of type_info
depends on the specific implementation of your compiler and library. It is not necessarily a simple string with its typical type name.
Notice how the type that typeid
considers for pointers is the pointer type itself (both a and b are of type class Base*
). However, when typeid
is applied to objects (like *a
and *b
) typeid yields their dynamic type (i.e. the type of their most derived complete object).
If the type typeid
evaluates is a pointer preceded by the dereference operator (*), and this pointer has a null
value, typeid throws a bad_typeid
exception.