 
                            Signed character data must be converted to an unsigned  type char before being assigned or converted to a larger signed type. Because compilers have the latitude to define char This rule applies to both signed char and (plain) char characters on implementations where char is defined to have the same range, representation, and behavior behaviors as either signed char or unsigned char.
However, this rule should be applied to both signed char and (plain) char characters.This rule is only applicable is applicable only in cases where the character data may contain values that can be interpreted misinterpreted as negative valuesnumbers. For example, if the char type is represented by a two's complement 8-bit value, any character value greater than +127 is interpreted as a negative value.
...
This rule is a generalization of STR37-C. Arguments to character-handling functions must be representable as an unsigned char.
Noncompliant Code Example
This non-compliant noncompliant code example is taken from a vulnerability in bash versions 1.14.6 and earlier that resulted in the that led to the release of CERT Advisory CA-1996-22. This vulnerability resulted from the sign extension of character data referenced by the string c_str pointer in the yy_string_get() function in the parse.y module of the bash source code:
| Code Block | ||||
|---|---|---|---|---|
| 
 | ||||
| static int yy_string_get(void) { register char *stringc_str; register int c; stringc_str = bash_input.location.string; c = EOF; /* If the string doesn't exist, or is empty, EOF found. */ if (stringc_str && *stringc_str) { c = *stringc_str++; bash_input.location.string = stringc_str; } return (c); } | 
The string c_str variable is used to traverse the character string containing the command line to be parsed. As characters are retrieved from this pointer, they are stored in a variable of type int. For compilers implementations in which the char type defaults to is defined to have the same range, representation, and behavior as signed char, this value is sign-extended when assigned to the int variable. For character code 255 decimal (-1 −1 in two's complement form), this sign extension results in the value -1 −1 being assigned to the integer, which is indistinguishable from EOF.
Noncompliant Code Example
This problem was can be repaired by explicitly declaring the string c_str variable as unsigned char.:
| Code Block | ||||
|---|---|---|---|---|
| 
 | ||||
| static int yy_string_get(void) { register unsigned char *stringc_str; register int c; stringc_str = bash_input.location.string; c = EOF; /* If the string doesn't exist, or is empty, EOF found. */ if (stringc_str && *stringc_str) { c = *stringc_str++; bash_input.location.string = stringc_str; } return (c); } | 
This solutionexample, however, is in violation of STR07-Aviolates STR04-C. Use plain char for characters in the basic character dataset.
Compliant Solution
In this compliant solution, the result of the expression *stringc_str++ is cast to (unsigned char) before assignment to the int variable c.:
| Code Block | ||||
|---|---|---|---|---|
| 
 | ||||
| static int yy_string_get(void) { register char *stringc_str; register int c; stringc_str = bash_input.location.string; c = EOF; /* If the string doesn't exist, or is empty, EOF found. */ if (stringc_str && *stringc_str) { /* Cast to unsigned type */ c = (unsigned char)*string++; /* cast to unsigned type */c_str++; bash_input.location.string = c_str; } return (c); } | 
Noncompliant Code Example
In this noncompliant code example, the cast of *s to unsigned int can result in a value in excess of UCHAR_MAX because of integer promotions, a violation of ARR30-C. Do not form or use out-of-bounds pointers or array subscripts:
| Code Block | ||||
|---|---|---|---|---|
| 
 | ||||
| #include <limits.h> #include <stddef.h> static const char table[UCHAR_MAX + 1] = { 'a' /* ... */ }; ptrdiff_t first_not_in_table(const char *c_str) { for (const char *s = c_str; *s; ++s) { if (table[(unsigned int)*s] != *s) { bash_input.location.string = string; } return (c); } | 
Risk Assessment
| return s - c_str;
    }
  }
  return -1;
}
 | 
Compliant Solution
This compliant solution casts the value of type char to unsigned char before the implicit promotion to a larger type:
| Code Block | ||||
|---|---|---|---|---|
| 
 | ||||
| #include <limits.h>
#include <stddef.h>
 
static const char table[UCHAR_MAX + 1] = { 'a' /* ... */ };
ptrdiff_t first_not_in_table(const char *c_str) {
  for (const char *s = c_str; *s; ++s) {
    if (table[(unsigned char)*s] != *s) {
      return s - c_str;
    }
  }
  return -1;
}
 | 
Exceptions
STR34-C-EX1: This rule only applies to characters that are to be treated as unsigned chars for some purpose, such as being passed to the isdigit() function. Characters that hold small integer values for mathematical purposes need not comply with this rule.
Risk Assessment
Conversion of character data resulting in a value in excess of UCHAR_MAX is an often-missed error that can result This is a subtle error that results in a disturbingly broad range of potentially severe vulnerabilities.
| Rule | Severity | Likelihood | Detectable | 
|---|
| Repairable | Priority | Level | 
|---|---|---|
| STR34-C | 
2 (medium)
2 (probable)
| Medium | Probable | Yes | No | 
| P8 | L2 | 
Automated Detection
...
| Tool | Version | Checker | Description | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Astrée | 
 | char-sign-conversion | Fully checked | ||||||
| Axivion Bauhaus Suite | 
 | CertC-STR34 | Fully implemented | ||||||
| CodeSonar | 
 | MISC.NEGCHAR | Negative Character Value | ||||||
| Compass/ROSE | Can detect violations of this rule when checking for violations of INT07-C. | 
...
| Use only explicitly signed or unsigned char type for numeric values | |||||||||
| Coverity | 
 | MISRA C 2012 Rule 10.1 MISRA C 2012 Rule 10.2 MISRA C 2012 Rule 10.3 MISRA C 2012 Rule 10.4 | Implemented Essential type checkers | ||||||
| Cppcheck Premium | 
 | premium-cert-str34-c | |||||||
| 
 | CC2.STR34 | Fully implemented | |||||||
| GCC | 2.95 and later | Detects objects of type  | |||||||
| Helix QAC | 
 | C2140, C2141, C2143, C2144, C2145, C2147, C2148, C2149, C2151, C2152, C2153, C2155 C++3051 | |||||||
| Klocwork | 
 | CXX.CAST.SIGNED_CHAR_TO_INTEGER | |||||||
| LDRA tool suite | 
 | 434 S | Partially implemented | ||||||
| Parasoft C/C++test | 
 | CERT_C-STR34-b | Cast characters to unsigned char before assignment to larger integer sizes | ||||||
| PC-lint Plus | 
 | 571 | Partially supported | ||||||
| 
 | CERT C: Rule STR34-C | Checks for misuse of sign-extended character value (rule fully covered) | |||||||
| RuleChecker | 
 | char-sign-conversion | Fully checked | ||||||
| TrustInSoft Analyzer | 
 | out of bounds read | Partially verified (exhaustively detects undefined behavior). | 
Related Vulnerabilities
CVE-2009-0887 results from a violation of this rule. In Linux PAM (up to version 1.0.3), the libpam implementation of strtok() casts a (potentially signed) character to an integer for use as an index to an array. An attacker can exploit this vulnerability by inputting a string with non-ASCII characters, causing the cast to result in a negative index and accessing memory outside of the array [xorl 2009].
Fortify SCA Version 5.0 with CERT C Rule Pack can detect violations of this rule.
Related Vulnerabilities
Search for vulnerabilities resulting from the violation of this rule on the CERT website.
References
| Wiki Markup | 
|---|
| \[[ISO/IEC 9899-1999|AA. C References#ISO/IEC 9899-1999]\] Section 6.2.5, "Types"
\[[MISRA 04|AA. C References#MISRA 04]\] Rule 6.1, "The plain char type shall be used only for the storage and use of character values." | 
Related Guidelines
| CERT C Secure Coding Standard | STR37-C. Arguments to character-handling functions must be representable as an unsigned char STR04-C. Use plain char for characters in the basic character set ARR30-C. Do not form or use out-of-bounds pointers or array subscripts | 
| ISO/IEC TS 17961:2013 | Conversion of signed characters to wider integer types before a check for EOF [signconv] | 
| MISRA-C:2012 | Rule 10.1 (required) Rule 10.2 (required) Rule 10.3 (required) Rule 10.4 (required) | 
| MITRE CWE | CWE-704, Incorrect Type Conversion or Cast | 
Bibliography
...
STR33-C. Size wide character strings correctly 07. Characters and Strings (STR) STR35-C. Do not copy data from an unbounded source to a fixed-length array