CERT manifest files

The CERT manifest files (=E2=80=9Ccert_manifest.xml=E2=80=9D and= =E2=80=9Ccert_juliet.csv=E2=80=9D) are authored by Lori Flynn, David Svobo= da, and Andrew Kotov.

Download them (version 1) here.

These files can be used by static analysis tool developers to test their= coverage of (some of the) CERT Secure Coding Rules for C, using many of 61= ,387 test cases in the Juliet test sui= te v1.2. The format of =E2=80=9Ccert_manifest.xml=E2=80=9D is a sli= ghtly-modified version of the SARD man= ifest format (https://samate.nis= t.gov/SRD/resources/sard_schema.xsd), designed to enable users of t= he SARD manifest to easily also use this new CERT manifest.

To determine which test suite data could be used for CERT Secure Coding = Rules, we used precise mappings (see here and here for precise mapping information) between CERT= Secure Coding Rules and CWEs, combined with analysis of the Juliet Test Su= ite metadata, and occasionally examining the test suite files (e.g., to che= ck if the type of a variable was a short or a long).

The file =E2=80=9Ccert_manifest.xml=E2=80=9D is modeled after Juliet ent= ries in the SARD manifest. It differs in two major ways:

Our manifest is for CERT Secure Coding Rules (not CWEs)
Our manifest includes =E2=80=9Cfixed=E2=80=9D entries that indicate li= ne ranges that do not have a violation of the CERT secure c= oding rule identified (i.e., these identify line ranges for which a static = analysis alert for that CERT secure coding rule would be a false positive)<= /li>

Generally, we tried not to modify the order of existing attribute fields= , so users of the SARD manifest could most easily also use our manifest (in= case their parsers rely on attribute order). Also, we used test suite attr= ibutes with values as in the original Juliet SARD manifest, so the particul= ar version of Juliet files can be identified.

Additional details about how our manifest differs from the SARD manifest= , with reasons:

We added the following new attributes for the testcase field (attribut= e and entry values provided in parentheses):=20
1. alternate-taxonomy. (alternate-t= axonomy=3D=E2=80=9CCERT-C-Standard") Purpose is to indicate which al= ternate code flaw taxonomy (eg. CERT rules, CWEs, MISRA rules, etc.) that i= nformation will be provided for, as opposed to the code flaw taxonomy that = the test suite was originally designed to test.
2. SubmissionDate-alternate-taxonomy. (SubmissionDate-alternate-taxonomy=3D2018-09-28) Purpose is to in= dicate the date of submission of this manifest to SARD, for potential publi= cation on the NIST SARD test suite website= . The similarly-named attribute SubmissionDate is specific to = the testcase itself, and that was used for all manifest entries.
3. alternate-taxonomy-author. (alternate-taxonomy-author=3D"Lori Flynn and David Svoboda and = Andrew Kotov") Purpose is to identify authors of the new manifest en= tries. The similarly-named author attribute is specific to the= testcase itself, and that was used for all manifest entries.


  For the False verdicts, we did particular things fo=
r the following fields and attributes (in bold):=20
  
   We added a fixed field (same as in the=
 original SARD manifest) that identifies where the identified CERT secure c=
oding rule is not violated=20
    
      For the verdict attribute, we =
use the value False (verdict=3D=E2=80=9DFalse=E2=80=9D).
    
     For the file field, we added fields and values similar to tho=
se for the =E2=80=9Cmixed=E2=80=9D tag (i.e., True verdict entries for Juli=
et test cases, in the original SARD manifest Juliet entries). Many of the f=
iles did not have entries in the original SARD Juliet manifest entries.=20
    
     numberOfFiles. (numberOfFiles=3D"1")=
 The purpose of this field for file entries with True verdicts=
 is to indicate how many files are in a testcase. As an initial estimate, i=
n False verdicts, we assume this count is only the file identi=
fied, in each case a single file.
     checksum. (checksum =3D=E2=80=9D<SHA1_HA=
SH>=E2=80=9D) The purpose of this attribute is to uniquely identi=
fy the file. The other SARD file entries for checksum were derived using SH=
A1, so we derived a checksum value by running sha1sum.
     size. (size =3D=E2=80=9D<SIZE>=E2=80=
=9D) The purpose of this attribute is to identify the number of byte=
s in the file. To get this number, we ran the following command in a bash s=
hell: wc -c
    
     id, (id=3D"10000000") The purpose of this field is to uniquely identify the testcase ID. Init=
ially, we start with the first ID at 10000000 (a number larger than any id =
in the current SARD manifest), then increase each by 1. These are placehold=
ers, as SARD assigns their own testcase ids.

    We simply copied these attributes and values describing the te=
st suite, for the testcase field: id=3D=E2=80=9D86=E2=80=9D, submissionDate=3D"2=
013-05-20", status=3D"Candidate"
     We added the following new attributes for the testcase field,=
 the same as described above for the True (=E2=80=9Cmixe=
d=E2=80=9D) verdicts: alternate-taxonomy, SubmissionDate-alternate-taxonomy, and a=
lternate-taxonomy-author.


The file =E2=80=9Ccert_juliet.csv=E2=80=9D contains True and False infor=
mation for Juliet test suite and CERT Secure Coding Rules, but in a sparser=
 format than the .xml file.
It has entries of 2 types:

 <CERT_RULE>, True, <JULIET_FILEPATH>, <SINGLE_LIN=
E>, <CWE>
 <CERT_RULE>, False, <JULIET_FILEPATH>, <LINE_RANG=
E>, <CWE>

The filepaths for type-2 (False) are different from t=
he filepaths for type-1 (True), for the same files. The type-1=
 filepath is taken from the Juliet test suite's manifest XML file obtained =
from the SARD site (https://samate.nist.gov/SRD/testsui=
te.php) by downloading the SARD-type manifest, The type-2 filepath=
 is taken from the filepath starting at the "testcases" d=
irectory from the source code in the Juliet test suite obtained the Juliet-=
standalone-test-suite way. To explain: the key to getting =
the different versions (Juliet-standalone-test-suite or SARD-type) of sourc=
ecode and manifest is how you download (from the same webpage!) from here: =
https://samate.=
nist.gov/SRD/testsuite.php

 to download the SARD-type manifest and code, click on the icon for "ma=
nifest" in the "SARD Suites" section of =
the webpage, which is below the "Standalone Suites" section.
 to download the Juliet-standalone-test-suite type of manifest and=
 code, get it in the "Standalone Suites=
" section at the top of the page