
3
section III. Each of the 12 descriptive files discussed in this section are static, contain the latest year for
which information is available, and therefore do not have a time dimension. All of these files contain
the firm ID (or BVDID) in the first column and can be linked by merging on this identifier.
1.DATA CONTENTS
The following describes the contents of the 12 firm description RAR files.
1. All addresses.txt: contains detailed address information. The variables include the main firm
identifier BVDID, the first four lines of the street address (both in English and the native
language), city (both in English and native language), postcode, country, country ISO code,
region in country, type of region in country, telephone and fax numbers, and address type.
a. There may be multiple entries per BVDID because one firm can have multiple address
types. The most common address type is incorporation address, but others include
previous address, branch address, and postal address.
b. Depending on the purpose of the study, the user can create the dataset that only
contains one observation per BVDID. For this, the user can implement the following
steps. Identify cases where there is more than one entry per BVDID (using the
duplicates tag command in Stata). If a BVDID does have multiple entries:
i. First, keep the incorporation address
ii. Some firms do not report an incorporation address, and therefore multiple
entries per BVDID need to be dealt with using different criteria. Second, among
remaining multiple entries, drop the previous address
iii. The remaining cases of multiple entries usually have two types of addresses:
office and postal. As a last step, keep the office address.
2. Contact info.txt: this file is similar to All addresses.txt, but has only one entry per BVDID. That
is, while the All addresses.txt file contains information on a firm’s previous address, branch
address, etc., the Contact info.txt file only contains the latest address for each firm. The file
contains the firm name, first four lines of the street address (both in English and the native
language), postcode, city (in English and the native language), country, country ISO,
metropolitan area (for the US), state/province (in US and Canada), county (US and Canada), fax
and telephone number, website, email address, region in the country and region type.
3. Identifiers.txt: contains various firm identifiers for each BVDID, including a national ID number,
the label of that national ID, the national VAT/tax identifier, trade register number, European
VAT number, LEI (legal entity identifier), and ticker symbol.
a. This file often has multiple entries for each BVDID. One common reason is that a
country has more than one type of national identifier. For example, in many countries
firms are assigned both a VAT/tax identifier and a LEI. Since both are types of national
IDs, there will be two observations per firm. One observation where the national ID
variable is populated with the VAT/tax identifier and the other where the national ID
variable is populated with the LEI.