Past Ambulatory Care Monthly News Data Tips

Visit the National Ambulatory Medical Care Survey website at https://www.cdc.gov/nchs/namcs/ and the National Hospital Ambulatory Medical Care Survey at https://www.cdc.gov/nchs/nhamcs/ for more information about these surveys.

Note: the January editions of Ambulatory Care Monthly News do not feature data tips.

May 2024

Data Tip of the Month

Did you know . . .

Injury data in NHAMCS are edited using a program which reviews codes for reason for visit, diagnosis, and cause of injury. It then assigns injury and intentionality status accordingly. This way, records which do not specifically state an injury but for which injury codes for reason, diagnosis, and/or cause of injury are present are recoded appropriately. Meanwhile, records which state an injury but for which no supporting data could be found are assigned to a ‘questionable’ injury status. This allows data users to make their own determination as desired. Check the codebook to select the injury variables for your research.

April 2024

Data Tip of the Month

Did you know . . .

To analyze NHAMCS data in SAS, download the dataset(s) you are interested in from here. Save the datasets in a new folder on your local workstation, for example, C:MyFilesED2021. The datasets are compressed and must be unzipped prior to use. After downloading, right-click on the file name from your directory screen; an option to unzip the files should appear. SAS input, label, and format files can be downloaded from here along with README files which include more information. To create a SAS dataset, you can use the following example:

filename ed21pub “C:MyFilesED2021ed2021”; /*unzipped ASCII data set*/

filename ed21for “C:MyFilesED2021ed21for.txt”; /*SAS format statement*/

filename ed21inp “C:MyFilesED2021ed21inp.txt”; /*SAS input statement*/

filename ed21lab “C:MyFilesED2021ed21lab.txt”; /*SAS label statement*/

%inc ed21for; /*reads in the formats*/

data test21;

infile ed21pub missover lrecl=9999;

%inc ed21inp; /*reads in the input statement*/

%inc ed21lab; /*reads in the labels*/

run;

March 2024

Data Tip of the Month

Did you know . . .

A method to analyze drug data involves the isolation of the records with drugs and the creation of a separate data file of drug mentions. Each Patient Record can have up to 30 drug mentions recorded, so whatever file is created would need to include all of them. This method can be used for obtaining estimates of drug mentions but is not recommended for variance estimation. Rather, the structure of the visit file should be kept intact when estimating variance. In order to do this, estimates of drug mentions can be obtained by creating a new weight variable (called DRUGWT in this example). This variable is created by multiplying PATWT (the patient visit weight) by NUMMED (the number of medications recorded at the sampled visit) or DRUGWT=PATWT*NUMMED. DRUGWT can then be used in place of PATWT to weight one’s data; it produces the estimated number of drug mentions rather than visits.

February 2024

Data Tip of the Month

Did you know . . .

For the 2021 NHAMCS, four items were imputed: age (0.09% of unweighted visit records), sex (0.09%), race (21.4%), and ethnicity (14.4%). Age and sex were imputed using a hot deck based on 3-digit ICD-10-CM code for primary diagnosis, triage level, ED volume, and geographic region. Patient race and ethnicity imputations were performed using a model-based single, sequential regression method. The model used to impute race and ethnicity included the following variables:

Census variables for ZIP code level race and ethnicity population estimates and an indicator for whether it was patient or hospital ZIP (used when patient ZIP was not available).
Patient age, sex, race, and ethnicity.
Triage level.
Log of ED wait time.
Primary expected source of payment derived from a hierarchical recode of the expected source of payment question.
Grouped 3-digit ICD-10-CM codes for primary diagnosis.
Year of visit.
Type of emergency service area.
Provider’s metropolitan statistical area status.
ED weighting and volume variables.

December 2023

Data Tip of the Month

Did you know . . .

It is possible to combine two or more years of NHAMCS to increase the sample size. This allows you to produce estimates with greater statistical reliability for subgroup analyses (for example, age, sex, race, and Hispanic origin).

When combining years of data, it is important to:

Verify that the data items of interest are comparable in terms of how they were collected in each year.
Verify that the variable names have not changed over the years.
To obtain an annual average number of weighted visits using a combined file, divide the weighting variable PATWT by the number of years you are combining. For example, if you are combining data for 2020 and 2021, you can create a new variable called PATWT2, defined as PATWT divided by 2 (PATWT/2). Weighting the data with PATWT2 will yield the annual average estimate for 2020 and 2021. If you run data for combined years and use the original weight (PATWT), your result will reflect a 2-year visit total rather than an annual average.

November 2023

Data Tip of the Month

Did you know . . .

When analyzing NAMCS and NHAMCS data with R software, use either the R “survey table” package (https://cdcgov.github.io/surveytable/ ) or the R “survey” package. In “survey table”, use the “tab” function to generate tables with estimates of counts and percentages. In “survey”, use the “svytotal” function to generate estimates of counts and “svymean” to generate estimates of percentages.

October 2023

Data Tip of the Month

Did you know . . .

README files containing instructions to create SAS, SPSS, and Stata datasets using the 2021 NHAMCS public use file are available here:

Instructions for previous years for SAS, SPSS, and Stata are also available, along with pre-made SAS, SPSS, and Stata datasets for 2015-2021.

NHAMCS public use file documentation is available here, and public use data files in ASCII format are available here.

September 2023

Data Tip of the Month

Did you know . . .

The default confidence interval (CI) method commonly produced by statistical software is the Wald confidence interval [p ± 1.96 × SE(p) for a two-sided 95% CI]. The Wald CI is known to have limitations for proportions. The NCHS publication Data Presentation Standards for Proportions includes criteria based on the absolute width and the relative width of the Clopper-Pearson confidence interval, which was adapted for complex sample surveys by Korn and Graubard. The calculation of the Korn and Graubard CI incorporates information from the survey design, including the effective sample size and, when appropriate, the degrees of freedom. For proportions estimated for a subgroup, the degrees of freedom should be calculated as (the number of PSUs with sampled observations in the subgroup of interest) – (the number of strata with sampled observations in the subgroup of interest). PSU is primary sampling unit.

August 2023

Data Tip of the Month

Did you know . . .

Throughout the years, NAMCS and NHAMCS datasets and related documentation have been released in a variety of formats. When looking for a dataset to download or other survey information, the following pages could be useful:

The Datasets and Documentation page lists all the data files and documentation from the earliest to the newest releases. It also includes links for a Data Notices page as well as pages containing updates to the data files and documentation, if any.

The Survey Instruments page contains links to the NAMCS and NHAMCS Patient Record forms (called sample cards in recent years) and Induction Interview forms for physicians, community health center administrators and providers, and hospitals and emergency departments.

The Survey Methods and Analytic Guidelines page contains detailed information on survey methods, nonresponse bias, and the various classification systems used to code NAMCS and NHAMCS data.

July 2023

Data Tip of the Month

Did you know . . .

You can view counts and rates of emergency department visits from 2016-2021 for the 10 leading primary diagnoses and reasons for visit, by patient and hospital characteristics of your interest. Estimates in this visualization highlight and expand on information provided in the annual NHAMCS web tables, which can be used to assess how these categories and rankings changed over the evaluated years. The tabs at the bottom of the visualization allow you to select between “Primary Diagnosis” and “Reason for Visit,” and the drop-down menus at the top of the visualization allow you to select the estimate type, the estimate category, and the group breakdown of interest.

June 2023

Data Tip of the Month

Did you know . . .
To analyze NAMCS or NHAMCS public use files using SAS, two options are available. Researchers can download a pre-made SAS dataset along with the corresponding SAS format file to run the data. Option 2 is to download the public use file in ASCII format along with SAS input, label, and format statements. These can be used to create one’s own SAS dataset. An example of Option 1 using 2020 NHAMCS ED data is shown below:

Option 1:
Download the 2020 ED pre-made SAS dataset (ed2020_sas.zip) and format file (ed20for,txt) and save them to a folder of your choosing.

Right-click on the name of the zipped file from your directory screen. There should be an option to extract the file to a location of your choosing.

Use this SAS code to create a temporary working file. In this example, the data are saved to a folder called “c:myfilesnhamcs”
%INC “c:myfilesnhamcsed20for.txt”; /*reads in the SAS formats from your downloaded format file*/
LIBNAME out1 “c:myfilesnhamcs”; /*points to the location of the downloaded data file*/
DATA test20; set out1.ed2020_sas; /*creates a temporary working file copied from the unzipped file*/
PROC SURVEYFREQ DATA=test20;
TABLES sex*ager /clwt cl;
CLUSTER cpsum;
STRATA cstratm;
WEIGHT patwt;
RUN;

For instructions on how to use the ASCII file to create your own SAS dataset (Option 2), see readme2020-ed-sas-txt.

May 2023

Data Tip of the Month

Did you know . . .

For the NHAMCS ED public use file (PUF), only the first four digits of the diagnosis codes, based on the International Classification of Diseases, 10th Revision, Clinical Modification (ICD-10-CM), are included. There is an implied decimal between the 3rd and 4th digits, and inapplicable 4th digits are dash-filled. For example: F321 = F32.1 Major depressive disorder, single episode, moderate. Because only 4 digits are included, more detailed codes are not available on the PUF. For instance, suicidal ideations (R45.851) is not an available code on the PUF. Instead, R45.841 would appear as R45.8 on the PUF (Other symptoms and signs involving emotional state) which includes conditions in addition to suicidal ideations. NCHS reports may present estimates using codes that are not available on the PUF. If access to the full ICD-10-CM codes is needed, it can be requested through the NCHS Research Data Center.

April 2023

Data Tip of the Month

Did you know . . .

The National Center for Health Statistics new data presentation standards for rates and counts have been released. The multistep NCHS data presentation standards for rates and counts are based on a minimum sample size and the relative width of a confidence interval (CI). Specific criteria for rates and counts, including the CI calculations used, differ between vital statistics and health surveys and may differ according to the source of the denominator. For specific components of NCHS data presentation standards for rates and counts visit Vital Health and Statistics Series 2 No. 200.

March 2023

Data Tip of the Month

Did you know . . .

NAMCS and NHAMCS collect data on up to 30 medications provided, prescribed, or continued at the sampled visit. Each drug is first coded “as written” using an internal NCHS classification (MED1-MED30). Each drug also has associated characteristics added: the generic-equivalent code from Multum (DRUGID1-DRUGID30), prescription status (PRESCR1-PRESCR30), controlled substance status (CONTSUB1-CONSTUB30), composition status (COMSTAT1-COMSTAT30), and up to 4 Multum therapeutic categories (RX1CAT1-RX30CAT1, RX1CAT2-RX30CAT2, RX1CAT3-RX30CAT3, RX1CAT4-RX30CAT4). For each therapeutic category, there are also Level 1, Level 2 and Level 3 variables that together show the complete nested structure of the therapeutic category. For example, the drug rosuvastatin has a Level 1 code of metabolic agents, a Level 2 code of antihyperlipidemic agents, and a Level 3 code of HMG-CoA reductase inhibitors (statins). This enables you to run drugs at either the first, second, or third level of the therapeutic classification. More information can be found in the annual public use file documentation or on the website.

February 2023

Data Tip of the Month

Did you know . . .

The process for requesting access to restricted-use data from a federal statistical agency, including the National Center for Health Statistics (NCHS), has recently changed. For the first time, a standard application process (SAP) will be used by all 16 agencies comprising the U.S. Federal Statistical System.

This change makes NCHS and other federal data more accessible and usable for evidence-building purposes. Now data users only need to follow one process and complete a single application to request access to data from multiple agencies. Application review criteria have also been standardized through SAP.

The SAP portal houses the following resources:

Metadata catalog: Data users can search key words to determine if NCHS or other federal statistical agencies have data suited to their specific-use cases. NCHS has restricted-use data on a variety of public health topics, including vital statistics, health and nutrition, health status, access to care, and ambulatory care services.
Standard application: Data users can complete one application to request access to NCHS or other federal data sets using the SAP portal. In doing so, they must demonstrate that any data they access will be used for statistical purposes only. Data users can also track the status of their application as it moves through the review process. If approved, NCHS will guide applicants through the data access process.

If you have questions about SAP for accessing NCHS restricted-use data contact rdca@cdc.gov.

December 2022

Data Tip of the Month

Did you know . . .

When analyzing NAMCS and NHAMCS data with R software, the R “survey” package should be used. The svydesign function combines a data frame and all the survey design information needed to analyze it. It is important to never subset the data frame before using the svydesign function. If you subset your data frame before defining your survey design object, you may produce incorrect variance estimates.

November 2022

Data Tip of the Month

Did you know . . .

When combining multiple years of NAMCS or NHAMCS data always check the contents of the data files because variable names may be different from year to year.  You can download the documentation for the years of interest here. The codebook section lists all the variables in the data file. If the labels of the variables of interest have changed, you should recode the variables to make their names and response categories consistent before appending the data.

October 2022

Data Tip of the Month

Did you know . . .

For a complex survey, the design degrees of freedom are calculated by subtracting the number of strata from the number of primary sampling units (PSUs). In an analysis on a subgroup, the degrees of freedom should be based on the number of strata and PSUs containing the observations of interest.  In SUDAAN, by using the PROC DESCRIPT procedure, the user can output the number of strata and PSUs represented in the subpopulation. In other packages, the user may need to calculate the number of PSUs and strata separately.

September 2022

Data Tip of the Month

Did you know . . .

NAMCS and NHAMCS have separate files of drug ingredients for each year of data, which can be merged with the public use files using a program provided on the survey website.  While each drug on the public use file includes up to four therapeutic categories, combination products are composed of multiple ingredients, and each one of those may have its own therapeutic categories. By adding the drug ingredient file to the main public use file, data users can access this additional information which is year- and survey-specific.

August 2022

Data Tip of the Month

Did you know . . .

When using SUDAAN remember to sort your input dataset by the design variables specified on the NEST statement. Your analysis dataset should be sorted in SAS by the strata and cluster variables before calling any SUDAAN procedures.

July 2022

Data Tip of the Month

Did you know . . .

To properly account for the sample design and obtain correct variance estimates, all patient visits with a positive sample weight should be included in your analysis. It is important not to drop any records from your dataset; for example, do not use an “IF” to subset your data. Instead, SAS provides domain or subgroup analysis, which allows you to include all observations while focusing on the subgroup of interest. It is also important to retain records with missing values for the variable of interest. To do this using SAS, always include NOMCAR in the PROC SURVEYFREQ statement to allow the missing values to be included in the standard error computations.

June 2022

Data Tip of the Month

Did you know . . .

When combining multiple years of NAMCS or NHAMCS data, you can produce averaged annual estimates by doing the following: create a new weight variable by dividing the patient visit weight (PATWT) by the number of the years in your analysis (e.g., if combining 3 years of data, new variable PATWT3=PATWT/3).

May 2022

Data Tip of the Month

Did you know . . .

Starting in 2014, up to 30 drugs are collected per visit in NAMCS and NHAMCS. Each MED code is associated with a DRUGID code (MED1 and DRUGID1, MED2 and DRUGID2, etc.). The MED code, based on an NCHS classification, represents the drug entry as reported in the survey instrument and can include brand names, generic names, or therapeutic effects (such as allergy relief). The DRUGID code, based on a proprietary classification, represents the generic composition of the drug. Each DRUGID can be associated with up to 4 therapeutic categories. For example, MED1 is assigned to DRUGID1, which is associated with RX1CAT1, RX1CAT2, RX1CAT3, RX1CAT4. These RXCAT variables will always reflect the highest level therapeutic code available. SAS exercises on how to use the drug variables could be found here [PDF – 95 KB].

April 2022

Data Tip of the Month

Did you know . . .

When using NAMCS or NHAMCS annual public use data file documentation, you will sometimes find web links to other documents located on the NCHS FTP server.  But when you click them, the link is broken.  That’s because NCHS changed its FTP address and these older links will no longer work.  There is an easy work-around, however.  The steps may vary slightly depending on your browser, but the logic is the same:  Simply right click on the ‘bad’ link from the documentation and copy the hyperlink to your browser’s URL line.  Change just the initial “ftp” in the URL address to “https”, and it should work correctly in most cases.  There is an extra step to take when using older links pointing to the 2018 NHAMCS Public Use File Documentation. (Such links are included in Appendix II and Appendix III of the 2018 NAMCS Public Use File Documentation.) Not only the initial “ftp” in the URL address should be changed to “https” but also the file name should be changed to doc18-ed-508.

March 2022

Data Tip of the Month

Did you know . . .

If you wish to link visit characteristics with providers and produce aggregated statistics at the provider level you could follow these steps:

Organize the data in a DATA step, converting missing values for continuous variables to ‘.’ and creating 0, 1 variables out of categorical variables where necessary

Use PROC SUMMARY (or PROC MEANS) to create one record per provider along with the aggregate statistics for that provider.

Clean up the output file by converting proportions to percentages.

SAS code examples can be found here [PDF – 53 KB].

February 2022

Data Tip of the Month

Did you know . . .

A new version of the 2019 NHAMCS Emergency Department public use data file was released which includes the ED weight variable (EDWT) only on the first record for each hospital (based on the HOSPCODE variable).  The initial file release included this variable on all ED records.  It is easier to produce facility-level estimates when the EDWT variable is present on only one record for each ED, and that is the way the file has traditionally been released.  To calculate facility-level estimates correctly, it is recommended that the revised version of the file be downloaded.  Visit-level estimates are unaffected.  Pre-made SAS, SPSS, and Stata datasets have also been updated to reflect this change.

December 2021

Data Tip of the Month

Did you know . . . In the International Classification of Diseases, 10^th Revision, Clinical Modification (ICD-10-CM), which is used to code NAMCS and NHAMCS data, diagnosis codes can have a maximum of seven digits. For the NHAMCS ED public use file, only the first four digits of the diagnosis codes are included. There is an implied decimal between the third and fourth digits and inapplicable fourth digits are dash-filled. For example: F321 = F32.1 Major depressive disorder, single episode, moderate; I10- = I10 Essential (primary) hypertension. Since ICD-10-CM uses non-numeric characters extensively, we are not able to provide numeric recodes for the diagnosis codes.

Past Ambulatory Care Monthly News Data Tips

Visit the National Ambulatory Medical Care Survey website at https://www.cdc.gov/nchs/namcs/ and the National Hospital Ambulatory Medical Care Survey at https://www.cdc.gov/nchs/nhamcs/ for more information about these surveys.

Note: the January editions of Ambulatory Care Monthly News do not feature data tips.

May 2024

Data Tip of the Month

April 2024

Data Tip of the Month

March 2024

Data Tip of the Month

February 2024

Data Tip of the Month

December 2023

Data Tip of the Month

November 2023

Data Tip of the Month

October 2023

Data Tip of the Month

September 2023

Data Tip of the Month

August 2023

Data Tip of the Month

July 2023

Data Tip of the Month

June 2023

Data Tip of the Month

May 2023

Data Tip of the Month

April 2023

Data Tip of the Month

March 2023

Data Tip of the Month

February 2023

Data Tip of the Month

December 2022

Data Tip of the Month

November 2022

Data Tip of the Month

October 2022

Data Tip of the Month

September 2022

Data Tip of the Month

August 2022

Data Tip of the Month

July 2022

Data Tip of the Month

June 2022

Data Tip of the Month

May 2022

Data Tip of the Month

April 2022

Data Tip of the Month

March 2022

Data Tip of the Month

February 2022

Data Tip of the Month

December 2021

Data Tip of the Month

November 2021

Data Tip of the Month

October 2021

Data Tip of the Month

September 2021

Data Tip of the Month

August 2021

Data Tip of the Month

July 2021

Data Tip of the Month

June 2021

Data Tip of the Month

May 2021

Data Tip of the Month

April 2021

Data Tip of the Month

March 2021

Data Tip of the Month

February 2021

Data Tip of the Month

December 2020

November 2020

October 2020