NLM Scrubber: NLM s Software Application to De-identify Clinical Text Documents

Part of paid clinical trials in Bethesda, Maryland.

Sponsor
National Library of Medicine (NLM)
Study ID
NCT02795806
Status
Enrolling By Invitation

Conditions

  • Personally Identifiable Information

Eligibility Criteria

Sex
ALL
Age
1 Day - N/A
Healthy Volunteers
Not accepted

Study Details

Background: Electronic health records contain a vast amount of data about diseases and treatments. Researchers could use this data to test their ideas, but they would need to use records from more than just their own group of patients. But access to those records is restricted to ensure patient privacy. U.S. National Library of Medicine (NLM) has created a computer tool called NLM Scrubber. This program recognizes and deletes personal information from health records. The researchers who developed this program now need access to the original records. This will allow them to see how well the program removes personal information from patient records and how they can make it more accurate. Objectives: To find ways to improve clinical text de-identification. Eligibility: No new participants. Researchers will review data that have already been collected. Design: Researchers will collect a random sample of reports. These will be from different doctors in different fields. Researchers will manually remove personal information from the records. Researchers will also automatically remove personal information from original records using NLM-Scrubber. Researchers will compare the results of the computer program versus the manual changes. They will note when the program has not been removing personal information correctly. They will also note when the program has been deleting nonpersonal health information incorrectly. Researchers will use the results to revise the program. They will keep testing it until the de-identification process is complete.

Key Dates

Start date
May 25, 2016
Status verified
Dec 2025
Primary completion
Jan 31, 2027
Completion
Jan 31, 2027

Study Design

Enrollment
50,000 participants (estimated)

Arms

  • Arm: 1
    Everybody for whom a clinical narrative report is created.

Primary Outcome Measure

The rate of de-identification of PII [ Time Frame: 01/01/2017-01/31/2027 ]

Locations (1)

FacilityCityStateZIPSite coordinators
National Library of MedicineBethesdaMaryland--

Find similar trials in Bethesda, MD