Publications by authors named "Sergey D Goryachev"

Purpose: To develop and validate an algorithm to extract clinically relevant data elements for prostate cancer (PCa) from prostate biopsy reports and magnetic resonance imaging (MRI) reports.

Patients And Methods: MRI reports and biopsy pathology reports were extracted from a cohort of 1,360,866 patients with PCa in the VA Cancer Registry System or the VA Corporate Data Warehouse, with 155,570 patients having the relevant reports for inclusion. We hand-annotated a sample of these reports, which were used to develop a rule-based natural language processing (NLP) algorithm for extracting Gleason score, positive cores, and total cores taken during biopsy from biopsy pathology reports and Prostate Imaging Reporting and Data System (PI-RADS) score, prostate-specific antigen (PSA) density, prostate volume, and prostate dimensions from MRI reports.

View Article and Find Full Text PDF

Purpose: Stage in multiple myeloma (MM) is an essential measure of disease risk, but its measurement in large databases is often lacking. We aimed to develop and validate a natural language processing (NLP) algorithm to extract oncologists' documentation of stage in the national Veterans Affairs (VA) Healthcare System.

Methods: Using nationwide electronic health record (EHR) and cancer registry data from the VA Corporate Data Warehouse, we developed and validated a rule-based NLP algorithm to extract oncologist-determined MM stage.

View Article and Find Full Text PDF

Clinical trial enrollment is impeded by the significant time burden placed on research coordinators screening eligible patients. With 50,000 new cancer cases every year, the Veterans Health Administration (VHA) has made increased access for Veterans to high-quality clinical trials a priority. To aid in this effort, we worked with research coordinators to build the MPACT (Matching Patients to Accelerate Clinical Trials) platform with a goal of improving efficiency in the screening process.

View Article and Find Full Text PDF

Objective: Reducing risk of coronavirus disease 2019 (COVID-19) infection among healthcare personnel requires a robust occupational health response involving multiple disciplines. We describe a flexible informatics solution to enable such coordination, and we make it available as open-source software.

Materials And Methods: We developed a stand-alone application that integrates data from several sources, including electronic health record data and data captured outside the electronic health record.

View Article and Find Full Text PDF

We consider the task of producing a useful clustering of healthcare providers from their clinical action signature- their drug, procedure, and billing codes. Because high-dimensional sparse count vectors are challenging to cluster, we develop a novel autoencoder framework to address this task. Our solution creates a low-dimensional embedded representation of the high-dimensional space that preserves angular relationships and assigns examples to clusters while optimizing the quality of this clustering.

View Article and Find Full Text PDF

Background: Spinal manipulation has been associated with cervical arterial dissection and stroke but a causal relationship has been questioned by population-based studies. Earlier studies identified cases using International Classification of Diseases Ninth Revision (ICD-9) codes specific to anatomic stroke location rather than stroke etiology. We hypothesize that case misclassification occurred in these previous studies and an underestimation of the strength of the association.

View Article and Find Full Text PDF