This position is located in the Department of Epidemiology, jointly housed in the College of Public Health and Health Professions and the College of Medicine. This person will manage the HIV databases, perform data preprocessing and database management, ensure data security, and help with data analysis and modeling under supervision from investigators using high level statistical and machine learning software packages. Facilitate data collection and data request, study coordination, and IRB related activities. This position must ensure and maintain the highest degree of confidentiality and be skilled information assurance as well as interpersonal relationships.
Data Management and Preprocessing
Perform database management for both existing and new research projects involving a variety of data sources, including electronic health records, claims data, HIV surveillance data, administrative datasets, and other relevant research databases. Use SAS, SQL, or equivalent programming languages to carry out data preprocessing, transformation, linkage, and cleaning in response to modeling and analytical needs. Manage and organize data sources in appropriate storage environments, such as REDCap, the PHHP NAS server, ResVault, and HiPerGator. Maintain detailed documentation of all code and processes used to ensure reproducibility and data integrity.
Data Analysis and Modeling
Conduct data analysis and apply various machine learning models using statistical software such as SAS, under the guidance of the principal investigator. Utilize relevant packages in R and Python to support modeling and analytical tasks. Effectively communicate results to investigators and research team, incorporating feedback to refine both the models and the data analysis plan as needed.
Facilitate Data Collection, Study Coordination, and IRB-Related Activities
Assist with primary data collection, participant recruitment, and study coordination. Collaborate with students and volunteers to support study-related activities and ensure smooth day-to-day operations. Manage Institutional Review Board (IRB) submissions, revisions, and approvals, manage data requests, and ensure compliance with data use agreements and ethical guidelines.
Data Reporting and Communication
Communicate regularly with investigators and the research team, providing updates on data management, analysis progress, study recruitment, and any challenges encountered along with proposed solutions. Attend team meetings as needed to support project coordination. Contribute to manuscripts by documenting data preprocessing and modeling procedures and generate preliminary results to support grant applications.
Clinical Terminology and Data Coding
Gather and compile information to support the coding of data using electronic health records (EHR). Responsibilities include identifying and organizing ICD and SNOMED codes to classify various health condition diagnoses. The position also entails compiling lab test names and mapping them to LOINC codes or other relevant coding systems, as well as determining appropriate clinical cutoff points for interpreting lab results. Additionally, the role requires identifying medication names and matching them to standardized coding systems such as RxNorm and NDC to ensure accurate classification of medication use. Stay current with knowledge and expertise related to database analysis and modeling and data security by attending relevant training courses, reading literatures, taking online skills training and learning from others with similar skills.
Dataset Preparation and Research Data Sharing
Manage the creation of study-specific datasets for use by students and collaborators, which may involve direct communication with investigators, trainees, and research team members. Assist in developing policies and procedures for requesting and accessing shared datasets and provide dataset overviews or introductions as needed. Ensure ethical use of research data by maintaining backup files, and by adhering to data sharing and storage policies established by the HIV research team, the University, the IRB, Data Use Agreements (DUAs), and other applicable local guidelines.
Professional Development and Continuing Education
Stay up to date with best practices in database analysis, modeling, and data security by attending relevant training sessions, reading scientific literature, participating in online courses, and learning from colleagues with similar expertise.