Position Focus:
The Data Mining (DM) Lab at Yale University lead by Dr. Samah Fodeh has an opening for a research assistant to participate in a series of projects that focus on leveraging healthcare data to improve patient care. The work spans analysis of structured and unstructured data in the electronic health record (EHR) as well as MyChart data. The position requires strong programming skills and familiarity with large language models (LLMs). The position will support projects focused on clinical natural language processing, predictive modeling, and real-world healthcare applications. The position is open for a driven individual with a masters degree in data science, computer science, biomedical informatics, or a similar background with some experience working with large datasets. Prior experience with healthcare is not required but will be helpful. The ideal candidate will have an interest in broad career development in a dynamic environment that allows them to develop as a healthcare data scientist.
Responsibilities will also include participating in the design, implementation, and maintenance of data pipelines and leading/assisting in building algorithms for language processing and deep learning with close collaboration from the study team.
While programming experience with python and/or R is required, experience with one or more of the following skills will be an asset: large tabular data with python/R, applications in python with PyTorch/Huggingface/SpaCy, applications of computer vision, image analysis, and text analysis in Tensorflow or PyTorch, large language models such as Llama and GPT models.
Essential Duties:
1. Designs, plans, implements and evaluates complex multifaceted research projects/systems, including method development, adaptation and validation. 2. Collaborates with PI to define research endeavors and the development of research hypothesis and approach. 3. Solves complex methodology, protocol, procedural and research problems through design of techniques, procedures and policies that will achieve research goals. 4. Develops long range plans for supplies and equipment to ensure smooth operation of laboratory. 5. Investigates, analyzes and evaluates complex data, data collection systems and methods to reach scientific conclusions and ensures the integrity of research data. 6. Collaborates with PI to coordinate major lab renovations working with Facilities, Health and Safety, and Purchasing to ensure deadlines, safety standards and purchases are according to plans and within budgets. 7. Prepares scientific reports and papers for research proposals and published reports. 8. May perform other duties as assigned.
Required Education and Experience:
Masters Degree in a scientific discipline and three years of experience or an equivalent combination of education and experience.
The Data Mining (DM) Lab at Yale University lead by Dr. Samah Fodeh has an opening for a research assistant to participate in a series of projects that focus on leveraging healthcare data to improve patient care. The work spans analysis of structured and unstructured data in the electronic health record (EHR) as well as MyChart data. The position requires strong programming skills and familiarity with large language models (LLMs). The position will support projects focused on clinical natural language processing, predictive modeling, and real-world healthcare applications. The position is open for a driven individual with a masters degree in data science, computer science, biomedical informatics, or a similar background with some experience working with large datasets. Prior experience with healthcare is not required but will be helpful. The ideal candidate will have an interest in broad career development in a dynamic environment that allows them to develop as a healthcare data scientist.
Responsibilities will also include participating in the design, implementation, and maintenance of data pipelines and leading/assisting in building algorithms for language processing and deep learning with close collaboration from the study team.
While programming experience with python and/or R is required, experience with one or more of the following skills will be an asset: large tabular data with python/R, applications in python with PyTorch/Huggingface/SpaCy, applications of computer vision, image analysis, and text analysis in Tensorflow or PyTorch, large language models such as Llama and GPT models.
Essential Duties:
1. Designs, plans, implements and evaluates complex multifaceted research projects/systems, including method development, adaptation and validation. 2. Collaborates with PI to define research endeavors and the development of research hypothesis and approach. 3. Solves complex methodology, protocol, procedural and research problems through design of techniques, procedures and policies that will achieve research goals. 4. Develops long range plans for supplies and equipment to ensure smooth operation of laboratory. 5. Investigates, analyzes and evaluates complex data, data collection systems and methods to reach scientific conclusions and ensures the integrity of research data. 6. Collaborates with PI to coordinate major lab renovations working with Facilities, Health and Safety, and Purchasing to ensure deadlines, safety standards and purchases are according to plans and within budgets. 7. Prepares scientific reports and papers for research proposals and published reports. 8. May perform other duties as assigned.
Required Education and Experience:
Masters Degree in a scientific discipline and three years of experience or an equivalent combination of education and experience.
Job ID: 479478552
Originally Posted on: 6/2/2025
Want to find more Chemistry opportunities?
Check out the 17,615 verified Chemistry jobs on iHireChemists
Similar Jobs