John Pace, PhD

John Pace, PhD

Azle, Texas, United States
614 followers 500+ connections

About

First year dental student at the University of Iowa College of Dentistry

Articles by John

Activity

Join now to see all activity

Experience

Education

Licenses & Certifications

  • IBM Systems Business Partner Artificial Intelligence on IBM POWER9 Graphic
  • Nvidia Certified Deep Learning Institute (DLI) Instructor for Fundamentals of Deep Learning for Multiple Data Types Graphic

    Nvidia Certified Deep Learning Institute (DLI) Instructor for Fundamentals of Deep Learning for Multiple Data Types

    NVIDIA

    Issued
  • Certification in Nvidia Fundamentals of Deep Learning for Computer Vision from Nvidia Deep Learning Institute Graphic

    Certification in Nvidia Fundamentals of Deep Learning for Computer Vision from Nvidia Deep Learning Institute

    NVIDIA

    Issued
  • Certification in Nvidia Fundamentals of Deep Learning for Multi-GPUs from Nvidia Deep Learning Institute Graphic

    Certification in Nvidia Fundamentals of Deep Learning for Multi-GPUs from Nvidia Deep Learning Institute

    NVIDIA

    Issued
  • Certification in Nvidia Fundamentals of Deep Learning for Multiple Data Types from Nvidia Deep Learning Institute Graphic

    Certification in Nvidia Fundamentals of Deep Learning for Multiple Data Types from Nvidia Deep Learning Institute

    NVIDIA

    Issued
  • Nvidia Certified Deep Learning Institute (DLI) Instructor for Fundamentals of Deep Learning for Computer Vision Graphic

    Nvidia Certified Deep Learning Institute (DLI) Instructor for Fundamentals of Deep Learning for Computer Vision

    NVIDIA

    Issued

Publications

  • A role for host-parasite interactions in the horizontal transfer of transposons across phyla.

    Nature

  • Repair-mediated duplication by capture of proximal chromosomal DNA has shaped vertebrate genome evolution.

    PLoS Genetics

  • Horizontal SPINning of transposons.

    Communicative and Integrative Biology

  • Target site analysis of RTE1_LA and its AfroSINE partner in the elephant genome.

    Gene

  • Repeated horizontal transfer of a DNA transposon in mammals and other tetrapods.

    Proceedings of the National Academy of Sciences

  • The evolutionary history of human DNA transposons: evidence for intense activity in the primate lineage.

    Genome Research

  • The Evolutionary History And Genomic Impact Of Mammalian DNA Transposons. Published 2009.

    -

Projects

  • Using BERT (NLP) to Answer Questions about Medical Records

    - Present

    Project Goal: To use BERT, a state-of-the-art natural language processing (NLP) algorithm, to allow users to ask free text questions about medical records and obtain correct answers.

    Background and Project Description: In healthcare, specific patient data is often difficult to find. Chart notes written by doctors or nurses are unstructured and can be voluminous. With patient records easily being 50 pages or more, it is time consuming to manually look through the records to find the…

    Project Goal: To use BERT, a state-of-the-art natural language processing (NLP) algorithm, to allow users to ask free text questions about medical records and obtain correct answers.

    Background and Project Description: In healthcare, specific patient data is often difficult to find. Chart notes written by doctors or nurses are unstructured and can be voluminous. With patient records easily being 50 pages or more, it is time consuming to manually look through the records to find the answer to a question. In this project, I am utilizing BERT, a state-of-the-art NLP algorithm, to allow users to query the free text of chart notes and obtain answers quickly. Due to the unstructured nature of the data, significant pre-processing has to be done, such as dividing the notes into small chunks, or passages, that can be queried quickly. Additionally, BERT, by design, can only query small passages, so the data must be iteratively queried then the output data post-processed to display the correct answer.

    Languages and Frameworks: Python, Tensorflow

    AI and Machine Learning Techniques: Natural Language Processing

    Code on GitHub: https://github.com/pacejohn/BERT-Long-Passages

    Medium: https://medium.com/@johnpace_32927/question-and-answer-for-long-passages-using-bert-dfc4fe08f17f

  • Using AI to Transcribe American Sign Language

    - Present

    Project Goal: To develop an AI application that will transcribe American Sign Language (ASL) onto a screen so a non-ASL speaker can understand what is being signed.

    Background and Project Description: Many deaf people communicate using American Sign Language. They are able to understand what a non-deaf person is saying by reading lips. However, a person who does not speak ASL cannot understand what is being signed. Thus, communication only occurs one way. The goal of this project is to…

    Project Goal: To develop an AI application that will transcribe American Sign Language (ASL) onto a screen so a non-ASL speaker can understand what is being signed.

    Background and Project Description: Many deaf people communicate using American Sign Language. They are able to understand what a non-deaf person is saying by reading lips. However, a person who does not speak ASL cannot understand what is being signed. Thus, communication only occurs one way. The goal of this project is to allow for two-way communication. I am developing an application in which a person can use their phone camera to see a person who is signing. As the signs are formed, it will transcribe the signs onto the screen, allowing the person to understand what is being signed. This will allow for two-way communication. This project was showcased at the AI Summit Conference in San Francisco in September 2019 and will be showcased at the Super Computing 2019 conference in November 2019 and the AI Summit Conference in New York City in December 2019.

    Languages and Frameworks: Python, Tensorflow, OpenCV, IBM Power AI Vision

    AI Techniques: Computer Vision, Deep Learning (Convolutional Neural Networks, Recurrent Neural Networks, Natural Language Processing)

    YouTube video: https://youtu.be/gBQTo1hpM3Q

    See project
  • Creation of Computer Vision Workshop

    - Present

    Project Goal: Create a computer vision workshop to introduce basics of computer vision to diverse audiences.

    Background and Project Description: I have developed a workshop that introduces the basics of computer vision and allows the attendees to perform computer vision tasks such as image classification, object detection, and segmentation. There is also an overview of convolutional neural networks and how they are used in computer vision. I have taught the workshop around the country…

    Project Goal: Create a computer vision workshop to introduce basics of computer vision to diverse audiences.

    Background and Project Description: I have developed a workshop that introduces the basics of computer vision and allows the attendees to perform computer vision tasks such as image classification, object detection, and segmentation. There is also an overview of convolutional neural networks and how they are used in computer vision. I have taught the workshop around the country to data scientists, C-level customers, sales representatives, IT administrator, programmers, and even high school students.

    Languages and Frameworks: IBM Power AI Vision

    AI and Machine Learning Techniques: Computer Vision, Image Classification, Object Recognition, Image Segmentation, Convolutional Neural Networks

    See project
  • Using AI to Read Messy Text

    - Present

    Project Goal: To train convolutional neural networks to be able to read very “messy” text from images or video, transcribe the text, and output to usable formats.

    Background and Project Description: “Messy” text is defined as text within images that is typically poorly read by other image recognition software, due to varying background, shadows, dirt, obstructions, numerous challenging fonts, and weathering. Tombstones are a massive dataset of objects with "messy" text. Websites like…

    Project Goal: To train convolutional neural networks to be able to read very “messy” text from images or video, transcribe the text, and output to usable formats.

    Background and Project Description: “Messy” text is defined as text within images that is typically poorly read by other image recognition software, due to varying background, shadows, dirt, obstructions, numerous challenging fonts, and weathering. Tombstones are a massive dataset of objects with "messy" text. Websites like genealogy.com, ancestry.com, and findagrave.com allow people to upload pictures of tombstones, but they must be manually transcribed. In this project, I have developed several convolutional neural network models that will read images or video of tombstones and transcribe the letters into text. This speeds up transcription time and helps to reduce errors caused by human data entry of the information. This project was showcased at the IBM Think conference in San Francisco in February 2019. To see examples of the text recognition, see my YouTube videos at https://www.youtube.com/watch?v=RwOCFxAszfc&list=PLz8DxxKmiM0BLK9YWfxuGeX6-8yrsO7_B

    Languages and Frameworks: Python, Tensorflow, IBM Power AI Vision, OpenCV

    AI and Machine Learning Techniques: Convolutional Neural Networks

    See project
  • Predicting Blood Glucose Levels with Recurrent Neural Networks and ARIMA

    -

    Project Goal: Predict blood glucose levels using historical data for training.

    Background and Project Description: Predicting blood glucose levels for individuals with diabetes is important when trying to determine the amount of insulin that must be given at various time points. Insulin pump manufacturers have to train models to predict how much insulin a pump should give to patients. In this project I used historical data to train both deep learning and classical machine learning…

    Project Goal: Predict blood glucose levels using historical data for training.

    Background and Project Description: Predicting blood glucose levels for individuals with diabetes is important when trying to determine the amount of insulin that must be given at various time points. Insulin pump manufacturers have to train models to predict how much insulin a pump should give to patients. In this project I used historical data to train both deep learning and classical machine learning time-series models. For the deep learning part, I used a type of recurrent neural network called an LSTM (Long Short-Term Memory) for training and inference. For the machine learning part, I used SARIMA. For more details, see my blog post at https://www.ironmanjohn.com/home/time-series-benchmarks-between-the-ibm-power-9-ac922-server-and-the-nvidia-dgx-1


    Languages and Frameworks: Python, Tensorflow, Keras

    AI and Machine Learning Techniques: Recurrent Neural Networks, LSTM, SARIMA, Time-Series Analysis

    See project
  • Predicting Indycar Tire Usage with Microsoft Power BI

    -

    Project Goal: To use historical data to create a dashboard that allows data driven prediction of which tires to use during an Indycar race.

    Background and Project Description: In Indycar racing, all of the cars are almost identical, so no one team has an advantage due to the physical components of the car. In a race, a car must use 2 different types of tires, but it is up to the team to decide which tires to use at which times. Proper tire selection can be the difference between…

    Project Goal: To use historical data to create a dashboard that allows data driven prediction of which tires to use during an Indycar race.

    Background and Project Description: In Indycar racing, all of the cars are almost identical, so no one team has an advantage due to the physical components of the car. In a race, a car must use 2 different types of tires, but it is up to the team to decide which tires to use at which times. Proper tire selection can be the difference between winning and not winning. In this project, I used historical data to create a dashboard of tire usage over previous years. The team was able to find patterns of tire usage by winning teams that they had not seen before. This allowed them to make data driven decisions about which tires to use during the race and at what time. The data was in the form of dozens of log files that had to be significantly manipulated to extract useful information. The cleaned data was imported into Microsoft SQL Server which was the data source for the Microsoft Power BI dashboard.

    Languages and Frameworks: Python, SQL, SQL Server, Microsoft Power BI

  • Using Machine Learning to Predict Hospital Readmissions

    -

    Project Goal: To determine the likelihood of a patient being readmitted within a given time frame.

    Background and Project Description: Patient readmission is a significant challenge in the healthcare industry. Not only is it potentially harmful to patients, it is costly to healthcare institutions. In this project, I used classical machine learning techniques, such as XGBoost and logistic regression, to determine the likelihood of a patient being readmitted for the same condition…

    Project Goal: To determine the likelihood of a patient being readmitted within a given time frame.

    Background and Project Description: Patient readmission is a significant challenge in the healthcare industry. Not only is it potentially harmful to patients, it is costly to healthcare institutions. In this project, I used classical machine learning techniques, such as XGBoost and logistic regression, to determine the likelihood of a patient being readmitted for the same condition within a 30-day time frame. Patient data came from the de-identified MIMIC-III database. This project was particularly challenging in that the data for patients was often incomplete, miscoded, or missing. As a result, a large amount of data manipulation and cleaning was required before analysis could begin.

    Languages and Frameworks: Python, R, MySQL

    AI and Machine Learning Techniques: Logistic Regression, Decision Trees, Random Forests, Clustering, XGBoost

  • Computational analysis of DNA transposable elements in humans and 9 other mammalian species to determine their evolutionary history. Published in the journal "Genome Research" in 2007.

    -

  • Computational analysis of transposable element target sites in the elephant genome. Published in the journal "Gene" in 2008.

    -

  • Design of novel statistical test to determine if DNA duplication locations are statistically different than a null hypothesis of random insertion. Published in the journal "PLoS Genetics" in 2009.

    -

  • Discovery of horizontal transfer of transposable elements due to host-parasite interactions. Published in the journal "Nature" in 2010.

    -

  • Discovery, characterization, and statistical analysis of DNA expansion caused by duplication during DNA repair. Published in the journal "PLoS Genetics" in 2009.

    -

More activity by John

View John’s full profile

  • See who you know in common
  • Get introduced
  • Contact John directly
Join to view full profile

Other similar profiles

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses