Showing 8 Result(s)

Kaggle Leash Bio Therapeutics Competition

Source Code: GitHub Competition Overview In this competition, you’ll develop machine learning (ML) models to predict the binding affinity of small molecules to specific protein targets – a critical step in drug development for the pharmaceutical industry that would pave the way for more accurate drug discovery. You’ll help predict which drug-like small molecules (chemicals) …

Brain Region Enrichment Analysis

An exploration of the different biological enrichment algorithms and machine learning algorithms applied to an RNA expression dataset.  Source Code: GitHub Motivation This project is an exploration of RNAseq data from Kaggle. When I initially downloaded this dataset, it was because I wanted to learn how to do data analysis on high dimensional biological data. …

Kaggle ICR Competition – Age Related Conditions

Source Code: GitHub Competition Statement The goal of this competition is to predict if a person has any of three medical conditions. You are being asked to predict if the person has one or more of any of the three medical conditions (Class 1), or none of the three medical conditions (Class 0). You will …

Diabetes Readmission Classifier

Source Code: GitHub Summary (Not Completed!) Exploration of a dataset related to readmission of diabetes patients based on attributes the hospitals have collected. The goal of this workflow was to see if there is a way to successfully create a classification model that could differentiate between no admission, readmission before 30 days, and readmission after …

Marketing Analysis (Gradient Boosting)

Source Code: GitHub Background (Not Finished) This case is used for hiring Data Analysts for the iFood Brain team. The analysis was to predict whether a customer would accept the marketing campaign proposed by iFood in their grocery stored. Based on the customers profile, you must predict if they would accept one of the multiple …

CAFA 5 Protein Function Prediction (Kaggle Competition)

Competition Description: The goal of this competition is to predict the function of a set of proteins. You will develop a model trained on the amino-acid sequences of the proteins and on other data. Your work will help ​​researchers better understand the function of proteins, which is important for discovering how cells, tissues, and organs …

Kaggle Single Cell Perturbations

This Kaggle competition represents a significant opportunity to impact the future of medicine development. The goal of this competition was to predict chemical perturbation gene expression on new cell lines from 144 small molecule drugs. The dataset consisted of gene expression data from 6 different cell types. Two of the cell types were severely under represented, …