Skip to content
View Christ14n97's full-sized avatar

Block or report Christ14n97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Christ14n97/README.md

Christ14n97

Christian Peralta Viteri

I am a biologist pasionate about coding and data science

I started coding in Python in 2019, after completing my bachelor's degree. During my MSc in Biochemistry at the University of Geneva, Switzerland, I learned R and strengthened my background in statistical modelling, machine learning, multivariate analysis, and computational approaches for biological research.

During my MSc thesis, my work focused on untargeted metabolomics and data-driven strategies to identify and annotate biomarkers. This was a highly enriching experience that allowed me to develop skills in data science and bioinformatics. To further gain experience in software development, I completed my MSc thesis with a final project: development of a Python library to automate unsupervised and supervised multivariate analysis.

After graduating, I had the opportunity to work at the Faculty of Medicine of the University of Geneva, where I joined the Bioinformatics Support Platform as a junior data scientist. This experience allowed me to broaden my expertise and learn how to analyse bulk RNA-seq and single-cell RNA-seq data. I also received in-depth training in machine learning, including running experiments with neural networks, implementing variational autoencoders in Torch, and conducting benchmarks for different prediction tasks. In addition, I gained practical experience with Docker, high-performance computing, version control, and collaborative software development workflows.

Since 2024, I have been a PhD candidate at the University of Würzburg, Germany. My current work is multidisciplinary. On one hand, I am developing a deep learning model for personalized drug prioritization. This involves model architecture design, data collection, leakage-proof model training, and benchmarking, with the goal of achieving robust generalization to patient-derived samples. On the other hand, I am working on the discovery of molecular subtypes in oral squamous cell carcinoma (OSCC). In this context, I analyse bulk RNA-seq, single-cell RNA-seq, and spatial transcriptomics data. I apply statistical learning models, test hypotheses using robust statistical approaches, and interpret the results in their biological context.

Overall, my work is tailored toward translational impact, aiming to help OSCC patients benefit from more precise drug recommendations and a better understanding of the disease.

I hope to share some of my latest work soon.

Resume

Projects:

Skills

  • 🐍 Python

    • numpy
    • pandas
    • sckit-learn
    • PyTorch
    • plotly
    • seaborn
  • 📈 Machine Learning

    Imbalanced data sampling strategies:

    • SMOTE
    • TomekLink
    • Random Under-Sampling

    Models:

    Regression Classification
    OLS with LASSO - RIGDE regularization Logistic Regression
    Gradient Boosting Random Forest Classifier
    Random Forest Multi Layer Perceptron
    PLS-RA PLS-DA
  • 📊 R

    • Bioconductor
    • survival
    • ggplot2
    • caret
  • 💻 BASH scripting

bash git linux pandas python scikit_learn seaborn tensorflow

Contact me:

linkedin

Popular repositories Loading

  1. R_package_2020 R_package_2020 Public

    R package developed to wrap-up tools for data manipulation as well as visualization functions in the context of multivariate analysis of metabolomics data.

    R 1

  2. MSc_Untarg_Metabo_Workflow MSc_Untarg_Metabo_Workflow Public

    My MSc thesis focused on the improvement of metabolite biomarkers annotation. The resulting workflow combined common unsupervised-supervised ML models with graph-based analysis of metabolic reconst…

    1

  3. Christ14n97 Christ14n97 Public

  4. Bash_mastery_2021 Bash_mastery_2021 Public

    Computer Skill for Biologycal Research course gave practical lessons to get started with GNU/Linux environment to work with shell and shell scripting. The practical where focus on data file manipul…

    HTML

  5. Python_4_LifeScience_2019 Python_4_LifeScience_2019 Public

    Python Class for DNA-RNA manipulation

    Python

  6. Machine_Learning_Competition_2022 Machine_Learning_Competition_2022 Public

    ML competition hosted on kaggle. Binary classification problem to predict the income of USA population.

    HTML