Pablo A. Millan Arias

University of Waterloo

prof_pic.jpg

I am a postdoctoral researcher working under the supervision of Dr. Lila Kari. My main research areas are machine learning and bioinformatics. In particular, self-supervised learning for taxonomic identification and biologically inspired foundation models. I am also passionate about Information Theory and Theoretical Computer Science.

Huge fan of América de Cali and Colombian soccer national team. Amateur runner in the pursuit of a sub-90 minutes half-marathon and sub-3:30 marathon

education

  • 2019.04 - 2024.05: Ph.D. in Computer Science, University of Waterloo. [Thesis]
  • 2015.07 - 2019.03: B.Sc. in Math, Javeriana University. [Thesis]
  • 2013.01 - 2018.07: B.Sc. in Electronic Engineering, Javeriana University.

news

Jun 20, 2024 :sparkles: New paper uploaded to the Arxiv: The BIOSCAN-5M dataset
Jun 16, 2024 I raced the Waterloo 10KM Classic
May 27, 2024 🎓 I finished my PhD
Feb 22, 2024 Our research was featured in the Conversation newsletter
Dec 01, 2023 I will be presenting our latest paper at the 4th Workshop of Self-Supervised Learning in NeurIPS.

selected publications

  1. PLoS
    DeLUCS: Deep Learning for Unsupervised Clustering of DNA Sequences
    Pablo Millán Arias, Alipour Fatemeh, Kathleen Allen Hill , and 1 more author
    PLoS ONE, 2022
  2. Sci. Rep.
    Environment and taxonomy shape the genomic signature of prokaryotic extremophiles
    Pablo Millán Arias, Joseph Butler, Gurjit S. Randhawa , and 3 more authors
    Scientific Reports, Sep 2023