Submission Number: 147
Submission ID: 341
Submission UUID: f955ae90-f829-4dd2-95e3-ac4b6321de24
Submission URI: /form/project

Created: Wed, 05/25/2022 - 13:12
Completed: Wed, 05/25/2022 - 13:12
Changed: Wed, 07/06/2022 - 15:34

Remote IP address: 130.215.45.247
Submitted by: Guillaume Lamoureux
Language: English

Is draft: No
Webform: Project
Project Title: Development of TorchProteinLibrary, a library of differentiable primitives for deep learning models of protein structure
Program:
CAREERS (323)

Project Image: {Empty}
Tags:
{Empty}

Status: {Empty}
Project Leader
--------------
Project Leader:
Guillaume Lamoureux

Email: guillaume.lamoureux@rutgers.edu
Mobile Phone: {Empty}
Work Phone: {Empty}

Project Personnel
-----------------
Mentor(s):
{Empty}

Student-facilitator(s):
{Empty}

Mentee(s):
{Empty}


Project Information
-------------------
Project Description:
The project consists in optimizing and expanding TorchProteinLibrary, our library of differentiable primitives for deep neural network models of protein structure (see our preprint “TorchProteinLibrary: A computationally efficient, differentiable representation of protein structure” https://arxiv.org/abs/1812.01108). The library implements the functionalities needed to perform end-to-end learning of protein structure prediction.

The recent success of DeepMind’s AlphaFold2 has shown the need for an open software platform for the development of machine learning (ML) models focused on molecular structure and function, so that a diverse ecosystem of new methods can grow on top of a stable software base. Research interest on the topic is rapidly moving beyond protein structure prediction, towards integrating greater molecular diversity (DNA, RNA, small ligands, etc.) and towards understanding the interactions, dynamics, and context-specific details responsible for a given function or activity.

TorchProteinLibrary (TPL), our own open-source initiative, is ideally positioned to become a central tool for future development of deep learning models of biomolecular structure. The library is meant to develop along a number of different axes: 1) performance optimization (CUDA implementation and refactoring), 2) implementation of new differentiable layers (protein sequence/structure conversion and transformation), and 3) extension to nucleic acids (DNA and RNA).

Project Information Subsection
------------------------------
Project Deliverables:
{Empty}

Project Deliverables:
{Empty}

Student Research Computing Facilitator Profile:
{Empty}

Mentee Research Computing Profile:
{Empty}

Student Facilitator Programming Skill Level: {Empty}
Mentee Programming Skill Level: {Empty}
Project Institution: {Empty}
Project Address:
{Empty}

Anchor Institution: CR-Rutgers
Preferred Start Date: {Empty}
Start as soon as possible.: No
Project Urgency: Already behind3Start date is flexible
Expected Project Duration (in months): {Empty}
Launch Presentation: {Empty}
Launch Presentation Date: {Empty}
Wrap Presentation: {Empty}
Wrap Presentation Date: {Empty}
Project Milestones:
{Empty}

Github Contributions: {Empty}
Planned Portal Contributions (if any):
{Empty}

Planned Publications (if any):
{Empty}

What will the student learn?:
{Empty}

What will the mentee learn?:
{Empty}

What will the Cyberteam program learn from this project?:
{Empty}

HPC resources needed to complete this project?:
{Empty}

Notes:
{Empty}



Final Report
------------
What is the impact on the development of the principal discipline(s) of the project?:
{Empty}

What is the impact on other disciplines?:
{Empty}

Is there an impact physical resources that form infrastructure?:
{Empty}

Is there an impact on the development of human resources for research computing?:
{Empty}

Is there an impact on institutional resources that form infrastructure?:
{Empty}

Is there an impact on information resources that form infrastructure?:
{Empty}

Is there an impact on technology transfer?:
{Empty}

Is there an impact on society beyond science and technology?:
{Empty}

Lessons Learned:
{Empty}

Overall results:
{Empty}