Submission Number: 180
Submission ID: 3985
Submission UUID: 399fee51-c671-4acd-8572-060c448fd732
Submission URI: /form/project

Created: Sat, 08/26/2023 - 06:41
Completed: Sat, 08/26/2023 - 06:44
Changed: Thu, 06/13/2024 - 14:36

Remote IP address: 104.28.39.76
Submitted by: Gaurav Khanna
Language: English

Is draft: No
Webform: Project
Project Title: Computational pipelines for the analysis of plastic-degrading genes
Program:
CAREERS (323)

Project Image: https://support.access-ci.org/system/files/webform/project/3985/anthrax-image-protein-units-drug-factor-unit.jpg
Tags:
bioinformatics (277), biology (515), workflow (365), workforce-development (337)

Status: Complete
Project Leader
--------------
Project Leader:
Ying Zhang

Email: yingzhang@uri.edu
Mobile Phone: {Empty}
Work Phone: {Empty}

Project Personnel
-----------------
Mentor(s):
{Empty}

Student-facilitator(s):
Aidan McCrillis (19738)

Mentee(s):
{Empty}


Project Information
-------------------
Project Description:
The prevalence of microplastics is posing increasing problems to the environment and human health. Despite the identification of several plastic-degrading pathways, their presence and evolution among diverse microorganisms is largely unexplored. 

In this project, we will develop a computational pipeline using the Snakemake workflow management system to assemble bioinformatics tools for the identification and evolutionary analysis of plastic-degrading proteins. We will also examine the abundance of these proteins by mining metagenomic data. Snakemake workflow management system is a python-based tool to create reproducible and scalable data analyses. The supported student will work with PI and other lab researchers to understand the analysis workflow and bioinformatics tools and build a computational pipeline using Snakemake. This effort will include elements of HPC, bioinformatics, python programming, and git version controls.


Project Information Subsection
------------------------------
Project Deliverables:
{Empty}

Project Deliverables:
{Empty}

Student Research Computing Facilitator Profile:
{Empty}

Mentee Research Computing Profile:
{Empty}

Student Facilitator Programming Skill Level: {Empty}
Mentee Programming Skill Level: {Empty}
Project Institution: University of Rhode Island
Project Address:
{Empty}

Anchor Institution: CR-University of Rhode Island
Preferred Start Date: {Empty}
Start as soon as possible.: No
Project Urgency: Already behind3Start date is flexible
Expected Project Duration (in months): 6
Launch Presentation: https://support.access-ci.org/system/files/webform/project/3985/Aidan.Careers.project.launch.pptx
Launch Presentation Date: {Empty}
Wrap Presentation: https://support.access-ci.org/system/files/webform/project/3985/2024-06-12-Careers_Wrap_Presentation.pptx
Wrap Presentation Date: {Empty}
Project Milestones:
- Milestone Title: Milestone #1
  Milestone Description: Determine project scope, Snakemake work environment, launch presentation.
  Completion Date Goal: 2023-10-30
- Milestone Title: Milestone #2
  Milestone Description: Bioinformatics tool assembly and testing, finalize pipeline design.
  Completion Date Goal: 2023-11-30
- Milestone Title: Milestone #3
  Milestone Description: Complete working prototype, project git
  Completion Date Goal: 2023-12-30
- Milestone Title: Milestone #4
  Milestone Description: Extensive testing and debugging, application and data analysis.
  Completion Date Goal: 2024-02-28
- Milestone Title: Milestone #5
  Milestone Description: Wrap up development, update project git and documentation, wrap presentation
  Completion Date Goal: 2024-03-30

Github Contributions: {Empty}
Planned Portal Contributions (if any):
{Empty}

Planned Publications (if any):
{Empty}

What will the student learn?:
{Empty}

What will the mentee learn?:
{Empty}

What will the Cyberteam program learn from this project?:
{Empty}

HPC resources needed to complete this project?:
{Empty}

Notes:
{Empty}



Final Report
------------
What is the impact on the development of the principal discipline(s) of the project?:
This project will help to streamline the identification of potential plastic degrading genes and proteins and help to create reproducible results that can easily be built upon by other researchers.

What is the impact on other disciplines?:
This probably won't have a large impact on other disciplines but will allow people outside bioinformatics to use this pipeline without having a comprehensive knowledge of the packages and modules that were used in the pipeline.

Is there an impact physical resources that form infrastructure?:
{Empty}

Is there an impact on the development of human resources for research computing?:
{Empty}

Is there an impact on institutional resources that form infrastructure?:
{Empty}

Is there an impact on information resources that form infrastructure?:
{Empty}

Is there an impact on technology transfer?:
{Empty}

Is there an impact on society beyond science and technology?:
This project will help to further knowledge about plastic degrading proteins with the aim of helping to solve the problem of plastic pollution and finding a way to degrade the plastics that are in the ocean.

Lessons Learned:
I think I gained a lot of valuable knowledge in the process of building a workflow and what considerations should be taken to make a product that is usable for other people. I also gained a lot of experience with dealing with large sets of bioinformatics data and how to make sense of such a large dataset.

Overall results:
The overall result of the project I worked on is a workflow that allows for the identification and sorting of potential plastic degrading genes.