Submission information
Submission Number: 15
Submission ID: 32
Submission UUID: 210df84e-dc4e-489a-bb56-d7bea262913b
Submission URI: /form/project
Created: Tue, 09/03/2019 - 13:08
Completed: Tue, 09/03/2019 - 13:08
Changed: Thu, 05/05/2022 - 05:00
Remote IP address: 130.215.55.243
Submitted by: Northeast Cyberteam
Language: English
Is draft: No
Webform: Project
Project Title: Student-led Development of Open Source Materials for Hadoop Program: Northeast (308) Project Image: {Empty} Tags: big-data (4), ceph (56), data-wrangling (6), hadoop (12), storage (47) Status: Complete Project Leader -------------- Project Leader: Christopher Bennet Email: chris.bennett@maine.edu Mobile Phone: 2073331609 Work Phone: 2077787114 Project Personnel ----------------- Mentor(s): {Empty} Student-facilitator(s): {Empty} Mentee(s): {Empty} Project Information ------------------- Project Description: As part of a system-wide Data Science Degree, numerous modules have been developed that can be offered at a distance. These include VBA in Excel, SQL, R, and others. No module currently exists for Hadoop, nor does an instance of Hadoop that can be used for student training. This project aims to create a suitable Hadoop environment on University of Maine System resources and to create materials for a one credit micro-course that can be delivered at a distance. Project Information Subsection ------------------------------ Project Deliverables: Course materials including relevant assignments, readings, lectures, and tutorials for a class on Hadoop will be produced. The course is expected to be offered during or before the Fall of 2019. Project Deliverables: {Empty} Student Research Computing Facilitator Profile: Steve Nutting, Undergraduate at University of Maine Farmington Mentee Research Computing Profile: {Empty} Student Facilitator Programming Skill Level: {Empty} Mentee Programming Skill Level: {Empty} Project Institution: University of Maine Farmington Project Address: 228 Main St Brinkman House Farmington, Maine. 04938 Anchor Institution: NE-University of Maine Preferred Start Date: 09/07/2018 Start as soon as possible.: No Project Urgency: Already behind3Start date is flexible Expected Project Duration (in months): {Empty} Launch Presentation: {Empty} Launch Presentation Date: {Empty} Wrap Presentation: {Empty} Wrap Presentation Date: {Empty} Project Milestones: {Empty} Github Contributions: {Empty} Planned Portal Contributions (if any): Materials that can be added to the NE Cyberteam Portal include all written materials, relevant code, and tutorials covering Hadoop installation and exploration. Planned Publications (if any): A publication detailing the course is planned. What will the student learn?: The student will gain knowledge of setting up a Hadoop instance as well as a deeper understanding of Data Science. What will the mentee learn?: {Empty} What will the Cyberteam program learn from this project?: The Cyberteam will create better ties between research and education related to data science in the New England region. HPC resources needed to complete this project?: The hadoop cluster will initially be deployed on the OpenStack cloud run by the Advanced Computing Group of the University of Maine System. It will be migrated to the HPC cluster if deemed necessary. Notes: {Empty} Final Report ------------ What is the impact on the development of the principal discipline(s) of the project?: This work will positively impact the ability of the University of Maine System to train the next generation of data scientists. What is the impact on other disciplines?: Any discipline that uses big data can benefit from the proposed materials. Is there an impact physical resources that form infrastructure?: {Empty} Is there an impact on the development of human resources for research computing?: The proposed work will increase the number of people with a background in data science. Is there an impact on institutional resources that form infrastructure?: {Empty} Is there an impact on information resources that form infrastructure?: This work will increase the use of computing and data resources in Maine. Is there an impact on technology transfer?: {Empty} Is there an impact on society beyond science and technology?: {Empty} Lessons Learned: {Empty} Overall results: This will be publicly viewable on portal.