Submission information
Submission Number: 187
Submission ID: 4251
Submission UUID: 802b435b-48c3-4f1e-8c57-67713360aeba
Submission URI: /form/project
Created: Fri, 12/08/2023 - 15:21
Completed: Fri, 12/08/2023 - 15:21
Changed: Tue, 07/02/2024 - 10:51
Remote IP address: 131.128.76.34
Submitted by: Gaurav Khanna
Language: English
Is draft: No
Webform: Project
Project Title | Statistical Analysis of criminal cases in the United States District Court of Puerto Rico |
---|---|
Program | CAREERS |
Project Image | |
Tags | |
Status | Complete |
Project Leader | Michael Chou |
mchou@providence.edu | |
Mobile Phone | |
Work Phone | |
Mentor(s) | Michael Chou |
Student-facilitator(s) | Emily Gelchie |
Mentee(s) | |
Project Description | For the purposes of submitting an amicus brief to the US Supreme Court, the Puerto Rico Association of Criminal Defense Lawyers (PRACDL) compiled several indictments and docket sheets from the PACER system. Data from these documents were extracted and analyzed with sociodemographic data from the US Census. The wealth of data contained in these documents is not easily accessible for statistical study. The goal of this project is two-fold. First, to write script to data mine these documents for information including but not limited to: the length of time that the case is "open", the percentage of persons represented by a court-appointed attorney, the average length of sentences, the number of persons granted bail, the number of persons with bail violations and the reasons for those violations, among others. Secondly, data science techniques will be used to provide insightful visualizations and detect correlation between these various categories. An understanding of these data will facilitate related future social justice projects in this jurisdiction, as well as apply to other indictment and docket sheets from the PACER system at large. |
Project Deliverables | |
Project Deliverables | |
Student Research Computing Facilitator Profile | |
Mentee Research Computing Profile | |
Student Facilitator Programming Skill Level | |
Mentee Programming Skill Level | |
Project Institution | Providence College |
Project Address | |
Anchor Institution | CR-University of Rhode Island |
Preferred Start Date | |
Start as soon as possible. | Yes |
Project Urgency | Already behind3Start date is flexible |
Expected Project Duration (in months) | 6 |
Launch Presentation | |
Launch Presentation Date | |
Wrap Presentation | |
Wrap Presentation Date | |
Project Milestones |
|
Github Contributions | |
Planned Portal Contributions (if any) | |
Planned Publications (if any) | |
What will the student learn? | |
What will the mentee learn? | |
What will the Cyberteam program learn from this project? | |
HPC resources needed to complete this project? | |
Notes | |
What is the impact on the development of the principal discipline(s) of the project? | This project is at the forefront of large-scale data processing in the legal field. Our work allows for the analysis of court dockets, revealing macro trends that enhance transparency and shed light on biases in the legal system. |
What is the impact on other disciplines? | The impact on other disciplines is limitless. Although we focused on the interaction between law data science and social science for this project, there are court cases reflecting many disciplines. Now that we have created the technology for large-scale analysis, we are able to expand upon this project in any discipline which there are court cases for. |
Is there an impact physical resources that form infrastructure? | |
Is there an impact on the development of human resources for research computing? | |
Is there an impact on institutional resources that form infrastructure? | |
Is there an impact on information resources that form infrastructure? | |
Is there an impact on technology transfer? | |
Is there an impact on society beyond science and technology? | Once we can identify more macro trends in court cases, we can present these findings to government officials to reform the US justice system and mitigate legal biases. |
Lessons Learned | Throughout this project, I learned a lot about not only the biases present in the legal system, particularly charge 846, but also learned about working on a team of developers, conducting research, how to use GitHub, and enhance my data science skills while also learning about machine learning techniques. |
Overall results | Overall, we were able to begin to find trends in the court dockets we analyzed, suggesting some biases between wealth and preferential legal treatment. Looking forward, we will dig deeper into these initial findings to create a deeper proof of this discovery. |