For students in Professor Sam Ransbothamās Machine Learning and Artificial Intelligence class, there was more at stake in their final projects than just letter grades and GPAs. This semester, the undergraduates worked with real data from the . Students developed models that they hope will help guide the nonprofitās fundraising, volunteer engagement, and early childhood interventions.
āThere are real-life consequences to this project. Itās not just a homework assignment,ā said Melissa Walters ā23, one of 27 students in the course offered by the Carroll Schoolās Information Systems Department. Walters, who studies Business Analytics and Finance, spent the semester building a model to impute data in one of the data sets that United Way provided. Imputing data involves creating a predictive model that uses existing information to accurately fill in the gaps that are common with real-world data sets.
One of United Wayās priorities is early childhood intervention, and its work with over 50 childcare providers and other partners has created a vast database of information on educational screening, geographic trends, and developmental stages. However, like practically all real data sets, the database has inconsistencies and missing information, which make it difficult for the nonprofit to do any comprehensive analysis. Citing one example that her imputing project worked to correct, Walters says that the Boston neighborhood of Jamaica Plain is spelled and formatted in 13 different ways throughout the data set, splitting information that should be categorized uniformly into multiple sections.
In contrast to the synthetic data sources that professors typically generate for educational purposes, real-world data sets tend to be rife with omissions, typos, and incomplete information. In some ways, that added layer of difficulty was the point of the project, says Ransbotham, who is the William S. McKiernan ā78 Family Faculty Fellow. āIām trying to put students in a position of being able to work with the imperfect real world. Weāre in a business school, so students need to be prepared to make real decisions with imperfect data,ā he pointed out.
Interdisciplinary Collaboration + Big Data = Social Impact
Machine Learning and Artificial Intelligence was a new class, but this particular partnership with United Way began over a year and a half ago, when Ransbotham met Ellen Dickenson, vice president of analytics at theĀ . (The School of Social Work has collaborated with the organization for years to promote research and practice partnerships, including projects to reduce homelessness amongĀ local familiesĢż²¹²Ō»åĢżchildrenĀ in the Boston Public School system.) RansbothamāsĀ meeting with Dickenson was thanks to the Data Analytics for Social Impact research group at Boston College, which began with aĀ three-year grantĀ from the UniversityāsĀ Institute for the Liberal Arts. The interdisciplinary group, which Ransbotham is part of, includes nine faculty members from across Boston College whose specialties span social work, economics, nursing, education, and business. Despite their disparate backgrounds, all of the membersā research involves using āBig Dataā to solve social problems.
Associate Professor Summer Sherburne Hawkins of Boston Collegeās School of Social Work, who spearheaded the research group with her dean, Gautam N. Yadama, cited āBoston Collegeās strong commitment to social justiceā as setting the stage for the project. āThe point of the grant is to say, āHow can we have an impact on the community? How can we produce research thatās going to move the needle for the people who are involved with these services?āā explained Hawkins, who serves as the projectās principal investigator. The three-year Data Analytics for Social Impact grant will eventually result in the launch of an interdisciplinary graduate course during the 2021-2022 academic year, but students are already feeling its impact in the classroom in the form of Ransbothamās partnership with United Way.Ā
Throughout the first year of the grant, members of the research group heard from a variety of speakers, including Dickenson, who presented about United Wayās needs and opportunities for data and analytics. Like many organizations, United Way generates vast amounts of data that need analyzing: from early childhood education test scores to fundraising metrics to volunteer engagement information. And, like all professors who teach data analysis, Ransbotham needed data for his classes to analyze. The mutual benefit was clear.
āMessy, Real-World Dataā
In spring 2020, Ransbotham piloted the partnership with his graduate Data Analytics 3 class. Groups of full-time M.B.A. students analyzed two data sets provided by United Way: one with early childhood intervention data and another with information on volunteer engagement and fundraising efforts. At the end of the course, the groups presented their findings to Dickenson as well as to AJ Spagnuolo ā17, a director of analytics at the United Way. āIn so many cases I said, āOh yes, you were spot on,āā Dickenson said, referring to insights the groups had about the data.
Working through the āmessy, real-world dataā benefited the students too, said Kailey Rappaport, M.B.A. ā21, a member of the spring 2020 graduate Data Analytics 3 class. Rappaport, who plans to pursue a career in data analytics or marketing after receiving her M.B.A., emphasized the importance of business school classes offering experience with real data sets and real clients to prepare students for the workforce. Data Analytics 3 is the final course in in the M.B.A. curriculum, so Rappaport and her group used their prior analytics experience as a foundation and took the United Way project as a welcome opportunity to āchallenge ourselves a little bit.ā
āWe like to talk a lot in business classes about these theoretical constructs, but itās really different to actually look at a table that came from a real-world source,ā said Aidan Lena ā23, an undergraduate in this semesterās Machine Learning and Artificial Intelligence course. The small classes were held in person and students spent the semester studying theories of artificial intelligence and algorithm-building techniques. Interwoven with the complex information systems concepts were deeper discussions about the ethical implications of machine learning and the importance of implementing technology in a way that doesnāt reinforce existing inequality. After learning about business ethics during his freshman year Portico class, Lena said it felt natural to continue to focus on āthe broader ethical implications of the effect of AI on everything and how itās going to shift the world.ā
Those theoretical effects were made tangible during the two class sessions that Dickenson attended virtually. Instead of abstract discussions about synthetic data sets, students heard firsthand about the impact of data on United Wayās work. In addition to answering data-specific questions, she also provided context for the two data sets, sharing about the impact of their early childhood intervention programs and the organizational significance of building a strong donor and volunteer network.Ā
Giving Insights, Gaining Knowledge
Outside of the classroom, students worked to combine techniques and theory learned in class with the messy reality of real-world data. As their homework assignments developed from initial proposals to final refined models, students used in-class examples to fine-tune their models to fit United Wayās needs. And, although this semesterās projects were individualāin keeping with COVID protocols and in contrast to last springās M.B.A. group projectsāthere was still a sense of camaraderie among the undergraduates. āTo have dozens of students working on the same database and simultaneously discovering things about it is really a great opportunity for everyone involved,ā said Walters.Ā Ā
The undergraduates worked with the same two data sets as the M.B.A. students, although they presented a different type of analysis to Dickenson at the end of the semester. Unlike the M.B.A. data analytics class, which focused on analyzing past data and looking for historical patterns, Machine Learning and Artificial Intelligence involves predictive models that provide a framework for looking at future data. āPart of our objective this semester is to set [United Way] up to give value in the future. Rather than thinking about these as one-off analyses, United Way is going to generate the same type of data on an ongoing basis. Just because my semester is over doesnāt mean that their work stops,ā says Ransbotham, indicating the ongoing value of a well-tuned computer model for analyzing future data.
In the spirit of the Data Analytics for Social Impact group, Ransbotham is glad that those models are going toward a good cause: āIf students are going to put in effort for a learning experience, rather than yet another analysis of a synthetic data set, why donāt we try to use that effort towards something that can help?ā
The effort that students are putting in is substantial. āIāll probably spend upwards of 30 hours working with this model,ā said Walters, ābut itās worth it.ā She measures the value by not only the skills sheās learning but also the chance to leverage this know-how for others in the wider community. āThe class does speak to ¾«¶«Ó°Ņµās identity,ā she notes, referring to the accent onĀ social justiceĀ at a Jesuit university. āWeāre learning and itās educational for us, but weāre also helping people in the end. That feels really important.ā
Rachel Bird ā23 is a content development specialist (and part-time M.B.A. student) at the Carroll School of Management.