Skip to main content

MBDH Learning Innovation Fellows program – first cohort projects

The Midwest Big Data Innovation Hub Learning Innovation Fellows Program, housed at the University of Michigan School for Environment and Sustainability, enables teams to form for work toward better understanding of the intersections of the Hub’s “Cyberinfrastructure and Data Sharing” and “Data Science Education and Workforce Development” themes.

Our fellows work with faculty and teaching staff to create innovative interactive data analysis activities that can nest within sustainability science case studies. They design, prototype, and pilot these features in classrooms within the MBDH network. The program leverages talent and resources from two existing, open-source science learning environments. Gala (www.learngala.com) is a community-based, responsively designed sustainability science learning environment. Quantitative Undergraduate Biology Education and Synthesis (QUBESHub, or Qu) is a virtual center for faculty development and open educational resource sharing (https://qubeshub.org) that has had long-term support from NSF, formalizing and professionalizing open educational resources.

Through a series of virtual “Networkshops,” we connect undergraduate data science majors, graduate/professional students, faculty, and professionals. We can thus be inclusive, incorporating into classrooms problem-driven, data-rich material that speaks to lived infrastructural and environmental challenges from a range of communities across our region, and beyond. The team includes the following:

Leadership—

Rebecca Hardin (PI) is an anthropologist and Associate Professor at the University of Michigan School for Environment and Sustainability (UMSEAS), where she leads collaborations on the open-source, open-access learning platform Gala (www.learngala.com) and research group on Digital Justice. Rebecca also coordinates the Environmental Justice Field of Specialization and related Certificate program at UMSEAS.



Ann E. Russell (Co-PI) is an ecosystems ecologist, with special expertise in the biogeochemistry of tropical ecosystems. She is an Associate Adjunct Professor in the Department of Natural Resource Ecology and Management at Iowa State University, and PI of the NSF Research Collaborative network ALIVE: Authentic Learning in Virtual Environments.





M. Drew Lamar (Co-PI) is a mathematician and Associate Professor of Biology at William & Mary. His teaching and research are highly interdisciplinary in nature, using techniques and concepts from mathematics, statistics, biology, and computational sciences. Drew is Co-PI and Director of Cyberinfrastructure for the Quantitative Undergraduate Biology Education and Synthesis (QUBES) virtual center, with an interest and passion in open-source software development, quantitative biology education, and development of education gateways.

Ed Waisanen (Program Manager) is Program and Platform Lead for Gala (learngala.com). He has a master’s degree in Natural Resources and Environment from the University of Michigan, with a focus in Environmental Informatics and a background in multimedia production. Ed is focused on developing tools and communities that emphasize curation, open exchange, and narrative approaches to deepen learning.





Teams—

Data Learning for Restoration Ecology

Kyra Hull (Fellow) is a native of Grand Rapids, Michigan, and a first-year graduate student at Grand Valley State University, studying Biostatistics. Kyra is working on the following case about forest restoration, which is bilingual (Spanish and English versions): https://www.learngala.com/cases/a3224235-cdc0-44fc-a98b-46735dfef6c9




Karen Holl (Faculty Advisor) is a Professor of Environmental Studies at the University of California, Santa Cruz. Her research focuses on understanding how local and landscape-scale processes affect ecosystem recovery from human disturbance and using this information to restore damaged ecosystems. She advises numerous public and private agencies on land management and restoration; recently, she has been working to improve outcomes of the effort of the many large-scale tree-growing campaigns.




Data Learning to Address Groundwater Contamination

Saba Ibraheem (Fellow) is a second-year Health Informatics student at the University of Michigan, focusing on data analytics and research in health care. Saba is working on the following case, which is bilingual (English and French versions): https://www.learngala.com/cases/dioxane-plume





Rita Loch-Caruso (Faculty Advisor) is a toxicologist in the Department of Environmental Health Sciences at the University of Michigan School of Public Health, with a research focus in female reproductive toxicology and, in particular, mechanisms of toxicity related to adverse pregnancy outcomes such as premature birth.





Alan Burton (Faculty Advisor) is a Professor at the School for Environment and Sustainability and the Department of Earth and Environmental Sciences at the University of Michigan. His research focuses on sediment and stormwater contaminants and understanding contaminant bioavailability processes, effects, and ecological risk at multiple trophic levels. He is also a specialist in ranking stressor importance in human-dominated watersheds and coastal areas.





Data Learning in Livestock Ecologies

Daniel Iddrisu (Fellow) is a second-year student in Masters in International and Regional Studies, with a specialization in Africa, at the University of Michigan. He earned a BA degree in Integrated Community Development from the University for Development Studies, Tamale, Ghana. His research focuses on health, development, gender, and environmental health. The case he is working on takes place on the Greek Island of Naxos, but comprises skills for modeling and analyzing human/livestock interactions more broadly: https://www.learngala.com/cases/livestock-grazing

Johannes Foufopoulos (Faculty Advisor) is an Associate Professor at University of Michigan’s School for Environment and Sustainability, who focuses his lab research on fundamental conservation biology questions and on issues related to the ecology and evolution of infectious diseases. Major research projects examine how habitat fragmentation, invasive organisms, and global climate change result in species extinction.





Data Learning on Safari

Rahul Agrawal Bejarano (Fellow) has a background in computer science and he is currently working on a master’s degree at the University of Michigan School of Environment and Sustainability, with a concentration in Sustainable Systems. Rahul uses data from a diverse range of sources to shed light on today’s environmental challenges and develop innovative solutions, and is working on identifying climate-related vulnerabilities to our supply chains. He is working on this case, about the interactions of various wildlife species in the Serengeti: https://www.learngala.com/magic_link?key=oOTYOXyDRpmY_yM4AFlnXQ


Charles Willis (Faculty Advisor) is a Teaching Assistant Professor, Biology Teaching and Learning at the University of Minnesota. He is currently interested in the research and development of pedagogy practices for non-major biology students. In particular, he is focused on studying student-student and instructor-student feedback in online spaces. His research is also concerned with understanding how changing environments shape plant diversity on both evolutionary and ecological time scales. Currently, he is focused on using historical specimen data to study how historic climate change (over the past century) has impacted plant phenology and diversity across North America.

Jeffrey A. Klemens (Faculty Advisor) is an Assistant Professor of Biology at Thomas Jefferson University, where he serves as program director for the undergraduate biology curriculum. His current research activities are focused on the use of agent-based models to describe habitat use by organisms in the urban environment and the role of active learning in science education, particularly the use of systems thinking and other modeling techniques to improve student understanding of complex phenomena.




Data Learning in Detroit’s Eastern Market

Ghalia Ezzedine (Fellow) is a second-year master’s student studying Health Informatics. She is interested in leveraging data and digital tools to improve population health. In her free time, she likes to try new recipes, work out, and occasionally jump off a bridge or airplane. She chose this case study because of her interest in nutrition, and the shift in foods available at this iconic marketplace: https://www.learngala.com/cases/2b92db37-de87-4321-a531-510dea225189



Josh Newell (Faculty Advisor) is an Associate Professor in the School for Environment and Sustainability at the University of Michigan. He is a broadly trained human-environment geographer, whose research focuses on questions related to urban sustainability, resource consumption, and environmental and social justice. His research approach is often multiscalar and integrative and, in addition to theory and method found in geography and urban planning, he draws upon principles and tools of industrial ecology and spatial analysis.


Midwest Big Data Hub co-leads local events for 4th Annual Global Women in Data Science Conference

The Midwest Big Data Hub co-led local participation in the 4th annual Global Women in Data Science (WiDS) Conference, with sponsorship from the National Center for Supercomputing Applications (NCSA) and the University of Illinois. The event was free and open to all. The WiDS Conference, hosted on March 4th at 150 locations around the world, seeks to unite and connect women working in data science fields.

“We were very excited to co-sponsor this with NCSA, and support this inaugural Illinois event for Stanford’s Global Women in Data Science Day,” said Melissa Cragin, Executive Director of the Midwest Big Data Hub. “Partnering with others on events such as the Illinois WiDS allows us to best use our human resources and experts network to broaden participation in data science and Big Data research and education. I was honored to participate and have the opportunity to moderate such a terrific panel of accomplished leaders, who shared their perspectives on data science, data-enabled research, and opportunities for women in this space.”

panel discussion
Faculty panel moderated by MBDH Executive Director Melissa Cragin

The WiDS local events, hosted this year at NCSA, featured a variety of speakers from diverse backgrounds presenting sessions on opportunities for women in data science, technical vision talks, and the variety of data science and technology careers available in the Midwest.

“I always enjoy telling my story about how I got started working big data research,” said Ruby Mendenhall, Illinois Professor of Sociology and African-American Studies and NCSA faculty affiliate. “My story also demonstrates the importance of doing outreach to groups that are not traditionally represented in data science such as African American Studies.”

As part of her 2017-2018 NCSA Faculty Fellowship, Mendenhall and NCSA research programmer Kiel Gilleade completed a pilot study called the Chicago Stress Study that examines how the exposure to nearby gun crimes impacted African American mothers living in Englewood, Chicago. Mendenhall and Gilleade developed a mobile health study which used wearable biosensors to document 12 women’s lived experiences for one month last fall. As part of their research, Mendenhall, Gilleade, and their team were able to create an exhibit based on the study data they collected in order to bring the unheard, day-to-day stories of these mothers to life.

panel discussion
Panel discussion moderated by iSchool Professor Catherine Blake

Professor Donna Cox, Director of NCSA’s Advanced Visualization Lab, was a panelist at this year’s local conference, and praised the insights of the other speakers while emphasizing the importance of the larger WiDS conference. “It was valuable to hear other panelists,” said Cox. “The future of Women in Data Science should include raising awareness about important issues emerging in data science, especially socially-relevant issues. We need more women actively involved in the ethics of data science.”

Alice Delage, Associate Project Manager for NCSA and Program Coordinator for the MBDH, said, “Hosting WiDS Urbana-Champaign at Illinois was an opportunity to highlight the campus expertise around data science led by women.” Delage, who co-chairs the local Women@NCSA group, said, “Data science and technologies are increasingly impacting our lives and society, and it is imperative that women and minorities be part of these transformations. We wanted to showcase the groundbreaking work being done in that area by Illinois female data scientists and to inspire more women and underrepresented communities to engage in the field.”

There are also opportunities to expand the event next year by better incorporating student work in the program, Delage said, or running a datathon, for example. Some of this year’s participants have already volunteered to help with next year’s event.

A full list of this year’s speakers at the WiDS Conference at NCSA is here. For more information about the global WiDS conference and ways to get involved, please visit https://www.widsconference.org.

The MBDH is one of four regional Big Data Innovation Hubs with support from the National Science Foundation (award # 1550320), and works to build capacity and skills in the use of data science methods and resources in the 12-state U.S. Midwest Census region. Learn more about the Hub at https://midwestbigdatahub.org.

Thanks to NCSA Public Affairs for contributing to an earlier draft of this post.

Guest post – Data Science Education at Two-Year Colleges

By Matt Fall

Executive Director, Center for Data Science, Lansing Community College

Recently, the American Statistical Association (ASA), with support from the National Science Foundation (NSF), hosted a two-day summit in Washington D.C. to discuss outcomes and curricula for data science programs at two-year colleges. The Two-Year College Data Science Summit (TYCDSS) was intended to help spur the growth of data science programs at these institutions and included representatives from two and four-year institutions, government, and industry.

Sallie Keller (Virginia Tech) plenary talk (photo: Nicholas Horton)

The summit included several plenary talks discussing the role of two-year colleges in addressing the need for data scientists as well as a brief presentation from a graduate of a community college data science program. The majority of the summit, however, was devoted to a series of working sessions where the participants discussed ideal outcomes and competencies for three categories of students:

  • Category 1: students intending to complete an Associate’s degree and begin working
  • Category 2: students intending to earn an Associate’s degree and transfer to a 4-year program
  • Category 3: students seeking a certificate

The working discussions provided an opportunity for the summit participants to discuss what was expected and feasible for a student from each category to complete. The discussions were captured by a designated writing group and there will be a forthcoming write-up summarizing the recommendations of the summit participants with guidelines for two-year college data science programs.

This summit was particularly timely for my colleagues at Lansing Community College (LCC) as we have recently begun development of a data science program. Prior to the summit, participants were provided access to a list of resources that included relevant research, reports from related workshops, and sample syllabi. Of particular interest to us, as we design the layout of our program, were the Park City Math Institute’s Curriculum Guidelines for Undergraduate Programs in Data Science (2016) [PDF], the Oceans of Data Profile of the Data Practitioner (2016), and the Oceans of Data workshop report on Building Global Interest in Data Literacy (2016). The resources provided, candid discussions with other two-year colleges regarding their programs, and the discussions about realistic competency expectations were also of interest and informative to our program design.

The intent of the TYCDSS directly supports the MBDH’s priority area of interest in data science, education and workforce development. Two-year colleges provide higher education accessibility to many students who could not or would not otherwise pursue an advanced degree. An increasing number of these schools are offering certificate and Associate’s degree programs in data science and analytics to support growing workforce demand. Growth in these types of programs should naturally lead to an increase in data competency, enrollment in university programs, and larger hiring pools for data science based careers.

Related information:

Midwest Big Data Summer School 2018

Midwest Big Data Summer School reveals how big data can advance research efforts

By Paula Van Brocklin, Office of the Vice President for Research, Iowa State University

Iowa State University logo
The Midwest Big Data Summer School, held May 14-17 at Iowa State University, helped nearly 140 academic and industry researchers, graduate students and post-docs from nine states broaden their understanding of big data and its ability to advance their research interests. Iowa State has organized and hosted the event since 2016.

 
“The summer school seeks to bridge the gap between scientists and engineers using data science technology by introducing them to data science techniques and vocabulary,” said Hridesh Rajan, lead organizer of the Midwest Big Data Summer School and professor of computer science at Iowa State. “The idea is to help these individuals better communicate and leverage their data-science needs.”

The curriculum

The school’s first three days introduced attendees to a range of big data topics, including data acquisition, data preprocessing, exploratory data analysis, descriptive data analysis, data analysis tools and techniques, visualization and communication, ethical issues in data science, reproducibility and repeatability, and understanding domain/context.

On the final day, participants selected one of four tracks, which focused on a sub-area of big data analysis. The tracks were:

  • Foundations of Data Science
  • Software Analytics
  • Digital Agriculture
  • Big Data Applications

Several individuals at Iowa State were instrumental in developing and organizing the tracks’ curricula. Click here for a list of those involved.

Speakers

Keynote presenters at this year’s summer school were:

  • Chid Apte, director, Mathematical Sciences and Blockchain Solutions, IBM Research
  • Tom Schenk, chief data officer, City of Chicago
  • Jacek Czerwonka, principal software engineer, Microsoft Research
  • Will Snipes, principal scientist, ABB Research

A complete list of speakers, including their bios, is available here.

Data science evolving quickly

The field of big data, also referred to as data science, is relatively new yet advancing quickly. For this reason, organizers encourage researchers and scientists to learn as much as they can through resources like the Midwest Big Data Summer School.

“Our aim is for early career researchers and professionals – both in academia and industry – to get a taste of what it’s about, what the state of the art is and how they can start thinking about using data science in their own domains,” said Chinmay Hegde, assistant professor of electrical and computer engineering at Iowa State and a co-organizer of the summer school.

Many thanks

Rajan recognizes the summer school would not be possible without the help of many.

“We are especially thankful for the Midwest Big Data Hub, the National Science Foundation, the Office of the Vice President for Research, Iowa State’s College of Liberal Arts and Sciences, and the departments of computer science and statistics for providing both funding and personnel support for this event.”

Next year
Plans are in the works for the 2019 Midwest Big Data Summer School, though no dates have been set. Rajan said more application-specific tracks may be added to next year’s curriculum. Watch the Midwest Big Data Summer School website for more details in the spring of 2019.

——

Reposted from Iowa State University’s Research News blog. View the original post here.

MBDH partners on US Ignite Reverse Pitch challenge

part of Hub’s focus on Smart, Connected, and Resilient Communities

US Ignite Hackathon
UIUC collaborators and mentors meet with HackIllinois teams on US Ignite Challenge

The University of Illinois at Urbana-Champaign (UIUC) was awarded a $20,000 grant from US Ignite to host a Smart Gigabit Communities Reverse Pitch Challenge. The MBDH, along with other local partners (see below), contributed towards matching the grant, bringing to $40,000 the total resources available to support the development of smart gigabit applications for the benefit of the local community. Read More

New Report on “Keeping Data Science Broad”

A new report on the “Keeping Data Science Broad: Negotiating the Digital and Data Divide Among Higher Education Institutions” initiative was released by the South Big Data Hub and collaborators, including the Midwest Hub. This initiative brought together the BD Hubs community and other stakeholders to explore pathways for keeping data science education broadly inclusive. Read More