•  Location: Smith-Buonanno HallRoom: 106

    Join us for a Brown2026 lecture on how access to information in the digital age can affect civic participation.

    We are presenting field data evaluating real-time broadband and cellular transfer rates across Rhode Island—a study that serves as a modern audit of our state’s “informational infrastructure.”

    In the 18th century, the pulse of the American Revolution was maintained through the physical distribution of the printed word. Today, reliable broadband has superseded the postal road, becoming a civic necessity as fundamental as electricity or running water. Just as the Committees of Correspondence once relied on reliable horse-bound routes to organize the colonies, modern Rhode Islanders rely on digital lanes to access community services, local news, and the inner workings of our democracy.  Historically, the withholding of information has served as a silent form of disenfranchisement.

    Without equitable access to broadband, the modern constituent faces barriers reminiscent of pre-revolutionary information blackouts:

    • Informational Isolation: A lack of clarity on ballot initiatives, leading to the same “panic voting” or confusion seen in eras of unreliable news.
    • Candidate Obscurity: The inability to vet the records and qualifications of those seeking power.
    • Logistical Barriers: Historical hurdles to voter registration and polling location awareness, now exacerbated by the “speed” of digital-first requirements.

    Common limiting factors—socioeconomic status, urban proximity, and geographic isolation—are not new phenomena. In Rhode Island, these hurdles often mirror historical patterns of infrastructure degradation. Whether it was a washed-out bridge in 1776 or a lack of fiber-optic cables in 2026, the result remains the same: the isolation of the citizen from the state.

    This event will feature a conversation on the historical lineage of information access, drawing direct parallels between the revolutionary power of the printing press and the democratic mandate for universal broadband today.

    Speakers include:  Suresh Venkatasubramanian, CNTR Director; Timothy Edgar, PoP Computer Science, and undergraduate students Amber Zhao and Siddarth Nareddy

    Dinner will be provided to registered attendees.

    RSVP on Eventbrite
    View Full Event  
  •  Location: 164 Angell Street, 3rd floorRoom: 302

    Join the Data Science Institute for our next Data & Donuts session featuring Serena Booth, Assistant Professor of Computer Science at Brown University, where she studies how people design, understand, and evaluate AI systems. Her research focuses on human-AI and human-robot interaction, particularly how users specify goals and behaviors for machine learning systems. She also examines the societal impacts of AI, including how these technologies affect workers and consumers.
    The Data & Donuts series offers an informal space for students, faculty, and staff to connect over short talks by Brown faculty and data scientists on topics and campus resources related to data science. These roundtable-style sessions are open to all members of the Brown community with an interest in data science.

    View Full Event  
  •  Location: 164 Angell Street

    Join us for an afternoon of fun trivia with categories related to Brown, DSI, CNTR, and Rhode Island!

    Welcome to the DSI Trivia Game happening on March 14, 2026 at 2:00 PM! Get ready to put your knowledge to the test and have a blast with fellow trivia enthusiasts. Join us for an exciting afternoon filled with fun facts and friendly competition.

    Don’t miss out on the chance to showcase your trivia skills.  Food will be provided. Mark your calendars and spread the word to your friends – this is an event you won’t want to miss!

    RSVP Here
    View Full Event  
  •  Location: 164 Angell StreetRoom: 302

    Join the Brown Center for Computational Molecular Biology for our seminar with Dr. Yang Lu (UW Madison), on March 11, 2025:

    Advancing Trustworthy and Data-Driven Hypothesis Generation in Biomedical Research

    Abstract: Rapid developments in high-throughput sequencing have enabled biologists to collect large volumes of multi-omics data with unprecedented resolution. However, interpretation of such an increasing amount of heterogeneous biological data becomes highly nontrivial. In my talk, I will present a data-driven research paradigm to discover testable hypotheses directly from biological data in an interpretable and trustworthy fashion. In particular, the talk will focus on three recent works that address key aspects of biomedical research: analyzing data, generating and prioritizing hypotheses:

    (1) An interpretation method that generates testable biological hypotheses from machine learning (ML) models. Specifically, I developed an uncertainty-aware method to identify from single-cell RNA-seq data a combinatorial gene set signature to characterize the single-cell type. This method pioneers efforts to streamline existing single-cell analysis pipelines through a unified framework for easy interpretation.

    (2) A statistical method that prioritizes non-additive interactions from any ML models. The prioritized interactions, treated as hypotheses, are rigorously controlled for statistical errors without relying on p-values. This method was the first to demonstrate to the community that higher-order interpretations of ML models can be achieved with confidence guarantees.

    (3) A critical reevaluation of problematic statistical estimation of the Basic Alignment Search Tool (BLAST), a cornerstone tool used in daily biomedical analysis over the past 30 years. We have introduced an alternative method to address this issue, ensuring that it does not yield inflated estimates of significance. Our study has the potential to influence and reshape numerous conclusions drawn by researchers.

    Speaker Bio: 

    Yang Lu is an assistant professor at the University of Wisconsin-Madison. He was an assistant professor at University of Waterloo. Prior to that, he was a postdoctoral researcher in Prof. William Noble’s group at the University of Washington.He obtained his Ph.D. in Computational Biology and Bioinformatics under the supervision of Prof. Fengzhu Sun from University of Southern California.

    Before moving to the United States, he received M.S. and B.S. degrees in Computer Science and Engineering from Shanghai Jiao Tong University. Yang Lu’s research focuses on developing machine learning and statistical methods for genomics and proteomics data analysis. He is particularly interested in developing interpretation methods to find scientifically interesting and statistically confident hypotheses from complex biological data.

    Affiliations:

    (Primary) Department of Biomedical Engineering, UW-Madison

    Department of Biostatistics & Medical Informatics, UW-Madison

    –
    The mission of the Center for Computational Molecular Biology is to promote the development, implementation, and application of analytical and computational methods to foundational questions in the biological and medical sciences. The research programs of the core faculty in the CCMB lie foundationally at the intersection of computer science, evolutionary biology, mathematics, and molecular and cell biology. 
    View Full Event  
  •  Location: 164 Angell StreetRoom: 302

    Join the Data Science Institute and the Brown Quantum Initiative for a conversation with Celia Merzbacher, the Executive Director Quantum Economic Development Consortium at Semiconductor Research International (the largest semiconductor manufacturer consortium in the US). 

    Student Lunch (1:30pm):

    Celia will host a lunch with students and answer questions about quantum and computational career pathways.

    Talk (3:00pm):

    Celia will discuss quantum computation and semiconductor research, describing administration priorities and funding at agencies like NSF and DOE, as well as the legislation working its way through Congress to expand the National Quantum Initiative.

    –

    Speaker Bio:

    Dr. Celia Merzbacher is Executive Director of the Quantum Economic Development Consortium (QED-C), a global, industry-driven consortium managed by SRI International that aims to grow the quantum ecosystem. Dr. Merzbacher has more than two decades of experience as a leader of large multidisciplinary partnerships and programs at the intersection of government, industry, and academia. She was named a member of the Quantum 100 as part of the International Year of Quantum for her dedication and contributions. She has testified to Congress twice at hearings on the National Quantum Initiative and is a member of the GAO Expert Network that advises Congress. Merzbacher is a member of the U.S. delegation to the NATO Transatlantic Quantum Community and the G7 Quantum Working Group and, in partnership with UK Quantum, co-leads the UK-US Quantum Industry Exchange program. Celia is the host of “Been There, Done That”, a podcast where founders and executives from companies in the quantum sector share their experiences and lessons learned.

    Previously, Dr. Merzbacher was Assistant Director for Technology R&D in the White House Office of Science and Technology Policy and Executive Director of the President’s Council of Advisors on Science and Technology. She is a Fellow of the AAAS and serves on the National Academies of Science, Engineering, and Medicine (NASEM) Standing Committee on Horizon Scanning and was Chair of the NASEM National Materials and Manufacturing Board. Dr. Merzbacher began her career as a materials scientist at the U.S. Naval Research Laboratory in Washington D.C., where her research led to six patents and more than 50 technical publications. She holds a BS degree in geological sciences from Brown University and MS and PhD degrees in geochemistry and mineralogy from The Pennsylvania State University.

     

    RSVP to Student Lunch with Celia Merzbacher (1:30pm)

    RSVP to afternoon talk below:

    View Full Event  
  •  Location: 164 Angell StreetRoom: 302

    Join the Brown Center for Computational Molecular Biology for our seminar with Dr. Pei Wang (Mt. Sinai), on February 25, 2025:

    Pan-cancer proteogenomics characterization of tumor immunity

    Abstract: Despite the successes of immunotherapy in cancer treatment over recent decades, less than <10%-20% cancer cases have demonstrated durable responses from immune checkpoint blockade. To enhance the efficacy of immunotherapies, combination therapies suppressing multiple immune evasion mechanisms are increasingly contemplated. To better understand immune cell surveillance and diverse immune evasion responses in tumor tissues, we comprehensively characterized the immune landscape of more than 1,000 tumors across ten different cancers using CPTAC pan-cancer proteogenomic data. We identified seven distinct immune subtypes based on integrative learning of cell type compositions and pathway activities. We then thoroughly categorized unique genomic, epigenetic, transcriptomic, and proteomic changes associated with each subtype. Further leveraging the deep phosphoproteomic data, we studied kinase activities in different immune subtypes, which revealed potential subtype-specific therapeutic targets. Insights from this work will facilitate the development of future immunotherapy strategies and enhance precision targeting with existing agents.

    Bio: Dr. Pei Wang is a Professor of Genetic and Genomic Sciences at Icahn School of Medicine at Mount Sinai. She obtained her B.S. in Mathematics from Peking University, China, in 2000; and her Ph.D. in Statistics from Stanford University in 2004. Between 2004-2013, Dr. Wang served as a faculty in Program of Biostatistics, Division of Public Health Sciences, Fred Hutchinson Cancer Research Center and Department of Biostatistics, University of Washington, Seattle, WA. In Oct 2013, Dr. Wang joint Icahn Medical School at Mount Sinai, New York to lead an integrative proteogenomic research program. Dr. Wang’s research focuses on developing statistical and computational tools that translate billions of data points about diseases like cancer into answers about their causes. She is a team leader of the NCI Clinical Proteomic Tumor Analysis Consortium (CPTAC) that aims to understand the molecular basis of cancer through large-scale proteome and genome analysis. Dr. Wang is the principle investigators of the CPTAC Proteogenomics Data Analysis Center at Mount Sinai. The Center focuses on identify potential biomarkers and drug targets for cancer that will help accelerate cancer research.

    –
    The mission of the Center for Computational Molecular Biology is to promote the development, implementation, and application of analytical and computational methods to foundational questions in the biological and medical sciences. The research programs of the core faculty in the CCMB lie foundationally at the intersection of computer science, evolutionary biology, mathematics, and molecular and cell biology. 
    View Full Event  
  •  Location: Salomon Center for TeachingRoom: 202

    Join the Data Science Institute for a Data Science Industry Talk with DSI alum, Annie Phan ’21:

    Designing AI That Survives the Real World: Lessons from Enterprise Practice and the AI Maturity Mandate

    This hands-on session connects the “messy middle” of enterprise AI to the realities of data science work for career professionals and emerging leaders in the Brown Data Science ScM program. Drawing on case studies from Diligent, Fanatics Collectibles, and earlier consulting work, I’ll walk through what actually makes AI initiatives succeed or stall—data foundations, stakeholder alignment, translation muscle, and operating models, not just model performance. We’ll reverse‑engineer several real enterprise AI patterns from my upcoming book AI Maturity Mandate, then map those patterns onto participants’ project contexts: clarifying problem statements, scoping minimal viable data products, defining success metrics, and stress‑testing risks and failure modes.

    Speaker Bio:
    Annie Phan is a data and AI leader and Staff AI Solution Architect at Diligent, where she designs and deploys secure, enterprise-scale AI systems across go-to-market, legal, and finance functions. Previously, she led major AI-driven transformation initiatives at Fanatics Collectibles and served as a Data & AI expert at McKinsey & Company. A Forbes Business Council member and published thought leader, she has written on data strategy and AI maturity for outlets such as Forbes and Analytics Insight, served as a judge for programs including the Globee Awards, Business Intelligence Group, AI Forge, Hackathon Raptors, and Stevie Awards, and spoken at industry‑wide events alongside leaders from top AI and technology companies, including the Velric Hiring & Leadership Summit and Brown’s Data Science Institute. Annie holds a Master’s in Data Science and a Bachelor’s in Economics from Brown University.

    –

    This event is hosted in-person and on Zoom. Brown students and staff may attend in-person or online; community members external to Brown should attend online. 

    View Full Event  
  •  Location: 121 South Main StreetRoom: 375

    Claudio Battiloro, Research Associate at Harvard University, will offer a lecture entitled “Healthy Climate Adaptation in the AI Era.” The lecture is co-sponsored by the Department of Biostatistics in the School of Public Health and the Data Science Institute.

    Lunch will be provided, and in-person attendance is encouraged. For those unable to attend in person, the lecture will also be available on Zoom at the following link: https://brown.zoom.us/j/96777800137

    View Full Event  
  •  Location: 121 South Main StreetRoom: 375

    In this talk, Kan Chen (Postdoctoral Fellow, Harvard University) will present research on causal inference for large-scale, complex data, motivated by high-stakes decisions in health and biomedical settings where standard causal assumptions often fail. Rather than viewing data scale and complexity as obstacles, Chen’s work shows how structured, high-dimensional data can be leveraged to improve causal identification and support better policy and strategic decisions. Chen will first introduce FAMA (Factor Analysis-based Mediation Analysis), a framework for large-scale mediation analysis under unmeasured confounding that uses latent factor structure to uncover causal pathways in high-dimensional molecular data. He will then show how protein structure representations from AlphaFold can be integrated with causal models to estimate the effects of genetic and protein-level interventions on clinically relevant outcomes. Chen will conclude by extending these ideas into a broader research agenda at the intersection of causal inference and machine learning, aimed at turning large-scale biomedical data into actionable evidence for health policy and precision medicine.

    Refreshments will be provided, and in-person attendance is encouraged. For those unable to attend in person, the lecture will also be available on Zoom at the following link: https://brown.zoom.us/j/98261712136

    View Full Event  
  •  Location: Data Science InstituteRoom: 3rd floor open space

    This event will bring together panelists from DEEPS, Sociology, and Data Science:
    Dr. Chris Horvat
    Dr. Jung Eun Lee
    Dr. Kim Cobb
    Dr. Karianne Bergen
    to explore how data-driven approaches inform climate and environmental research, highlighting the range of climate modeling and analytical work happening across Brown.

    We will begin with a moderated panel in the DSI open space, followed by small-group roundtable discussions and Q+A in breakout rooms – with food from Kabob and Curry!

    We hope this event offers insight into how data and computation advance climate modeling, the tools and methods behind groundbreaking work, the intersections between data science and geoscience, and opportunities for students to get involved in climate research!

    RSVP here
    View Full Event  
  •  Location: 164 Angell StreetRoom: Open Area

    The CCMB Graduate Research Symposium is an opportunity for Center for Computational Molecular Biology (CCMB) graduate students to learn about their peers’ research and to practice sharing their work with colleagues. Senior PhD students will give 15-minute presentations on their work.

    This event will give new graduate students exposure to the type of work their colleagues do and foster collaboration and connection between our graduate students.

     

    Spring 2026 Presenters:

     

    Ananya Pavuluri

    Advisors: Erica Larschan, Ritambhara Singh

    “Let’s talk about sex: investigating differences between males and females across diverse species.”

     

    Ria Vinod

    Advisor: Lorin Crawford

    View Full Event  
  •  Location: 164 Angell StreetRoom: 302

    The Data Matters Seminar Series is hosted by the Data Science Institute, covering why data matters across the physical, biological, computational, and social sciences. Data Matters is intended to stimulate conversations and collaboration by bringing multiple perspectives to challenging data-driven problems.

    Light refreshments will be provided. 

     

    GEOPACHA AI

    South American Archaeology, at Scale?: Using Vision Foundation Models to Map Sites en masse in GeoPACHA-AI

    Abstract: Archaeologists excel at documenting the material dimensions of social life at local and regional scales but we often struggle to generate national and continental-sale datasets with continuous, systematic coverage. Collecting such data is important for at least three reasons: 1) macro-regional datasets provide context for understanding the variation that we observe at local and regional scales; 2) synthetic databases stitched together out of regional projects often amplify sampling biases; and 3) past peoples themselves frequently understood and acted in their worlds through large scale networks, including empires, circuits of interregional trade, and long-distance seasonal migration. In this presentation, I discuss the development and deployment of one solution to this problem: GeoPACHA-AI (Geospatial Platform for Andean Culture, History, and Archaeology), an international collaborative project that develops and harnesses Vision Transformer-based AI models to conduct continental-scale archaeological imagery survey. GeoPACHA’s project area covers nearly the entirety of the central Andes, from northern Ecuador through southern Chile, and we seek to enable new perspectives on interregional social networks, the broad impacts of imperial expansions, and long-term responses to climate change.

    Parker VanValkenburgh’s research and publications employ archaeological methods to address anthropological research questions, with a particular focus on the long-term impacts of colonialism and imperialism on Indigenous people and environments in the Peruvian Andes. Through the study of diverse materials and media––including architecture, ceramics, environmental datasets, and archival documents––he seeks to understand how relationships between people, institutions, and environments are transformed in the course of imperial histories, as well as how the strategies of survival and resilience that communities develop to deal with empires are passed down and reworked across generations. In the course of doing so, he strives to generate approaches that are widely applicable to the study of empire(s) beyond the Andean region and which contribute to interdisciplinary understanding of imperial legacies in the modern world. In this work, he draws amply on digital methodologies, including the tools of geographic information systems (GIS), to map and analyze social, political, and environmental change in space and time. He also applies a critical lens to the study of digital media and methodologies, asking not just how these techniques facilitate archaeological scholarship, but how digital mediation transforms the ways we work with collaborators, research subjects, students, and public audiences.

    __

    This seminar and discussion will be mediated by Karianne Bergen (DSI, DEEPS)

    View Full Event  
  • Register
    View Full Event  
  •  Location: 121 South Main StreetRoom: 375

    Larry Han, Assistant Professor of Public Health and Health Sciences at Northeastern University, will offer a lecture entitled “Trustworthy Uncertainty Quantification for Biomedical Decision-Making: Fair and Causal Conformal Inference with Efficiency Guarantees.” The lecture is co-sponsored by the Department of Biostatistics in the School of Public Health and the Data Science Institute.

    Refreshments will be provided, and in-person attendance is encouraged. For those unable to attend in person, the lecture will also be available on Zoom at the following link: https://brown.zoom.us/j/94375178485

    View Full Event  
  • *Spring 2026 workshops will be held virtually (https://brown.zoom.us/j/96158269691)*

    Presenter: Ref Bari, Masters student in Physics and PI for T-REX

    “The data science behind imaging black holes with T-REX: Time-Resolving Explorer Satellite”

     

    In 2019, the Event Horizon Telescope (EHT) directly imaged a black hole for the first time, heralding a new era in astronomy. The T-REX (Time-Resolving Explorer Satellite) mission extend the EHT to space, achieving sub-ISCO temporal resolution and enabling the first time-resolved movies of Sgr A*’s accretion flow at 86 GHz.

    This workshop will cover the data science aspect of T-REX: imaging a black hole generates petabytes of data and involves sophisticated fourier transforms, solving inverse problems, and data compression.

    This talk will be of interest to physicists and data scientists alike.

    Level: Some experience

    –

    DSCoV (Data Science, Computation, and Visualization) workshops are lunchtime introductions to basic data science and programming skills and tools, offered by and for Brown staff, faculty, and students (with occasional presenters from outside Brown). The workshops are interactive, so bring a laptop. All are welcome.

    These workshops will be held virtually on Zoom: https://brown.zoom.us/j/96158269691

    DSCoV Workshops
    View Full Event  
  •  Location: 164 Angell StreetRoom: 302

    Are you interested in the Data Fluency undergraduate certificate?

    Join the Data Science Institute (DSI) on Thursday, January 29th at 12:00pm for an informational session with the certificate’s Director of Undergraduate Studies (DUS), DSI academic staff, and current certificate students. Come learn about the certificate requirements, important deadlines, and more! A light lunch will be provided. The session will be recorded for students who can’t attend.

    Register Here
    View Full Event  
  •  Location: 164 Angell StreetRoom: 302

    The Brown Center for Computational Molecular Biology invites Dr. Rajiv McCoy (Johns Hopkins) to speaker in our seminar series on January 28, 2025.

    Human genome evolution within and across generations

    Abstract: Genetic variation at the level of DNA sequence mediates much of the phenotypic diversity that exists in nature, both within and between species, including humans. My lab uses the tools of computational and statistical human genetics to answer questions about how germline and somatic evolution shape 1) genome function and 2) reproduction and development. Our work spans diverse systems but is unified around the goal of developing methods for elucidating the evolutionary forces that impact our genomes across scales of biological organization.

    Importantly, much of the phenotypic variation in nature traces not to differences in amino acid sequences, but to regulatory variation influencing transcription and splicing of RNA. I will describe my lab’s recent work generating and analyzing a large gene expression dataset from globally diverse human individuals, toward a more complete view of the mechanisms driving gene expression diversity and evolution within our species.

    In addition to the evolutionary processes operating across generations, my lab is also interested in the recurrent forces of natural selection that shape human development. For example, it is estimated that less than half of all human conceptions survive to birth, primarily due to chromosome mis-segregation during meiosis and mitosis. I will describe our latest work repurposing clinical genetic testing data from in vitro fertilized embryos to understand the genetic basis of variation in human chromosome abnormalities.

    Rajiv McCoy is an Associate Professor in the Department of Biology, with a secondary appointment in the Department of Genetic Medicine, at Johns Hopkins University. He received his PhD in Biology from Stanford University and completed postdoctoral training at the University of Washington and Princeton University before joining Johns Hopkins in 2018. His lab applies computational and statistical approaches to large-scale human genetic datasets. Research accomplishments include contributing to analysis of the first complete human genome, development of functional genomic resources for globally diverse populations, and discovery and characterization of common genetic variants associated with aneuploidy—the leading cause of human pregnancy loss. The McCoy lab is supported by funding from the NIH, NSF, Lalor Foundation, and American Society of Hematology and is actively engaged in teaching and mentoring in computational genetics at both the graduate and undergraduate levels.

    The mission of the Center for Computational Molecular Biology is to promote the development, implementation, and application of analytical and computational methods to foundational questions in the biological and medical sciences. The research programs of the core faculty in the CCMB lie foundationally at the intersection of computer science, evolutionary biology, mathematics, and molecular and cell biology. 

    View Full Event  
  • Please join us for our upcoming virtual information session for admitted students of Brown’s online Master’s in Data Science. Attendees will gain further insight into the program structure, student resources and next steps.

    Register Now
    View Full Event  
  •  Location: 164 Angell StreetRoom: Open Area

    Join the Data Science Institute as we recognize the recipients of our inaugural Early Career Breakthrough Research Awards and Seed Grant Awards.

    We look forward to celebrating the outstanding the outstanding research at Brown that leverages data science to propel research and innovation across disciplines. 

    This celebration will recognize each award recipient. Refreshments will be provided. All are welcome to attend. 

    This event can also be viewed over Zoom: https://brown.zoom.us/j/95629661736

     

    2026 Early Career Breakthrough Research Award Recipients:

    • In the Biological Sciences: Ying Ma, Edens Family Assistant Professor of Healthcare Communications and Technology and Assistant Professor of Biostatistics
    • In the Physical Sciences & Engineering: Loukas Gouskos, Christopher M. Barter Assistant Professor of Physics
    • In the Humanities & Social Sciences: Kim Fernandes, Assistant Professor of Anthropology

    2026 Data Science Seed Grant Recipients:

    • Qualitative Content Analysis Using R and Large Language Models (Jacy Weems, Emmanuelle Belanger)
    • Creating the Trypanosoma brucei digital cell (Laura Smithson, Ana Sofia Cepeda-Diaz, and Chris de Graffenried)

    • Leveraging Rhode Island Linked Administrative Data to Understand the Effects of Paid Leave on Children’s Health and Educational Outcomes (Margot Jackson, Jiwan Lee, Michael Silverstein and Lauren Schlichting)

    • Developing an Oxford Nanopore Whole-Genome Sequencing and Analysis Pipeline for Plasmodium falciparum Drug Resistance Surveillance (Abebe Fola)

    • How Does the Dissolution of the US Department of Education Impact the Education Ecosystem? (Lindsey Kaler, Cameron Arnzen, Niamh Stull and Susan Moffit)

    • The Jewish Studies Citation Network (Michael Satlow)

    • Bloom: A Smart Plant System Integrating Sensor-Driven Data Analytics to Promote Urban Sustainability and Human-Plant Interaction (Jenny Zhu, Savvas Koushiappas, Ayaan Jamal, Bruno Rodriguez-Mendez, Ryan Pang)

    • iPREDiCT: A toolkit for cancer recurrence prediction using integrated genomics and clinical data (Ece Uzun, Jeremy Warner, Seema Nagpal and Samuel Rubinstein)

    • Data Science Collaborative for Real-World Evidence in Cancer Therapeutics (Rebecca Hubbard)

    • From Engagement to Outcomes: Predicting and Improving Student Career Readiness at Brown (Viktor Gavrielov, Amanda Khoo)

    • Brown Efficient AI Compression Network (BEACON) (Richard Gaitskell, Gaetano Barone, Ian Dell’Antonio, Matt LeBlanc, Jonathan Pober and Jennifer Roloff)

    • Machine Learning Conference on Justice in Education, Healthcare, and Public Policy (Kenya Andrews)

    View Full Event  
  • Are you considering continuing your education at Brown through the Fifth-Year Master’s program? Join us for this information session to learn how you can extend your Brown experience and earn a master’s in data science in just one additional year.
    During the session, admissions staff and program leadership will share insights on:

    • The structure and benefits of the data science fifth-year master’s program
    • Application requirements and tips for submitting a strong application
    • Resources and support available to master’s students at Brown

    This session is designed to answer your questions—your participation will guide the conversation. We look forward to helping you explore the next step in your academic journey at Brown.

    Register Here!
    View Full Event  
  •  Location: 164 Angell StreetRoom: Open Area

    Yule definitely want to attend this event.

    Join us for the DSI Winter Wonderland Social, where we’ll be spreading cheer, serving treats, and snow-stalling your finals stress with fun activities and cozy vibes.

    It’s the perfect chance to take a study break, mingle, and get your jingle on before the semester wraps up.

    When: Thursday, December 11
    Time: 4:00 – 6:00 PM
    Where: Data Science Institute, 164 Angell St, Floor 3
    What: Food, fun, and lots of festive energy

    We’ll be sleighing the season together, don’t flake out on us.

    View Full Event  
  • Register
    View Full Event  
  •  Location: 164 Angell StreetRoom: 302

    Join the Data Fellows Program for presentations from our Fall 2025 cohort on their projects this semester!

    The Data Science Fellows program offers a unique opportunity for students to collaborate with a participating faculty member to infuse data science tools and practice in existing undergraduate courses. Undergraduates interested in data science and collaborative work with a faculty member have the opportunity to enroll in the Data Science Fellows course (DATA 1150, offered each fall semester). This course will prepare the Data Science Fellows to serve as consultants for faculty wishing to enhance data science curricula at Brown. This program is offered in collaboration with the Sheridan Center for Teaching and Learning. Students learn a core set of data science practices, active learning pedagogies, and collaborative communication skills.

    See past projects on the Data Fellows website

    View Full Event  
  •  Location: Watson Center for Information Technology (CIT)Room: 241

    Anti-Regulatory AI

    For almost 30 years, Lessig’s declaration that “code is law” has shaped how we regulate the technology du jour. Artificial intelligence (AI) is today’s technology, but its code may no longer be regulatory. AI may, in fact, be anti-regulatory by virtue of its opaque unpredictability–and this has introduced rippling clashes with longstanding legal rules. So the story goes.

    This dissertation puts forth an appraisal of AI’s code as a different kind of anti-regulatory mechanism – one that minimizes the costs of laws, or bypasses them altogether.

    First, we consider the surveillant consequences of how privacy-enhancing technologies (PETs) are layered as part of AI development in attempts to place data accumulation under the radar of data protection and privacy laws. We conclude with a re-imagination of data regulations to guide a development of PETs that protect the subjects of AI systems.

    Secondly, we examine how offerings like red-teaming and sovereign AI function as mechanisms of regulatory influence in the legitimization of technical self-governance. We conclude by considering how technological expertise can be leveraged to evaluate these offerings’ anti-regulatory functions.

    Finally, we present a taxonomy of “avoision” for AI regulation, with a particular eye to how the European Union’s AI Act (AIA) could be avoided in ways that subvert its spirit. We highlight how AI design decisions to the same technological effect can produce differing compliance determinations. We conclude with a discussion of how these tactics can be addressed through policy levers built within the AIA.

    Taken together, our analysis suggests that tensions between AI and regulation do not merely arise from AI’s inherent technical properties. They can also result from design decisions that minimize the costs that existing and emerging regulations pose. Consequently, the pursuit of effective AI regulation necessitates a careful account of the regulatory incentives that guide AI production. This dissertation maps an incentive-aware frame for AI governance in service of that pursuit.

    Host: Professor Suresh Venkatasubramanian

    View Full Event  
  •  Location: 164 Angell Street (DSI 3rd Floor)Room: 302

    Join the Data Science Institute for our next Data & Donuts session featuring Professor Björn Sandstede, Alumni-Alumnae University Professor of Applied Mathematics at Brown University. Professor Sandstede is a leading scholar in applied dynamical systems whose research connects mathematical theory with data-driven modeling across fields such as computational biology, pattern formation, and nonlinear dynamics. His work advances our understanding of complex systems while bridging rigorous analysis with modern data science.

    The Data & Donuts series offers an informal space for students, faculty, and staff to connect over short talks by Brown faculty and data scientists on topics and campus resources related to data science. These roundtable-style sessions are open to all members of the Brown community with an interest in data science.

    DSI DUG: Data & Donuts Events Interest Form
    View Full Event  
  •  Location: Rhode Island HallRoom: 108

    Data Science Institute Industry Talk

    In this fireside chat, Professor Lipman shares his experience working as a Data Scientist and Tech Lead Manager at Google. Peter will summarize his contributions at Google, where he managed cross-functional analytics-focused teams, and discuss his views on the skills needed to succeed in big tech. 

    Peter J. Lipman, PhD, recently joined Brown’s faculty as an Associate Professor of the Practice of Biostatistics at the School of Public Health. He holds a BA in Mathematics from Johns Hopkins University and a PhD in Biostatistics from Harvard University. Peter joins us after spending over a decade at Google; his work largely focused on labelled data generation for machine learning models in both Ads (2013-2022) and the Play Store (2022-2025). 

    View Full Event  
  •  Location: 164 Angell StreetRoom: Open Seminar Area

    “Statistical and Computational Advances for Detecting Nonlinear Contributions to Complex Traits”

    Advisors: Lorin Crawford, Ph.D. and Daniel Weinreich, Ph.D.

    Statistical genetics has improved our understanding of how genetic variation shapes human complex traits and diseases. Growing biobanks, improved sequencing technologies, and efforts of scientific collaboration have enabled the field to make progress in estimating heritability, detecting single-variant associations, and elucidating polygenic architectures. Genome-wide association studies (GWAS), rare variant tests, and models for gene-by-gene (GxG) and gene-by-environment (GxE) interactions have established a foundation for translating the insights from statistical genetics to clinical risk prediction and drug target discovery.

    Despite ever-increasing scale and availability of data, many questions about the genetic architecture of complex traits remain open. Traits may be driven by thousands of variants with small effects, while linkage disequilibrium patterns, pleiotropy, epistasis, and heterogeneity in environmental exposures obscure genetic inference. GWAS signals frequently are located in non-coding regions, complicating biological interpretation, and predictive models lose accuracy across ancestries likely due to differing linkage disequilibrium and environments. Many methodological challenges remain in order to better understand nonlinear contributions to phenotypic variation, disease susceptibility, and frailty.

    The present dissertation addresses these challenges by extending the marginal interaction framework to multivariate analysis and time-to-event traits, and by providing a framework for integration of multi-omic information into genetic association studies. Chapter 1 introduces a multivariate marginal epistasis test (mvMAPIT) that leverages genetic correlations between traits to detect genetic variants involved in GxG interactions. This method is implemented as a multivariate linear mixed model and improves power for detecting marginal epistasis in multivariate data. Chapter 2 presents the Cox proportional hazards gene-by-environment interaction test (CphGxE). This model enables the detection of GxE interactions in time-to-event traits by partitioning the heritable variance of frailty into genetic, environmental, and GxE interaction components. Chapter 3 proposes the Sparse Marginal Epistasis (SME) test. This approach enables the integration of functional data as biological priors into genome-wide association studies of epistasis, achieving substantial improvements in power and scalability of the marginal interaction framework to biobank-scale data.

    Together, these methods present computational and statistical advances by providing scalable and interpretable approaches for studying nonlinear genetic architectures in complex traits.

    View Full Event  
  •  Location: School of Public Health at Brown University, 121 South Main Street, Providence, RI 02912Room: 245

    Andriy Norets, Ph.D.,
    Professor of Economics
    Department of Economics at Brown University

     

    Talk Title: Bayesian Nonparametric Model for Nonseparable Instrumental Variable Regression (joint work with Simone Martinalli)

    Abstract: We propose a flexible Bayesian model for estimation of nonseparable instrumental variable regressions with a univariate endogenous covariate and a univariate unobservable responsible for endogeneity.
    Our model uses recently developed nonparametric priors for conditional distributions based on covariate dependent mixtures that deliver optimal adaptive posterior contraction rates in settings with discrete and continuous variables under possible smoothness and sparsity. We develop posterior concentration results and a Markov chain Monte Carlo algorithm for the model. The proposed methodology provides a simple parametric baseline model that can be gradually extended to more flexible parametric models and ultimately a nonparametric one by increasing the number of mixture components or setting a prior on it. Thus, robustness to nonlinearities, nonseparabilty, heteroskedasticity and other deviations from the baseline can be gradually introduced in a computationally tractable way. We illustrate the model performance in simulations and applications.

    View Full Event  
  •  Location: School of Public Health at Brown University, 121 South Main Street, Providence, RI 02912Room: 245

    Stephen Bach, Ph.D.,
    DSI Campus Advisory Board Co-Chair, Eliot Horowitz Assistant Professor of Computer Science at Brown University

     

    Talk Title: A Data-Centric Approach to AI Adaptation and Alignment

    Abstract: Training generative AI is not a one-step process. In the case of large language models (LLMs), self-supervision is often followed by supervised and reinforcement learning stages to improve instruction following, safety, and other desirable qualities. This multi-stage process that has emerged in the last 3 years has led to huge leaps in model capabilities. It has also led to new challenges and risks. In this talk, I will overview some of our group’s work to identify and address such challenges by focusing on the training data used at different stages. First, I will discuss the problem of adapting LLMs to new, specialized domains and the role that synthetic, i.e. LLM-generated, training data can play. Then, I will share some of our work showing how mismatches in training data at different stages can lead to safety alignment risks. In one case, LLMs with inadequate safety training can be more likely to respond to harmful queries when presented in languages with less abundant data like Swahili or Scots Gaelic. In another case, LLMs trained to reason about solving math problems can then deploy those same reasoning skills to reason out of their own safety guardrails. Together, these findings highlight the importance of careful training data management at all stages of AI development.

    View Full Event  
  •  Location: 164 Angell StreetRoom: Carney Innovation Area, Floor 4

    Presenter: Alice Paul, Assistant Professor of Biostatistics

    “Intro to Simple Data Analysis in R”

    This workshop will start by introducing how to load data into R and conduct a short exploratory analysis, including generating summary statistics, creating summary tables, and generating simple plots. Additionally, we will practice implementing some common statistical tests.

    Level: Beginner

    Required Prior Knowledge: Introductory statistics

    DSCoV (Data Science, Computation, and Visualization) workshops are lunchtime introductions to basic data science and programming skills and tools, offered by and for Brown staff, faculty, and students (with occasional presenters from outside Brown). The workshops are interactive, so bring a laptop. All are welcome.

    These workshops can be attended in person (Carney Innovation Hub, 164 Angell St, Floor 4) or on Zoom: https://brown.zoom.us/j/96158269691

    Food will not be served; please bring your own lunch.

    DSCoV Workshops
    View Full Event  
  • Please join us for our upcoming Virtual Information Session. Attendees will gain further insight into the program structure, curriculum highlights and application process. Please note, all times listed are Eastern (ET).
    Register Here!
    View Full Event  
  •  Location: 164 Angell StreetRoom: Carney Innovation Area, Floor 4

    Presenter: Paul Xu, Center for Computation and Visualization

    “Hyper-Modern Python Project Development with uv”

    Blazing fast and feature-rich, uv has emerged as the go-to tool for modern python development. We will cover how to use uv to manage Python dependencies and build/publish Python packages

    Level: Beginner

    Required Prior Knowledge: Basic python proficiency

    ___

    DSCoV (Data Science, Computation, and Visualization) workshops are lunchtime introductions to basic data science and programming skills and tools, offered by and for Brown staff, faculty, and students (with occasional presenters from outside Brown). The workshops are interactive, so bring a laptop. All are welcome.

    These workshops can be attended in person (Carney Innovation Hub, 164 Angell St, Floor 4) or on Zoom: https://brown.zoom.us/j/96158269691

    Food will not be served; please bring your own lunch.

    DSCoV Workshops
    View Full Event  
  •  Location: 164 Angell StreetRoom: 302

    Join the Brown Center for Computational Molecular Biology for our seminar with Dr. Weihan Li, Assistant Professor of Molecular Biology, Cell Biology and Biochemistry at Brown, on November 12, 2025:

    Imaging a single mRNA molecule throughout its life cycle

    Abstract:

    The Li Lab seeks a molecular understanding of how cells control the spatial organization of gene expression. We use advanced single-molecule imaging techniques to study the precise localization of RNAs and proteins. For instance, in yeast, approximately 20% of all the mRNAs are targeted to the surfaces of the endoplasmic reticulum and mitochondria, where they are translated to support organelle biogenesis and maintenance. Our lab aims to uncover the mechanisms that regulate the spatial organization of gene expression, and determine the consequences of its dysregulation.

    Dr. Weihan Li is an Assistant Professor at Brown University. He received his PhD from UCSF, and completed his postdoctoral training at Albert Einstein College of Medicine. The research of the Li lab focuses on dissecting the spatial organization of gene expression.
    –
    The mission of the Center for Computational Molecular Biology is to promote the development, implementation, and application of analytical and computational methods to foundational questions in the biological and medical sciences. The research programs of the core faculty in the CCMB lie foundationally at the intersection of computer science, evolutionary biology, mathematics, and molecular and cell biology. 
    View Full Event  
  • Presenter: Meredith Mendola, Center for Technological Responsibility, Re-imagination, and Redesign

    “Accessibytes: An A11y Toolkit”

    This workshop is designed to equip you with the practical tools and knowledge needed to build more inclusive digital content. We’ll demystify digital accessibility by moving beyond abstract concepts and focusing on concrete, actionable steps you can take in your daily work, from creating presentations and documents to designing applications. You’ll learn to identify common accessibility barriers and apply simple, effective techniques to ensure your work is usable by everyone. This is a toolkit for anyone who designs, creates, or codes—no prior accessibility knowledge required.

    Level: Beginner

    Required Prior Knowledge: None

    This workshop will be held primarily virtually. Our regular meeting location (Carney Innovation Hub) will be used as a viewing location for those who would like to attend in person. 

    ___

    DSCoV (Data Science, Computation, and Visualization) workshops are lunchtime introductions to basic data science and programming skills and tools, offered by and for Brown staff, faculty, and students (with occasional presenters from outside Brown). The workshops are interactive, so bring a laptop. All are welcome.

    These workshops can be attended in person (Carney Innovation Hub, 164 Angell St, Floor 4) or on Zoom: https://brown.zoom.us/j/96158269691

    Food will not be served; please bring your own lunch.

    DSCoV Workshops
    View Full Event  
  •  Location: 164 Angell StreetRoom: Floor 3

    Join DSI, CCMB, and CNTR for a casual Q&A session with current Brown PhD students about applying to grad school!

    This event is for students at Brown who are or may be applying to graduate school to ask questions of current grad students from DSI / CCMB / CNTR about the process of applying and the experience of grad school! If you are applying to PhD programs this year or are thinking about applying in the future, come have your questions answered and hear about our grad students’ experiences with applying and starting graduate school.

    We will have a handful of current PhD students from Computational Biology, Computer Science, and other computational fields represented at DSI gathered to answer questions and chat with curious undergrads and masters students.

    If you are a current PhD student and would like to participate as one of our grad student panelists, please reach out to aspensc@brown.edu to confirm.

    All Brown students are welcome to attend. We will provide a limited amount of refreshments for this event.

    View Full Event  
  •  Location: 164 Angell StreetRoom: Open Seminar Space

    Probabilistic Characterization of Transcriptomic States in Single-Cell Data

    Advisor: Lorin Crawford, PhD

    Understanding how cells adopt and transition between transcriptional states is central to deciphering tissue organization, disease progression, and therapeutic response. Single-cell RNA sequencing (scRNA-seq) provides a powerful platform for characterizing this heterogeneity at unprecedented resolution. However, current computational methods for identifying and interpreting cell states face several major challenges. They often rely on clustering heuristics that assume a fixed number of states, separate the inference of states from the discovery of their defining gene markers, and fail to account for how cellular state composition varies across biological contexts such as time, perturbation, or patient background. These limitations hinder the interpretability and actionability of inferred states, constraining their potential as biomarkers in precision medicine.

    This dissertation develops a family of scalable, interpretable, and probabilistically grounded models for the unsupervised characterization of transcriptomic cell states in single-cell data. The proposed framework addresses three key methodological needs: (i) the joint inference of cell states and their associated gene markers, (ii) the relaxation of assumptions regarding the number of latent states, and (iii) the probabilistic modeling of state persistence and context-dependent variation over biological time. The models build on Bayesian nonparametric principles—particularly the Dirichlet and dependent Dirichlet process mixture frameworks—and employ sparse priors to identify gene-level markers that distinguish cell states within high-dimensional expression space. Variational inference algorithms are derived to enable scalable estimation on datasets containing millions of cells.

    In the first part of the thesis, a sparse nonparametric Bayesian clustering model is
    developed that infers both the number of cell states and their characteristic gene expression patterns directly from expression data, without requiring prior cluster specification. The second part introduces NCLUSION, a scalable variational implementation that jointly performs clustering and marker selection, providing statistically robust and biologically interpretable results. The final part presents i-NCLUSION, a hierarchical extension that integrates experimental structure via a directed acyclic graph representation, allowing for dynamic modeling of cell state preferences across related contexts such as time points or patient samples. Applied to large-scale datasets—including an 11-million-cell mouse developmental time series and longitudinal profiles of human breast milk—these methods
    outperform structure-agnostic baselines and uncover biologically consistent patterns of cellular adaptation.

    Collectively, this work contributes a unified probabilistic framework for defining, interpreting, and contextualizing cell states as actionable biomarkers. By combining statistical rigor, scalability, and biological interpretability, the models developed here lay the foundation for more transparent and clinically relevant analyses of single-cell data, advancing the broader goals of precision medicine and systems-level understanding of cellular behavior.

    View Full Event  
  •  Location: 164 Angell StreetRoom: Carney Innovation Area, Floor 4

    Presenter: Xiran Liu, Postdoctoral Researcher, Center for Computational Molecular Biology (CCMB)

    Uncovering Latent Structures in Data: An Introduction to Topic Modeling and Its Applications Beyond NLP

    Topic modeling is a family of techniques used to uncover hidden themes or latent structures within large collections of data—originally developed for natural language processing (NLP) and later extended to diverse domains such as genomics, finance, and social sciences. In this workshop, we will explore the fundamentals of topic modeling, popular methods, and their implementation in Python. We will also see demonstrations of how topic modeling can be applied to extract meaningful biological signals from bioinformatics datasets.

    ___

    DSCoV (Data Science, Computation, and Visualization) workshops are lunchtime introductions to basic data science and programming skills and tools, offered by and for Brown staff, faculty, and students (with occasional presenters from outside Brown). The workshops are interactive, so bring a laptop. All are welcome.

    These workshops can be attended in person (Carney Innovation Hub, 164 Angell St, Floor 4) or on Zoom: https://brown.zoom.us/j/96158269691

    Food will not be served; please bring your own lunch.

    DSCoV Workshops
    View Full Event  
  •  Location: 164 Angell Street (DSI 3rd Floor)Room: R302

    Join the Data Science Institute for our next Data & Donuts session featuring Professor Ellie Pavlick, Assistant Professor of Computer Science and Linguistics at Brown University and Research Scientist at Google DeepMind. Professor Pavlick leads the LUNAR Lab, where her research explores how humans and AI systems understand language, concepts, and reasoning, bridging computational modeling with insights from cognitive science, neuroscience, and philosophy.

    The Data & Donuts series offers an informal space for students, faculty, and staff to connect over short talks by Brown faculty and data scientists on topics and campus resources related to data science. These roundtable-style sessions are open to all members of the Brown community with an interest in data science.

    DSI DUG: Data & Donuts Events Interest Form
    View Full Event  
  •  Location: 164 Angell StreetRoom: 302

     

    Data & Donuts

    Join the DSI at 4:00 pm on Thursday, October 30 for Data & Donuts!

    Ellie Pavlick

    Ellie Pavlick is the Briger Family Distinguished Associate Professor of Computer Science, an Associate Professor of Cognitive and Psychological Sciences, and a Research Scientist at Google Deepmind. She leads the Language Understanding and Representation (LUNAR) Lab, which seeks to understand how language “works” and to build computational models which can understand language the way that humans do. Her lab’s projects focus on language broadly construed, and often includes the study of capacities more general than language, including conceptual representations, reasoning, learning, and generalization. Her lab is interested in understanding how humans acheive these things, how computational models (especially large language models and similar types of “black box” AI systems) achieve these things, and what insights can be gained from comparing the two. They often collaborate with researchers outside of computer science, including cognitive science, neuroscience, and philosophy.

    The format of this series allows colleagues to connect informally with students and will feature short talks by Brown faculty and data scientists on research or campus resources related to data science.

    These talks are open to students, faculty, and staff of all levels with an interest in data science and are set up as a community roundtable to engage everyone.

    Donuts will be served!

    View Full Event  
  •  Location: 164 Angell StreetRoom: Floor 3

    Join the Data Science Institute for an afternoon of spooky fun, delicious food, and friendly competition as we celebrate the season together. 

    DSI, CCMB, and CNTR faculty, staff, researchers, and students are all welcome to join us for this social event. 

    Enjoy pizza, snacks, and Magical refreshments while participating in a lineup of fun Halloween-themed activities:


    Activities include:

    • Ghost Cup Stacking - Test your speed and steady hands!

    • Pin the Skeleton - A classic party favorite with a spooky twist.

    • Costume Contest - Show off your creativity at 4:45 PM!
      Bonus points for Data Science-inspired costumes (think haunted algorithms, spooky datasets, or ghostly graphs!).

    Come for the pizza, stay for the fright, and don’t forget your costume!

    We can’t wait to celebrate the spooky season with you. 

    View Full Event  
  • Ting Ye, Ph.D.,
    Assistant Professor of Biostatistics
    School of Public Health at University of Washington

     

    Talk Title: From Estimands to Robust Inference of Treatment Effects in Master Protocol Trials

    Abstract: A master protocol trial is an innovative clinical trial design that uses a single overarching protocol to evaluate multiple treatments, diseases, or disease subtypes, where participants are often randomized to different subsets of treatment arms based on individual characteristics, enrollment timing, and treatment availability. While offering increased flexibility, this constrained and non-uniform treatment assignment poses inferential challenges, with two fundamental ones being the precise definition of treatment effects and robust, efficient inference on these effects. Such challenges arise primarily because some commonly used analysis approaches may target estimands defined on populations inadvertently depending on randomization ratios or trial operational format, thereby undermining interpretability. This article, for the first time, presents a formal framework for constructing a clinically meaningful estimand with precise specification of the population of interest. Specifically, the proposed entire concurrently eligible (ECE) population not only preserves the integrity of randomized comparisons but also remains invariant to both the randomization ratio and trial operational format. Then, we develop weighting and post-stratification methods to estimate treatment effects under the same minimal assumptions used in traditional randomized trials. We also consider model-assisted covariate adjustment to fully unlock the efficiency potential of master protocol trials while maintaining robustness against model misspecification. The SIMPLIFY trial, a master protocol assessing continuation versus discontinuation of two common therapies in cystic fibrosis, is utilized to further highlight the practical significance of this research. All analyses are conducted using the R package RobinCID.

    View Full Event  
  •  Location: 164 Angell Street (DSI 3rd Floor)Room: R302

    Join the Data Science Institute for our Data & Donuts session featuring Professor Jamie Trost, Assistant Teaching Professor of Cognitive and Psychological Sciences at Brown University. Professor Trost specializes in statistics and quantitative methodologies, with research that bridges attention studies and the Scholarship of Teaching and Learning.

    The Data & Donuts series offers an informal space for students, faculty, and staff to connect over short talks by Brown faculty and data scientists on topics and campus resources related to data science. These roundtable-style sessions are open to all members of the Brown community with an interest in data science.

    DSI DUG: Data & Donuts Events Interest Form
    View Full Event  
  •  Location: 164 Angell StreetRoom: 302

     

    Data & Donuts

    Join the DSI at 4:00 pm on Thursday, October 23 for Data & Donuts!

    Dr. Jamie Trost

    Dr. Jamie Trost is a lecturer of statistics whose roots lie in cognitive psychology. While pursuing her doctoral degree studying attention and perception at the University of Notre Dame, Jamie became heavily interested in the Scholarship of Teaching and Learning (SoTL), particularly in regard to teaching quantitative statistics and methodologies. At Brown, Jamie leads courses in statistics at both undergraduate and graduate levels and will serve as a statistical consultant to graduate students in the CoPsy department. 

    The format of this series allows faculty to connect informally with students and will feature short talks by Brown faculty and data scientists on research or campus resources related to data science.

    These talks are open to students, faculty, and staff of all levels with an interest in data science and are set up as a community roundtable to engage everyone.

    Donuts will be served!

    View Full Event  
  • Please join us for our upcoming Virtual Information Session. Attendees will gain further insight into the program structure, curriculum highlights and application process. Please note, all times listed are Eastern (ET).
    Register Here!
    View Full Event  
  •  Location: Nelson Center for EntepreneurshipRoom: Davis Family Venture Lab (4th Floor)

    Join the Nelson Center for Entrepreneurship for a Founder Talk with Yeshimabeit “Yeshi” Milner ’12, founder of Data for Black Lives.

    Learn how Yeshimabeit took her love for data to make change in the lives of Black people. Data For Black Lives is a nonprofit organization and a movement of activists, organizers, and scientists committed to the mission of using data to create concrete and measurable change in the lives of Black people.

    This event is held in collaboration with the Data Science Institute, Brown Center for Students of Color, and the Center for the Study of Race and Ethnicity in America.

    RSVP Here!
    View Full Event  
  •  Location: 164 Angell StreetRoom: 302

    Join the Brown Center for Computational Molecular Biology for our seminar with Dr. Sendurai Mani on October 22, 2025:

    Reactivation of Embryonic EMT Programs to Unlock Cancer Stem Cell Traits, Metastasis, and Resistance

    Abstract:
    Epithelial–mesenchymal transition (EMT) is a key developmental process that cancer cells often hijack to promote progression and evade therapy. Reactivation of EMT programs allows cancer cells to gain stem cell–like traits that facilitate metastasis and contribute to therapy resistance. In this talk, I will present evidence and demonstrate how reactivating embryonic EMT circuits underpins gaining stemness traits, cellular plasticity, enabling tumor cells to switch between epithelial and mesenchymal states. Additionally, I will explore the mechanistic connections between EMT and cancer stem cells and how these interrelated processes contribute to tumor progression. Finally, I will discuss ways to target EMT–stemness pathways to overcome metastasis and the development of resistance to treatments.

    Sendurai A. Mani is a Professor in the Department of Pathology and Laboratory Medicine at Brown University. He is also the Associate Director of Translational Oncology at Brown University Legorreta Cancer Center. Dr. Mani earned a Ph.D. from The Indian Institute of Science, Bangalore, India and then did postdoctoral work with Dr. Robert A. Weinberg at the Whitehead Institute/Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, USA. He then joined the University of Texas MD Anderson Cancer Center, Houston, Texas as an Assistant Professor in December of 2007 and has been promoted to Associate Professor with tenure in 2013. In late 2022, Dr. Mani joined Brown University Legorreta Cancer Center as a professor and Associate Director of Translational Oncology. Dr. Mani has received numerous prizes and awards for his research, including a Jimmy V foundation’s V-Scholar Award and The American Cancer Society Research Scholar award. Dr. Mani’s original finding demonstrating the cancer cells acquire stem cell properties by activating latent embryonic epithelial-mesenchymal transition (EMT) program provided the foundation and explanation for the presence of plasticity within the tumor as well as the development of resistance to various treatments.

    The mission of the Center for Computational Molecular Biology is to promote the development, implementation, and application of analytical and computational methods to foundational questions in the biological and medical sciences. The research programs of the core faculty in the CCMB lie foundationally at the intersection of computer science, evolutionary biology, mathematics, and molecular and cell biology. 

    View Full Event  
  •  Location: Sidney E. Frank Hall for Life SciencesRoom: Marcuvitz Lobby

    The Data Science Institute is excited to bring together our affiliated faculty community to meet and mingle at our annual Affiliated Faculty Mixer! 

    Please join us for an evening with colleagues from across campus to make connections and get involved with the Data Science Institute.

    If you are a DSI affiliated faculty member, please let us know that you’re coming in advance!

    Register
    View Full Event  
  • Join the Data Science Online Master’s Program for an engaging conversation with David Firrincieli, Adjunct Faculty Member and AI Engineer in Deloitte’s Government & Public Services practice, where he currently consults full-time for a large federal health agency on their data infrastructure and AI capabilities. David will share strategic insights on how federal agencies architect, deploy, and manage enterprise data systems, exploring the intersection of technical implementation, regulatory compliance, and cross-agency collaboration. This session will examine how data infrastructure decisions shape government capabilities and public outcomes.

    Register to attend this online webinar.

    Register Now
    View Full Event  
  •  Location: 164 Angell StreetRoom: Carney Innovation Area, Floor 4

    Presenter: Frank Donnelly, Head of GIS and Data Services and the coordinator of GeoData@SciLi.

    “Data Rescue and Screenscraping with Python”

    Learn about the Data Rescue Project, an international volunteer effort led by librarians, archivists, non-profits, and data scientists who are downloading and preserving US federal government datasets. Discover how you can screen scrape and automate data downloads with Python.

    Level: Beginner

    Required Prior Knowledge: The first half of the session will be a presentation. Some basic programming experience is helpful if you would like to do the hands-on exercises in the second half.

    DSCoV (Data Science, Computation, and Visualization) workshops are lunchtime introductions to basic data science and programming skills and tools, offered by and for Brown staff, faculty, and students (with occasional presenters from outside Brown). The workshops are interactive, so bring a laptop. All are welcome.

    These workshops can be attended in person (Carney Innovation Hub, 164 Angell St, Floor 4) or on Zoom: https://brown.zoom.us/j/96158269691

    Food will not be served; please bring your own lunch.

    DSCoV Workshops
    View Full Event  
  •  Location: 164 Angell StreetRoom: Open Area

    The CCMB Day is an opportunity for Center for Computational Molecular Biology (CCMB) graduate students to learn about their peers’ research and to practice sharing their work with colleagues. Senior PhD students will give 15-minute presentations on their work.

    We hope this event will give new graduate students exposure to the type of work their colleagues do and foster collaboration and connection between our graduate students.

     

    Fall 2025 Presenters:

    Name: Cecile Meier-Scherling

    Advisor: Jeff Bailey and Lorin Crawford

    Presentation Title: Predicting Drug Resistance: Computational Modeling of Artemisinin Resistance in Africa

     

    Name: Jazeps Medina-Tretmanis

    Advisor: Emilia Huerta-Sanchez

    Presentation Title: TBA

     

    Name: Tuan Pham

    Advisor: Ritambhara Singh

    Presentation Title: TBA



    Name: Leah Darwin

    Advisor: David Rand

    Presentation Title: Selective response of mitochondrial and nuclear genomes to an OXPHOS inhibitor in experimental populations of Drosophila

    View Full Event