Fifth National Science Data Fabric (NSDF)
All-Hands Meeting “Maximizing Returns on Investment of the National Science Data Fabric: Driving Practical Impact and Sustainable Innovation”
San Diego, CA on May 5 - 7, 2025

The National Science Data Fabric (NSDF) is an NSF-supported pilot that connects an open network of institutions, including minority-serving institutions, by constructing a continuous integration platform for equitable data use and data-driven sciences.

For the fifth NSDF All-Hands Meeting, we gather new and existing members of the NSDF community in San Diego, CA to address challenges of data democratization. This year’s theme will be “Maximizing Returns on Investment of the National Science Data Fabric: Driving Practical Impact and Sustainable Innovation” and we will meet from May 5-7, 2025.



Attendees to the Fifth National Science Data Fabric (NSDF) AHM (2025).

KEYNOTE

Keynote Speaker: Miron Livny, director of the Center for High Throughput Computing (CHTC), the John P. Morgridge Professor of Computer Science and a Vilas Research Professor at the University of Wisconsin-Madison

Title: Is your Data Handler Trustworthy?

Abstract

The PATh project offers a variety of distributed data handling services. Researchers who place their workloads at PATh Access Points leverage these services to bring input data to their applications and to ship results back to archival sites. They also employ them to copy the applications, in most cases in the form of container images, to execution sites provided by the Open Science Pool (OSPool). Owners of Object Stores who federate datasets they host via the Open Science Data Federation (OSDF) use these services to make locally stored objects available to remote consumers for frequent reuse.

In the past 12 months PATh services have performed more than 3B Object transfers across more than 100 institutions. The distributed nature of the infrastructure requires objects to be copied across institution boundaries and to be buffered in memory or disk at multiple locations. These operations clearly introduce concerns about data integrity and access control. Do the researchers and Object providers who use these services believe that the PATh services are trustworthy?

The talk will review our ongoing effort to develop data handling services that we believe to be trustworthy. A key element of our current development work is transitioning our software tools and infrastructure into capability-based authorization. We hope that facilitating trust relationships through capabilities will help researchers and data providers to entrust their objects with the PATh services. Namely, to believe that the OSDF and the OSPool data handling services are trustworthy.

Bio

Miron Livny received a B.Sc. degree in Physics and Mathematics in 1975 from the Hebrew University and M.Sc. and Ph.D. degrees in Computer Science from the Weizmann Institute of Science in 1978 and 1984, respectively. Since 1983 he has been on the Computer Sciences Department faculty at the University of Wisconsin-Madison, where he is currently the John P. Morgridge Professor of Computer Science and a Vilas Research Professor. He serves as the director of the Center for High Throughput Computing (CHTC), is leading the HTCondor Software Suite effort and serves as the technical director of the OSG. He is a member of the scientific leadership team of the Morgridge Institute of Research where he leads the Research Computing theme.

Dr. Livny’s research focuses on distributed processing and object management systems and involves close collaboration with researchers from a wide spectrum of disciplines. He pioneered the area of High Throughput Computing (HTC) and developed frameworks and software tools that have been widely adopted by academic, government and commercial organizations around the world.

Livny is the recipient of the 2006 ACM SIGMOD Test of Time Award the 2013 HPDC Achievement Award the 2020 IEEE TCDP Outstanding Technical Achievement Award, 2020 IEEE TCDP High Impact Paper Award and the 2023 UW-Madison Vilas Research Professor award.

KEYNOTE

Keynote Speaker: Ryan Coffee

Title: Breaking the fourth wall: experimental introspection via audience participation.

Abstract

Science is undergoing a transformation; it is moving away from slow iteration of test, observe, report, and repeat. Given the rate of experimental phase space search that is becoming prevalent in today’s scientific detector systems, real-time adaptation is finding its way into accelerator controls and detector autonomy. This development raises an interesting question regarding the role of human intellect in real-time decision making for experimental campaigns of the not-so-distant future. As we survey a couple of current examples for some of the emerging and envisioned methods of stream processing along the stages of data acquisition, we will reveal some benefits and some dangers of incorporating domain specific analysis for autonomous information extraction. By envisioning some of the enabling technologies, we hope to reveal where magic should be avoided and mechanics adopted. We hope to use such technologies to break the fourth wall of observation, allowing scientists to become live participants in the exponential acceleration of our endeavor to innovate; live participation may even help us avoid the Theatre of the Absurd.

Bio

Ryan earned his Bachelor of Arts in Philosophy and Bachelor of Science in Physics from the University of Arkansas followed by a PhD in Atomic, Molecular and Optical (AMO) Physics from the University of Connecticut. He joined the PULSE Institute at Stanford/SLAC in 2006 and led the first laser pumped, x-ray probed experiment at the Linac Coherent Light Source (LCLS) in 2009. Since then, he has become Senior Research Scientist in PULSE and LCLS with an emphasis on AMO science and novel instrumentation and the requisite computational methods for streaming data processing at the sensor edge, in particular targeting the million frames per second LCLS-II.

In that context he has been a core member of the SLAC AI Initiative since its inception with particular emphasis on Machine Learning for real-time information extraction. With projects ranging from x-ray spectroscopy in molecules, ultrafast materials response, radiographic medical imaging, and tokamak plasma fusion, he has become an adamant proponent of data and model marketplaces for cross-domain innovation sharing with built in provenance and value tracking for an intelligent adaptive data and model retention.

Beyond SLAC, Ryan is driving an integrative approach to instrumentation and co-design of computing infrastructure across the portfolio of Department of Energy labs and facilities as well as technologies across the computing industry. From diagnostic and detector development to algorithms and AI accelerators, from the sensor Edge to Leadership Computing Facilities, he is leveraging his hobbies and his passions to drive the bleeding edge of basic science to address the emerging challenges of automation in industry and agriculture for a better future for his daughters.

SCHEDULE

You can find the program here

LOCATION

University of California, San Diego
San Diego Supercomputer Center
9836 Hopkins Dr, La Jolla, CA 92037

HOTELS

The meeting hotels include:

VENUE

SDSC is located on the UC San Diego Campus in La Jolla. Address: 9836 Hopkins Dr, La Jolla, CA 92037.

San Diego Supercomputer Center’s Auditorium E-B212 located on the ground floor of SDSC’s east entrance, just off the driveway on Hopkins Dr, close to the Hopkins Parking Structure, Northwest end of UC San Diego campus.

Google maps exact location

The San Diego International Airport (SAN) is the closest airport to UC San Diego and SDSC.

For driving directions see the visitors page on the SDSC website

TRANSPORTATION

VISITOR PARKING: PARKING PERMITS ARE REQUIRED TO PARK ON UC SAN DIEGO CAMPUS

  • Visiting UC faculty and staff who hold annual permits at their home campuses are eligible for up to 5 consecutive days of complimentary parking at UC San Diego. Visit this link for more details.
  • For non-UC visitors, please see more details here. Purchase permits at a very reasonable rate.
  • The most convenient parking location is the Hopkins parking structure located on Hopkins Dr and Voigt Dr, just south of SDSC.
  • Parking legally is the attendee’s responsibility. The penalty for an improperly parked car is at least $65 per day. We cannot be held responsible for citations issued for parking in an incorrect space or improperly displaying your permit. More information on Parking Citation Penalties

For cab or shuttle Pick-up/Drop-off: 9836 Hopkins Dr, La Jolla, CA 92037

SDSC is located just south of the RIMAC Arena.

Rideshare: Lyft or Uber Use “San Diego Supercomputer Center” as destination.

A taxi or transportation service can be used from the airport. Helpful link for taxi service in San Diego:

Public Transportation surrounding UC San Diego

EXPRESSION OF INTEREST IN PARTICIPATING

The meeting is an invitation only event. Still, we are looking for new members. Are you new to NSDF or interested in joining us? We are currently gathering expressions of interest to participate. Please complete the form by March 1, 2025, to let us know if you would like to attend.

Expressions of Interest Form: https://forms.gle/idjUT269RGTQkfZF7.

QUESTIONS

If you have any questions, please email Michael Sutherlin atmsuther5@utk.edu.




This material is based upon work supported by the National Science Foundation under Grant No. 2138811.

Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Copyright © 2024 National Science Data Fabric