Science Mesh in High Energy Physics and Endangered Linguistics - Open Data Systems & Data Science Environments

Science Mesh in High Energy Physics and Endangered Linguistics - Open Data Systems & Data Science Environments

The first CS3MESH4EOSC webinar, entitled “Science Mesh in High Energy Physics and Endangered Linguistics - Open Data Systems & Data Science Environments” took place on 6th May from 11:00 am-12:00 pm CEST.  The webinar was showcasing how the Science Mesh is integrating both data science environments and open data repositories into the federated Science Mesh, supporting collaboration of distributed science teams across disciplines. Exemplary benefits of this integration in Humanities and High Energy Physics fields were presented by real use-cases that are already getting societal and scientific benefits from those Science Mesh’s features.

The Science Mesh is a rich ecosystem for frictionless scientific collaboration and access to research services. It aims to be an interoperable platform to easily sync & share, and deploy applications and software components within the full CS3 community to extend functionalities of the service.  This will offer researchers opportunities to assemble an efficient, reliable, collaborative and transparent research tool chain, while retaining control over their datasets, while becoming FAIR compatible.

 

Watch the recorded video.

 

Target Audience

  • Researchers, data curators and analysts aiming to assembling an efficient, reliable, collaborative and transparent research tool chain, to more easily articulate the impact of their research in the upcoming altimetrics policy environment;
  • Institutional operators and services looking forward to using an easier long-term data custodianship and seamless access the functionality or the data they require in a transparent way, fully integrated with the in-house services;
  • Commercial software developers that aim to develop new product based on complex, large scale data in virtually all sectors;
  • Non-commercial software developers looking for an easier time writing research workflow code against a standardised API to access metadata;
  • Policy makers & citizens to get to know standarised tools that facilitate collaboration of citizen scientists and Citizen Observatories, as well as make make FAIR data relevant in the day to day science practice of all researchers.

Agenda

  • 5minWelcome and overview of CS3MESH4EOSC (Jakub Moscicki - CS3MESH4EOSC coordinator and Deputy Group Leader for Storage at CERN) (Presentation)
  • 5minScience Mesh: Unlocking scientific collaboration through technology (Pedro - Technical Coordinator of CS3MESH4EOSC and Product Manager at CERN) (Presentation)
  • 5minScience Mesh core applications - the federated layer (Maciej Brzezniak - Storage and Cloud Systems Architect at PSNC) (Presentation)
  • 12minCollaborative data science: High Energy Physics and beyond (Marcin Sieprawski - Head of Big Data Lab at Software Mind) (Presentation)
  • 12minGuarantee long-term usefulness thanks to metadata: Endangered Linguistics with PARADISEC (Guido Aben - director of eInfrastructure Partnerships at AARNet) (Presentation)
  • 10minQuestions and Answers
  • 5minWhat's next & Closing Remarks (Jakub Moscicki - CS3MESH4EOSC coordinator and Deputy Group Leader for Storage at CERN)

Speakers

Jakub Moscicki - Deputy Group Leader for Storage at CERN

Jakub T. Mościcki is a researcher and software engineer at CERN, Geneva, Switzerland. He obtained the PhD in Computer Science from the University of Amsterdam (UvA). He was involved in designing and developing frameworks for supporting distributed computing for the LHC experiments as well for multidisciplinary applications in theoretical physics, medical and radiation studies, bio-informatics, drug design and telecommunications. He now works in the Data Storage Services group which designs and operates large-scale disk and tape storage for LHC physics data as well as the general purpose filesystems services such as AFS. Jakub is also the coordinator of CS3MESH4EOSC project.

Pedro Ferreira - Product Manager at CERN

Pedro holds a Master's degree in Software Engineering from the University of Porto in Portugal. He joined CERN in 2007, where he led, from 2014 to 2020, the Indico Project, an Open Source event management system which is used to host more than 800.000 events at CERN alone and is trusted by more than 250 other institutions world wide. Since 2020, he is the Technical Coordinator and Leader of Work Package 1 of the CS3MESH4EOSC Project.

 

Maciej Brzezniak - Storage and Cloud Systems Architect at PSNC

Maciej Brzeźniak leads the Data and Storage Management Group @PSNC. He is a storage and cloud computing systems architect, senior systems administrator. His interests include scalable data management and storage technologies, storage systems performance and data security. He co-led development of the National Data Storage system and its deployment the production long-term storage service in Poland. In CS3MESH4EOSC project Maciej, on behalf of PSNC, coordinates WP4 (Users and applications) and the tasks on Common application requirements and deployment mechanisms as well as the sync & share service integration task with WP3 (Interoperability platform).

Marcin Sieprawski - Head of Big Data Lab at Software Mind

Experienced System Architect and R&D Project Manager with 20+ years of enterprise software design and development. Founder and leader of Big Data Lab, focused on R&D, Data Science, data driven innovation and high quality agile software development. His current focus in Big Data innovation is state of the art solution for management of geo-located data and low-latency services, including GPU-based acceleration of geospatial indexes.

Guido Aben - AARNet eInfrastructure Partnerships Director

Guido Aben is AARNet’s director of eInfrastructure partnerships. He holds an MSc in physics from Utrecht University. In his current role at AARNet, he is responsible for building partnerships and fostering collaborations between like-minded entities in international eScience and eInfrastructure, all in order to build services to researchers’ requirements. His current prime interest is in federated large data collaboration systems, but he’s always on the lookout for new diamonds in the rough.