SlideShare a Scribd company logo
1 of 24
HDF
Town Hall
ESIP Summer Meeting
July 9, 2013
4/4/2013

HDF Briefing to NASA

2
Changes in The HDF Group
• New Staff
•
•
•
•

7/9/2013

Earth Science program Director (Habermann)
Earth Science Project Manager (Plutchak)
Project Management Office Coordinator
Quality Engineer

ESIP Summer 2013

3
Earth Science Program
Director (Ted)

Project manager
(Joel)

Earth Science Team
ESDIS HDF

JPSS HDF

Maintenance, QA

IDPS support

Tools and
applications

Ted Habermann
Larry Knox
Joe Lee
Joel Plutchak
Elena Pourmal
Kent Yang
Albert Cheng

High Level
Libraries

Studies, Analyses

7/9/2013

JPSS Tools

Operations
Support

NASA Metadata

Outreach

ESIP Summer 2013

4
Mailing lists and archives
• news@lists.hdfgroup.org
• http://hdfgroup.org/news/

• hdf-forum@lists.hdfgroup.org
• http://mail.hdfgroup.org/pipermail/hdfforum_hdfgroup.org/

• New mailing for NASA DAACs
• hdf-nasa-daac@lists.hdfgroup.org

7/9/2013

ESIP Summer 2013

5
HDF Releases

7/9/2013

ESIP Summer 2013

6
Maintenance Releases 2012–2013
2012
HDF4

Jan Feb Mar Apr

May

Jun Jul

4.2.7

HDF5

Aug Sep

Oct

Nov

4.2.8
1.8.9

1.8.10

HDFJava
h4h5
tools

2013
HDF4
HDF5

2.9
2.2.1

Jan Feb Mar Apr

May

Jun Jul

Aug Sep

Oct

7/9/2013

Nov

Dec

4.2.9
1.8.11

1.8.12

HDFJava
h4CF

Dec

2.10
1.0
beta

ESIP Summer 2013

7
HDF4 maintenance releases

HDF 4.2.9 (February 2013)
• Support for Mac 10.8 with Intel and Clang
compilers
• Support for Cygwin version 1.7.7 and higher

7/9/2013

ESIP Summer 2013

8
HDF5 maintenance releases

HDF5 1.8.10 (Nov 2012) and patch1 (Jan 2013)
• Interoperability between h5dump and h5import
• Performance improvements in h5diff for the files
with many attributes
• Support for I/O bigger than 2GB on Mac OS X

7/9/2013

ESIP Summer 2013

9
HDF5 maintenance releases

Future releases
• Request to support wide character filenames
(MathWorks)
• Request to support UTF-32 encoding (H5Py)
• Request to support parallel compression

7/9/2013

ESIP Summer 2013

10
New OSs and Compilers
• HDF software is now supported on
• SunOS 5.11 (Sparc) with Studio 12 compilers
• CentOS 6 with GCC and Intel compilers
• Mac OS X 10.8.* with Clang and Fortran, Java 1.7
Cygwin 1.7.7
• Windows 7 with VS 12 and Intel 13
• Windows 8 with VS 12 and Intel 13

7/9/2013

ESIP Summer 2013

11
Java maintenance releases
2.9 release (December 2012)
• Show groups/attributes in creation order
• Export data to a binary/ASCII file without having to
open the object in the TableView
• Reload feature to close/open file
• Improvements for installation

7/9/2013

ESIP Summer 2013

12
Java maintenance releases
2.10 release (December 2013)
•
•
•
•

7/9/2013

0 or 1-based indexing when displaying arrays
Displaying long names of files (“…” in names)
Ability to modify HDF4 compressed dataset
Support netCDF-4 files with VL attributes

ESIP Summer 2013

13
Tools

7/9/2013

ESIP Summer 2013

15
HDF and netCDF interoperability tools
•
•
•
•
•

HDF4/HDF-EOS2 to CF conversion toolkit - June
HDF-EOS5 augmentation tool (maint) - Dec 2013
HDF-EOS2 dumper tool (maint) - every other year
HDF-EOS5 to netCDF-4 conversion tool (retired)
HDF4 & HDF5 Handlers – May, to synchronize w/
Hyrax release

7/9/2013

ESIP Summer 2013

16
HDF Visualization tool assessment

• To evaluate the HDF Group’s data viewing
tools and user needs, and to explore,
recommend, and prioritize improvements.

7/9/2013

ESIP Summer 2013

17
Other activities

7/9/2013

ESIP Summer 2013

18
Prototype Studies

• Apache Open Source Incubator Pilot Project
• Digital Object Identifier (DOI) support in HDF5

7/9/2013

ESIP Summer 2013

19
HPC R&D
• HDF5 Virtual Object Layer
• Allows apps to store and access HDF5 objects in
arbitrary storage methods and formats
• Allows HDF5 apps to migrate to future storage systems
with no source code modifications

• HDF5: Asynchronous I/O
• Application doesn’t wait for I/O

• Fault Tolerance:
• Prevent crash from corrupting HDF5 file

• End-to-End Data Integrity:
• Verify integrity of data from birth to death of file

• I/O Autotuning
• Runtime framework that dynamically determines
optimal application I/O strategy
7/9/2013

ESIP Summer 2013

20
Parallel I/O and Analysis of a Trillion
Particle VPIC Simulation
 Problem: Support I/O and analysis needs for
state-of-the-art plasma physics code

 Novel Accomplishments:
 Ran Trillion particle VPIC simulation on
120,000 hopper cores and generated 350
TB dataset
 Parallel HDF5 obtained peak 35GB/s I/O
rate and 80% sustained bandwidth
 Developed hybrid parallel FastQuery
using FastBit to utilize multicore hardware
 FastQuery took 10 minutes to index and 3
seconds to query energetic particles
 SC12 paper, XLDB 2012 poster

I/O bandwidth utilization for parallel writes (blue) with HDF5 on
120,000 cores

 CS Impact
 Demonstrated software scalability for
writing and analyzing ~40TB HDF5 files
 Enabled novel discoveries in plasma
physics (next slide)
A comparison of indexing (top table) and query times (bottom) for
hybrid and MPI-FastQuery
Science Impact: Multiple Scientific
Discoveries in Plasma Physics

•

Preferential acceleration along magnetic field

Discovered power-law in energy spectrum

Energetic particles are correlated with flux ropes

Discovered agyrotropy near the reconnection hot-spot
Other projects of interest
• ITER – International fusion research project
• Architecture for HDF5 for ITER data life cycle

• Particle accelerators and instrument vendors
• Faster I/O for compressed data
• Let apps send pre-compressed chunks directly to
file.

• Dynamic filter loading in HDF5
• Let apps read data compressed with non-standard
filter.

• SWMR
• Single Writer/Multiple Readers
7/9/2013

ESIP Summer 2013

23
Other projects of interest
• Digital Twin
• “Digital Twin integrates ultra-high fidelity simulation
with the vehicle’s on-board integrated vehicle
health management system, maintenance history
and all available historical and fleet data to mirror
the life of its flying twin and enable unprecedented
levels of safety and reliability.”

7/9/2013

ESIP Summer 2013

24
thanks

7/9/2013

ESIP Summer 2013

25

More Related Content

What's hot

Accelerating Science with Cloud Technologies in the ABoVE Science Cloud
Accelerating Science with Cloud Technologies in the ABoVE Science CloudAccelerating Science with Cloud Technologies in the ABoVE Science Cloud
Accelerating Science with Cloud Technologies in the ABoVE Science CloudGlobus
 
Linked Sensor Data cube
Linked Sensor Data cubeLinked Sensor Data cube
Linked Sensor Data cubeLaurent Lefort
 
ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...
ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...
ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...Matthäus Zloch
 
SchemEX - Creating the Yellow Pages for the Linked Open Data Cloud
SchemEX - Creating the Yellow Pages for the Linked Open Data CloudSchemEX - Creating the Yellow Pages for the Linked Open Data Cloud
SchemEX - Creating the Yellow Pages for the Linked Open Data CloudAnsgar Scherp
 

What's hot (20)

Accelerating Science with Cloud Technologies in the ABoVE Science Cloud
Accelerating Science with Cloud Technologies in the ABoVE Science CloudAccelerating Science with Cloud Technologies in the ABoVE Science Cloud
Accelerating Science with Cloud Technologies in the ABoVE Science Cloud
 
GDAL Enhancement for ESDIS Project
GDAL Enhancement for ESDIS ProjectGDAL Enhancement for ESDIS Project
GDAL Enhancement for ESDIS Project
 
Advancing Scientific Data Support in ArcGIS
Advancing Scientific Data Support in ArcGISAdvancing Scientific Data Support in ArcGIS
Advancing Scientific Data Support in ArcGIS
 
Linked Sensor Data cube
Linked Sensor Data cubeLinked Sensor Data cube
Linked Sensor Data cube
 
CLIM Program: Remote Sensing Workshop, High Performance Computing and Spatial...
CLIM Program: Remote Sensing Workshop, High Performance Computing and Spatial...CLIM Program: Remote Sensing Workshop, High Performance Computing and Spatial...
CLIM Program: Remote Sensing Workshop, High Performance Computing and Spatial...
 
Usage of NCL, IDL, and MATLAB to access NASA HDF4/HDF-EOS2/HDF-EOS5 data
Usage of NCL, IDL, and MATLAB to access NASA HDF4/HDF-EOS2/HDF-EOS5 dataUsage of NCL, IDL, and MATLAB to access NASA HDF4/HDF-EOS2/HDF-EOS5 data
Usage of NCL, IDL, and MATLAB to access NASA HDF4/HDF-EOS2/HDF-EOS5 data
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...
ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...
ESWC 2019 - A Software Framework and Datasets for the Analysis of Graphs Meas...
 
HDF Group Support for NPP/NPOESS/JPSS
HDF Group Support for NPP/NPOESS/JPSSHDF Group Support for NPP/NPOESS/JPSS
HDF Group Support for NPP/NPOESS/JPSS
 
Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)
Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)
Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)
 
GES DISC Eexperiences with HDF Formats for MEaSUREs Projects
GES DISC Eexperiences with HDF Formats for MEaSUREs ProjectsGES DISC Eexperiences with HDF Formats for MEaSUREs Projects
GES DISC Eexperiences with HDF Formats for MEaSUREs Projects
 
HDF & HDF-EOS Data & Support at NSIDC
HDF & HDF-EOS Data & Support at NSIDCHDF & HDF-EOS Data & Support at NSIDC
HDF & HDF-EOS Data & Support at NSIDC
 
Geoscience Data Analysis and Visualization Tools from NCAR
Geoscience Data Analysis and Visualization Tools from NCARGeoscience Data Analysis and Visualization Tools from NCAR
Geoscience Data Analysis and Visualization Tools from NCAR
 
HDF Product Designer
HDF Product DesignerHDF Product Designer
HDF Product Designer
 
The New HDF-EOS WebSite - How it can help you
The New HDF-EOS WebSite - How it can help youThe New HDF-EOS WebSite - How it can help you
The New HDF-EOS WebSite - How it can help you
 
HDF OPeNDAP Project Update and Demo
HDF OPeNDAP Project Update and DemoHDF OPeNDAP Project Update and Demo
HDF OPeNDAP Project Update and Demo
 
How to Meet the CF Conventions with NcML for NASA HDF/HDF-EOS
How to Meet the CF Conventions with NcML for NASA HDF/HDF-EOSHow to Meet the CF Conventions with NcML for NASA HDF/HDF-EOS
How to Meet the CF Conventions with NcML for NASA HDF/HDF-EOS
 
SchemEX - Creating the Yellow Pages for the Linked Open Data Cloud
SchemEX - Creating the Yellow Pages for the Linked Open Data CloudSchemEX - Creating the Yellow Pages for the Linked Open Data Cloud
SchemEX - Creating the Yellow Pages for the Linked Open Data Cloud
 
NASA HDF/HDF-EOS Data for Dummies (and Developers)
NASA HDF/HDF-EOS Data for Dummies (and Developers)NASA HDF/HDF-EOS Data for Dummies (and Developers)
NASA HDF/HDF-EOS Data for Dummies (and Developers)
 
HDF Project Update
HDF Project UpdateHDF Project Update
HDF Project Update
 

Viewers also liked

Convencion derechos nino[1]
Convencion derechos nino[1]Convencion derechos nino[1]
Convencion derechos nino[1]mar19643
 
Practica 6 unidad 3 administrador de usuario
Practica 6 unidad 3 administrador de usuarioPractica 6 unidad 3 administrador de usuario
Practica 6 unidad 3 administrador de usuarioGiovanna99
 
Formacion Cultural/ Alexander Rivas
Formacion Cultural/ Alexander RivasFormacion Cultural/ Alexander Rivas
Formacion Cultural/ Alexander RivasMarvella Avila
 
Communications Specialist, Post-Production Supervisor
Communications Specialist, Post-Production SupervisorCommunications Specialist, Post-Production Supervisor
Communications Specialist, Post-Production SupervisorGregory Peskay
 
Forest Hill Farm Reference
Forest Hill Farm ReferenceForest Hill Farm Reference
Forest Hill Farm ReferenceAdrian Tobin
 
Carta organisasi i touch'15
Carta organisasi i touch'15Carta organisasi i touch'15
Carta organisasi i touch'15halwaizzati
 

Viewers also liked (14)

NASA HDF/HDF-EOS Data Access Challenges
NASA HDF/HDF-EOS Data Access ChallengesNASA HDF/HDF-EOS Data Access Challenges
NASA HDF/HDF-EOS Data Access Challenges
 
Convencion derechos nino[1]
Convencion derechos nino[1]Convencion derechos nino[1]
Convencion derechos nino[1]
 
Resume
ResumeResume
Resume
 
Laloy lola coeducacion
Laloy lola coeducacionLaloy lola coeducacion
Laloy lola coeducacion
 
Revista de la ley
Revista de la leyRevista de la ley
Revista de la ley
 
Practica 6 unidad 3 administrador de usuario
Practica 6 unidad 3 administrador de usuarioPractica 6 unidad 3 administrador de usuario
Practica 6 unidad 3 administrador de usuario
 
Formacion Cultural/ Alexander Rivas
Formacion Cultural/ Alexander RivasFormacion Cultural/ Alexander Rivas
Formacion Cultural/ Alexander Rivas
 
Christian Resume 2016.doc
Christian Resume 2016.doc Christian Resume 2016.doc
Christian Resume 2016.doc
 
Piramo y tisbe
Piramo y tisbePiramo y tisbe
Piramo y tisbe
 
Communications Specialist, Post-Production Supervisor
Communications Specialist, Post-Production SupervisorCommunications Specialist, Post-Production Supervisor
Communications Specialist, Post-Production Supervisor
 
Forest Hill Farm Reference
Forest Hill Farm ReferenceForest Hill Farm Reference
Forest Hill Farm Reference
 
Carta organisasi i touch'15
Carta organisasi i touch'15Carta organisasi i touch'15
Carta organisasi i touch'15
 
Crackers hackers y lammers
Crackers hackers y lammersCrackers hackers y lammers
Crackers hackers y lammers
 
Being a consultant developer
Being a consultant developerBeing a consultant developer
Being a consultant developer
 

Similar to HDF Town Hall

The Earth System Grid Federation: Origins, Current State, Evolution
The Earth System Grid Federation: Origins, Current State, EvolutionThe Earth System Grid Federation: Origins, Current State, Evolution
The Earth System Grid Federation: Origins, Current State, EvolutionIan Foster
 
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...Databricks
 
Hdf5 current future
Hdf5 current futureHdf5 current future
Hdf5 current futuremfolk
 

Similar to HDF Town Hall (20)

Welcome to HDF Workshop V
Welcome to HDF Workshop VWelcome to HDF Workshop V
Welcome to HDF Workshop V
 
HDF5 and The HDF Group
HDF5 and The HDF GroupHDF5 and The HDF Group
HDF5 and The HDF Group
 
HDF Status and Development
HDF Status and DevelopmentHDF Status and Development
HDF Status and Development
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
HDF4 Mapping Project Update
HDF4 Mapping Project UpdateHDF4 Mapping Project Update
HDF4 Mapping Project Update
 
ICESat-2 H5-ES Product Development Strategy
ICESat-2 H5-ES Product Development StrategyICESat-2 H5-ES Product Development Strategy
ICESat-2 H5-ES Product Development Strategy
 
HDF OPeNDAP update
HDF OPeNDAP updateHDF OPeNDAP update
HDF OPeNDAP update
 
The Earth System Grid Federation: Origins, Current State, Evolution
The Earth System Grid Federation: Origins, Current State, EvolutionThe Earth System Grid Federation: Origins, Current State, Evolution
The Earth System Grid Federation: Origins, Current State, Evolution
 
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
Deep Learning on Apache Spark at CERN’s Large Hadron Collider with Intel Tech...
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
Introduction to HDF5
Introduction to HDF5Introduction to HDF5
Introduction to HDF5
 
HDF-Java Overview
HDF-Java OverviewHDF-Java Overview
HDF-Java Overview
 
HDF Project Update
HDF Project UpdateHDF Project Update
HDF Project Update
 
Archive Information Packages for NASA HDF-EOS Data
Archive Information Packages for NASA HDF-EOS DataArchive Information Packages for NASA HDF-EOS Data
Archive Information Packages for NASA HDF-EOS Data
 
Hdf5 current future
Hdf5 current futureHdf5 current future
Hdf5 current future
 
Easy Access of NASA HDF data via OPeNDAP
Easy Access of NASA HDF data via OPeNDAPEasy Access of NASA HDF data via OPeNDAP
Easy Access of NASA HDF data via OPeNDAP
 
Support for NPP/NPOESS/JPSS by The HDF Group
 Support for NPP/NPOESS/JPSS by The HDF Group Support for NPP/NPOESS/JPSS by The HDF Group
Support for NPP/NPOESS/JPSS by The HDF Group
 
Data Interoperability
Data InteroperabilityData Interoperability
Data Interoperability
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
HDF-EOS Software Developer/Vendor Workshop Wrapup
HDF-EOS Software Developer/Vendor Workshop WrapupHDF-EOS Software Developer/Vendor Workshop Wrapup
HDF-EOS Software Developer/Vendor Workshop Wrapup
 

More from The HDF-EOS Tools and Information Center

STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...The HDF-EOS Tools and Information Center
 

More from The HDF-EOS Tools and Information Center (20)

Cloud-Optimized HDF5 Files
Cloud-Optimized HDF5 FilesCloud-Optimized HDF5 Files
Cloud-Optimized HDF5 Files
 
Accessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDSAccessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDS
 
The State of HDF
The State of HDFThe State of HDF
The State of HDF
 
Highly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance FeaturesHighly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance Features
 
Creating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 FilesCreating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 Files
 
HDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance DiscussionHDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance Discussion
 
Hyrax: Serving Data from S3
Hyrax: Serving Data from S3Hyrax: Serving Data from S3
Hyrax: Serving Data from S3
 
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLABAccessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
 
HDF - Current status and Future Directions
HDF - Current status and Future DirectionsHDF - Current status and Future Directions
HDF - Current status and Future Directions
 
HDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and FutureHDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and Future
 
HDF - Current status and Future Directions
HDF - Current status and Future Directions HDF - Current status and Future Directions
HDF - Current status and Future Directions
 
H5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only LibraryH5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only Library
 
MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10
 
HDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDFHDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDF
 
HDF5 <-> Zarr
HDF5 <-> ZarrHDF5 <-> Zarr
HDF5 <-> Zarr
 
HDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server FeaturesHDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server Features
 
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
 
HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?
 
HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020
 

Recently uploaded

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 

Recently uploaded (20)

Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 

HDF Town Hall

  • 1. HDF Town Hall ESIP Summer Meeting July 9, 2013
  • 3. Changes in The HDF Group • New Staff • • • • 7/9/2013 Earth Science program Director (Habermann) Earth Science Project Manager (Plutchak) Project Management Office Coordinator Quality Engineer ESIP Summer 2013 3
  • 4. Earth Science Program Director (Ted) Project manager (Joel) Earth Science Team ESDIS HDF JPSS HDF Maintenance, QA IDPS support Tools and applications Ted Habermann Larry Knox Joe Lee Joel Plutchak Elena Pourmal Kent Yang Albert Cheng High Level Libraries Studies, Analyses 7/9/2013 JPSS Tools Operations Support NASA Metadata Outreach ESIP Summer 2013 4
  • 5. Mailing lists and archives • news@lists.hdfgroup.org • http://hdfgroup.org/news/ • hdf-forum@lists.hdfgroup.org • http://mail.hdfgroup.org/pipermail/hdfforum_hdfgroup.org/ • New mailing for NASA DAACs • hdf-nasa-daac@lists.hdfgroup.org 7/9/2013 ESIP Summer 2013 5
  • 7. Maintenance Releases 2012–2013 2012 HDF4 Jan Feb Mar Apr May Jun Jul 4.2.7 HDF5 Aug Sep Oct Nov 4.2.8 1.8.9 1.8.10 HDFJava h4h5 tools 2013 HDF4 HDF5 2.9 2.2.1 Jan Feb Mar Apr May Jun Jul Aug Sep Oct 7/9/2013 Nov Dec 4.2.9 1.8.11 1.8.12 HDFJava h4CF Dec 2.10 1.0 beta ESIP Summer 2013 7
  • 8. HDF4 maintenance releases HDF 4.2.9 (February 2013) • Support for Mac 10.8 with Intel and Clang compilers • Support for Cygwin version 1.7.7 and higher 7/9/2013 ESIP Summer 2013 8
  • 9. HDF5 maintenance releases HDF5 1.8.10 (Nov 2012) and patch1 (Jan 2013) • Interoperability between h5dump and h5import • Performance improvements in h5diff for the files with many attributes • Support for I/O bigger than 2GB on Mac OS X 7/9/2013 ESIP Summer 2013 9
  • 10. HDF5 maintenance releases Future releases • Request to support wide character filenames (MathWorks) • Request to support UTF-32 encoding (H5Py) • Request to support parallel compression 7/9/2013 ESIP Summer 2013 10
  • 11. New OSs and Compilers • HDF software is now supported on • SunOS 5.11 (Sparc) with Studio 12 compilers • CentOS 6 with GCC and Intel compilers • Mac OS X 10.8.* with Clang and Fortran, Java 1.7 Cygwin 1.7.7 • Windows 7 with VS 12 and Intel 13 • Windows 8 with VS 12 and Intel 13 7/9/2013 ESIP Summer 2013 11
  • 12. Java maintenance releases 2.9 release (December 2012) • Show groups/attributes in creation order • Export data to a binary/ASCII file without having to open the object in the TableView • Reload feature to close/open file • Improvements for installation 7/9/2013 ESIP Summer 2013 12
  • 13. Java maintenance releases 2.10 release (December 2013) • • • • 7/9/2013 0 or 1-based indexing when displaying arrays Displaying long names of files (“…” in names) Ability to modify HDF4 compressed dataset Support netCDF-4 files with VL attributes ESIP Summer 2013 13
  • 15. HDF and netCDF interoperability tools • • • • • HDF4/HDF-EOS2 to CF conversion toolkit - June HDF-EOS5 augmentation tool (maint) - Dec 2013 HDF-EOS2 dumper tool (maint) - every other year HDF-EOS5 to netCDF-4 conversion tool (retired) HDF4 & HDF5 Handlers – May, to synchronize w/ Hyrax release 7/9/2013 ESIP Summer 2013 16
  • 16. HDF Visualization tool assessment • To evaluate the HDF Group’s data viewing tools and user needs, and to explore, recommend, and prioritize improvements. 7/9/2013 ESIP Summer 2013 17
  • 18. Prototype Studies • Apache Open Source Incubator Pilot Project • Digital Object Identifier (DOI) support in HDF5 7/9/2013 ESIP Summer 2013 19
  • 19. HPC R&D • HDF5 Virtual Object Layer • Allows apps to store and access HDF5 objects in arbitrary storage methods and formats • Allows HDF5 apps to migrate to future storage systems with no source code modifications • HDF5: Asynchronous I/O • Application doesn’t wait for I/O • Fault Tolerance: • Prevent crash from corrupting HDF5 file • End-to-End Data Integrity: • Verify integrity of data from birth to death of file • I/O Autotuning • Runtime framework that dynamically determines optimal application I/O strategy 7/9/2013 ESIP Summer 2013 20
  • 20. Parallel I/O and Analysis of a Trillion Particle VPIC Simulation  Problem: Support I/O and analysis needs for state-of-the-art plasma physics code  Novel Accomplishments:  Ran Trillion particle VPIC simulation on 120,000 hopper cores and generated 350 TB dataset  Parallel HDF5 obtained peak 35GB/s I/O rate and 80% sustained bandwidth  Developed hybrid parallel FastQuery using FastBit to utilize multicore hardware  FastQuery took 10 minutes to index and 3 seconds to query energetic particles  SC12 paper, XLDB 2012 poster I/O bandwidth utilization for parallel writes (blue) with HDF5 on 120,000 cores  CS Impact  Demonstrated software scalability for writing and analyzing ~40TB HDF5 files  Enabled novel discoveries in plasma physics (next slide) A comparison of indexing (top table) and query times (bottom) for hybrid and MPI-FastQuery
  • 21. Science Impact: Multiple Scientific Discoveries in Plasma Physics • Preferential acceleration along magnetic field Discovered power-law in energy spectrum Energetic particles are correlated with flux ropes Discovered agyrotropy near the reconnection hot-spot
  • 22. Other projects of interest • ITER – International fusion research project • Architecture for HDF5 for ITER data life cycle • Particle accelerators and instrument vendors • Faster I/O for compressed data • Let apps send pre-compressed chunks directly to file. • Dynamic filter loading in HDF5 • Let apps read data compressed with non-standard filter. • SWMR • Single Writer/Multiple Readers 7/9/2013 ESIP Summer 2013 23
  • 23. Other projects of interest • Digital Twin • “Digital Twin integrates ultra-high fidelity simulation with the vehicle’s on-board integrated vehicle health management system, maintenance history and all available historical and fleet data to mirror the life of its flying twin and enable unprecedented levels of safety and reliability.” 7/9/2013 ESIP Summer 2013 24

Editor's Notes

  1. Joe will take care of it.
  2. HDF5 1.8.7 – 1.8.9 Fortran 2003 support, support for Fortran dimension scalesHDF4 releases in support of the H4 mapping projectSupport for Powerpc64 platform (big-endian)Java – addressed all ESDIS requestsBased on the latest available HDF4 and HDF5H4h5tools – updated to 18 APIs, no 18 features were added
  3. Up to here elena fixes. Add QA person.
  4. Joe moved this slide after maintenance plan.
  5. Java HDF4.
  6. Java HDF4.
  7. Does this belong to Goal #5?
  8. HDFView more than 10 years old. Since first implemented, new technologies and techniques have emerged that could help improve HDFView. We surveyed HDFView users last year. A lot of good ideas came out of that.We will not just look at Java, but other alternatives such as QT.This is an internally funded project led by Cao, Heber, Readey (Amazon).This group will:Review our vision for vis tools and how they are aligned with our mission. Review and company goals as regards support for vis tools. Identify needs and opportunities based on current and potential customers and their needs and desires.Review technologies and tools currently available that can help us develop new tools if needed, how the new tools compare with current HDF tools, and what they might offer in terms of improvements.Develop of a set of guiding principles for going forward.Recommend activities, perhaps leading to a roadmap to long-term goals for the visualization tool(s).
  9. The slide highlights recent accomplishments from the ExaHDF5 project funded by DOE/ASCR Exascale Scientific Data Management award.1) Parallel I/O with HDF5We ran a Trillion particle simulation on 120K cores on hopper. The code produced 30 TB of particle data per timestep, and produced over 350TB of data total- To the best of our knowledge, this is the first time that anyone has demonstrated writes to a single, shared 30 TB HDF5 fileWe hit peak I/O rates on hopper (~35GB/s) during the run, we sustained an average ~23GB/s, which is a new record for parallel HDF5 performance2) FastBit based analysis- We developed a novel hybrid parallel version of FastBit to do the indexing/querying on the datasetThis was the first time that we used FastBit and FastQuery to index and query a dataset with Trillion entriesWe were able to index the dataset in 10 minutes and query the dataset in 3 seconds DOE researchers: Prabhat (PI), Suren Byna, Oliver Rubel and John Wu (LBNL)Scientific collaborators: HomaKarimabadi (UCSD), VadimRoytershteyn (UCSD) and Bill Daughton (LANL)Simulation code used in the study is VPIC, developed at LANL.Please address any questions to Prabhat (prabhat@lbl.gov).
  10. 3) Scientific insightsThis is the first time that our science collaborators have been able to examine the trillion particle dataset. They had largely ignored the particle data, or looked at a coarse grained version earlier- Our collaborators discovered a power-law distribution in the energy spectrum of the particles. This is the first kinetic plasma physics to demonstrate a power-law distribution; our analysis capabilities directly facilitated this discovery Our collaborators had made a number of conjectures and hypothesis regarding the interplay between particles and the magnetic fields and multi-dimensional phase-space distribution of particles. Using these new tools, they were able to confirm these hypothesis quantitatively. More specifically the scientists found: - a preferential acceleration of particles in a direction parallel to the magnetic field - predominant distribution of energetic particles in the current sheet, suggesting that flux ropes can confine these particlesagyrotropic (asymmetric) distribution of particles near the magnetic reconnection event
  11. Kent needs to update this.