SlideShare a Scribd company logo
1 of 1
Download to read offline
Open Chemistry: Realizing Open Data, Open Standards and Open Source
                                                                                                                         Marcus D. Hanwell, Kyle Lutz, David Lonie, Chris Harris, and David Cole
                                                                                                               Website: http://openchemistry.org/                     Email: marcus.hanwell@kitware.com, kyle.lutz@kitware.com
                                                                                                                             Scientific Computing, Kitware, Inc, 28 Corporate Drive, Clifton Park, NY 12065.


                                                Avogadro                                                                                                            Open Chemistry                                                                                        Chemical Data Explorer
The Avogadro project is a cross-platform, open-source approach to building chemical                                       The Open Chemistry project is developing a suite of applications and support libraries                         The Chemical Data Explorer is an cross-platform, open-source application that
structures. It uses external simulation packages in addition to integrated analysis and                                   to improve the workflow in computational chemistry, biology, materials science and                              builds on the capabilities of the Visualization Toolkit, Qt and MongoDB. It can
visualization routines. The work presented here illustrates a workflow for quantum                                         related areas. A set of open, connected components that can tackle small problems                              connect to a local or remote database, ingest new data from various sources and
mechanical calculations, allowing the preparation of chemical structures, rough                                           on the desktop, and big research projects requiring significant time on the world’s top                         make that data semantically rich. It can apply informatics techniques to the data
optimization, and subsequent calculation of electron density isosurfaces, molecular                                       supercomputers.                                                                                                it contains to search for structures with particular properties. Work is ongoing to
orbitals, etc.                                                                                                                                                                                                                           more tightly integrate computational job storage and search.
                                                                                                                                               Log File                                                          Input File
                                                                                                                                                                             Simulation



                                                                                                                                                Results                      Informatics                     Job Submission




                                                                                                                                                                          HPC integration


                                                                                                                                                    Local                                                      Cloud
                                                                                                                                                                            Supercomputer



                                                                                                                            Figure 5: The workflow that the Open Chemistry components are being developed for.



  Figure 1: Avogadro application (left), ray-traced molecule (center) and the periodic table widget (right).                                                              OpenQube
Avogadro allows the user to prepare jobs for quantum packages, such as NWChem,                                            OpenQube is a small, open-source C++ library that reads key quantum data from                                    Figure 3: The user interface showing a query and structures (top-left), a scatter plot matrix (top-right), scatter
GAMESS, Gaussian and Q-Chem. Due to the plugin-based nature of the Avogadro                                               calculations produced by codes such as NWChem, GAMESS and Gaussian. It can                                       plot with tooltip (bottom-left), and K-means clustering (bottom-right).
project, many specialized functions can be added for a large range of applications,                                       read in basis sets, eigenvectors and density matrices, and calculate the magnitude
such as molecular docking, surface modeling and electronic structure.                                                     of the molecular orbitals and electron density on regularly-spaced grids. The data
                                                                                                                          produced can be used for further analysis and visualization of electronic structure.
                                                                                                                                                                                                                                                             Visualization Toolkit and ParaView
                                                                                                                                                                                                                                         The Visualization Toolkit (VTK) is an open-source, C++ toolkit for 2D and
                                              MoleQueue                                                                                                                    Chemkit                                                       3D graphics, volume rendering, image processing, visualization and modeling.
The MoleQueue application provides a graphical interface that integrates high-                                                                                                                                                           Development began in 1993, and it now has a large community of developers
                                                                                                                          Chemkit is an open-source, C++ library for molecular modeling, cheminformatics,
performance computing (HPC) resources on the desktop. It offers a seamless                                                                                                                                                                distributed around the world in a diverse set of fields. VTK processes data using
                                                                                                                          and molecular visualization. It features a modular, plugin-based architecture and
integration layer for applications, such as Avogadro, to submit jobs to local and                                                                                                                                                        a data flow graph (pipeline) in which each algorithm takes zero or more inputs
                                                                                                                          includes over 40 plugins that implement 15 file formats, 6 line formats, 4 force-fields,
remote computational resources. Job lifetime is managed by MoleQueue, and results                                                                                                                                                        and produces zero or more outputs. VTK is scalable to large data because it has
                                                                                                                          2 partial charge models, 2 aromaticity models, 8 atom typers and 30 molecular
can be opened in any external program.                                                                                                                                                                                                   distributed algorithms that use MPI to execute on large computing clusters.
                                                                                                                          descriptors. In addition, Chemkit includes an integrated visualization library built
                                                                                                                          on OpenGL/Qt, with Python bindings for easy scripting.




                                                                                                                                                                                                                                           Figure 4: Volume rendered molecular orbital with sliced contour (left), and library dependency graph (right).


                                                                                                                            Figure 6: Cartoon rendering of protein (left), surface rendering (center), and molecule rendering (right).
                                                                                                                                                                                                                                         ParaView is an open-source, cross-platform data analysis and visualization
                                                                                                                                                                                                                                         application. It is one of the flagship open-source projects developed by Kitware,
  Figure 2: The MoleQueue program configuration dialog for a PBS remote system.
                                                                                                                                                                  Software Process                                                       building on VTK and Qt to provide a client-server application that allows users
• Graphical configuration of queues and programs                                                                           These projects are open-source, targeting multiple platforms and architectures. A                              to quickly build visualizations to analyze their data. ParaView was developed to
                                                                                                                          quality-inducing software process is employed using best-of-breed technologies such                            analyze extremely large data sets using distributed memory computing resources.
• Support for Sun Grid Engine, PBS and running calculations locally
                                                                                                                          as Git for distributed version control, Gerrit for code review, CMake for cross-                               It can be used interactively with the cross-platform GUI, or scripted from Python.
• JSON-RPC protocol for interprocess communication over local sockets or ZeroMQ                                                                                                                                                          VTK and ParaView are being augmented with additional functionality for chemistry
                                                                                                                          platform building, CTest for unit/regression testing and CDash for software quality
• C++ and Python client libraries                                                                                         feedback. Most code is BSD licensed, and designed with reuse in mind.                                          through projects such as the Google Summer of Code and Open Chemistry.

More Related Content

What's hot

Evolution of database access technologies in Java-based software projects
Evolution of database access technologies in Java-based software projectsEvolution of database access technologies in Java-based software projects
Evolution of database access technologies in Java-based software projectsTom Mens
 
LDV: Light-weight Database Virtualization
LDV: Light-weight Database VirtualizationLDV: Light-weight Database Virtualization
LDV: Light-weight Database VirtualizationTanu Malik
 
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryOpen Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryMarcus Hanwell
 
The Materials Project Ecosystem - A Complete Software and Data Platform for M...
The Materials Project Ecosystem - A Complete Software and Data Platform for M...The Materials Project Ecosystem - A Complete Software and Data Platform for M...
The Materials Project Ecosystem - A Complete Software and Data Platform for M...University of California, San Diego
 
PTU: Using Provenance for Repeatability
PTU: Using Provenance for RepeatabilityPTU: Using Provenance for Repeatability
PTU: Using Provenance for RepeatabilityTanu Malik
 
GEN: A Database Interface Generator for HPC Programs
GEN: A Database Interface Generator for HPC ProgramsGEN: A Database Interface Generator for HPC Programs
GEN: A Database Interface Generator for HPC ProgramsTanu Malik
 
Ipaw14 presentation Quan, Tanu, Ian
Ipaw14 presentation Quan, Tanu, IanIpaw14 presentation Quan, Tanu, Ian
Ipaw14 presentation Quan, Tanu, IanBoris Glavic
 
ExSchema - ICSM'13
ExSchema - ICSM'13ExSchema - ICSM'13
ExSchema - ICSM'13jccastrejon
 
GlobusWorld 2015
GlobusWorld 2015GlobusWorld 2015
GlobusWorld 2015Tanu Malik
 
Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014aceas13tern
 
Reproducible Workflow with Cytoscape and Jupyter Notebook
Reproducible Workflow with Cytoscape and Jupyter NotebookReproducible Workflow with Cytoscape and Jupyter Notebook
Reproducible Workflow with Cytoscape and Jupyter NotebookKeiichiro Ono
 
07 data structures_and_representations
07 data structures_and_representations07 data structures_and_representations
07 data structures_and_representationsMarco Quartulli
 
GeoDataspace: Simplifying Data Management Tasks with Globus
GeoDataspace: Simplifying Data Management Tasks with GlobusGeoDataspace: Simplifying Data Management Tasks with Globus
GeoDataspace: Simplifying Data Management Tasks with GlobusTanu Malik
 
Volunteer Computing using BOINC
Volunteer Computing using BOINCVolunteer Computing using BOINC
Volunteer Computing using BOINCPooyan Mehrparvar
 
Open-source from/in the enterprise: the RDKit
Open-source from/in the enterprise: the RDKitOpen-source from/in the enterprise: the RDKit
Open-source from/in the enterprise: the RDKitGreg Landrum
 
EUDAT Generic Execution Framework
EUDAT Generic Execution FrameworkEUDAT Generic Execution Framework
EUDAT Generic Execution FrameworkEUDAT
 
ExSciTecH: Expanding Volunteer Computing to Explore Science, Technology, and ...
ExSciTecH: Expanding Volunteer Computing to Explore Science, Technology, and ...ExSciTecH: Expanding Volunteer Computing to Explore Science, Technology, and ...
ExSciTecH: Expanding Volunteer Computing to Explore Science, Technology, and ...Michela Taufer
 
Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Valery Tkachenko
 

What's hot (20)

Evolution of database access technologies in Java-based software projects
Evolution of database access technologies in Java-based software projectsEvolution of database access technologies in Java-based software projects
Evolution of database access technologies in Java-based software projects
 
LDV: Light-weight Database Virtualization
LDV: Light-weight Database VirtualizationLDV: Light-weight Database Virtualization
LDV: Light-weight Database Virtualization
 
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryOpen Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
 
Bioinformatics on Azure
Bioinformatics on AzureBioinformatics on Azure
Bioinformatics on Azure
 
The Materials Project Ecosystem - A Complete Software and Data Platform for M...
The Materials Project Ecosystem - A Complete Software and Data Platform for M...The Materials Project Ecosystem - A Complete Software and Data Platform for M...
The Materials Project Ecosystem - A Complete Software and Data Platform for M...
 
PTU: Using Provenance for Repeatability
PTU: Using Provenance for RepeatabilityPTU: Using Provenance for Repeatability
PTU: Using Provenance for Repeatability
 
GEN: A Database Interface Generator for HPC Programs
GEN: A Database Interface Generator for HPC ProgramsGEN: A Database Interface Generator for HPC Programs
GEN: A Database Interface Generator for HPC Programs
 
Ipaw14 presentation Quan, Tanu, Ian
Ipaw14 presentation Quan, Tanu, IanIpaw14 presentation Quan, Tanu, Ian
Ipaw14 presentation Quan, Tanu, Ian
 
ExSchema - ICSM'13
ExSchema - ICSM'13ExSchema - ICSM'13
ExSchema - ICSM'13
 
GlobusWorld 2015
GlobusWorld 2015GlobusWorld 2015
GlobusWorld 2015
 
Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014Tim Pugh-SPEDDEXES 2014
Tim Pugh-SPEDDEXES 2014
 
Reproducible Workflow with Cytoscape and Jupyter Notebook
Reproducible Workflow with Cytoscape and Jupyter NotebookReproducible Workflow with Cytoscape and Jupyter Notebook
Reproducible Workflow with Cytoscape and Jupyter Notebook
 
07 data structures_and_representations
07 data structures_and_representations07 data structures_and_representations
07 data structures_and_representations
 
GeoDataspace: Simplifying Data Management Tasks with Globus
GeoDataspace: Simplifying Data Management Tasks with GlobusGeoDataspace: Simplifying Data Management Tasks with Globus
GeoDataspace: Simplifying Data Management Tasks with Globus
 
Volunteer Computing using BOINC
Volunteer Computing using BOINCVolunteer Computing using BOINC
Volunteer Computing using BOINC
 
Open-source from/in the enterprise: the RDKit
Open-source from/in the enterprise: the RDKitOpen-source from/in the enterprise: the RDKit
Open-source from/in the enterprise: the RDKit
 
Cluster Computing Web2 Sept2009
Cluster Computing Web2 Sept2009Cluster Computing Web2 Sept2009
Cluster Computing Web2 Sept2009
 
EUDAT Generic Execution Framework
EUDAT Generic Execution FrameworkEUDAT Generic Execution Framework
EUDAT Generic Execution Framework
 
ExSciTecH: Expanding Volunteer Computing to Explore Science, Technology, and ...
ExSciTecH: Expanding Volunteer Computing to Explore Science, Technology, and ...ExSciTecH: Expanding Volunteer Computing to Explore Science, Technology, and ...
ExSciTecH: Expanding Volunteer Computing to Explore Science, Technology, and ...
 
Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...
 

Similar to Open Chemistry: Realizing Open Data, Open Standards, and Open Source

Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...Anubhav Jain
 
IRJET- A Workflow Management System for Scalable Data Mining on Clouds
IRJET- A Workflow Management System for Scalable Data Mining on CloudsIRJET- A Workflow Management System for Scalable Data Mining on Clouds
IRJET- A Workflow Management System for Scalable Data Mining on CloudsIRJET Journal
 
Elastic r sc10-tutorial
Elastic r sc10-tutorialElastic r sc10-tutorial
Elastic r sc10-tutorialArden Chan
 
BEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors India
BEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors IndiaBEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors India
BEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors IndiaTutors India
 
2012 ieee projects software engineering @ Seabirds ( Trichy, Chennai, Pondich...
2012 ieee projects software engineering @ Seabirds ( Trichy, Chennai, Pondich...2012 ieee projects software engineering @ Seabirds ( Trichy, Chennai, Pondich...
2012 ieee projects software engineering @ Seabirds ( Trichy, Chennai, Pondich...SBGC
 
SOFTWARE BASED CALCULATION OF CAPACITY OUTAGE OF GENERATING UNITS
SOFTWARE BASED CALCULATION OF CAPACITY OUTAGE OF GENERATING UNITSSOFTWARE BASED CALCULATION OF CAPACITY OUTAGE OF GENERATING UNITS
SOFTWARE BASED CALCULATION OF CAPACITY OUTAGE OF GENERATING UNITSvivatechijri
 
Implementing load balancing algorithm in middleware system of volunteer cloud...
Implementing load balancing algorithm in middleware system of volunteer cloud...Implementing load balancing algorithm in middleware system of volunteer cloud...
Implementing load balancing algorithm in middleware system of volunteer cloud...Gargee Hiray
 
Software tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningSoftware tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningAnubhav Jain
 
An Efficient Hardware Implementation of Canny Edge Detection Algorithm
An Efficient Hardware Implementation of Canny Edge Detection AlgorithmAn Efficient Hardware Implementation of Canny Edge Detection Algorithm
An Efficient Hardware Implementation of Canny Edge Detection Algorithmijtsrd
 
OpenACC and Open Hackathons Monthly Highlights May 2023.pdf
OpenACC and Open Hackathons Monthly Highlights May  2023.pdfOpenACC and Open Hackathons Monthly Highlights May  2023.pdf
OpenACC and Open Hackathons Monthly Highlights May 2023.pdfOpenACC
 
OpenACC Monthly Highlights Summer 2019
OpenACC Monthly Highlights Summer 2019OpenACC Monthly Highlights Summer 2019
OpenACC Monthly Highlights Summer 2019OpenACC
 
research Paper face recognition attendance system
research Paper face recognition attendance systemresearch Paper face recognition attendance system
research Paper face recognition attendance systemAnkitRao82
 
Between Simulator and Prototype: Crossover Architecture for Testing and Demon...
Between Simulator and Prototype: Crossover Architecture for Testing and Demon...Between Simulator and Prototype: Crossover Architecture for Testing and Demon...
Between Simulator and Prototype: Crossover Architecture for Testing and Demon...Jaroslaw Domaszewicz
 
Partial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather ConditionsPartial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather ConditionsIRJET Journal
 
A Review: Metaheuristic Technique in Cloud Computing
A Review: Metaheuristic Technique in Cloud ComputingA Review: Metaheuristic Technique in Cloud Computing
A Review: Metaheuristic Technique in Cloud ComputingIRJET Journal
 
Thesies_Cheng_Guo_2015_fina_signed
Thesies_Cheng_Guo_2015_fina_signedThesies_Cheng_Guo_2015_fina_signed
Thesies_Cheng_Guo_2015_fina_signedCheng Guo
 

Similar to Open Chemistry: Realizing Open Data, Open Standards, and Open Source (20)

Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...Software Tools, Methods and Applications of Machine Learning in Functional Ma...
Software Tools, Methods and Applications of Machine Learning in Functional Ma...
 
IRJET- A Workflow Management System for Scalable Data Mining on Clouds
IRJET- A Workflow Management System for Scalable Data Mining on CloudsIRJET- A Workflow Management System for Scalable Data Mining on Clouds
IRJET- A Workflow Management System for Scalable Data Mining on Clouds
 
Elastic r sc10-tutorial
Elastic r sc10-tutorialElastic r sc10-tutorial
Elastic r sc10-tutorial
 
BEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors India
BEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors IndiaBEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors India
BEST IMAGE PROCESSING TOOLS TO EXPECT in 2023 – Tutors India
 
2012 ieee projects software engineering @ Seabirds ( Trichy, Chennai, Pondich...
2012 ieee projects software engineering @ Seabirds ( Trichy, Chennai, Pondich...2012 ieee projects software engineering @ Seabirds ( Trichy, Chennai, Pondich...
2012 ieee projects software engineering @ Seabirds ( Trichy, Chennai, Pondich...
 
SOFTWARE BASED CALCULATION OF CAPACITY OUTAGE OF GENERATING UNITS
SOFTWARE BASED CALCULATION OF CAPACITY OUTAGE OF GENERATING UNITSSOFTWARE BASED CALCULATION OF CAPACITY OUTAGE OF GENERATING UNITS
SOFTWARE BASED CALCULATION OF CAPACITY OUTAGE OF GENERATING UNITS
 
Implementing load balancing algorithm in middleware system of volunteer cloud...
Implementing load balancing algorithm in middleware system of volunteer cloud...Implementing load balancing algorithm in middleware system of volunteer cloud...
Implementing load balancing algorithm in middleware system of volunteer cloud...
 
Software tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data miningSoftware tools for high-throughput materials data generation and data mining
Software tools for high-throughput materials data generation and data mining
 
An Efficient Hardware Implementation of Canny Edge Detection Algorithm
An Efficient Hardware Implementation of Canny Edge Detection AlgorithmAn Efficient Hardware Implementation of Canny Edge Detection Algorithm
An Efficient Hardware Implementation of Canny Edge Detection Algorithm
 
OpenACC and Open Hackathons Monthly Highlights May 2023.pdf
OpenACC and Open Hackathons Monthly Highlights May  2023.pdfOpenACC and Open Hackathons Monthly Highlights May  2023.pdf
OpenACC and Open Hackathons Monthly Highlights May 2023.pdf
 
ThesisProposal
ThesisProposalThesisProposal
ThesisProposal
 
CV
CVCV
CV
 
OpenACC Monthly Highlights Summer 2019
OpenACC Monthly Highlights Summer 2019OpenACC Monthly Highlights Summer 2019
OpenACC Monthly Highlights Summer 2019
 
research Paper face recognition attendance system
research Paper face recognition attendance systemresearch Paper face recognition attendance system
research Paper face recognition attendance system
 
Between Simulator and Prototype: Crossover Architecture for Testing and Demon...
Between Simulator and Prototype: Crossover Architecture for Testing and Demon...Between Simulator and Prototype: Crossover Architecture for Testing and Demon...
Between Simulator and Prototype: Crossover Architecture for Testing and Demon...
 
Partial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather ConditionsPartial Object Detection in Inclined Weather Conditions
Partial Object Detection in Inclined Weather Conditions
 
Defense
DefenseDefense
Defense
 
A Review: Metaheuristic Technique in Cloud Computing
A Review: Metaheuristic Technique in Cloud ComputingA Review: Metaheuristic Technique in Cloud Computing
A Review: Metaheuristic Technique in Cloud Computing
 
B040101007012
B040101007012B040101007012
B040101007012
 
Thesies_Cheng_Guo_2015_fina_signed
Thesies_Cheng_Guo_2015_fina_signedThesies_Cheng_Guo_2015_fina_signed
Thesies_Cheng_Guo_2015_fina_signed
 

Recently uploaded

Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Open Chemistry: Realizing Open Data, Open Standards, and Open Source

  • 1. Open Chemistry: Realizing Open Data, Open Standards and Open Source Marcus D. Hanwell, Kyle Lutz, David Lonie, Chris Harris, and David Cole Website: http://openchemistry.org/ Email: marcus.hanwell@kitware.com, kyle.lutz@kitware.com Scientific Computing, Kitware, Inc, 28 Corporate Drive, Clifton Park, NY 12065. Avogadro Open Chemistry Chemical Data Explorer The Avogadro project is a cross-platform, open-source approach to building chemical The Open Chemistry project is developing a suite of applications and support libraries The Chemical Data Explorer is an cross-platform, open-source application that structures. It uses external simulation packages in addition to integrated analysis and to improve the workflow in computational chemistry, biology, materials science and builds on the capabilities of the Visualization Toolkit, Qt and MongoDB. It can visualization routines. The work presented here illustrates a workflow for quantum related areas. A set of open, connected components that can tackle small problems connect to a local or remote database, ingest new data from various sources and mechanical calculations, allowing the preparation of chemical structures, rough on the desktop, and big research projects requiring significant time on the world’s top make that data semantically rich. It can apply informatics techniques to the data optimization, and subsequent calculation of electron density isosurfaces, molecular supercomputers. it contains to search for structures with particular properties. Work is ongoing to orbitals, etc. more tightly integrate computational job storage and search. Log File Input File Simulation Results Informatics Job Submission HPC integration Local Cloud Supercomputer Figure 5: The workflow that the Open Chemistry components are being developed for. Figure 1: Avogadro application (left), ray-traced molecule (center) and the periodic table widget (right). OpenQube Avogadro allows the user to prepare jobs for quantum packages, such as NWChem, OpenQube is a small, open-source C++ library that reads key quantum data from Figure 3: The user interface showing a query and structures (top-left), a scatter plot matrix (top-right), scatter GAMESS, Gaussian and Q-Chem. Due to the plugin-based nature of the Avogadro calculations produced by codes such as NWChem, GAMESS and Gaussian. It can plot with tooltip (bottom-left), and K-means clustering (bottom-right). project, many specialized functions can be added for a large range of applications, read in basis sets, eigenvectors and density matrices, and calculate the magnitude such as molecular docking, surface modeling and electronic structure. of the molecular orbitals and electron density on regularly-spaced grids. The data produced can be used for further analysis and visualization of electronic structure. Visualization Toolkit and ParaView The Visualization Toolkit (VTK) is an open-source, C++ toolkit for 2D and MoleQueue Chemkit 3D graphics, volume rendering, image processing, visualization and modeling. The MoleQueue application provides a graphical interface that integrates high- Development began in 1993, and it now has a large community of developers Chemkit is an open-source, C++ library for molecular modeling, cheminformatics, performance computing (HPC) resources on the desktop. It offers a seamless distributed around the world in a diverse set of fields. VTK processes data using and molecular visualization. It features a modular, plugin-based architecture and integration layer for applications, such as Avogadro, to submit jobs to local and a data flow graph (pipeline) in which each algorithm takes zero or more inputs includes over 40 plugins that implement 15 file formats, 6 line formats, 4 force-fields, remote computational resources. Job lifetime is managed by MoleQueue, and results and produces zero or more outputs. VTK is scalable to large data because it has 2 partial charge models, 2 aromaticity models, 8 atom typers and 30 molecular can be opened in any external program. distributed algorithms that use MPI to execute on large computing clusters. descriptors. In addition, Chemkit includes an integrated visualization library built on OpenGL/Qt, with Python bindings for easy scripting. Figure 4: Volume rendered molecular orbital with sliced contour (left), and library dependency graph (right). Figure 6: Cartoon rendering of protein (left), surface rendering (center), and molecule rendering (right). ParaView is an open-source, cross-platform data analysis and visualization application. It is one of the flagship open-source projects developed by Kitware, Figure 2: The MoleQueue program configuration dialog for a PBS remote system. Software Process building on VTK and Qt to provide a client-server application that allows users • Graphical configuration of queues and programs These projects are open-source, targeting multiple platforms and architectures. A to quickly build visualizations to analyze their data. ParaView was developed to quality-inducing software process is employed using best-of-breed technologies such analyze extremely large data sets using distributed memory computing resources. • Support for Sun Grid Engine, PBS and running calculations locally as Git for distributed version control, Gerrit for code review, CMake for cross- It can be used interactively with the cross-platform GUI, or scripted from Python. • JSON-RPC protocol for interprocess communication over local sockets or ZeroMQ VTK and ParaView are being augmented with additional functionality for chemistry platform building, CTest for unit/regression testing and CDash for software quality • C++ and Python client libraries feedback. Most code is BSD licensed, and designed with reuse in mind. through projects such as the Google Summer of Code and Open Chemistry.