SlideShare a Scribd company logo
1 of 30
Download to read offline
Unlocking breeding potential of African
crops through data management an
example with CASSAVABASE
Guillaume Bauchet
Plant and Animal Genome Conference
San Diego January 2016
gjb99@cornell.edu
OUTLINE
http://nextgencassava.org/
CASSAVABASE , What  for?
CASSAVABASE , a  user  perspective
CASSAVABASE , search,  manage,  analyze
CASSAVABASE , a  view
The  Central  data  store  for  NEXTGEN CASSAVA :
Genomic  selection  in  African  cassava  breeding  programs
http://nextgencassava.org/
NEXTGEN CASSAVA
What are the major challenges?
● Multi trait and Multi breeding environments for cassava
phenotypic data collection
● Large scale production of genomic data using GBS
● Integrate Genomic Selection tool via web interface
What are the major challenges?
● Make the most of this resource for cassava breeders:
speed up the analysis and decision making
What are the needs?
● Search various data types (phenotypes and germplasm) in a large datastore
● Manage data and daily breeding activity through comprehensive interface
● Analyse and retrieve data for genomic assisted breeding
What are our solutions?
● Integrate phenomic & genomic data with breeding tools
● Use Perl with the Bio::Chado::Schema and Natural Diversity
module as database architecture
● Retrieve genomic information
● Sequence visualization ● Open source
https://github.com/solgenomics/
http://cassavabase.org/
New search bar
Navigation bar always visible on top Expandable search box
Caroussel
New responsive design
CASSAVABASE
by numbers
2016: + 80,000 accessions, 2,5 billion genetic observations
2014:
+360 registered users
From Phenotype to Genotype to Breeding:
Harvesting the fruits of CASSAVABASE
CASSAVABASE, an Office perspective: Search
Search breeding program, location, trial, trait, year, accession
CASSAVABASE, a field perspective: Manage Phenotypes
Define phenotypic traits via Cassava
trait dictionaryin CASSAVABASE
Data
collection
via FieldBook
app*
Design trials, barcodes &
field maps
in CASSAVABASE*
Data uploading in
CASSAVABASE
via .xls and .txt file *
*See Alex Ogbonna PAG presentation
“Managing Phenotypic Data through Cassavabase with Fieldbook App”
“
Data analysis in
CASSAVABASE
-Sum. stat
-ANOVA
-BLUP
-GS
In CASSAVABASE
Design genotyping
Trial in CASSAVABASE
TASSEL
pipeline
Data filtering
&
imputation
GBS data uploading
In CASSAVABASE
GS Analysis
& Visualization
in
CASSAVABASE
GBS facility @ Cornell
CASSAVABASE, a lab perspective: Manage Genotypes
CASSAVABASE an office perspective: Manage
Breeding programs, trial, accession
CASSAVABASE : Analyze with SolGS
Phenotypic values Population Structure GEBV vs phenotypes
See Isaak Tecle PAG presentation & poster 342
“solGS: A Web-based Solution for Genomic Selection”
GEBV
CASSAVABASE : Analyze with SolGS
CASSAVABASE from the Office: Analyze phenotypes
QC to phenotypes
Single trial
CASSAVABASE from the Office: Analyze phenotypes
QC to phenotypes
Single trial
CASSAVABASE tools: Analyze pedigree
CASSAVABASE from the Office: Analyze phenotypes
data_2011_B1
4 6 8 10
r= 0.68
p<0.001
r= 0.66
p<0.001
4 6 8 10 14
r= 0.70
p<0.001
4681012
r= 0.63
p<0.001
46810
data_2011_B2
r= 0.76
p<0.001
r= 0.79
p<0.001
r= 0.73
p<0.001
data_2011_B3
r= 0.76
p<0.001
46810
r= 0.68
p<0.001
4681014
data_2012_B1
r= 0.75
p<0.001
4 6 8 10 12 4 6 8 10 4 6 8 12
46812
data_2012_B2
30 31 32 33 34 35 36 37
-1.5-0.50.51.5
Fitted values
Residuals
Residuals vs Fitted
26
9
15
-2 -1 0 1 2
-1012
Theoretical Quantiles
Standardizedresiduals
Normal Q-Q
26
9
15
30 31 32 33 34 35 36 37
0.00.40.81.2
Fitted values
Standardizedresiduals
Scale-Location
269
15
0.0 0.1 0.2 0.3 0.4 0.5
-2-1012
Leverage
Standardizedresiduals
Cook's distance
Residuals vs Leverage
9
26
15
ANOVA, h2,
BLUP, GxE
QC phenotypes
Multiple trials
JBrowse
CASSAVABASE tools: Analyze sequence
Variant
effects
prediction
VIGS tool
CASSAVABASE tools: Analyze sequence
BLAST
CASSAVABASE, a User perspective: support & interaction
CASSAVABASE, a User perspective: support & interaction
-> Provide support on technical issues ( data management)
-> Gather user request for tool improvement and new developments
(pedigree queries, VIGS)
-> 2016: Install Mirror site @ IITA Ibadan, Nigeria
Weekly meetings with users in Africa: Wiki, FB pages & mailing list:
CASSAVABASE Upcoming developments
Search: Integrate trait & values in the wizard search
Manage: extract data subset according to their phenotypic
values, conditionnal choices
Analyze: -Phenotypic analysis developments (ANOVA, GxE)
-Pedigree analysis
-Jbrowse: Mutation prediction of genetic variants
-SolGS: Jobs queuing, trial selection improvement
Lukas
Mueller
Alex
Ogbonna
Bryan
Ellerbrock
Naama
Menda
Isaak
Tecle
Nick
Morales
AKNOWLEDGEMENTS
Jeremy
Edwards
BMGF
Chiedozie
Egesi
Peter
Kulakow
Robert
Kawuki
Ismail
Rabbi
Questions?

More Related Content

Viewers also liked (7)

3b Cassavabase workshop: manage accessions
3b  Cassavabase workshop: manage accessions3b  Cassavabase workshop: manage accessions
3b Cassavabase workshop: manage accessions
 
3h Cassavabase workshop: manage barcode
3h  Cassavabase workshop: manage barcode3h  Cassavabase workshop: manage barcode
3h Cassavabase workshop: manage barcode
 
YamBase phenotyping workflow demo
YamBase phenotyping workflow demoYamBase phenotyping workflow demo
YamBase phenotyping workflow demo
 
3a Cassavabase worksop: manage breeding-program ands locations
3a  Cassavabase worksop: manage breeding-program ands locations3a  Cassavabase worksop: manage breeding-program ands locations
3a Cassavabase worksop: manage breeding-program ands locations
 
Musa base phenotyping workflow demo
Musa base phenotyping workflow demoMusa base phenotyping workflow demo
Musa base phenotyping workflow demo
 
Introduction to SQL
Introduction to SQLIntroduction to SQL
Introduction to SQL
 
Improvements in the Tomato Reference Genome (SL3.0) and Annotation (ITAG3.0)
Improvements in the Tomato Reference Genome (SL3.0) and Annotation (ITAG3.0)Improvements in the Tomato Reference Genome (SL3.0) and Annotation (ITAG3.0)
Improvements in the Tomato Reference Genome (SL3.0) and Annotation (ITAG3.0)
 

Similar to Cassavabase general presentation PAG 2016

Introducing VSClinical: Streamlining ACMG Variant Interpretation Guidelines
Introducing VSClinical: Streamlining ACMG Variant Interpretation GuidelinesIntroducing VSClinical: Streamlining ACMG Variant Interpretation Guidelines
Introducing VSClinical: Streamlining ACMG Variant Interpretation Guidelines
Golden Helix
 

Similar to Cassavabase general presentation PAG 2016 (20)

Cassava genome hub
Cassava genome hubCassava genome hub
Cassava genome hub
 
Population Calling: a powerful tool for novel mutation detection in larger sa...
Population Calling: a powerful tool for novel mutation detection in larger sa...Population Calling: a powerful tool for novel mutation detection in larger sa...
Population Calling: a powerful tool for novel mutation detection in larger sa...
 
Mark Sawkins' presentation at the Symposium on Crop Breeding Databases - Nove...
Mark Sawkins' presentation at the Symposium on Crop Breeding Databases - Nove...Mark Sawkins' presentation at the Symposium on Crop Breeding Databases - Nove...
Mark Sawkins' presentation at the Symposium on Crop Breeding Databases - Nove...
 
From Sequence to Knowledge: The Art and Science of Phage Genome Annotation
From Sequence to Knowledge: The Art and Science of Phage Genome AnnotationFrom Sequence to Knowledge: The Art and Science of Phage Genome Annotation
From Sequence to Knowledge: The Art and Science of Phage Genome Annotation
 
Introducing VSClinical: Streamlining ACMG Variant Interpretation Guidelines
Introducing VSClinical: Streamlining ACMG Variant Interpretation GuidelinesIntroducing VSClinical: Streamlining ACMG Variant Interpretation Guidelines
Introducing VSClinical: Streamlining ACMG Variant Interpretation Guidelines
 
Platforms CIBERER and INB-ELIXIR-es
Platforms CIBERER and INB-ELIXIR-esPlatforms CIBERER and INB-ELIXIR-es
Platforms CIBERER and INB-ELIXIR-es
 
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
 
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Rare Variant Analysis Workflows: Analyzing NGS Data in Large CohortsRare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
 
3b. Biotechnolgies & Genomics - Jane Theaker
3b. Biotechnolgies & Genomics - Jane Theaker3b. Biotechnolgies & Genomics - Jane Theaker
3b. Biotechnolgies & Genomics - Jane Theaker
 
Functional Predictions and Conservation Scores in VSClinical
Functional Predictions and Conservation Scores in VSClinicalFunctional Predictions and Conservation Scores in VSClinical
Functional Predictions and Conservation Scores in VSClinical
 
TGAC Browser bosc 2014
TGAC Browser bosc 2014TGAC Browser bosc 2014
TGAC Browser bosc 2014
 
Arraygen_Brochure
Arraygen_BrochureArraygen_Brochure
Arraygen_Brochure
 
Building bioinformatics resources for the global community
Building bioinformatics resources for the global communityBuilding bioinformatics resources for the global community
Building bioinformatics resources for the global community
 
Ensembl Plants: Visualising, mining and analysing crop genomics data
Ensembl Plants: Visualising, mining and analysing crop  genomics dataEnsembl Plants: Visualising, mining and analysing crop  genomics data
Ensembl Plants: Visualising, mining and analysing crop genomics data
 
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
 
GRM 2011: The Integrated Breeding Platform tools and services
GRM 2011: The Integrated Breeding Platform tools and servicesGRM 2011: The Integrated Breeding Platform tools and services
GRM 2011: The Integrated Breeding Platform tools and services
 
Research Program Genetic Gains (RPGG) Review Meeting 2021: Forward Breeding: ...
Research Program Genetic Gains (RPGG) Review Meeting 2021: Forward Breeding: ...Research Program Genetic Gains (RPGG) Review Meeting 2021: Forward Breeding: ...
Research Program Genetic Gains (RPGG) Review Meeting 2021: Forward Breeding: ...
 
Next generation sequencing & microarray-- Genotypic Technology
Next generation sequencing & microarray-- Genotypic TechnologyNext generation sequencing & microarray-- Genotypic Technology
Next generation sequencing & microarray-- Genotypic Technology
 
2019 03 05_biological_databases_part4_v_upload
2019 03 05_biological_databases_part4_v_upload2019 03 05_biological_databases_part4_v_upload
2019 03 05_biological_databases_part4_v_upload
 
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsRamil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
 

More from solgenomics

More from solgenomics (17)

Sl4.0 and ITAG4.0
Sl4.0 and ITAG4.0Sl4.0 and ITAG4.0
Sl4.0 and ITAG4.0
 
Cassavabase-PhenoApp sample tracking
Cassavabase-PhenoApp sample trackingCassavabase-PhenoApp sample tracking
Cassavabase-PhenoApp sample tracking
 
Musabase PAG 2018
Musabase PAG 2018Musabase PAG 2018
Musabase PAG 2018
 
Cassavabase workshop IITA oct2016
Cassavabase workshop IITA oct2016Cassavabase workshop IITA oct2016
Cassavabase workshop IITA oct2016
 
Sql cheat sheet
Sql cheat sheetSql cheat sheet
Sql cheat sheet
 
Introduction to YamBase
Introduction to YamBaseIntroduction to YamBase
Introduction to YamBase
 
Cassavabase SolGS presentation PAG 2016
Cassavabase SolGS presentation PAG 2016Cassavabase SolGS presentation PAG 2016
Cassavabase SolGS presentation PAG 2016
 
Cassavabase SolGS poster PAG 2016
Cassavabase SolGS poster PAG 2016Cassavabase SolGS poster PAG 2016
Cassavabase SolGS poster PAG 2016
 
2 Cassavabase workshop: search menu
2  Cassavabase workshop: search menu2  Cassavabase workshop: search menu
2 Cassavabase workshop: search menu
 
3c Cassavabase workshop: manage-crosses
3c  Cassavabase workshop: manage-crosses3c  Cassavabase workshop: manage-crosses
3c Cassavabase workshop: manage-crosses
 
3d Cassavabase workshop: manage field-trial
3d  Cassavabase workshop: manage field-trial3d  Cassavabase workshop: manage field-trial
3d Cassavabase workshop: manage field-trial
 
3e Cassavabase workshop: manage genotyping-trials
3e  Cassavabase workshop: manage genotyping-trials3e  Cassavabase workshop: manage genotyping-trials
3e Cassavabase workshop: manage genotyping-trials
 
3f Cassavabase workshop: manage field-book
3f  Cassavabase workshop: manage field-book3f  Cassavabase workshop: manage field-book
3f Cassavabase workshop: manage field-book
 
3g Cassavabase workshop: manage phenotyping
3g  Cassavabase workshop: manage phenotyping3g  Cassavabase workshop: manage phenotyping
3g Cassavabase workshop: manage phenotyping
 
4 Cassavabase workshop: analyze menu
4  Cassavabase workshop: analyze menu4  Cassavabase workshop: analyze menu
4 Cassavabase workshop: analyze menu
 
5 Cassavabase workshop: contact us
5  Cassavabase workshop: contact us5  Cassavabase workshop: contact us
5 Cassavabase workshop: contact us
 
SGN UPLB 2016
SGN UPLB 2016SGN UPLB 2016
SGN UPLB 2016
 

Recently uploaded

biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
Scintica Instrumentation
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
Silpa
 

Recently uploaded (20)

biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 

Cassavabase general presentation PAG 2016

  • 1. Unlocking breeding potential of African crops through data management an example with CASSAVABASE Guillaume Bauchet Plant and Animal Genome Conference San Diego January 2016 gjb99@cornell.edu
  • 2. OUTLINE http://nextgencassava.org/ CASSAVABASE , What  for? CASSAVABASE , a  user  perspective CASSAVABASE , search,  manage,  analyze CASSAVABASE , a  view
  • 3. The  Central  data  store  for  NEXTGEN CASSAVA : Genomic  selection  in  African  cassava  breeding  programs http://nextgencassava.org/
  • 5. What are the major challenges?
  • 6. ● Multi trait and Multi breeding environments for cassava phenotypic data collection ● Large scale production of genomic data using GBS ● Integrate Genomic Selection tool via web interface What are the major challenges? ● Make the most of this resource for cassava breeders: speed up the analysis and decision making
  • 7. What are the needs? ● Search various data types (phenotypes and germplasm) in a large datastore ● Manage data and daily breeding activity through comprehensive interface ● Analyse and retrieve data for genomic assisted breeding What are our solutions? ● Integrate phenomic & genomic data with breeding tools ● Use Perl with the Bio::Chado::Schema and Natural Diversity module as database architecture ● Retrieve genomic information ● Sequence visualization ● Open source https://github.com/solgenomics/
  • 9. New search bar Navigation bar always visible on top Expandable search box
  • 12. CASSAVABASE by numbers 2016: + 80,000 accessions, 2,5 billion genetic observations 2014: +360 registered users
  • 13. From Phenotype to Genotype to Breeding: Harvesting the fruits of CASSAVABASE
  • 14. CASSAVABASE, an Office perspective: Search Search breeding program, location, trial, trait, year, accession
  • 15. CASSAVABASE, a field perspective: Manage Phenotypes Define phenotypic traits via Cassava trait dictionaryin CASSAVABASE Data collection via FieldBook app* Design trials, barcodes & field maps in CASSAVABASE* Data uploading in CASSAVABASE via .xls and .txt file * *See Alex Ogbonna PAG presentation “Managing Phenotypic Data through Cassavabase with Fieldbook App” “ Data analysis in CASSAVABASE -Sum. stat -ANOVA -BLUP -GS In CASSAVABASE
  • 16. Design genotyping Trial in CASSAVABASE TASSEL pipeline Data filtering & imputation GBS data uploading In CASSAVABASE GS Analysis & Visualization in CASSAVABASE GBS facility @ Cornell CASSAVABASE, a lab perspective: Manage Genotypes
  • 17. CASSAVABASE an office perspective: Manage Breeding programs, trial, accession
  • 18. CASSAVABASE : Analyze with SolGS Phenotypic values Population Structure GEBV vs phenotypes See Isaak Tecle PAG presentation & poster 342 “solGS: A Web-based Solution for Genomic Selection” GEBV
  • 19. CASSAVABASE : Analyze with SolGS
  • 20. CASSAVABASE from the Office: Analyze phenotypes QC to phenotypes Single trial
  • 21. CASSAVABASE from the Office: Analyze phenotypes QC to phenotypes Single trial
  • 23. CASSAVABASE from the Office: Analyze phenotypes data_2011_B1 4 6 8 10 r= 0.68 p<0.001 r= 0.66 p<0.001 4 6 8 10 14 r= 0.70 p<0.001 4681012 r= 0.63 p<0.001 46810 data_2011_B2 r= 0.76 p<0.001 r= 0.79 p<0.001 r= 0.73 p<0.001 data_2011_B3 r= 0.76 p<0.001 46810 r= 0.68 p<0.001 4681014 data_2012_B1 r= 0.75 p<0.001 4 6 8 10 12 4 6 8 10 4 6 8 12 46812 data_2012_B2 30 31 32 33 34 35 36 37 -1.5-0.50.51.5 Fitted values Residuals Residuals vs Fitted 26 9 15 -2 -1 0 1 2 -1012 Theoretical Quantiles Standardizedresiduals Normal Q-Q 26 9 15 30 31 32 33 34 35 36 37 0.00.40.81.2 Fitted values Standardizedresiduals Scale-Location 269 15 0.0 0.1 0.2 0.3 0.4 0.5 -2-1012 Leverage Standardizedresiduals Cook's distance Residuals vs Leverage 9 26 15 ANOVA, h2, BLUP, GxE QC phenotypes Multiple trials
  • 24. JBrowse CASSAVABASE tools: Analyze sequence Variant effects prediction
  • 25. VIGS tool CASSAVABASE tools: Analyze sequence BLAST
  • 26. CASSAVABASE, a User perspective: support & interaction
  • 27. CASSAVABASE, a User perspective: support & interaction -> Provide support on technical issues ( data management) -> Gather user request for tool improvement and new developments (pedigree queries, VIGS) -> 2016: Install Mirror site @ IITA Ibadan, Nigeria Weekly meetings with users in Africa: Wiki, FB pages & mailing list:
  • 28. CASSAVABASE Upcoming developments Search: Integrate trait & values in the wizard search Manage: extract data subset according to their phenotypic values, conditionnal choices Analyze: -Phenotypic analysis developments (ANOVA, GxE) -Pedigree analysis -Jbrowse: Mutation prediction of genetic variants -SolGS: Jobs queuing, trial selection improvement