3 de julio de 2023

My notes on Plant Biology Europe 2023 at Marseille

Hi, in this and following posts I will be sharing my notes of the sessions of the Plant Biology Europe 2023 at Marseille: https://europlantbiology2023.org

day2 day3 day4

Note: access here the slides of the "Learning to build and interrogate the pangenome of Brachypodium distachyon" talk.

03072023 

Uncovering the hidden half of plants: discovering novel ways roots sense and adapt to heterogeneous environments. Malcolm Bennet.

Talks about xerobranching (https://pubmed.ncbi.nlm.nih.gov/30270188), a mechanism to control root branching in water-deprived soil layers. This is initially controlled by phloem-produced ABA [compamion cells] that travels to the root and triggers an auxin response in the epidermis [plasmodesmata] (https://doi.org/10.1126/science.add3771). They have used air-gap experiments to show this is conserved across angiosperms.

In the 2nd part of the talk he talks about how soil compaction inhibits root growth and produces actually wider roots. However, in mutants unsensitive to ethylene, roots can actually overcome compaction. It seems that growth arrest is actually an ethylene response. Plants sense compaction by gauging gaseous ethylene diffusion in the soil (root radar signal). He is asked whether growing through compact soil actually damages the root. His answer is that natural variability suggests that damage is not the main driver or these mechanisms.

He goes on to stress we need to consider plant + soil + microbes to answer any root questions, so we should all talk to soil and microbe scientists.

 

A central role of root symbionts: the plant response to environmental stresses. Raffaella Balestrini.

Explains how strigogalactones produce by the root trigger the fungus infection during mycorrhiza establishment. Transcriptional landscape changes before -> after, revealing that [plant, fungus]  redundant transcripts encoding P-transporters ate upregulated. She shows experiments with Lotus japonica + Gigaspora. She moves to explain that AM symbiosis increases drought resilience, but it’s a process we still know little about, as responses depend on genotype interactions. She mentions https://doi.org/10.1126/science.aaz5192 and https://nph.onlinelibrary.wiley.com/doi/full/10.1111/nph.18281. She explains that infection in tomato affects also stomata density.

They are now working in a consortium that will produce tomato seeds coated with priming chemicals [including salicylic acid, which affect spore germination] and fungal spores to maximize stress resilience of adult plants.  She mentions https://doi.org/10.1016/j.tplants.2022.06.004 .

 

Fusing genome simulation and crop models for computer-aided breeding in future environments – Arnaud DESBIEZ-PIAT

Talks about training genomic prediction models with 10-cross validation as part of a virtual breeding platform. They get the genotypes ranked correctly, but has a 3d biass in anthesis date. Explores the effect of simulated and real recombination and models (rrBLUP vs Bayes) on the correlation of traits. He says that automatic phenotyping of germination rates is very hard. The goal is to keep genetic gain > errors.

Pervasive Under-Dominance in Gene Expression Underlying Emergent Growth Trajectories in Arabidopsis thaliana Hybrids– Wei YUAN

As Daniela Ristova, uses Biorender.com to make summaries. In her experiments comparing F1 hybrids to inbreds she finds less additive genes than dominant ones [hybrids have larger rosette areas]. She finds that promoters with more rare alleles have lower expression [allele freq as a proxy of deleteriousness], with hybrids with less rare alleles performing best. Paper soon out in Genome Biol?

Reconsidering photoperiod-sensitivity (PS) for maize adaption to climate change– Justine DROUAULT

Hypothesis: tropical maize could be re-introduced into temperate maize to increase adaptation in the face of climate change. She shows that selection towards PS indirectly selected also for other traits, such as plant height, etc. She classifies haplotypes according to their PS. She finds that tropical genotypes can reduce thermal time to flowering, but effect depends on background genotype.

Plasticity of root permeability for nutrient acquisition. Marie Barberon.

She explains the plastic role of suberin diffusion barrier in the root, citing https://www.pnas.org/doi/10.1073/pnas.2101730118.  A set of at least 4 MYB factors control (ectopic) suberization: 41,53,92,93. They cannot do single-cell as they cannot produce viable protoplasts of their root cells.

Polygenic selection and the evolution of gene expression in Arabidopsis Iyrata. Juliette de Meaux.

Inspiring talk on how her group is taking advantage of the Arabidopsis genomes (thaliana, halleri and lyrata) to study gene dominance. She starts by explaining that most genetic variance is deleterious and probably cannot be removed due to drift and the inability of selection to eliminate it. She shows that genes that show dominance display less Ks (more constrained) and that, conversely, additive genes, with smaller phenotypic effects, show larger Ks values (https://europepmc.org/article/ppr/ppr453330). She refers to https://www.nature.com/articles/s41467-021-23558-2, where they compare sterile hybrids of those species to classify cis variation. In outlokk, she favors future focus on many genes with small additive effect but large collective weight.

Changes in competitive ability over the course of durum wheat domestication are mediated by plant functional traits– Taïna LEMOINE

She studies Triticum monococcum, T. dicoccoides, T. turgidum landraces and T. turgidum elite varieties to measure their i) performance and ii) trait plasticity in the face of stress. She finds that domesticated materials performs better overall, but in terms of individual trait plasticity, wild material show larger variability, which suggests that breeding with a view of individual traits can still make gains from wild materials.

Horizontal gene transfer in Hordeum species– Marek SZECÓWKA

Hordeum species acquired DNA from panicoid sources via at least 9 events 5-1My ago. These lineages diverged 50Myr ago (https://pubmed.ncbi.nlm.nih.gov/33484020/). He asks how many accessory genes are from panicoids and what are their roles. So he chops Hordeum  genomes in 1Kb bits which are then mapped onto reference panicoid genomes, requiring at least 500b matches (BLASTN, 175b for BLASTX) and high % identity. He then requires >=70% identity and fragments >=5Kb for further analysis.  

They have also tracked panicoid TEs in Hordeum genomes and they seem to follow the segments and chromosomes previously shortlisted. They behave effectively as additional confirmation. They have built consensus panicoid TE library for this work.


8 de junio de 2023

Janet Thornton, se jubila la madre de la bioinformática estructural

Hola, ayer escuché por videoconferencia un rato de la última charla que dio Janet Thornton en el EMBL-European Bioinformatics Institute antes de jubilarse.

Woman standing at podium
fuente: EMBL

Janet ha sido sin duda una de las madres de la bioinformática, sobre todo en el área de la bioinformática estructural. Por ejemplo, tiene su propio modelo de sustitución de aminoácidos para hacer filogenias (JTT). Podéis ver su enorme influencia en la literatura en EuropePMC, o las palabras que le dedican Alfonso Valencia o Roland Dunbrack. Entre la larga lista de discípulos están por ejemplo David Jones (la J del modelo JTT, parte del equipo de AlphaFold y examinador de mi tesis), Christine Orengo o Nick Luscombe, todos ellos autores a los que he citado innumerables veces.

Yo la conocí personalmente en un congreso en Brasil, el ISMB2006, donde tuve la fortuna de tener una reunión cara a cara con ella donde me dio consejos y ánimos para mi incipiente carrera en la ciencia. Hace 4 años volvimos a coincidir en la cafetería del EMBL-EBI y recordando ese rato me dijo algo como "no te ha ido tan mal, verdad?".

En su charla recordaba observaciones que ella y su grupo habían hecho en las últimas décadas sobre la lista de aminoácidos importantes para explicar la catálisis de las enzimas. Eso le dio pie a repasar los resultados de los últimos años de trabajo, liderados por Antonio Ribeiro, donde se han centrado en sistematizar las reglas y en medir de manera objetiva la similitud entre mecanismos enzimáticos, ganando capacidad predictiva por el camino (ver por ejemplo https://europepmc.org/article/MED/36659981 y https://europepmc.org/article/PPR/PPR540240). Terminó esta parte de la charla, la última que pude seguir, diciendo que lo más difícil de jubilarse era no poder planear nuevos experimentos y estudios para todo lo que queda por saber. Creo que esa curiosidad es el motor para muchos de nosotros, no tengo nada más que añadir,

hasta pronto,

Bruno


 


 

 


 

 



2 de junio de 2023

Aprende Bioinformática en España en 2023

Hola,

hoy comparto un documento que ha publicado la asociación de jóvenes bioinformáticos de Granada (BioInfoGRX).

https://bioinformaticsgrx.es/wp-content/uploads/2021/11/Main-web2.png

Se trata de la guía "Aprende BioInformática", que se ocupa de:

  • Estudiar BioInformática en España (grados y másters)
  • Cursos online de BioInformática (biología, programacion y analisis de datos)


Es un documento dinámico que actualizará su contenido periódicamente. Puedes sugerir cambios en bioinformaticsgrx@gmail.com y descargarlo en:

https://bioinformaticsgrx.es/wp-content/uploads/2022/02/Gui%CC%81a-formativa-en-bioinforma%CC%81tica.docx.pdf


Espero sea útil,

Bruno


10 de mayo de 2023

Meeting notes: "At the forefront of plant research 2023" (III)

This last session is shared with the next meeting "Genomics-assisted breeding for boosting crop and livestock improvement".

Wednesday 10 may

Imagen
https://twitter.com/CRAGENOMICA/status/1656225214785568768

Cristobal Uauy  ‘Crop breeding as a DNA-assembly problem‘

Pictures breeding as creating mosaics genomes from genotypes that bring in traits of interest. Refers to the work of Brinton et al (https://www.nature.com/articles/s42003-020-01413-2) to build haplotypes. SNP chips fail to tell apart pangenomes entries as they target mostly nearly identical regions; instead haplotypes reconstruct the history of Watkins landraces accurately. They also found that landraces can improve the yield of elite cultivars such as Paragon, although with awful plant architectures.

They are also reconstructing haplotypes from raw reads in a scalable way. The goal is to find regions identical by state (IBS). Their window-based K-mer approach is https://github.com/Uauy-Lab/IBSpy; they use read of the reference genome as negative control. In their WGA benchmarks they find that 1SNP in 5Kbp ~ identical by state. 30% of their A-genome data have a divergence that you would expect for wild wheat. In fact they confirmed that T. monococcum had 1% IBS regions with hexaploidy wheat, which are evidence of introgressions. They have also found introgressions of T. timopheevii. Work at U Nottingham suggests any region of the hexploid genome is susceptible to introgression. The method works with barley and maize inbred lines.

They use affinity propagation clustering to simplify haplotypes. They obtain robust haplotypes by comparing K-mers to all pangenome assemblies. If several genotypes have the same pattern against the same reference, they probably are IBS in that region.

Up to 55% of modern cultivars can be traced back to <10 Watkins landraces. In the UK they realized the need to go back crossing to landraces to improve yield.

 

Nils Stein  ‘The barley pan-genome: large structural genome variation is not rare’

Starts by explaining how genomes of barley have powered applications from SNP calling up to genomic selection.  He then explains how 22K barleys were analyzed by GBS and intersected to core collections to pick n=20 genotypes, including one wild barley. He shows alternative ways to depict and summarize pangenomes, mentioning that graphs seem to work well with human genomes, but current tools don’t work well with barley (ie pggb). They had to mask geneless regions and build single-copy gene clusters.

They are now working towards a second version of the pangenome adding 23 wild barley and reaching a total of 76 genotypes, not yet T2T assemblies (talking with Sanger for that). This set seem to plateau for domesticated barleys, not yet for wild ones.  

 

Bruno Contreras-Moreira ‘Upgrading the gene annotation at the population level reveals the diverse pan-gene set of Asian rice’

My talk, a related preprint can be found at: https://www.biorxiv.org/content/10.1101/2023.01.03.520531v1

Code, documentation and examples at: https://github.com/Ensembl/plant-scripts

I paste here the abstract: Oryza sativa and Arabidopsis thaliana are the best sources of gene function information among plant genomes, probably for their role as model species. In rice, two independent efforts (RAP-DB and MSU - the latter no longer updated) curated the Nipponbare genes that researchers refer to in their papers. However, these two gene sets are different, and more importantly, do not include gene models found in other rice cultivars. To address these limitations, our consortium  produced high quality genome assemblies for 15 cultivars representative of all cultivated Asian rice, plus the reference genome (IRGSP), which were then annotated using the same protocol and gene expression data. The goal was to produce a consistent catalog of gene models that could be used by rice breeders around the world and distributed by key resources such as Ensembl Plants, Gramene and UniProt. A software prototype was developed to find collinear genes annotated in the rice set, producing a total of 84,530 gene clusters. Of these,  26,357 were found in 15+ rices (soft-core). We then confirmed that the soft-core set i) contains 94.6% of all the BUSCO protein domains of order Poales, ii) 89% of genes of agronomic value curated by RAP-DB and iii) has the largest support from protein mass spectrometry experiments. Further inspection of collinear genes revealed a large degree of diversity in terms of gene boundaries and a significant number of missing genes and potential loss of function alleles. This work highlights the challenges of defining a consensus gene annotation for a crop when different cultivars and populations are considered.

Dani Zamir  ‘Epistasis time’

Epistasis is the surprise you get when you combine two genes and get something you did not expect, not additive. Based on his recent tomato work with two interacting QTLs in different tomato  chromosomes, where they found significant heterosis for 10% of normal water input (https://www.pnas.org/doi/abs/10.1073/pnas.2205787119) he pushed us to move to pairs of markers, triplets of markers instead of one at a time. 

 

Danelle Seymour  ‘Integrating scales to traverse the genotype-phenotype divide in citrus’

[Scion = injerto] She does a pangenome survey of NLR gene models across Citrus species in the context of the HLB insect-transmited bacterial disease. Then she moves to explain their field trials in FL, US, along 3 years. They also do automatic phenotyping of several traits by taking 30 images per fruit.