Here you will find the Lupin genome in fasta format along with the associated data. This includes annotation, protein sequences, sequencing data...

Lupin Genome version 1.0 (latest)

Here you will find the latest genomic sequence of white lupin, including the annotation of 38258 Genes and 3129 non coding RNA. You can also download the sequences of all the annoted elements. The sequencing data that have been used for the genome assembly are also available.

Genome Annotations Fasta sequences

Pangenome assembly
Variety Genome Annotation Genes mRNA CDS Proteins
P21525 P21525.fasta.gz P21525.gff3.gzgenesmRNA CDSproteins
Volodia Volodia.fasta.gz Volodia.gff3.gzgenesmRNA CDSproteins
ALB01 ALB01.fasta.gz ALB01.gff3.gzgenesmRNA CDSproteins
Amiga Amiga.fasta.gz Amiga.gff3.gzgenesmRNA CDSproteins
Badajoz Badajoz.fasta.gz Badajoz.gff3.gzgenesmRNA CDSproteins
Batsi_Wild Batsi_Wild.fasta.gz Batsi_Wild.gff3.gzgenesmRNA CDSproteins
Clovis Clovis.fasta.gz Clovis.gff3.gzgenesmRNA CDSproteins
Dieta Dieta.fasta.gz Dieta.gff3.gzgenesmRNA CDSproteins
Dogan Dogan.fasta.gz Dogan.gff3.gzgenesmRNA CDSproteins
EGY6484B EGY6484B.fasta.gz EGY6484B.gff3.gzgenesmRNA CDSproteins
Energy Energy.fasta.gz Energy.gff3.gzgenesmRNA CDSproteins
Feodora Feodora.fasta.gz Feodora.gff3.gzgenesmRNA CDSproteins
Figaro Figaro.fasta.gz Figaro.gff3.gzgenesmRNA CDSproteins
Gerelta-2 Gerelta-2.fasta.gz Gerelta-2.gff3.gzgenesmRNA CDSproteins
GR38 GR38.fasta.gz GR38.gff3.gzgenesmRNA CDSproteins
GRC5262B GRC5262B.fasta.gz GRC5262B.gff3.gzgenesmRNA CDSproteins
Gyulatanya Gyulatanya.fasta.gz Gyulatanya.gff3.gzgenesmRNA CDSproteins
Hansa Hansa.fasta.gz Hansa.gff3.gzgenesmRNA CDSproteins
Kalina Kalina.fasta.gz Kalina.gff3.gzgenesmRNA CDSproteins
Kiev Kiev.fasta.gz Kiev.gff3.gzgenesmRNA CDSproteins
LD37 LD37.fasta.gz LD37.gff3.gzgenesmRNA CDSproteins
Lucky Lucky.fasta.gz Lucky.gff3.gzgenesmRNA CDSproteins
Luxe Luxe.fasta.gz Luxe.gff3.gzgenesmRNA CDSproteins
Magnus Magnus.fasta.gz Magnus.gff3.gzgenesmRNA CDSproteins
Murringo Murringo.fasta.gz Murringo.gff3.gzgenesmRNA CDSproteins
N3507 N3507.fasta.gz N3507.gff3.gzgenesmRNA CDSproteins
Nahrquell Nahrquell.fasta.gz Nahrquell.gff3.gzgenesmRNA CDSproteins
Neuland Neuland.fasta.gz Neuland.gff3.gzgenesmRNA CDSproteins
Neutra Neutra.fasta.gz Neutra.gff3.gzgenesmRNA CDSproteins
Orus Orus.fasta.gz Orus.gff3.gzgenesmRNA CDSproteins
P27174-4 P27174-4.fasta.gz P27174-4.gff3.gzgenesmRNA CDSproteins
Poutignano Poutignano.fasta.gz Poutignano.gff3.gzgenesmRNA CDSproteins
Primorsky Primorsky.fasta.gz Primorsky.gff3.gzgenesmRNA CDSproteins
Shinfield Shinfield.fasta.gz Shinfield.gff3.gzgenesmRNA CDSproteins
Start Start.fasta.gz Start.gff3.gzgenesmRNA CDSproteins
SYR6258B SYR6258B.fasta.gz SYR6258B.gff3.gzgenesmRNA CDSproteins
Tombowskij Tombowskij.fasta.gz Tombowskij.gff3.gzgenesmRNA CDSproteins
Ulysses Ulysses.fasta.gz Ulysses.gff3.gzgenesmRNA CDSproteins
Wado-2 Wado-2.fasta.gz Wado-2.gff3.gzgenesmRNA CDSproteins
Sequencing data

Illumina Reads PacBio Reads Bionano Genomics Irys :

For convenience, all the Pacbio reads (from the subreads.bam files) have been merged into a single big fasta file.

Illumina Sequencing data (First 15 Varieties batch) Illumina Sequencing data (Second 24 Varieties batch) Raw genomic variation from 39 accessions Oxford Nanopore Sequencing data Other genome assembly

Here you will find all the raw sequencing data that were used for this study. In addition, for RNA-seq, the raw count table is provided. Just click on the button you are intereted in to display the downloadable corresponding data.

Data Sequencing Data

Data Sequencing Data