From b711f8723524f2fab8ee7e84645f4a9724a8576c Mon Sep 17 00:00:00 2001 From: Lan Hui Date: Tue, 28 Oct 2025 15:30:12 +0800 Subject: Update doc --- brain.documentation/QUICKSTART.rst | 26 ++++++++++++++++++++++++-- 1 file changed, 24 insertions(+), 2 deletions(-) (limited to 'brain.documentation') diff --git a/brain.documentation/QUICKSTART.rst b/brain.documentation/QUICKSTART.rst index 8a862df..c63f358 100644 --- a/brain.documentation/QUICKSTART.rst +++ b/brain.documentation/QUICKSTART.rst @@ -150,10 +150,32 @@ Use *cron job* to schedule running these two scripts on December 25 and December download_ena_records.py will download the following files to folder Data/information: - ena_read_run.xml + + run accession + + title + + experiment accession + - ena_read_experiment.xml -- ena_sample.xml -- ema_study.xml + + experiment accession (e.g., SRX19576837) + + study accession (e.g., SRP425891) + + sample descriptor (e.g., SRS16960021) + + title + + library strategy (e.g., RNA-Seq) + + library source (e.g., TRANSCRIPTOMIC) +- ena_sample.xml + + sample accession, e.g., SRS17013367 + + taxon id (e.g., 3702) + + title + +- ena_study.xml + + study accession, e.g., SRP425891 + + study abstract + +The above four files can be linked through accessions. +Given a run accession in ena_read_run.xml, we can get its associated experiment accession. +Given an experiment accession in ena_read_experiment.xml, we can get its associated sample accession and study accession. +Given a sample accession, we can find its taxon id in ena_sample.xml. +Given a study accession, we can find its abstract in ena_study.xml. parse_end_xlm.py will use the above four XML files as input, and output the following two files to Data/information: - rnaseq_info_database.json (containing run IDs, library_source and library_strategy) -- cgit v1.2.1