Available Features Explained

Welcome to the Available Features Explained section of the GenomeSet documentation. This section provides an in-depth explanation of every core function and feature available on the GenomeSet platform — from data uploading to advanced genomic analysis tools, result management, and data sharing.


Data Management Features

User Registration and Profiles

  • Create a personal account to access data upload, saving, and sharing features.
  • Manage personal information and security settings.
  • Access your uploaded files, saved projects, analysis results, and sequence subsets.

Upload Data

  • Upload your personal genomic data in FASTA (.fa/.fna/.fasta) or tabular (CSV/TSV) format.
  • Assign a data type during upload for proper parsing.
  • Uploaded files are stored under the Uploaded Files section in your profile.
  • Easily select and manage uploaded files for analysis using the Select button and buffer window.

Data Exploration Tools

Species Explorer

  • Browse a rich, interactive taxonomic tree of publicly available organisms from NCBI.
  • Select organisms by kingdom, division, class, order, or species.
  • Use checkboxes to select up to 5 organisms for comparative analysis.
  • Visualize organism lineages and navigate taxonomic relationships.
  • Selected organisms appear in a buffer for quick analysis or removal.

GEO Dataset Explorer

  • Search and browse publicly available gene expression datasets from the GEO (Gene Expression Omnibus) repository.
  • Select datasets based on study criteria or data type.
  • Analyze gene expression data using GenomeSet’s integrated tools.

Analysis Functions (Analyzer Page)

Once data or organisms are selected, the Analyzer Page becomes available. Here, you can run a wide array of genomic analyses.

Parsing & Sequence Type Selection

  • For NCBI organisms, the system downloads .fna (genomic sequences) and .gff3 (annotation) files.
  • Extracts specific regions: cDNA, CDS, UTR 5’, UTR 3’, and Gene sequences.
  • Checkboxes allow you to choose which sequence types to include in your analysis.

Core Analysis Functions

  • Length Analysis:

    Measures and visualizes sequence length distributions for selected regions.

  • GC-Content:

    Calculates the GC (Guanine + Cytosine) percentage for sequences or sequence regions.

  • CpG-Island Detection:

    Identifies high-density CpG regions — typically associated with gene regulation areas.

  • K-Mer Frequency Analysis:

    Counts and analyzes all possible k-length subsequences within genomic sequences.

Projects Tools

GFF3 Parser: Coming soon — parse and explore gene annotations.
Massive Genome Research Tool: Coming soon — analyze large genome batches.
Genomes Map: Future feature for visualizing genome structures interactively.