Sequence Retrieval & Management
This is AI Scientist’s most differentiated core capability — instead of “discussing” cloning strategies with text, you can directly complete the entire workflow from sequence retrieval to assembly simulation within a conversation.
Multi-Source Retrieval
Section titled “Multi-Source Retrieval”AI Scientist uses a priority decision tree to find the best sequence source automatically:
| Priority | Source | Description |
|---|---|---|
| 1 | Component Memory Library | Search your previously saved sequences first |
| 2 | NCBI | Fetch DNA sequences by gene name or Accession number (batch supported) |
| 3 | UniProt | Fetch protein sequences by protein name or ID (batch supported) |
| 4 | Addgene | Search and retrieve public plasmids and their sequences |
| 5 | Standard Vector Library | 2,800+ built-in standard vectors ready to use |
When you request a sequence, AI Scientist checks each source in order — starting with your own library to reuse previously curated sequences, then falling back to public databases. Batch retrieval is supported for NCBI and UniProt, allowing you to fetch multiple sequences in a single request.
Sequence Operation Toolbox
Section titled “Sequence Operation Toolbox”Once sequences are loaded into your session, you have access to a comprehensive set of manipulation tools:
- Create / Rename / Delete — manage sequences in your workspace
- Extract subsequences — pull out specific regions by coordinates or feature annotations
- Reverse complement — generate the complementary strand
- Concatenate — join multiple sequences end-to-end
- Point mutations — introduce specific nucleotide or amino acid changes
- Region insertions / deletions — add or remove sequence segments at specified positions
- Restriction enzyme site search — find cut sites for specific enzymes or common cloning enzyme sets, useful for domestication workflows
- Sequence export — download in GenBank, FASTA, or JSON formats for use in external tools