The program structure is a software package for using multi-locus genotype data to investigate population structure. Its uses include inferring the presence of distinct populations, assigning individuals to populations, studying hybrid zones, identifying migrants and admixed individuals, and estimating population allele frequencies in situations where many individuals are migrants or admixed.
It can be applied to most of the commonly-used genetic markers, including SNPS, microsatellites, RFLPs and AFLPs.
- Structure is free to use and open source, but no license specified.
- Strauto is free to use and open source, but no license specified.
- strasuto-puhti is free to use and open source, but no license specified.
- structureHarvester is free to use and open source under its own License
- CLUMPP is free to use, but no license specified.
- Structre 2.3.4 is available in Puhti
To use Structure in Puhti, run first following setup commands:
module load biokit module load strcture
In Puhti, we recommend that you submit your structure jobs using help tool called
This tool is a modified version of strauto structure
job sumbission tool. Note that many details in the strauto manual do not apply to
Next move to the scratch directory of your project. Any subdirectory inside your scratch area will do. For example:
strauto-puhtiprogram. The parameter file must always be named as
input.py. The name of the actual datafile is defined in
input.py. The data file name should end with
A sample file, provided by strauto can be copied to your current directory with commands:
cd structure_job1 cp /appl/soft/bio/structure/strauto/input.py ./ cp /appl/soft/bio/structure/strauto/sim.str ./
When the input file has been prepared, the strcture job can be launched with command:
You can leave the monitor process running, but if you want to stop it, press:
The structure jobs will still continue their execution in the batch job system of Puhti. If you run the command:
Note, that strauto-puhti does not use the internal, gnu-parallel based, parallelization. Instead, parallelization is based on array jobs. Because of this, you should not change the parallel parameter value to True in the structure input file.
Last edited Mon Mar 29 2021