Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Metrics structuring for contribution

Note

Prerequisites

This article requires that you understand how data is stored on SynDB, we recommend reading through the overview article if you are uncertain.

This article is a guide for contributors who wish to upload their data to SynDB. Please don’t hesitate to ask for help on the Discord channel if you have any questions; this part can be challenging.

Data structuring

Schema

Each SynDB table has its own schema, which can be found on our GitHub repository. The schema defines the supported column names and their data types. The data must be structured in a way that is compatible with the schema of the table you are contributing to.

The column names and the data stored under them must be in a format that is compatible with the type. You can find the supported column names for each SynDB table in the . You may use the glossary at the end of this article for reference.

Note

Nano

We use nanometers as the unit for all measurements; includes volume, radius, and distance.

Sourcing raw data

You may upload the sourcing raw data files including meshes or SWL to SynDB. Place the absolute path to the file in your table file. The following are supported:

  • Meshes in .glb format, column name: mesh_path
  • SWC files, .swc, column name: swc_path

This list is the main tracker for the supported formats. You may request additional formats on the Discord channel. The SynDB team will review the request and consider adding the new format to the platform.

Columns

Most column types are self-explanatory, but some require additional explanation.

Identifiers and relations

The CID column defined in your table can have any unique hashable value, it will be replaced by a UUID when uploaded to SynDB. When uploading a relational dataset, the cid column in the parent will be used to correlate the relations to the children by their parent_id; meaning the hashable value in the parent cid column must match the parent_id in the child. parent_enum can be omitted as the compartments are defined at the tabular level, and will, therefore, be added automatically.

Example

Notice the parent_id column in the child table, this is the cid of the parent table. The parent_enum column is not present in the child table, as it is defined at the tabular file name.

vesicle.csv, child

cidneurotransmittervoxel_radiusdistance_to_active_zoneminimum_normal_lengthparent_idcentroid_zcentroid_xcentroid_y
0glutamate26.9129705.24502314505.2321996.2244953.6
1glutamate25.5388615.02132314505.2321996.2244953.6
2glutamate29.5260513.07012314505.2321996.2244953.6
3glutamate30.5131479.92242314505.2321996.2244953.6
4glutamate28.3977454.82482314505.2321996.2244953.6
5glutamate30.2033459.75572324505.2321996.2244953.6
6glutamate33.4548374.81312324505.2321996.2244953.6
7glutamate32.0890455.92932344505.2321996.2244953.6

axon.csv, parent

voxel_volumemitochondria_counttotal_mitochondria_volumecid
385668034.56193208043.521
1492089016.324412054179.842
327740497.92004

Glossary

KeyDescription
dataset_idThe unique identifier for the dataset, of type uuid.
cidThe unique identifier for a SynDB unit within the dataset, of type uuid.
parent_idThe CID of the parent component, of type uuid.
parent_enumAn integer representing the type or category of the parent component, of type int.
polarityThe polarity of the neuron, of type ascii.
voxel_volumeThe volume of the voxel, of type double.
voxel_radiusThe radius of the voxel, of type double.
s3_mesh_locationThe location of the mesh in S3 storage, of type smallint.
mesh_volumeThe volume of the mesh, of type double.
mesh_surface_areaThe surface area of the mesh, of type double.
mesh_area_volume_ratioThe ratio of the surface area to the volume of the mesh, of type double.
mesh_sphericityThe sphericity of the mesh, of type double.
centroid_zThe z-coordinate of the centroid, of type double.
centroid_xThe x-coordinate of the centroid, of type double.
centroid_yThe y-coordinate of the centroid, of type double.
s3_swb_locationThe location of the SWB in S3 storage, of type smallint.
terminal_countThe count of terminals, of type int.
mitochondria_countThe count of mitochondria, of type int.
total_mitochondria_volumeThe total volume of mitochondria, of type double.
neuron_idThe unique identifier for the associated neuron, of type uuid.
vesicle_countThe count of vesicles, of type int.
total_vesicle_volumeThe total volume of vesicles, of type double.
forms_synapse_withThe unique identifier of the synapse that the component forms with, of type uuid.
connection_scoreThe score representing the strength or quality of the connection, of type double.
cleft_scoreThe score for the synaptic cleft, of type int.
GABAThe concentration or presence of GABA neurotransmitter, of type double.
acetylcholineThe concentration or presence of acetylcholine neurotransmitter, of type double.
glutamateThe concentration or presence of glutamate neurotransmitter, of type double.
octopamineThe concentration or presence of octopamine neurotransmitter, of type double.
serineThe concentration or presence of serine neurotransmitter, of type double.
dopamineThe concentration or presence of dopamine neurotransmitter, of type double.
root_idThe external root identifier from the source platform (e.g. FlyWire), of type int.
pre_idThe unique identifier of the pre-synaptic component, of type uuid.
post_idThe unique identifier of the post-synaptic component, of type uuid.
dendritic_spine_countThe count of dendritic spines, of type int.
neurotransmitterThe type of neurotransmitter present in a vesicle, of type ascii.
distance_to_active_zoneThe distance from the vesicle to the active zone, of type double.
minimum_normal_lengthThe minimum normal length, of type int.
ribosome_countThe count of ribosomes within the endoplasmic reticulum, of type int.