Step 1: Load a data file¶
Tip
It is possible to try BioPAN using a demonstration file. Select the small dataset to understand how BioPAN works and the complete one to perform a more complex analysis 1.
How to prepare your data¶
BioPAN takes an input file in CSV (Comma-Separated Values) format containing quantitative data. The file structure should be as follows:
The first row contains sample labels (e.g.: ‘wild_type_1’, ‘wild_type_2’, ‘control_1’, ‘control_2’). The file must contain at least two conditions (e.g.: ‘wild_type and ‘control’) and two samples per condition.
The first column contains lipid molecular species. The lipid subclasses recognised by BioPAN are provided in the table below.
Columns 2 to n contain molecular concentration quantification.
Abbreviation in BioPAN |
Subclass in LIPID MAPS |
MG |
|
DG |
|
TG |
|
PA |
|
PC |
|
PE |
|
PS |
|
PG |
|
PI |
|
PIP |
|
PIP2 |
|
PIP3 |
|
LPA |
|
LPC |
|
LPE |
|
LPS |
|
LPG |
|
LPI |
|
O-DG |
|
O-PA |
|
O-PC |
|
O-PE |
|
O-LPA |
|
O-LPC |
|
O-LPE |
|
P-PC |
|
P-PE |
|
P-LPA |
|
P-LPC |
|
P-LPE |
|
CL |
|
Cer |
|
dhCer |
|
SM |
|
dhSM |
|
SPB |
|
dhSPB |
|
SPBP |
|
dhSPBP |
|
FA |
|
FaCoA |
Note
The subclass
term is used to refer to the lipid names in the above table and the molecular species
one is used to refer
to the complex subclass/structure. The structure corresponds to the number of carbons and bonds separated by a colon.
For example the ‘DG’ subclass and the ‘DG 30:0’ molecular species.
Following is an example of an input file:
sample,condition1_1,condition1_2,condition2_1,condition2_2
FaCoA(16:1),87.86,146.49,1041.98,1082.65
FaCoA(17:1),71.98,132.04,354.17,399.86
FA(16:0),25006.43,25477.98,32934.67,33290.69
FA(18:0),19293.23,21057.76,19763.06,20423.42
MG(16:0),28.27,29.62,68.35,66.21
MG(16:1),2.19,2.77,5.00,4.78
MG(18:0),61.31,66.28,92.66,94.77
MG(18:1),67.79,60.46,86.52,91.67
MG(18:2),26.96,24.59,36.34,30.88
MG(20:4),0.53,0.84,2.59,4.95
MG(22:6),0.17,0.49,0.53,0.15
DG(30:0),206.01,159.52,497.23,550.29
DG(30:1),109.54,93.64,246.66,228.54
DG(32:0),1539.57,1764.44,3237.65,3941.12
DG(32:1),1954.26,1976.83,2972.57,2845.93
DG(32:2),575.92,632.22,1216.70,1114.80
DG(34:0),98.05,118.36,649.18,910.87
TG(46:3),1.69,4.28,1.77,0.85
TG(46:4),0.19,0.52,0.27,0.16
TG(48:0),4.72,16.70,11.85,4.09
TG(48:1),30.94,65.80,40.25,21.59
TG(48:2),36.42,79.39,40.63,19.11
TG(48:3),16.73,41.24,14.45,6.31
O-PC(32:0),13.99,18.52,35.82,21.02
O-PC(32:1),10.25,10.70,25.98,12.10
It is also possible to download the demonstration files 1 to understand the required structure.
Note
After loading a dataset, LipidLynxX 2 is launched in the background. LipidLynxX takes the input file and converts the nomenclature of the lipid molecular species to BioPAN nomenclature. It will also equalise the structure of the lipid molecular species to the Bulk level if it not already (e.g.: PE(34:1)).
Warning
Currently, BioPAN:
Does not analyse oxidised lipids.
Only consider the sphingoid bases with 18 carbons and 0 (dhCer/dhSM) or 1 double bond (Cer/SM) for the Sphingolipids. For example: it means that the lipids molecular species ‘Cer 24:0’ corresponds to ‘Cer(d18:1/24:0)’ and ‘dhCer 20:4’ corresponds to ‘Cer(d18:0/20:4)’ (same for SM/dhSM).
Data summary¶
The data summary table summarises the data extracted from the imported file:
Unrecognised species: molecular species whose subclass is not recognised by BioPAN. The recognised lipid subclasses are available in the table from the section How to prepare your data.
Processed species: molecular species that are involved in at least one reaction (read more: Which lipids and reactions appeared on the graph?).
Unprocessed species: molecular species that are not involved in any reactions.
It is possible to see the list of molecular species “unrecognised”, “processed” and “unprocessed” by clicking on the associated numbers. For the categories “processed” and “unprocessed”, you can also visualise the nomenclature used by BioPAN from your input dataset.
Note
A distinction is made between fatty acids
and lipids
because BioPAN stores reactions in a different way. It stores the reactions at the level of the molecular species for FA (example FA (16: 0) -> FA (16: 1)) and at the level of the subclass for lipids (example DG -> PA).