User Tools

Site Tools


data_matrix_expression_file

This is an old revision of the document!


Data matrix expression file

Data matrix expression file is a Tab-separated values file.

A tab-separated values (TSV) file is a simple text format for storing data in a tabular structure (e.g. database or spreadsheet data). Each record in the table is one line of the text file. Each field value of a record is separated from the next by a tab stop character. More about TSV...

This file has two columns if there is only one sample, and more than two columns if there are many samples. First line is a header and must contain the names of the samples. The first column correspond to gene, probe or protein Ids from:

  1. Ensembl gene
  2. HGNC symbol
  3. Entrez id
  4. Affy HG U133A probeset
  5. Affy HG U133B probeset
  6. Affy HG U133-PLUS-2 probeset
  7. Affy HTA 2.0

The next columns correspond to gene expression values in numeric format from each sample.

Here is an example of a file with only one example:

expression.txt
id	sampleName
1	0.3
2	1
3	0.7

And here is another example with more than one sample:

expression.txt
id	sample1	sample2	sample3
1	0.3	0.6	0.2
2	1	0.8	0.9
3	0.7	0.9	0.3
data_matrix_expression_file.1459953340.txt.gz · Last modified: 2017/05/24 15:27 (external edit)