Pre-processing data

The purpose of pre-processing is to improve the overall quality of the data. The raw data contains information, such as foreground and background signals as well as quality statistics, about all the spots detected on an array. The quality of the spots may vary. Information from poor quality spots is hard to trust and should therefore be removed from the dataset. This can be done by passing the raw data through different filters. There may also be errors associated with the data, including systematic errors from the technical or experimental procedures. Systematic errors can be reduced by normalising the data. Normalisation is therefore an important step in pre-processing data.

  1. Select an array by clicking on a Array
  2. Click the Process tab
  3. Filter and normalisation procedures are added by clicking on the Add Process button and choosing the wanted process. The raw data files decide which filters you can apply. This may vary from different formats and different array platforms. To get an idea of the possibilities, it is a good idea to make a copy of one of the data files and then open the copy in e.g. Excel. This will allow you to get familiar with the different columns available, as well as the type of values they contain. If you for instance want to remove control spots, you must find out which column contains this information and use the names for the filter. Suggestions for filtering steps for different arrayplatforms can be found here:
  4. To see what the different filtering and normalisation methods does to the data, it is always good to have a plot

    Preprocessing batch

  5. Save the process batch when you are happy with it. This way you can load it later to make changes or use it as it is on later datasets. It saves you from having to do all the work again later.
  6. The processes you have added have currently only been added to the sample you selected before clicking the Process tab. Click the Copy to all button to copy the processing steps to the other arrays.
  7. You have now nearly finished all pre-processing. There is a User Info and Post Compilation tab.
  8. When you are satisfied with the sample setup and filters press Compile.