2. If there are samples from previous sequencing runs to be included in this analysis, the FASTQ for these samples needs to be in the `$READS_DIR/$project_id` folder.
```
mkdir $READS_DIR/$project_id
cd $READS_DIR
```
Check to see if the reads have already been merged and are available in a previous run's folder. For each sample:
```
find . -name '<sample_id>'
cp <prev_project_id>/*<sample_id>* $project_id/
```
If not found, look in the original data folder and copy the original files.
Move to the project reads folder and rename the files to match the `<sample_id>_<R[12]>.gz` pattern if there is only one file per read end. If there are two files per read end, merge the files with the given script.
3. In the params folder, create the symlinks to the reads and the bcbio configuration files. If specifying a common sample suffix, ensure it includes any joining characters, e.g. “-“ or “_”, so that the family identifier can be cleanly separated from the suffix. Get the number of families from the batch.