Input file must be in CSV format and packaged as a zip file.
The input file consists of two columns named "cdr3_aa" and "count" respectively.
The "cdr3_aa" column represents the type of TCR cdr3 aa sequences.
The "count" column represents the number of each sequence, and the sum should be 30,000.
The input must be from a lung nodule patient.