fuzzyfastq-rs

Introduction

The fuzzyfastq is a command-line tool written in Rust to processes FASTQ files (including gzipped FASTQ files) to identify and count reads that match given nucleotide sequences. This tool supports mismatch tolerance, allowing users to specify the percentage of mismatches allowed in the sequence matching process. Results are reported as raw counts and a percentage of total reads. It is intended for determining presence of sequence components and does not take into account multiple sequence matches on a single read.

Features

Process standard and gzipped FASTQ files. Match sequences with an allowance for mismatches. Handle both direct sequence input and sequences provided in a CSV file.

Installation

git clone https://github.com/rnabioco/fuzzyfastq-rs

cd fuzzyfastq-rs

cargo install --path .

Usage

fuzzyfastq <mode> <sequence_or_path_to_csv> <fastq_directory> [mismatch_percentage]

The tool accepts the following command line arguments:

Mode (--seq or --csv): Specifies the mode of operation. --seq: Directly use a provided sequence. --csv: Use sequences from a specified CSV file.
Sequence or Path to CSV File: Depending on the mode, provide either a nucleotide sequence or the path to a CSV file containing sequences.
FASTQ Directory: Path to the directory containing FASTQ files. Input as FASTQ files (.fastq, .fq, .fastq.gz, or .fq.gz formats).
Mismatch Percentage (optional): The allowable mismatch percentage as a decimal (e.g., 0.1 for 10% mismatches). Defaults to 0 if not provided.

CSV format

#Name, Sequence
Barcode01,ATGCTACGCTAGCTACGTCAGTCGAT
Barcode02,TGCTCGCTAGTCGCATCGATCGATCG

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
example_sequences.csv		example_sequences.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fuzzyfastq-rs

Introduction

Features

Installation

Usage

CSV format

About

Releases

Packages

Languages

rnabioco/fuzzyfastq-rs

Folders and files

Latest commit

History

Repository files navigation

fuzzyfastq-rs

Introduction

Features

Installation

Usage

CSV format

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages