Extraneous Lines in `samples.txt` by RyanBerger98 · Pull Request #13 · bioforensics/ezfastq

RyanBerger98 · 2026-04-02T12:45:30Z

This PR fixes a bug recently encountered.

When running ezfastq a samples.txt file will automatically be generated for the user containing the sample names of all the files that have been copied over. ezfastq will append sample names to this file rather than overwrite it if ezfastq is ran multiple times with the same working directory. ezfastq avoids writing duplicate sample names to the samples.txt file. However, if ezfastq is ran multiple times with the same samples, empty lines will be appended to the file.

Closes #12

RyanBerger98

@standage this is ready for review!

RyanBerger98 · 2026-04-02T12:52:45Z

+    if len(added_samples) > 0:
+        with open(workdir / "samples.txt", "a") as fh:
+            print(*added_samples, sep="\n", file=fh)


This is the fix. Only open samples.txt to write sample names if the number of samples copied over is greater than 0.

RyanBerger98 · 2026-04-02T12:53:07Z

+def test_duplicate_samples(tmp_path):
+    seq_path = files("ezfastq") / "tests" / "data" / "flat"
+    arglist = [seq_path, "test1", "test2", "test3", "--workdir", tmp_path]
+    cli.main(arglist)
+    with open(tmp_path / "samples.txt", "r") as fh:
+        num_lines = len(fh.readlines())
+        assert num_lines == 3
+    cli.main(arglist)
+    with open(tmp_path / "samples.txt", "r") as fh:
+        num_lines = len(fh.readlines())
+        assert num_lines == 3


Regression test. This fails on the main branch but passes here.

RyanBerger98

@standage now this is ready for review

RyanBerger98 · 2026-04-02T16:56:24Z

+                if line.strip():
+                    old_name, new_name = cls.parse_name(line, sep="\t")
+                    name_map[old_name] = new_name


The fix for #12. Checks that line actually has content after whitespace has been stripped.

RyanBerger98 · 2026-04-02T16:57:13Z

@@ -86,3 +87,16 @@ def test_fq_command(tmp_path):
    arglist = ["ezfastq", seq_path, "test1", "test2", "test3", "--workdir", tmp_path]
    run(arglist)
    assert len(list((tmp_path / "seq").glob("*_R?.fastq.gz"))) == 6


Updated test to ensure that empty lines in input sample file doesn't break ezfastq

standage

LGTM, thanks!

implemented fix; updated tests

a71a351

RyanBerger98 commented Apr 2, 2026

View reviewed changes

RyanBerger98 requested a review from standage April 2, 2026 12:53

fix for reading empty lines

0937f14

RyanBerger98 commented Apr 2, 2026

View reviewed changes

standage approved these changes Apr 3, 2026

View reviewed changes

standage merged commit 2bc7267 into main Apr 3, 2026
4 checks passed

standage deleted the duplicate_samples_txt branch April 3, 2026 17:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extraneous Lines in `samples.txt` #13

Extraneous Lines in `samples.txt` #13
standage merged 2 commits into
mainfrom
duplicate_samples_txt

RyanBerger98 commented Apr 2, 2026 •

edited

Loading

Uh oh!

RyanBerger98 left a comment

Uh oh!

RyanBerger98 Apr 2, 2026

Uh oh!

RyanBerger98 Apr 2, 2026

Uh oh!

RyanBerger98 left a comment

Uh oh!

RyanBerger98 Apr 2, 2026

Uh oh!

RyanBerger98 Apr 2, 2026

Uh oh!

standage left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

RyanBerger98 commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RyanBerger98 left a comment

Choose a reason for hiding this comment

Uh oh!

RyanBerger98 Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

RyanBerger98 Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

RyanBerger98 left a comment

Choose a reason for hiding this comment

Uh oh!

RyanBerger98 Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

RyanBerger98 Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

standage left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RyanBerger98 commented Apr 2, 2026 •

edited

Loading