Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rubicon #123

Merged
merged 33 commits into from
Mar 12, 2022
Merged

Rubicon #123

merged 33 commits into from
Mar 12, 2022

Conversation

DrYak
Copy link
Member

@DrYak DrYak commented Feb 26, 2022

I think that's about how much polishing we'll do on that upload feature for now.

Care to have a look at how it is organized?

uweschmitt and others added 27 commits February 10, 2022 22:13
 - coreutil on non-linux platforms (realpath, sha<nnn>sum, etc.)
 - optional inputs (dehuman, fasta checksums, etc.)
 - allow user to tailor the script to their needs
   e.g. uploading to an SFTP server, etc.
 - comment in indented rule block inside condition
 - reuse rejects from bwa/bowtie/ngshmmalign
 - option to redo a (bwa) align to rejects,
   WITHOUT forcing to reprocess everything else
 - "local" option in dehuman.smk
   (was using wrong config property)
 - "results" and "samples" are separate in common.smk
 - keep the linter happy
 - runs tests on upload.smk, too
 - lint Docker
 - ref_{xxx} fasta files now follows "consensus" property,
   like consensus.bcftools.fasta

 - RegEx expanded to cover more corner cases to convert
   Illumina index "1:N:0:..." into SAM comment "BC:Z:..."
   - support dual index: ATCG+CGAT
   - support single index: GATTA
   - support numerical indicator: 180
 - compressing unfiltered raw read into cram
 - use dehuman ressource for larger alignements
   (host, etc.)
 - options to exec in demo upload script
 - typos
 - unfiltered cram in tests
  - use temp_prefix-ed temporary files for sorting
  - keep only aligned raw reads in host_aln.cram
@github-actions
Copy link

github-actions bot commented Feb 26, 2022

MegaLinter status: ✅ SUCCESS

Descriptor Linter Files Fixed Errors Elapsed time
✅ BASH shellcheck 6 0 0.23s
✅ DOCKERFILE hadolint 1 0 0.39s
✅ MARKDOWN markdownlint 5 5 0 0.82s
✅ PERL perlcritic 1 0 1.05s
✅ PYTHON black 11 0 0 1.32s

See errors details in artifact MegaLinter reports on CI Job page

 - it is mandatory for some targets like GISAID
DrYak and others added 4 commits March 12, 2022 17:11
 - When _NOT_ in catchup mode, re-use the extracted_data/R?.fastq
   that is produced as part of the normal processing.
   (Note: unlike prinseq, the sorting isn't required for BWA)

 - Change text to "host" instead of "human" or "H. Sapiens"
 - Can use an URL for host genome (Homo Sapiens)
 - re-organising bwa indexing code to support this
 - can still crash in various circumstance, considered unstable
 - small cosmetic fixes (mostly uppercase and final dot)
 - sync HTML manual
@DrYak DrYak merged commit f730d88 into master Mar 12, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants