Export mapped variants and add README for public data dump#728
Open
bencap wants to merge 4 commits into
Open
Conversation
4b3c99a to
a66674d
Compare
Emit a va/{urn}.va.ndjson file per mapped score set in the public data
export, one record per current mapped variant carrying its highest
materialized VA-Spec layer.
- add variant_highest_level_annotation to resolve the highest available
layer (pathogenicity > functional statement > study result), returning
None for variants without a post-mapped allele
- extract get_current_mapped_variants_for_annotation as the shared
eager-load source of truth for the annotated-variant endpoints and the
export, and route the three streaming routers through it
- document the va/ output, layer ladder, and the functional-evidence-only
caveat in the dump README
- cover the resolver with unit tests across the uncalibrated, functional,
pathogenicity, and unmapped cases
- Omit the score-calibration "Baseline score" extension when no baseline score exists. Extension.value is required, so a null value was stripped by model_dump(exclude_none=True) and the object no longer re-parsed through the VA-Spec models. This also corrects the API's VA-Spec streaming endpoints, which share the builder. - Gate dump annotation files on the presence of current mapped variants, so score sets whose mappings are all superseded no longer emit empty or stale annotation files. - Newline-terminate every NDJSON record to match the API streams and keep line-based consumers happy. - Add regression tests covering the baseline-score extension round-trip.
5d6e150 to
5c155f4
Compare
7 tasks
Coverage Report for CI Build 27652871621Coverage decreased (-0.2%) to 88.96%Details
Uncovered Changes
Coverage Regressions25 previously-covered lines in 4 files lost coverage.
Coverage Stats
💛 - Coveralls |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Reopens the erroneously closed #711.