The repetitive nature and complexity of some medically relevant genes poses a challenge for their accurate analysis in a clinical setting. The Genome in a Bottle Consortium has provided variant benchmark sets, but these exclude nearly four hundred medically relevant genes due to their repetitiveness or polymorphic complexity. Here we characterize 273 of these 395 challenging autosomal genes using a haplotype-resolved whole-genome assembly. This curated benchmark reports over 17,000 single nucleotide variations, 3,600 INDELs, and 200 structural variations each for human genome reference GRCh37 and GRCh38 across HG002. We show that false duplications in either GRCh37 or GRCh38 result in reference-specific, missed variants for short- and long-read technologies in medically relevant genes including
CBS
,
CRYAA
, and
KCNE1
. When masking these false duplications, variant recall can improve from 8% to 100%. Forming benchmarks from a haplotype-resolved whole-genome assembly may become a prototype for future benchmarks covering the whole genome.
Abstract:We examined the hypothesis that learning to write Chinese characters influences the brain's reading network for characters. Students from a college Chinese class learned 30 characters in a characterwriting condition and 30 characters in a pinyin-writing condition. After learning, functional magnetic resonance imaging collected during passive viewing showed different networks for reading Chinese characters and English words, suggesting accommodation to the demands of the new writing system through shortterm learning. Beyond these expected differences, we found specific effects of character writing in greater activation (relative to pinyin writing) in bilateral superior parietal lobules and bilateral lingual gyri in both a lexical decision and an implicit writing task. These findings suggest that character writing establishes a higher quality representation of the visual-spatial structure of the character and its orthography. We found a greater involvement of bilateral sensori-motor cortex (SMC) for character-writing trained characters than pinyin-writing trained characters in the lexical decision task, suggesting that learning by doing invokes greater interaction with sensori-motor information during character recognition. Furthermore, we found a correlation of recognition accuracy with activation in right superior parietal lobule, right lingual gyrus, and left SMC, suggesting that these areas support the facilitative effect character writing has on reading. Finally, consistent with previous behavioral studies, we found character-writing training facilitates connections with semantics by producing greater activation in bilateral middle temporal gyri, whereas pinyin-writing training facilitates connections with phonology by producing greater activation in right inferior frontal gyrus.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.