The RADAR project involves a collection of machine learning research thrusts that are integrated into a cognitive personal assistant. Progress is examined with a test developed to measure the impact of learning when used by a human user. Three conditions (conventional tools, Radar without learning, and Radar with learning) are evaluated in a large-scale, betweensubjects study. This paper describes the RADAR Test with a focus on test design, test harness development, experiment execution, and analysis. Results for the 1.1 version of Radar illustrate the measurement and diagnostic capability of the test. General lessons on such efforts are also discussed. Report Documentation PageForm Public reporting burden for the collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources, gathering and maintaining the data needed, and completing and reviewing the collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing this burden, to Washington Headquarters Services, Directorate for Information Operations and Reports, 1215 Jefferson Davis Highway, Suite 1204, Arlington VA 22202-4302. Respondents should be aware that notwithstanding any other provision of law, no person shall be subject to a penalty for failing to comply with a collection of information if it does not display a currently valid OMB control number. OverviewThe RADAR (Reflective Agents with Distributed Adaptive Reasoning) project 1 within the DARPA PAL (Personalized Assistant that Learns) program is centered on research and development towards a personal cognitive assistant. The underlying scientific advances within the project are predominantly within the realm of machine learning (ML). These ML approaches are varied and the resulting technologies are diverse. As such, the integration result of this research effort, a system called Radar, is a multi-task machine learning system. Annual evaluation on the integrated system is a major theme for the RADAR project, and the PAL program as a whole. Furthermore, there is an explicit directive to keep the test consistent throughout the program. As such, considerable effort was devoted towards designing, implementing, and executing the evaluation. This document describes this process, protocol, and some of the results for the Radar 1.1 test. Note that this document is not centered on Radar features or the actual machine learning methods used.It is also important to note that the RADAR project differs from the bulk of its predecessors and its companion PAL program project, CALO 2 , in that humans are in the loop for both the learning and evaluation steps. Radar is trained by junior members of the team who are largely unfamiliar with ML methods. Generic human subjects are then recruited to use Radar while handling a simulated crisis in a conference planning domain. This allows concrete measurement of performance using a h...
Purpose To examine the two-year test-retest reliability of Immediate Post-concussion Assessment and Cognitive Testing (ImPACT) in a sample of collegiate athletes. Methods A total of 163 college athletes (63.8% female) completed two pre-season baseline assessments two years apart as mandated by their athletic programs. Participants completed the first baseline prior to their first year of competition (M age=18.38, M edu=12.13). Individuals with baselines flagged as invalid by ImPACT were excluded. No diagnosed concussions occurred between baseline assessments. Results Pearson r correlations for ImPACT composite scores between assessments ranged from .32 to .70. Paired samples t-tests indicated significant differences between verbal memory (t(162)=-4.61, p<.001, MD=-3.6) and visual-motor speed (t(162)=-4.10, p<.001, MD=-1.5) at time 1 and time 2. Intraclass correlation coefficients (ICCs) with a “two-way mixed” model and “consistency” type indicated higher reliability for all composite scores. Visual-motor speed (mean ICC=.83; .76–.87 95% confidence interval [CI]) was the most reliable composite score, followed by visual memory (.66; 95% CI .53-.75), impulse control (.62; 95% CI .48–.72), verbal memory (.58; 95% CI .43–.69), and reaction time (.49; 95% CI .30-.62). The total symptom score showed moderate reliability (.63; 95% CI .50–.73). Conclusion Overall, ImPACT composite and total symptom scores appear to maintain relative long-term stability in a collegiate sample across a two-year period. ImPACT composite scores showed moderate to good reliability, and total symptom scores showed moderate reliability. Results of this study corroborate previous research indicating moderately stable ImPACT scores across a 2-year test-retest period in collegiate athletes (Schatz, 2010).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.