Introduction. Binaural beats (BBs) are phantom sound illusions perceived when two sounds of slightly different frequencies are separately transmitted to the ears. It is suggested that some BB frequencies might entrain the brain and enhance certain cognitive functions such as working memory or attention. Nevertheless, studies in this regard are very scarce, quite controversial, and merely covering a very small portion of this vast field of research (e.g., testing only a few BB frequencies), not to mention adopting some limited methodologies (e.g., no assessment of the loudness of the BB sound, adopting only between-subject analyses, and testing only one perceptual modality). Hence, we aimed to assess the potential effects of alpha, beta, and gamma BBs on cognitive-behavioral parameters of working memory and attention examined simultaneously in two different modalities (visuospatial and auditory-verbal). Methods. This within-subject five-arm randomized placebo-controlled clinical trial included 155 trials in 31 healthy right-handed subjects (17 women, 14 men,
30.84
±
6.16
years old). Each subject listened to 8-minute sessions of 10 Hz, 16 Hz, and 40 Hz binaural beats versus 240 Hz pure tone and silence (in random orders). In each 8-minute block, they played a dual 2-back task with feedback enabled. Their cognitive-behavioral parameters (working memory capacities, signal detection measures (hit rate, false alarm rate, sensitivity, and response bias), and reaction speed measures (response time and intrasubject response time variability)) were calculated. The effects of the sound interventions and short-term training on these working memory and attention measures were assessed statistically using mixed-model linear regressions, repeated-measures ANOVAs and ANCOVAs, Bonferroni post hoc tests, and one-sample
t
-tests (
α
=
0.05
). Results. The following are some major statistically significant findings (
P
≤
0.05
): In the visuospatial modality, the 10 Hz BB reduced the response time and intrasubject response time variability and reduced the extent of decline over time in the case of visuospatial working memory, sensitivity, and hit rate. In the auditory-verbal modality, the 10 Hz intervention reduced the hit rate, false alarm rate, and sensitivity. The 10 Hz intervention also caused the lowest intermodality discrepancies in hit rates and false alarm rates, the highest response time discrepancies, and negative discrepancies in working memories and sensitivities (indicating the superiority of the visuospatial modality). The response biases tended to be liberal-to-neutral in the verbal modality and rather conservative in the visuospatial modality. Reactions were faster in the visuospatial modality than the auditory-verbal one, while the intrasubject variability of reaction times was smaller in the auditory-verbal modality. Short-term training can increase the hit rate, working memory, and sensitivity and can decrease the false alarm rate and response time. Aging and reduced sound intervention volume may slow down responses and increase the intrasubject variability of response time. Faster reactions might be correlated with greater hit rates, working memories, and sensitivities and also with lower false alarm rates. Conclusions. The 8-minute alpha-band binaural beat entrainment may have a few, slight enhancing effects within the visuospatial modality, but not in both modalities combined. Short-term training can improve working memory and some cognitive parameters of attention. Some BB interventions can affect the intermodality discrepancies. There may be differences between the two modalities in terms of the response speeds and intrasubject response time variabilities. Aging can slow down the response, while increasing the volume of audio interventions may accelerate it.