Did you hear that existentialist percussion piece? It was highly cymballic.
We have previously discussed arrhythmic music in reference to ambient and minimalist compositions. There are many other types of music without a pulse, such as noise, ambient, and drone music. These fall either in the realm of outliers, or serve the same purpose as minimalism, and can be analyzed similarly as the previously mentioned works.
So, we come to a fruitful realm of neurological study, namely how the brain responds to rhythm. Life itself happens rhythmically, and some sort of rhythmic action or resonance can be found within the basic functions of all life on earth. Advanced beat perception is a direct indicator of higher brain function, in particular how a nervous system relates auditory rhythmic stimuli with locomotion. In other words, very few species can dance.
In scientific parlance, synchronizing motor processes with auditory cues is called entrainment. When you tap your hand along with a metronome, for example, you have entrained to a repetitive auditory stimulus. The word comes from literally stepping onto a train from a stationary platform. To remember its meaning initially, I imagined the train cars as a beat clacking by, then stepping onto the train as a way to synchronized with that steady movement.

Entrainment with a beat is difficult within the animal kingdom, for reasons still not entirely understood – but there are some well-researched clues. In this excellent and succinct presentation by Drs. Aniruddh Patel and Ola Ozernov-Palchik, they postulate that vocal mimicry is the key to whether an organism is capable of the kind of temporal perception/anticipation necessary for accurate rhythmic entrainment. Their research shows a strong connection between aptitude at reading, speaking, and rhythmic discrimination in children. After all, language itself is temporal, and processing the time-information of speech is inherent to understanding what has been said.
Researchers in the last few years were a little surprised to discover that non-human primates are pretty bad at rhythmic entrainment to music. Even those that are capable of it take a long time to learn the skill, and tend to always be a little a late. We think this is due to a lack of neural motor region coupling to auditory regions, meaning these animals can only judge a beat by determining the interval between pulses, instead of predicting/anticipating like humans do.
Also, accurate beat processing may require a vocal mimicking or learning ability. This means that animals born with only innate vocalizations, and do not obtain more throughout their lives by mimicking what they hear, will be unable to accurately entrain movement to music.
The most prominent non-human vocal learners in nature are songbirds, which have quite adorably been proven to be able to spontaneously dance to unfamiliar music at varying beats. An example is Snowball the Dancing Cockatoo, who deservedly went viral a few years ago dancing to the Backstreet Boys. Snowball caught the attention of Dr. Patel, thus beginning a lively ongoing discussion as to what animals can and can’t perform this feat, and why or why not.
The list of dancing animals is currently humans, songbirds, elephants, and most recently sea lions. Bonobos are an interesting soft exception, which you can read more about in this article, Beasts That Keep the Beat. There’s also a little more here regarding the relationship between motor anticipation, motor learning, temporal perception, and social engagement, and how these factors relate to beat processing skill in various species.
These points are important, because they represent direct and testable connections of how rhythmicity and beat processing is acheived in the mammalian brain. The emerging story is that, while musicality has long been associated with neurologic linguistic processes [1][2][3][4], it is by no means the full story. Music activates very deep brain structures that are not all tied to language, but rather only tied to language in part.
For now, we can summarize human rhythm perception as relating to vocally-motivated temporal discrimination, motor planning and execution, and social engagement. Each of this elements can exist separately from one another in non-human animal species, but exist intertwined in a, dare I say it, cognitive dance in the human brain.
Social Drumming
As was mentioned previously in this series, drum circles have a long and well-studied history of helping foster positive communal feelings, and have been shown to help with such issues as at-risk behaviors, depression, anxiety, and addiction. The history of group drumming and dancing touches virtually every culture on Earth, long before the advent of modern psychology, and were often considered to be explicitly therapeutic or cleansing in nature.
As I’ve also mentioned, while we know communal dancing and hand drumming have therapeutic effects for many individuals and communities, we don’t know how or why that particular ritual differs from, say, group discussion, eating, or other communal events. What is it about the act of rhythmic communal cooperation that works so effectively? I’ve not been able to find a single rigorous controlled study that distinguishes why (or if!) the rhythmic nature of such communal therapies is necessary for why they work. But we can infer some of that relationship by starting with what’s called the auditory brainstem response. This is a phenomenon in which we can directly correlate auditory stimulus with corresponding electric signals in the brainstem. The relationship is electrically direct, and, in my opinion, totally awesome.

Pay particular attention to the example labeled Piano Melody. Notice that the electric impulses in the brainstem line up perfectly with the steady pulse of the piano notes. Such synchronized impulses mark the beginning of a vast connection to many centers of the brain, as laid out in this paper by our trusty friend, Dr. Thaut, pictured below for no reason.

I can in no way completely map the path of a stimulus from the moment it travels through the cochlear mechanism into the brainstem in a humble blog post such as this. I will, however, mention some key points about the process.
The Auditory Brainstem Response
After synchronized pulses to auditory stimuli occur in the brainstem, electric signals travel on to regions such as the spinal cord, the subcortex and cortex, strongly interacting with the motor system. The brainstem itself regulates most of the basic cyclical body functions, such as cardiac and respiratory processes. It’s also pivotal in maintaining consciousness and regulating the sleep cycle [1].
The Chanda/Levitin paper lists several ways we know the tempo of music affects our physiology. Like so many other studies, they could find no direct neurochemical relationship to “relaxing music,” they did find strong correlation between pulse tempo and rhythmic bodily processes.
“These effects are largely mediated by tempo: slow music and musical pauses are associated with a decrease in heart rate, respiration and blood pressure, and faster music with increases in these parameters. This follows given that brainstem neurons tend to fire synchronously with tempo.”
“The Neurochemistry of Music, Chanda, Levitin, 2013
Heart rate and breathing are directly related to our emotional state at a given time, not only for speed but also regularity. Unfortunately, much of the texts related to are pseudoscience, particularly with regards to treatment, which coincidentally often relates to making money. However, some highly regarded researchers including Bessel van der Kolk have observed that PTSD is strongly correlated to heart rate and breathing variability, resilience, and coherence. The Chanda/Levitin paper demonstrates that these metrics are directly regulated by rhythmic musical stimulus. In other words, cultures have been utilizing rhythmic music to regulate physiological, neurological, and neurochemical stress responses for many, many years via group drumming as a form of therapy.
Also worth mentioning here is a type of rhythmic therapy called EMDR, which uses eye movements or knee/thigh tapping to impose repetitive bilateral distraction on neural processes. Again, as is common with such things, the concept has been appropriated by pseudoscience, quackery, and exploitative monetization. But there is a growing body of sound evidence showing its efficacy, although how and why it works is still not fully understood [1][2][3].
EMDR stands for “Eye Movement Desensitization Reprocessing.” In it, the therapist moves their fingers back and forth in a steady rhythm while the patient follows this with their eyes. This has been shown to reduce the amount of stress a person feels while recounting traumatic memories. Tapping refers to repetitively tapping on the body, usually the knees or thighs, to achieve a similar goal. I spent some time watching various videos of EMDR and tapping sessions, such as this EMDR one from Dr. Jamie Marich and this resource tapping one from Dr. Laura Parnell (a major proponent of tapping-style EMDR).
I paid a lot of attention to the tempi used in these demonstrations, clocking the rates as within a narrow range usually hovering around 85-95 BPM. Dr. Marich completes one cycle (back-and-forth) at about 88-90 BPM, while Dr. Parnell is slightly faster, at about 90-94 BPM. This makes sense, since using two hands to tap is more efficient than using one oscillating hand. The slowest tempo I could find was about 70 BPM, and none went faster than 100 BPM. So the tempo is always slow, which as we know can help keep things like heart rate and breathing down. I can only imagine it would be stressful if someone started hammering on your knees at 160 BPM and asked you to recall particular memories. Woefully, I can’t find much rigorous literature discussing the different effects various speeds have on this technique.
The crux of this technique and how it functions is called bilateral stimulation. This means continually activating both hemispheres of the brain using a left-right alternating pattern. Eye movements are a very easy way to activate different regions of the brain, which is (partly/mostly/largely) why they happen as we dream in REM sleep [1][2].
Bilateral tactile sensations have a similar effect to moving the eyes back and forth. If you tap on your left leg with your left hand, and your right leg with your right, alternating at a steady pace, it will also alternate hemisphere activation. There is some evidence that this can effectively reduce the stress response as well. All of which is summarized in this quote from Dr. Robert Strickgold:
“We propose that the repetitive redirecting of attention in EMDR induces a neurobiological state, similar to that of REM sleep, which is optimally configured to support the cortical integration of traumatic memories into general semantic networks.”
EMDR: A putative neurobiological mechanism of action, Strickgold, 2001
The repetitive distraction induces a dreamlike state, is what he’s saying. Remember all that talk about attention earlier in this blog series? Contemporary psychotherapists are developing a verified method of neural engagement that denies the brain the resources it needs to drum up (har har) fearful or anxious thoughts. To reference the loudness paper again, takes up the neural space that might otherwise promote negative arousal.
There are many, many studies on the ability of repetitive drumming, whether listening, dancing, or playing, to induce a trance-like state [1][2][3] that is similar to hypnosis, all of which falls under temporary, non-pathological dissociation as a form of self-regulation. Which is a hell of a sentence.
“Music emerges as a particularly versatile facilitator of dissociative experience because of its semantic ambiguity, portability, and the variety of ways in which it may mediate perception, so facilitating an altered relationship to self and environment.”
An empirical study of normative dissociation in musical and non-musical everyday life experiences, Herbert, 2011
Now, imagine hand drumming in a circle of friends, colleagues, similarly trained musicians, communally associated peers, or whatever. Hand drumming, by definition, is a self-controlled rhythmic bilateral stimulation in which the player’s hands tactically engage in a steady alternating pattern. This regulates the player’s nervous system by entrainment, which is also synchronous with the surrounding players and audience. One can visually confirm this by observing head bobbing, side-to-side swaying, foot-tapping, etc. in surrounding participants, all of which are also bilateral movements. I can’t find any good papers specifically about how this confirmation of social synchronization affects the brain, which is surprising. Human social behavior is deeply rooted in our brain [1][2][3], so it seems like such a study would be a… er… no-brainer. But one can at least guess that this process involves temporal anticipation/prediction/discrimination, as well as motor planning and execution, all confirmed visually, aurally, and thus socially, which cannot be achieved by similar but arrythmic activities.
The point is that we are already using hidden versions of engaging and distracting rhythms during psychotherapy as a way to deal with overwhelming stress and other negative emotions, memories, and tendencies – a contemporary refinement of what we were already doing for millennia. I personally wonder if nervous rhythmic actions that produce sound, like idle/anxious knee or desk tapping for example, fits into this picture somewhere.
All this suffices to say that rhythmicity in music arises from a deeply innate human architecture, the so-called glue that binds together the vast majority of human-organized sound.
Types of Rhythm
I’ll now switch track to discuss an infinitesimally small pool of examples showing the various ways rhythm arises within music. Rhythm in a sonic work can be explicitly percussive – i.e. in drum ensembles – such as in an Ewe, samba, or taiko ensemble. You’ll also find it in solo pieces this one for Korean jang-go, as well as this contemporary percussion composition in the raga style, Piru Bole. However, rhythm exists much more than percussion music. Rather, it’s a necessary element in anything with a meter or pulse, whether or not it’s actually percussive in timbre. This makes the effect of rhythm on the brain extremely diverse. Let’s try and characterize some examples using the cake method.
In the above example, we hear a rhythm played via rock drum kit solo. It possesses no pitched or harmonic content. The tempo is very fast, accelerating to around 150 BPM, and contains rapid subdivision and snare rolls. The timbre is obviously percussive, utilizing the full range of the kit, creating a wide spectral texture from the bass drum to the cymbal crashes.
Something rather surprising is that no existing scientific study I can find relates brain activity to drum solo listening. The reason is probably that a drum kit is actually a collection of instruments played simultaneously, making it too difficult to extrapolate meaningful data. Instead, researchers tend to look at rhythmic sounds one at time. This presents the problem of deciding which texture to use when studying rhythm. There’s probably a big difference between how the brain receives the same beat pattern depending on whether it’s middle C on a piano, a sine tone, a metronome click, a bass drum, or a crash cymbal. Indeed, simply the volume of a kick drum has an effect on bodily entrainment. This is is why Garth’s solo up there, as it currently stands, is completely outside of the realm of existing research.
Rhythmic content is fast and complex, accelerando to ~150 BPM. No pitched, melodic, tonal, or harmonic content. Standard drumkit timbres with full spectral range. Quite loud. Structure is through-composed freestyle solo, short in overall length. Live improvisatory rock context.
The first ~1:40 of this track is an arpeggiated melodic line generated with a Roland TB-303. Despite having no sounds that are traditionally percussive, the rapid pace, fast-attack wave shapes, and jumps between low and high pitches creative a highly rhythmic, dancey feel without the need for synthesized drums. Which, of course, makes the composition all the more effective when the drums do come in, then go out, then come in again several times.
Rhythmic content is rapid 4-to-the-floor dance beat. Melody is repetitive, fast, and arpeggiated, containing large leaps, making non- or nearly non-singable. Harmonic content is enharmonic slow, soft pads. All timbres are computer generated/synthesized in the style of acid house techno. Dynamics alternate between quieter sections and loud beat sections, which play out over an unusually long runtime of 16 minutes. Club/raver/electronic dance tradition.
The second movement of Ravel’s sole string quartet composition is a really great example of why music selection is so important when conducting research. The beginning section is highly rhythmic and exciting, featuring constant pizzicato techniques and arpeggiation, just like the previous acid techno track. It also counts as a classical composition and employs so called “relaxing” timbres, namely, strings. When a research paper says it used classical music as a relaxing control, what if this piece snuck its way in? Compare this to Adagio for Strings by Barber. If we don’t know the tempo, timbre, or general feel of each individual piece, both might be characterized as similar in a research environment, although I can guarantee both would activate quite differently in the brain. Indeed, this is a perfect example of why each aspect of a musical selection must be indicated in research, because, generally, this track shares major similarities in some ways with the acid techno track, and other major similarities with Adagio for Strings, but all three pieces would be received quite differently by a listener.
Focusing only on the first movement: Highly rhythmic, achieved via constant pizzicato eight, sixteenth, and 32nd notes. Highly melodic, recurring singable theme. Enharmonic key structure with Euroclassical modulations achieved mostly with arpeggiations between players. Timbres are full-spectrum pizzicato and arco strings as in a traditional quartet. Structure, dynamics, and context are variable as is consistent with the Impressionist Eurotraditional artistic epoch.
Percussive melodic instruments such as the marimba demonstrated above, as well as the xylophone, gamelan, or vibraphone (to name only a few) are very pure combinations of rhythm and pitch. They function by producing a tone with a relatively pure timbre (i.e. closer to a sine wave) and seem to be used about as often as metronome clicks when studying beat processing. I can’t find a study that looks at the difference between pitch and unpitched perception along this axis. Which is weird.
Rhythmic content is fast and steady, highly melodic, enharmonic Eurotraditional key structure, wooden malletophonic timbre and spectral range, relatively consistent medium volume, Baroque tradition.
And so, with Steve Reich’s Clapping Music, we finally venture into the realm of postminimalism and phasing music. We’ll get slightly more into what that means in a second. But I want to focus on this particular piece because it’s another really good example of the difficulty of music selection without being very specific. First of all, this is a highly rhythmic piece with no vocals or pitched content. However, it also very clearly has an human element. The brain may react completely differently when it can tell a percussive sound comes from a human rather than a manufactured instrument, in the same way the brain responds completely differently to vocal sounds compared to otherwise similar sound content.
Instead of breaking it down, let’s instead take a look at how different this piece sounds based on the recording, like this fast-paced ensemble version of Clapping Music. It sounds completely different from this version. Or look at Evelyn Glennie performing it on woodblocks, or these super cool jugglers playing it slowly with bouncy balls, or, my favorite, actress Angie Dickinson performing the piece in the 1967 film Point Blank. They’re all the same piece, but some feature extremely different tempi and timbres, likely provoking strong variations in the neurological listening experience. So, even if a researcher actually mentions the name of the composition (which they don’t always do, at all), unless we know which specific recording of that piece is used, we still can’t rely on any data presented.
Postminimalism
We began with Cage’s silent music, then added arrhythmic, meditative harmonies (with some naturalistic textures) to build a minimalist composition. Now, by adding a rhythmic element to minimalist practices, we create the genre (or subcategory) known as postminimalism, also sometimes called phasing music.
In most minimalist music, just like the Pisaro piece or any of the ambient works, slow harmonies progress over a long period of time. In postminimalism, the same process occurs but with an added pulse, and usually short snippets of interlacing melodies that phase in and out. The intro section of Steve Reich‘s Electric Counterpoint is a really perfect example of how this compositional technique arises directly from a true minimalist approach.
The introductory choppy chords eventually move into melodic snippets that fade (or phase) in and out, giving sensation of movement while retaining the slow-moving harmonic structure valued by minimalism. However, by avoiding any longform, easily singable melodies (short, quick notes with large leaps) we avoid true ostinati/thematic content like in other genres.
As a note, if you read descriptions in the links, you’ll notice that the term postminimalism is sometimes lumped in with minimalism, though in contemporary music practice this is considered inaccurate, or perhaps just plain old lazy.
I am very personally interested in the difference between rhythm as accorded by something like Electric Counterpoint vs. a West African Ewe ensemble or Garth’s drum solo up there. Both contain strong rhythmicity, but one comes only from pitched/chordal content and the pulse is very clear, i.e. comprised entirely of eighth notes and eighth rests. The others are unpitched and feature far more complexly interlaced syncopation. Would there be some marriage of stimulus in the Reich piece that we wouldn’t see in the unpitched one? Is the strength of response different by the nature of how rhythmicity is achieved in one or the other? Or the spectral range or waveshape of either? Does the harmonic content neurally allow one to be more repetitive than the other before boredom sets in?
Absolutely none of these questions have been addressed in any research I can find. I think it has a lot to do with the fact that the above questions don’t directly relate to rehabilitation of disease/disorder, and are rather purely theoretical. But still, I can’t imagine having that kind of information would tangentially inform therapeutic musical neuroscience research. Anyway.
Every time we add a new element to our cake, it begins to look more like a cake. Cakeness is subjective, but few would look at Electric Counterpoint and deny that it is a piece of music, which some actually might about the Pisaro piece, and many still do with 4’33”.
Now that the overall point of this exercise is becoming more clear, and just because I love postminimalim, I’ll share some examples of postminimalist recordings below, all of which have similar properties. If you were to characterize each using the elements of music previously described, how would they differ between each work? What would be similarly or identically described?
Summary
If lazily described, much of the above music might sound identical to one another – “percussive music,” “rhythmic music,” “fast music,” “electronic music,” or “classical music” as examples. However, I hope it’s growing clear just how variable a listener’s experience of each recording might be. Koyaanisqatsi, for example, is often called minimalist because its tempo is so much slower than the other two examples. But this piece contains more musical elements than the Pisaro. It has a strong rhythmic pulse, a wide-ranging spectral character, and far more timbral information, including the game-changing element of human voices. Even without taking individual bias and tastes into consideration (like if someone happened to know and like the film, for example), the two pieces will objectively have a quite different neurological effect on the listener despite a casual description running the risk of lumping them together as the same.
In the next section, we will discuss pitch, melody, and harmony in greater depth, the latter of which will lead to the discussion of structure in music. As this will lead us squarely into the territory of folk and pop music, instead of a parting image, I leave you with a really chill remix of Reich’s Electric Counterpoint by RJD2 from the album Deadringer. Also feel free to enjoy Jonny Greenwood of Radiohead performing another version of the same piece.