The Origin of the European Folk Music Scale: A New Theory

Over a century ago, musicians noticed that European folk music seemed to exhibit certain uniform traits. They tried to analyse the music based upon the vast musical knowledge that they believed they had. The result was that folk music was described as tetratonic (four notes per octave), pentatonic (five notes per octave), or hexatonic (six notes per octave) because the music seemed to be “gapped” in some way. Musicologists also applied the ecclesiastical modal naming scheme,[i] which is a bit odd since that modal naming convention requires seven notes per octave, and the modes are defined by the precise placement of two half-steps; yet, European folk music is missing one (hexatonic) or both (tetratonic and pentatonic) half steps. All of these terms and approaches originate from the frame of reference of the diatonic scale, which demonstrates the characteristic of octave equivalency; that is, there are an equal number of notes per octave. If the terms used by musicologists to describe folk music don’t seem to be successful, could it be that the folk music scale is based on a non-diatonic scale? If so, how can we conjure up a solution from the undocumented musical tradition of more than a millennium ago to solve the problem?

I have attempted to answer this question by looking at the material culture available at that time. Music was mainly sung.[ii] Instruments were extremely rare.[iii] What instruments that did exist were those that assisted in the accomplishment of work. If you examine marginalized societies today where there are still shepherds working in relative isolation in the Carpathians (Russia/Ukraine), Poland, the Balkans, Estonia, Romania, Sweden, Norway, etc., you will still see them carrying an instrumental tool. It was made by taking a branch or the trunk of a small tree, splitting it, hollowing it out, putting the two halves back together secured with sap and birch-bark or osier (willow) rings. It was a wooden, valve-less, finger-hole-less trumpet; in short, it was a wooden shepherd’s trumpet. These shepherds’ trumpets had various names such as trambita, trâmbiţa, trembita (tрембіта), trombita, tulnic, trimitis, taures, torvi, touhitorvi, näverlur, neverlur, alphorn, vallehorn, karjapasun, bucium (from which we have in English “bugle”), etc. Such names are obvious cognates to modern forms such as trompeta (Spanish), tromba (Italian), trompette (French), trompete (German), “trumpet” and the earlier “trump” (English). These trumpets played the harmonic or overtone series (I prefer the term natural scale); most people are familiar with this from hearing bugle calls such as “Taps” and “Reveille.” 

So, what is the natural scale? When you pluck a string or buzz your lips on a tube, the resulting sound is actually a series of notes played at once. You’re hearing the frequency produced by the movement of, say, the whole length of a string on a violin displacing air when plucked. Yet the string is flexible, and there can be two, three, four, or more waves that can fit along the length of the string; those waves produce pitches as well. They are very faint on a stringed instrument, but on a wind instrument, you can blow harder to make one overtone louder than another and clearly hear it (that overtone is all that you hear, really). Here are the pitches that can be produced on an 8’ long shepherd’s trumpet (Figure 1, below). A “partial” is the number of whole waves that that fit within its tube.

Figure 1: The Natural Scale as it is Often Presented

The first thing that stands out is that the notes are gapped at the lower end (left side) and become closer together as the pitches ascend (to the right). The black notes are pitches that are very much different in intonation from the diatonic scale; arrows point in the direction the pitch sounds in relation to the equally tempered scale. The first partial is nearly impossible to play as are C2 and C3; however, G3, is attainable. After that, the lower pitches are much easier to play than the higher pitches, C4 perhaps being the easiest.

Imagine a transcriber in the past listening to a tune played on a shepherd’s trumpet. Envision a trumpeter playing the 7th partial and ending the tune end on C; the transcriber would not consider the pitch of B♭4 for the 7th partial since B♭4 is not in the key of C major. The choice would be between A4 and B♮4. Since the 7th partial is closer to A4, the transcriber would probably write A4 when it’s heard. Also, in the figure above, the 11th partial is written as F♯5; it could also be written as F♮5 since it is almost exactly halfway between the two notes. Again, given the choice, a transcriber would probably write F♮5 since F♯5 is not in the key of C major. The other possibility is to write F♯5 if the tune ended in G (or E), but this condition is easy to identify.

If the natural scale’s representation on the diatonic staff is adjusted with these concerns in mind, the playable notes might be represented as this (Figure 2, below):

Figure 2: The Natural Scale as Transcribers May Have Used It

It should be pointed out that although the natural scale looks unevenly placed on the figure above, it is in fact linear with respect to frequency; if available notes are plotted against those notes’ frequencies, the following results (see Figure 3, below):

Figure 3: The Natural Scale; Notes Available vs. Frequency (Cycles per Second)

The x-axis is the note value (here the partial number, but it is just the integer value of the notes available, in order); the corresponding frequency value in Hertz (cycles-per-second) is placed on the y-axis. The diatonic scale is much different. In the chart below, I have placed the notes of the diatonic scale in order as integer values and plotted that against the frequency of the pitches beginning on C4 (C4, D4, E4, F4, G4, A4, etc.; see Figure 4, below):

Figure 4: The Diatonic Equally Tempered Scale; Note Number vs. Frequency

Here the x-axis reflects the note value (1=C, 2=D, 3=E, etc.); the corresponding frequency is placed on the y-axis. I deliberately placed the notes equidistantly on the x-axis in order to highlight the uneven nature of the diatonic scale. If I had placed the half steps halfway between the whole steps, you would see that the points trace a smooth parabolic curve (in equal temperament), not a straight line.

A comparison of Figure 3 and Figure 4 evidence that the diatonic scale and the natural scale are vastly different. One is exponential with respect to frequency (diatonic), the other linear (natural). If you make the diatonic scale linear in your mind (as most people do), then the natural scale appears to be logarithmic, as in Figure 2. Attempting to transcribe tunes played on instruments using the natural scale with the diatonic scale notational system is like trying to fit a square peg in a round hole: it just doesn’t work.

Consider Figure 2 again. If a trained musician hears a shepherd’s trumpet playing in the lower range (G3, C4, E4, G4, A4, C5), the musician will classify the tune as tetratonic (four notes per octave; that is, four notes ascending including and starting on C4, and then the octave, C5). If the tune happens to rise to D5, the diatonically, octave-equivalent trained musician will then assume that D4 is in existence and declare that the tune is pentatonic (missing F4 and B4, but having five notes per octave); the pitches assumed to be present are G3, C4, (D4), E4, G4, A4, C5, D5, etc., when in fact D4 does not appear in the tune at all. A similar erroneous assumption may occur again on a different note (11th partial, F5) as a shepherd’s trumpet ascends in pitch. A tune that would be classified as pentatonic (missing all Fs and Bs) would be declared hexatonic once F5 is found, because the lower octave F4 is assumed to be in existence (having C4, (D4), E4, (F4), G4, A4, C5, D5, E5, F5, etc.). This classification occurs even though both D4 and F4 do not in fact exist.[iv]

Simply put, because diatonically trained musicians think in terms of seven notes per octave or “octave equivalency,” it has never occurred to them that they were dealing with a system that was not octave-based. This is understandable since the two systems have become linked and intertwined today.

If my theory is true, then it becomes clear why the ecclesiastical modes were used to describe folk music;[v] it also explains the percentages with which each individual mode is alleged to have occurred. If a tune played on a shepherd’s trumpet ends on the 3rd, 4th, 5th, etc. partial, transcribers would suggest that the tune would be, in order: G major (3rd partial; I have never seen this occur and will ignore this possibility), C major (4th partial), E minor (5th partial, as the relative minor to G major with the 11th partial heard as F♯5), G major or G mixolydian (6th partial), A minor (7th partial, as the relative minor to C major), C major (8th partial), D dorian (9th partial), E phrygian (10th partial), etc., from easiest to play to more difficult to play (which is a reflection of airstream flow and how we speak as well). In general, this matches what musicologists have said of folk tunes of the past. For example, concerning Celtic melodies, Bradley suggested that 60% were ionian (major), 20% were mixolydian, 12% were aeolian (natural minor), and 8% were dorian.[vi] Breathnach suggested 60% were ionian, 15% were mixolydian, 10% were dorian, and aeolian tunes were the least numerous.[vii]

I have found similar results. While I was taking a break from the University of Otago and living in a Gaelic-speaking region of North America, I was approached by a publisher and asked to re-set The Celtic Lyre,[viii] a set of four music books from about 1885; he wanted me to bring them up to modern standards. I decided to place the melodies so that the natural scale was apparent if applicable. Of the sixty-eight tunes, sixty-six could be easily played on a shepherd’s trumpet (that’s 97%); the other two were known to have been created on the bagpipes. This turned into a monumental project, but I eventually finished it.[ix] I found the tunes ended on the following notes with these percentages: C: 54%, G: 21%, D: 13%, A: 12%. This is roughly in-line with the ecclesiastical modal attribution percentage that Bradley and Breathnach had given to Celtic folk songs. If this is written as the percentages of The Celtic Lyre/Bradley/Breathnach, we have: ionian (54%/60%/60%), mixolydian (21%[x]/20%/15%), aeolian (12%/12%/least numerous), and dorian (13%/8%/10%). That’s quite a coincidence considering that 97% of the tunes of The Celtic Lyre could easily be played on a shepherd’s trumpet. This implies that the tunes analysed by Bradley and Breathnach could also be played by shepherds’ trumpets.

If we match the natural scale to the ecclesiastical modes, we get in order of theoretical likelihood:[xi] ionian (C4, C5, also G4), aeolian (E4, A4), mixolydian (G4), dorian (D5).[xii] This would suggest why “[T]he modes on A and C were excluded from the ecclesiastical list, their effect judged as being ‘lascivious’ and ‘worldly’ and unsuitable for religious use since their influence would be pernicious.”[xiii] So if you assume that my theory is correct, everything makes sense. Pastoral melodies were played on shepherds’ trumpets that trained musicians would describe as being the equivalent to the ionian and aeolian (and some mixolydian and some dorian) modes. The church banned those modes and pastoral instruments from church because they were linked to coarse, immoral behaviour. Over time, the forbidden modes became increasingly acceptable until they overwhelmed the ecclesiastical modes.[xiv] By that time, the culture and the instruments that created that fulcrum of change had all but disappeared.[xv]  

I should also point out that although my Ph.D. thesis is focused on medieval Gaelic narrative song, barriers placed between cultures, such as between Gaels and Saxons, are often created by modern nationalistic notions. Shepherds’ trumpets were spread all over Europe; folk music from various cultures, including those of the English, Welsh, Irish, Scottish, French, German, Italian, Spanish, and others, exhibit the same pitch-traits and have been described in the same way and with the same ending-note percentages as I found in The Celtic Lyre. The various languages represented by the names of shepherds’ trumpets listed above testify to their widespread use throughout Europe.

You may not believe me, but try to find the natural scale in old folk tunes for yourself. I gave directions on how to do this in The Celtic Lyre[xvi] and online,[xvii] but here it is in a nutshell: transpose the tune so that there are no accidentals in the key signature. Transpose the tune up or down a fourth or fifth (this doesn’t change the key signature in pentatonic tunes) so that the “missing” notes are on F4 and B4 or B4 and F5. For hexatonic tunes, place the “missing” note on B4. Transpose the tune up and down until you see the lower pitches lock in to the natural scale. If this process doesn’t work, check the tune’s provenance. Often, tunes were composed on a piano to mimic older songs. The composers made the songs pentatonic, but didn’t gap them in the lower octave; for example, D4 might fall on a stressed beat, and you can’t transpose it away.[xviii]

While this theory is a simple solution to a long-standing problem, a deeper issue is how we as ethnomusicologists approach the music of other cultures. We have the diatonic scale driven into our bones by our schooling; therefore, we have a propensity to unconsciously direct this on the cultures that we are studying with the greatest sincerity and the best of intentions. Unfortunately, there are unseen, hidden tendencies that are a part of our everyday thoughts and behaviours that cause us to make choices that can have destructive results. The diatonic scale is one of them. Be on guard.

Click here for a full explanation of this theory with many examples

Click here for a video of this tradition as it is alive today

Photo courtesy of Martin Kleibl.

Aindrias Hirt is currently pursuing a Ph.D. in Music and Irish & Scottish Studies at the University of Otago, Dunedin, New Zealand through a full international scholarship. His topic is investigating heroic narrative song (the origin of recitative) still found in marginalized cultures in Europe, particularly the songs of Fionn Mac Cumhaill. He is analyzing song recordings made from the 1950s which have survived in a living tradition; they date from the Late Middle Ages.

[i] Some of the many researchers include Cecil Sharp, (Sharp, Cecil J. 1932 English Folk Songs from the Southern Appalachians, Maud Karpeles, 2 vols., London: Oxford University Press), Kennedy-Fraser (Kennedy-Fraser, Marjory 1909 Songs of the Hebrides, 3 vols., London: Boosey and Co.), Gilchrist (Gilchrist, Annie G. 1911 Note on the Modal System of Gaelic Tunes, Journal of the Folk-Song Society 4, no. 16, 150-53), William H. Grattan Flood (Grattan-Flood, William H. 1905 A History of Irish Music, Book, Dublin: Browne and Nolan Limited), Donald MacDonald (MacDonald, Donald c.1900 Irish Music and Irish Scales, London: Breitkopf and Härtel), Bertrand H. Bronson (Bronson, Bertrand H. 1946 Folksong and the Modes, The Musical Quarterly 32, no. 1, 37-49), Francis Collinson (Collinson, Francis 1966 The Traditional and National Music of Scotland, London: Routledge and Kegan Paul), James Culwick, Bradley and Breathnach, Ó Boyle, Finlay Dauney, E. Cecil Curwen, etc. As popular as the ecclesiastical modes may be for researchers, there is doubt as to their suitability when applied to folk music. For example, Cazden (Cazden, Norman 1971 A Simplified Mode Classification for Traditional Anglo-American Song Tunes, Yearbook of the International Folk Music Council 3, 45-78: 47) states, “[F]amiliar mode classifications and categories are historically a travesty and systematically both inept and unproductive.” The consensus seems to be (Bronson, Bertrand H. 1972 Are the Modes Outmoded?, Yearbook of the International Folk Music Council 4, 23-31) to use the ecclesiastical system until a better system is found. This paper’s theory seems to have fulfilled this and Cazden’s requirements.

[ii] I am now thoroughly convinced that music is a pitch-structuring of speech, just as poetry is a structuring of rhythm in order to elevate the importance of what is being said by raising the register, or “style” as Labov termed it. You do not need to reduce the rhythm of speech to simplistic multiples (2/4, 3/4, 4/4, etc.) in order to call it music. It is useful to reduce the metre to simplistic multiples when there is accompaniment of more than two instruments or else risk a cacophony of pitch attacks on unison notes. If there is no accompaniment, there is no need for such a reduction. As a singer, I was taught that I needed accompaniment; now I find it annoying and disruptive.

[iii] The rise of instrumental music accompanied the rise of instrument production with the growth of cities and the middle class. Before the 16th century, instruments were, by today’s standards, extremely rare.

[iv] In all fairness, F5 is often missing from folk tunes as well, which often make the tunes appear to be pentatonic. I believe this is due to complex droning harmony being present. Throat singers often omit the 11th partial.

[v] The ecclesiastical modes were brought into Europe from Greece albeit in a distorted form. When a tune ended on a particular note of the diatonic scale, it was thought to impart a particular sentiment to the listener; the Church found that this purported trait made it very useful in orchestrating the emotions of the congregation. The original, permissible ecclesiastical modes were dorian (ended on D), phrygian (ended on E), lydian (ended on F), and mixolydian (ended on G). Ionian (ended on C, the major mode) and aeolian (ended on A, the natural minor) were originally proscribed; the ionian (major) mode acquired the epithet of the modus lascivus (lustful mode).

[vi] Bradley, Seóirse and Breathnach, Breandán, "Ireland," in The New Grove Dictionary of Music and Musicians, ed. Stanley (London: MacMillan Publishers, 1980), 317.

[vii] Breathnach, Breandán 1996 Folk Music and Dances of Ireland, Cork: Ossian Publications, Ltd., 10.

[viii] There is an important distinction between “Gaelic” and “Celtic” but this differentiation is esoteric to most readers and has been ignored in this short paper due to length constraints.

[ix] Whyte, Henry 1885 The Celtic Lyre, Ed. Truman Matheson & Musical Ed. Aindrias Hirt, 2012 ed., St. Andrews, Canada: Sìol Cultural Enterprises.

[x] I have given the notes ending in G to be that of the mixolydian mode since the 11th partial is halfway between F♯5 and F♮5. It is also seen as G major with F♯5 acting as a very flat leading tone. The Irish call this note “F-flat” (F♭) not because it is lower than F♮5, but because it is a flat F♯5 in what most people are told is G major.

[xi] It is generally more likely for tunes to begin and end on lower pitches. I believe that this is due to the effects of reduced airflow (as explained by the Bernoulli Effect) toward the beginning and ending of exhalations when speaking (I believe that music is an imitation of speech). With this in mind, if the diatonic scale was the origin of the folk music scale, the order of precedence (including the forbidden modes) of the ecclesiastical modes applied to folk music should be: ionian, dorian, phrygian, lydian, mixolydian, aeolian. That is not actually the case if folk tunes are analyzed by using the ecclesiastical modes. In fact, the phrygian and lydian modes are very rare; dorian is more frequent, but then the natural scale has some correlation to the dorian mode.

[xii] This explains why the early Christian Church eschewed both the ionian (major) and aeolian (minor) modes. They wouldn’t want the rowdy influences of pastoral music intruding on the “serious” business of religion.

[xiii] Vincent, J. 1951 The Diatonic Modes in Modern Music University of California Press 225.

[xiv] Instruments that played the natural scale also had a strong effect on the history of art music concerning harmonies and tunings. With Pythagorean tuning that was used for the diatonic scale and ecclesiastical modes, major thirds were 22 cents sharp. Major thirds on instruments that produce the natural scale are exactly in-tune, as are fifths. Church musicians were forbidden to play triads since they were terribly out-of-tune, but they knew triads could be played in-tune since they heard shepherds’ trumpets that played thirds and triads in-tune. So they tried to temper the diatonic scale to make thirds as in-tune as on shepherds’ trumpets (called just intonation). In essence, we have major and minor modes and harmonic progressions of triads because of natural instruments that existed in Medieval (and Antiquity) Europe, not because of what was imported from Greece.

I have found it helpful to think in terms of two warring “camps” in order to organize things in my mind. In one camp is the music of the Christian Church; the Church was trying to convert the uncouth pagans. The Church brought in new music not only to direct the emotions of the congregations, but also to be different from what currently existed; they associated this music to dancing and drunkenness (partying). So the Church introduced the diatonic scale, the diatonic modes (that is supposed to direct the congregations’ emotions, but they actually altered the modes by one step when they brought them in from Greece, which effectively refuted the theory that listening to the modes resulted in a particular emotion), the diatonic scale, octave equivalency, in-tune fifths, but out-of-tune thirds. In the other camp, the heathen had the natural scale, in-tune thirds and fifths, but only a gamut of notes. They fought and plundered from each other.  

[xv] Bach’s BWV 118 had part indications for two litui (a lituus was a 3’ Roman cavalry trumpet). The music required natural trumpets about 9’ long. It seems apparent that Bach used two alphorns and just called them litui so no one could complain that he was using proscribed pastoral instruments in church.

[xvi] Whyte The Celtic Lyre.

[xvii] Hirt, Aindrias The European Folk Music Scale: A New Theory

[xviii] For example, the Walt Disney movie Brave featured the song “Tha mo ghaol air àird a' chuain” (My Love is on the High Seas). It appeared to be a traditional song, but it didn’t match the natural scale. Research showed that Bernard Covert composed the tune in 1847.


"Sounding Board" is intended as a space for scholars to publish thoughts and observations about their current work. These postings are not peer reviewed and do not reflect the opinion of Ethnomusicology Review. We support the expression of controversial opinions, and welcome civil discussion about them. We do not, however, tolerate overt discrimination based on race, sex, gender, sexual orientation, or religion, and reserve the right to remove posts that we feel might offend our readers.