U II \Ii has index set Tip II T'f!, and the atlas corresponding to the coproduct surjection Tip II T'f!
\jJ
has index set Tip U T'f!, and the atlas corresponding to the coproduct surjection Tip U T'f!
--+
I.
CHAPTER 13. GLOBAL COMPOSITIONS
310
isomorphism K : G ~ K.G, and therefore introduce a new chart domain (G,O@OP) of this isomorphism. We can proceed in this way until we obtain a minimal set of eight chart domains (A, O@OP), (AI, O@OP), (B, O@OP), (BI' O@OP) (G,O@OP),((6
g). G,O@OP),(C,O@OP),(H,O@OP)
together with 15 corresponding isomorphisms 'l/Ji, defining a second, equivalent atlas 'I! for Boul I . This second atlas makes evident the typology of the internal structure of Boul. Exercise 21 Show that the relation of atlas equivalence in definition 36 is in fact an equivalence relation
13.2.1
The Nerve of an Objective Global Composition
Since the covering G I is a fixed data of an objective global composition, we should immediately introduce a visualization tool: the nerve of an objective global composition. In the course of the theory, this construction will be extended, but we should look at its most elementary aspect as soon as possible. Given a global objective composition G I , the (abstract) nerve or simplicial complex n( G I ) of G I , as well as its geometric realization N(G I ) are defined6 • Recall that N(G I ) is a union of affine simplexes 10"1 associated with the (abstract) simplexes 0" E n(GI ) such that for any two different simplexes 10"1, ITI, their interiors 7 are disjoint, i.e., 10"1 0 n ITlo = 0. Example 20 For our example Boul I , we start with the zero-dimensional skeleton No(Boul I ) which is a set of 15 points, the O-simplexes IJI, one for each chart J E I, which we distribute in three-space ~3, see figure 13.7 a) for this procedure. For the moment, it is not important where to place these points; the only condition is that they be distinct if their charts8 J E I are. Now, we look at all couples J, J' with non-empty intersection. Their O-simplexes IJI, IJ'I are connected by a straight line, a I-simplex IJ, J'I, see 13.7 b). This visualization idea stems from combinatorial topology, however, musicology has been aware of such a construction. In his treatise on harmony [479] Schonberg talks about the harmonic strip 9 between two chords which have one or several tones (pitch classes) in common. If we view chords as being charts in the framework of harmony (see also section 13.4.2 and chapter 27), I-simplexes are precisely the formalization of Schonberg's harmonic strip. In combinatorial topology, one goes one step further. There is no deeper reason to stop at I-simplexes and to proceed with a test for common tones in three or more charts. In our next step, we look at all triples J, J' , J" of mutually distinct charts such that they have common elements. We then add 2-simplexes IJ, J' , Jill, i.e., triangular surfaces with the O-simplexes IJI, IJ/I, IJ"I as vertices, see 13.7 c). And the last step consists in joining each group of four 6See appendix H.2 for these concepts. 7The interior of an affine simplex is the simplex minus its faces. For example, the interior of an affine I-simplex is the straight line minus the two endpoints. 8This is a typical abuse of language: identification of the chart with its codomain. In fact, we are only looking at the codomains in the covering I. g"Harmonisches Band."
311
13.2. WHAT ARE GLOBAL COMPOSITIONS?
a)
b)
c)
K·U·B eI)
I
Figure 13.7: Construction of the nerve of the Boulez series Boul for its internal structure defined in figure 13.1. a) Every chart corresponds to a point. b) Two points are connected by a straight line iff the corresponding charts have non-empty intersection. c) Three different charts having common points are connected by triangular surfaces. d) four different charts with common tones define a full tetrahedron. distinct O-simplexes IJI, IJ'I, IJ"I, Jill I I by a full tetrahedron if the intersection J n J' n J" n Jill is non-empty, see 13.7 d). So the internal structure of Boulez' series appears as a complex intertwining of local charts which overlap as shown by the geometric nerve in 13.7 d). Observe that in general, this procedure will not stop in three-dimensional simplexes. Only here, no five charts have common notes. The shape of the nerve of an objective global composition is a good measure for the complexity of the global configuration. There are two extremal situations of this perspective: On one hand, a nerve may be discrete, i.e., reduced to the O-dimensional skeleton. This happens if we just draw disjoint groups of notes on a given composition (as principally suggested by JackendoffjLerdahl), which is called a discrete interpretation. In this case, there are only isolated local objective compositions around, and the connectivity of the global construction is trivial. The extreme case of such a "strategy" happens if we just draw circles around every tone of a composition; this will completely destroy its gestalt, and we essentially boil down everything to counting notes. This is quite silly (though not superfluous for accounting purposes), and we
312
CHAPTER 13. GLOBAL COMPOSITIONS
shall call such a global composition a silly interpretation. On the other hand, we may just take one single local chart to cover the entire composition, and in this case, we have a huge set of notes which does not observe all the more local similarities or symmetries or other relations. This will be a very difficult, cryptic object, also since classification of large local compositions is quite intricate (see our discussion in chapter 11), we call it the indiscrete interpretation. Both extrema are no intelligent solutions for human cognition: Good global structures are somewhere in the middle between intractable monsters and insignificant atomized data. But it would be nonsense to set up artificial limits (like Jackendoff/Lerdahl [243]) to the grouping formalism because it is one of the most important objectives to make all grouping perspectives accessible and formally tractable. The breakdown of the Jackendoff/Lerdahl Generative Theory of Tonal Music (GTTM) happens where their grouping levels have to rely on traditional music theory in order to work. The GTTM is built on a competence which is both, formally and semantically, out of the reach of the GTTM, and therefore boils down to an interface between psychology and music theory which does neither solve the formal inconsistency of music theory nor observe the variability of the psychological grouping: a bad glue of two unresolved components. Often, an objective global composition is not given in advance, but only results from a compatible gluing of local charts. The next theorem describes a necessary and sufficient condition for a system of local compositions to become an atlas for a global composition. Theorem 12 Let A be an address, T a set of indexes. Suppose we are given a family of Aaddressed objective local compositions
(13.1) of forms Fj , together with a family Isoil>
=
«hj/1 : (Ki,j, A@Fj ) ~ (Kj,i, A@Fi))i,jET
(13.2)
of isomorphisms in ObLoc whenever Ki,j is non-empty. And we assume that each (Ki,j, A@Fj ) is a sub-composition of the diagonal element (Kj,j, A@Fj ), i. e., we have inclusion morphisms Pi,j /1 : Ki,j C Kj,j, and that for all indexes,
13.2. WHAT ARE GLOBAL COMPOSITIONS?
313
Observe that objective global compositions need not be derived from given local compositions by coverings as in the above Boulez example. The concept is a proper extension to the local framework, and the general construction method is described by the above theorem. We shall develop necessary and sufficient criteria for covering constructs in section 16.1.
Exercise 22 Let us define a zero-addressed objective global composition by data corresponding to the above theorem. The index set is T = {l, 2, 3}. We take a constant space form Fi = Onset EEl Pitch with functor @1~2. The family ReliP of local compositions is defined as follows: Consider the four (zero-addressed) points Cl = (0,0), C2 = (0,1), C3 = (1,0), C4 = (1,1) in 1R2 and set
= ({ Cl, C2, C3, C4}, 0@1R2),i = 1,2,3, K 1 ,2 = K 2 ,1 = ({ C3, C4}, 0@1R2), Ki,i
K 1 ,3 = K 3 ,1 = ({ Cl, cd, 0@1R 2),
= ({ C3, C4}, 0@1R2 ), K 3 ,2 = ({Cl,C2},0@1R2), Ki,j = 0 else. K 2 ,3
The family I soiP is given by
(Pt,2 = I dK2 ,1 , (Pt,3 = I d K3 ,1 , <1>2,3
= <1>3,2 = e1,1
. ((} ~l) .
Show that the conditions of the theorem are satisfied and try to draw a picture of the global composition. We shall see in section 16.1 that this global composition cannot be constructed by a covering of a local composition. We should observe a hidden subtlety of the global composition context: Global ambient spaces. In fact, every local composition is connected to an ambient space form F. But if we are given an atlas of local compositions, the ambient spaces are no longer uniquely determined. In fact, the construction of a global composition makes chart names exchangeable: it is 'homonymous'. More precisely, we have as many names for a given point x of the support as we have chart names on any of the compatible atlases. So the name of the 'ambient space' of x is an entire collection of local chart names. On the level of local supports, univocal naming would not be a good idea since on the overlaps of two different covering sets, what name should one select? Homonymy is unavoidable for global compositions. One should, however, not refrain from naming or sweep names away in favor of anonymy. Global naming is interchangeable, but not irrelevant:
Definition 37 The name Name(x) of an element x E G of an objective global composition G is the set of names of all the local chart forms (in any compatible atlas) which hit this element.
314
CHAPTER 13. GLOBAL COMPOSITIONS
13.3
Functorial Global Compositions
Summary. Corresponding to functorial local objects, the (functorial) local compositions in the category Loc, global functorial compositions are introduced as global objects by an atlas of charts, composed of (functorial) local compositions. -L;-
The definition of functorial global compositions is the transposition of the objective case with everything made functorial:
Definition 38 A (global) functorial composition is defined by the following data:
(i) A functor G G=UI,
E
Mod@ and a finite, non-empty, generating set I of subfunctors of G, i.e.,
(ii) an address A, (iii) a family (Kt , @A x Ft)tET of A-addressed functorial local compositions, (iv) a surjection
h :T
---+
I :t
f----t
It,
(v) an isomorphism of functors CPt : K t ~ It for each t E T, (vi) for each couple s, t E T such that Is
n It -I- 0, the induced isomorphism
(restricted to the respective domains and codomains) defines an isomorphism
of functorial local compositions. The data (iii) to (v) are called an A-addressed atlas
---t
I.
13.3. FUNCTORIAL GLOBAL COMPOSITIONS
315
To begin with, we have a family of functorial local compositions (Kt , @A x Ft)tET, as well as a system of sub composition inclusion monomorphisms
Ps,t : (Ks,t, @A x Ft ) C (Kt , @A x Ft ), defined by the inverse images Ks,t have induced isomorphisms
=
(F;l(Is n It) for any two different indexes s, t. We also
;;"s ,til: '-P
K t ,s
.:::.
Ks , t
of functorial local compositions. Let G be the colimit functor of the system (fis,t, Ps,t of isomorphisms and inclusions. For an address change x : B ---+ A, we have the formula
x@G .:::. colim(x@Ks,x@Ks,t) = colim(Ks.x, Ks,t.x)
(13.3)
on the slices of the colimit. Because of the right colimit expression, the slice x@G is also denoted by G.x. In general, G is not a global functorial composition, in fact, the canonical morphisms Kt ---+ G are not even injective. To obtain a global composition, we suppose that for all address changes f : B ---+ A, the maps A@Fs ---+ B@Fs are injective for all indexes s. This is the case, for example, if A = Oz, or if Fs is constant. Then it follows that the system of isomorphisms and inclusions cPs,t, Ps,t satisfies the conditions of theorem 12, and therefore, the canonical morphisms it : Kt
---+
G
are mono, i.e., injective for all addresses. Further, the canonical squares
Kt,s Ps.tO¢s,t
1
Kt
ilt,s ----+
Ks
1 is
it
----+
(13.4)
G
are cartesian. Therefore G is generated by the isomorphic images it of Kt . Furthermore, since the evaluation of the inductive system offunctors at the morphism IdA (the slice at IdA) yields the original objective configuration, the generators it are different iff the original covering elements It are. So we have obtained a finite covering i of G, an atlas (Kt , @A x Ft)tET of functorial local compositions, and a surjection h : T ---+ i; this completes the construction. Exercise 23 Explicate this construction in all the details. Exercise 24 Establish a theorem for functorial compositions which corresponds to theorem 12.
Remark 3 Observe that, although the chart morphisms are compatible in the sense of fulfilling the conditions of theorem 12, no compatibility of the underlying functor morphisms on the ambient spaces is required. This means that we will not be able to extend the gluing procedures to the ambient spaces; in other words: global objects are really global, no common ambient space is available in general!
316
13.4
CHAPTER 13. GLOBAL COMPOSITIONS
Interpretations and the Vocabulary of Global Concepts
Summary. A large set of (objective) global compositions is constructed by "interpretations" of given (objective) local compositions. This construction is a subtlety that tends to escape to common interpretative activities. It is, however, a basic prerequisite for every interpretative activity in music. This is why this type of global construction is called interpretation. We give standard examples of such global objects. -~-
Very often, music analysis and composition explicitly deal with local compositions, but this coarse perspective is not sufficient: implicitly, one specifies local parts of this data and different overlapping relations among such parts. For example, Boulez' series of example 13.1 needs a covering by 15 local parts in order to be fully understood. Many analytical texts (e.g. [243, 393, 466]) also use this kind of local part selection. As already stressed, this type of chart specification is not temporary, it is a substantial attribute of the analytical work. We shall therefore introduce a special construction method of global compositions which take into account this important technique as an interpretational basic. Here is the general construction method of such global compositions which we therefore call interpretations.
Definition 39 Let (K, A@F) be an A-addressed objective local composition, and let I be a non-empty covering of the support set K. Then the covering K I, together with the obvious atlas (i, A@F)iEI, defines an objective global composition, the interpretation KI of K associated with the covering I. Let (K, @A x F) be an A-addressed functorial local composition, and let I be a nonempty covering12 of the support functor K. Then the functorial covering K I, together with the obvious atlas (i, @A X F)iEI, defines a functorial global composition, the interpretation K I of K associated with the functorial covering I. We shall see in chapter 16 that interpretations define a special case: Global compositions may be far from 'interpretable', i.e., they are not 'isomorphic' to any possible interpretations (we shall make these remarks precise later).
Example 22 A second example of functorial global compositions is associated with interpretations of objective compositions. Let (K, A@F) be an objective local composition, and take an interpretation KI by an atlas I = (K.). of subcompositions. Then we have an interpretation j = (K.). of the local functorial composition k since unions and? commute by lemma 2, section 7.4. But observe that the intersection ]{.1 1\ ]{.2 is not objective in general. This is however the case for the zero address A = Oz, and there, the colimit construction and this one coincide. 12The functor K is generated by the subfunctors in I.
13.4. INTERPRETATIONS AND THE VOCABULARY OF GLOBAL CONCEPTS
13.4.1
317
Iterated Interpretations
Summary. Interpretation is not a one-step process, it may be iterated on an infinity of levels. We present the formal framework and examples.
-I:If (K, A@F) is an A-addressed objective local composition, we can give an interpretation by a covering set I of sub compositions (K" A@F) of K. We then introduce this sequence of forms:
G iH
------>
Go=F, Power (G i ), i
2 Fun (Gi)>--*oFun(G i
)
=
0, ....
Then, the interpretation I is an A-addressed objective local composition in G 1 whose union is K, U 1= K. Here, the union operator is defined functorially as a transformation Ui : G iH ---t G i , i = 1,2, ... , with the initial evaluation Ul (1) = K. So we have an infinite diagram
II» = (G 1
lh G2 'h G3 lh ... )
and we can define a corresponding limit form
Limint(F)
------> Id
Limit(II»)
(13.5)
whose denotators are local compositions K as above, together with coverings I of K, and coverings J of I etc. This means that we are given an infinite succession of interpretations and interpretations of interpretations etc., in other words, an infinite sequence of interpretations of constantly increasing level of complexity. Of course there are infinitely many such denotators even if we fix an initial sequence of first k positions. A very classical 'infinite interpretation' is the one induced by the nerve of an interpretation. In fact, if we start as above, i.e., taking the covering I of K, we get an initial sequence (K, I, ... ). The simplicial complex n( 1) is an element of G 3 and can be iterated ad libitum by ntH (1) = n(nt(I)). This defines a sequence
nco(KI) = (K, I, n(I), n 2(1), ... n t (1), ... ) which indeed is in Limint(F). In particular, if x E K, the star of x st(x) = {U E I with x E U} is an element of n(1), and the star st 2(x) = st(st(x)) of st(x) is an element of n 3 (1), etc.,
sti(x)
E
n1+ 2i (1).
Suppose that each element x E K is given a weight w, for example the constant weight = 1 or some weight stemming from an analysis of K as in motivic or metrical theory. Then each covering set U E I can be given a numerical weight, for example a power w(U) = CExEU w(x) -l)P as in the metrical theory (see section 21.2) or some other musically motivated evaluation. Then, if x E K, we have a weight Wl(X) = w(st(x)) = EUEst(x) w(U), and then inductively WtH(X) = E uE stt+l(X) w(u), if we take the above formula w(u) = (2::xEu w(x) -l)P to calculate recursively the weight of elements of Gi . This associates a power series
w(x)
318
CHAPTER 13. GLOBAL COMPOSITIONS
The reader may easily generalize this idea to obtain a numerical evaluation of points in infinite interpretations. This means that infinite interpretations can be boiled down to yield numerical 'coordinates' of points in compositions, and therefore quantitative measures for further processing within performance or analytical contexts.
13.4.2
The Pitch Domain: Chains of Thirds, Ecclesiastical Modes, Triadic and Quaternary Degrees
Summary. Chains of thirds and modes are a classical domain of interpretation. The construction deals with "refinement" of scales qua local compositions. The refinement specifies determined notes or groups of notes which give the scales an interior profile. More elaborate interpretations of scales concern triadic and quaternary degrees for concerns of harmony. -I;-
Recall from section 7.2.1 that we may identify an octave-periodic scale S by its class chord S: A-v-+o-Scale(Sl"" Sk). Many common scales and chords are classified by use of the so-called third chain construction. To understand this construction, consider the two standard cases: 12-tempered and just scales. For the 12-tempered case, we are effectively working in the ambient pitch class space PiMod 12 with module functor @Z12. Here, we have the major third M12 = e4 and the minor third m12 = e 3 translation. If the pitch class space is identified with its Sylow decomposition: @Z12 ~ @(Z4 EB Z3) : x f---+ (x mod 4, x mod 3), M12 identifies to M12 = e(O,l), whereas m12 identifies to e( -1,0). See right half of figure 13.8 for the torus representation of PiM od 12 . On the EulerPlane ambient space (see example 5 in section 7.2.1), we have the corresponding translations Mjust = e(O,l) and mjust = e(l,-l), see left half of figure 13.8.
10g(5)
log(3)
Figure 13.8: Left: Major and minor third transpositions in just tuning pitch classes. Right: The third transpositions on the third torus representing 12-tempered pitch classes. With these translations in mind, we can define a third chain:
Definition 40 A chord S = {Sl,'" Sd in PiMod 12 or in EulerPlane, respectively, of cardinality k is called a third chain iff the elements of S can be ordered in such a way that
13.4. INTERPRETATIONS AND THE VOCABULARY OF GLOBAL CONCEPTS
319
Sij+l = Tj(Sij) for all j = 1, ... k - 1 and Tj is either the major third translation M12 (resp. M just) or the minor major translation m12 or mjust, respectively.
Lemma 16 Every zero-addressed chord in PiM Od 12 is contained in a third chain.
If we admit positive and negative translation in the minor third direction mjust, we obtain the concept of weak third chains in just tuning (see also right half of figure 13.8).
Lemma 17 Every zero-addressed chord in Euler Plane is contained in a weak third chain, but not necessarily in a third chain.
Exercise 25 Give a proof of the preceding lemmata.
The set 3Chains of all zero-addressed third chains in PiM Od 12 is known and has been used in computer software presto®(see section 25.2.1), as well as in RUBATO®'s HarmoRubette (see chapter 41.3). See appendix L.2 for this list. The (objective) interpretation O@PiMod~fhains of O@PiMod 12 by the covering 3Chains is called the third chain interpretation of the pitch class space. In the 12-tempered case, the set of third chains containing a given zero-addressed chord S with minimal cardinality are called the minimal third chains of S, and this set is called the third chain closure of S, it is denoted by 3Chain(S). This set can grow quite dramatically for nonstandard chords. For example, the chromatic 3-chord S = {O, 1, 2} has 23 minimal (7-element) third chains, including third chains which start with three of the classical triads: major, minor, and augmented. In order to describe ecclesiastical modes, we make use of the 12-tempered space PiM Od 12 , although other pitch class spaces would also do the job. The point here is less the tuning but the construction of an interpretation from a given scale (chord). Our modeling of ecclesiastical modes will follow the (zero-addressed) scale X = C-major = {O, 2, 4, 5, 7, 9, IT}, the other modes are be derived by evident transposition. To describe modes, one selects two elements in X: the tenor or recitation tone t, as well as the finalis or final f. The first interpretation is the mode X f , i.e., the enrichment of the scale by a determined tonic f. This is an interpretation X f = Xl! of X by an atlas If of two charts: I = {X, {J}}, the scale and the tonic singleton. Observe that this is not a full-fledged concept of tonality. To meet this requirement, a mode may and will be enriched by other structural aspects, such as selected degree chords, etc. According to the position of the tonic, the mode
CHAPTER 13. GLOBAL COMPOSITIONS
320 has the names as listed in table 13.4.2.A.
Table 13.4.2.A I N r. I mode and plagal variant I finatis j I tenor t
1.a 1.p 2.a 2.p 3.a 3.p 4.a 4.p 5.a 5.p 6.a 6.p 7.a 7.p
Dorian Hypodorian Phrygian Hypophrygian Lydian Hypolydian Mixolydian Hypomixolydian Aeolian Hypoaeolian Locrian Hypolorian Ionian Hypoionian
d d e e
j j g g a a b b c c
a=d+7 j=d+3 c=e+8 a=e+5 c=j+7 a=j+4 d=g+7 c=g+5 e=a+7 c=a+3 g=b+8 d=b+3 g=c+7 e=c+4
The next step introduces an ecclesiastical mode 13 as a refined version of the simple mode interpretation: An ecclesiastical mode on scale X is the interpretation Xj,t = Xlj,t with atlas Ij,t = {X, U}, {j, t}}, For every final j, there are two modal variants: the authentic (,a in table 13.4,2,A) and the plagal (,p in table 13.4.2.A), the latter being marked by the prefix "Hypo". For the authentic mode we have either t = j + 7 or t = j + 8 (sixths), for the plagal one, we have either t = j +3 or t = j +4 or t = j + 5 (thirds or the fourth). The modal pair Nr. 6.a,6.p are listed for completeness, however, they are scarcely documented. The aeolian and ionian modes were only introduced in the 16th century by Glarean, but they are the basis of the modern major-minor system where the tenors have disappeared (and we only consider the modes X j ), the final tone being renamed to tonic, such that we now have aeolian =cantus mollis (Ca for scale C), and ionian = cantus durus (Cc for scale C). In harmony, interpretations are important which associate 3-chords with special harmonic functions. Corresponding interpretations of scales are called triadic interpretations, we shall discuss them in the sequel for the 12-tempered and just tuning. We shall see below that the examples to be discussed here have special automorphism properties. This is why we call them triadic degree interpretations. For 12-tempered pitch classes, we again take the major scales X, as well as the melodic and harmonic minor scale Xm and Xh (see figure 7.4). Observe however, that the name of such a scale, X = C, for example, does not imply the selection of a tonic. The name is only historically loaded and not in our structural setting. We are going to define the triadic degree interpretations for X = C, the others being deduced by transposition. 13The ambitus, i.e., the octave where the modal melody may move, is omitted here because we work in octave classes.
13.4. INTERPRETATIONS AND THE VOCABULARY OF GLOBAL CONCEPTS The triadic degree interpretation
C(3)
321
is defined by a seven element atlas
(3) = {Ie,IIe,IIIe,IVe, Ve, VIc, VIle}
of three-element charts. The analogous notation works for minor scales, i.e., c~) is defined by seven charts (3 m ) = {Ic",,IIc,,,, ... VIIc",}, and c~3) by (3h) = {Ich,IIch ,'" VIIch }. If ever the scale is clear, we omit it, writing simply I for the first degree, etc. The precise values for all our charts are shown in table 13.4.2.B. The non-trivial automorphisms of the major and melodic minor scales are shown with the notation Ux = inversion (German "Umkehrung") at pitch x, and Ux / x +1 = inversion between neighboring pitches x, x + 1. The arrows in row two are for the alteration shifts in the melodic and harmonic minor scales against the major scale. Table 13.4.2.B
autom. deg. I II III IV V VI VII
C
Cm
• 0.0 • • 0.0.0.
• 0 • • 0.0.0.0 •
I
Ch f-
I
= Ud
.000.00.0000 00.00.000.00 0000.00.000. .0000.000.00 00.0000.000. .000.0000.00 00.00.00000.
tpe. maj. min. min. maj. maj. min. dim.
f-
f-
.0 •• 0.0 •• 00 •
=Ug
.00.000.0000 00.00.000.00 000.000.000. .0000.000.00 00.0000.000. .00.00000.00 00.00.00000.
tpe. min. min. aug. maj. maj. dim. dim.
.00.000.0000 00.00.00.000 000.000.000. .0000.00.000 00.0000.000. .00.0000.000 00.00.00000.
tpe. min. dim. aug. min. maj. maj. dim.
Observe that not all major, minor, diminished, or augmented triads in these scales are automatically degrees. For example, the triad {at>, b, d} is not a degree in harmonic minor Ch. Referring to the automorphism groups of these scales, as listed in the chord classification table appendix L.1, we observe that the degree atlases (3), (3 m ), (3h) are all invariant under the respective automorphism groups. More precisely, we have these degree orbits in the three scales:
X: {I +-+ VI},{II +-+ V},{III +-+ IV},{VII} Xm : {I +-+ V}, {II +-+ IV}, {III}, {VI +-+ VII} Xh : {!}, {II}, {II!}, {IV}, {V}, {VI}, {VII} where the third is so since the automorphism group of harmonic minor is trivial, Xh is "rigid". The geometric nerves N(X(3»), N(x~»), N(x~3») of each of these global objective zero-addressed compositions are Mobius strips, as shown in figure 13.9. Following Schonberg's proposal [478], we call it the harmonic strip. The harmonic strip has seven simplexes of dimensions zero and two, and fourteen simplexes of dimension one. The I-skeleton is exactly Schonberg's harmonic strip. The order of the degrees on the strip's boundary is the so-called fifth sequence,
V
---+
I
---+
IV
---+
V II
---+
III
---+
VI
---+
II
(---+
V),
322
CHAPTER 13. GLOBAL COMPOSITIONS
VII . d )
IrIO S
III Figure 13.9: The nerve N(X(3)), N(x~)), N(x~3)) is a Mobius strip. Following Schonberg's proposal, we call it the harmonic strip. although this is only a diminished fifth (tritone) for the seventh-fourth passage. In other words, the boundary is connected, unlike with a normal strip where we have two connected components. This is due to the lack of orientation on the harmonic strip: One walk around the entire strip changes your upside to downside, see figure 13.10.
Figure 13.10: The harmonic strip is not orientable. The mutual position of the boundary changes after a round-trip. This property has consequences for Riemann's harmony.
13.4.2.1
Orientation in Riemann Function Theory
We should add a remark on the failure of Riemann's attempt to build a global function theory of harmony, as was discussed by Carl Dahlhaus in [100]. The central concept of this theory is the function which is to be attached to every (!) possible chord, not only to the chords of common usage. Before starting the discussion, we observe that the problem here is not so much one of tunings, but rather one attached to the non-orient ability of the triadic interpretation
13.4. INTERPRETATIONS AND THE VOCABULARY OF GLOBAL CONCEPTS
323
independently of tuning specifications. Therefore we want to stick to the harmonic strip as discussed above (this nerve will reappear for the triadic interpretation in just tuning which we discuss below). Riemann's idea was to define "tonality" by use of a function with three possible values: "Tonical" (T), "Dominant" (D), and "Subdominant" (S), which can be attributed to chords. So we have to deal with a function T :
Ch
---+
TDS
defined on the set Ch of all (zero-addressed objective) chords (in PiM Od 12 , to make the ideas precise), with values in the three-element set T DS = {T, D, S} of harmonic function values. Moreover, we have to distinguish between two sub-categories of tonal functions: major, and minor, for each pitch class x: +Tx :
Ch
---+
TDS (x-major tonality)
°Tx :
Ch
---+
TDS (x-minor tonality)
According the the selected tonality, Riemann's program is to attribute a specific value to given chords. For example, one would like to have
which means "{g, b, d} is tonical in G-major tonality", or
which means "{g, b, d} is dominant in C-minor tonality". When they define tonality, special chords of the triadic degree interpretation special values, i.e.:
X(3) are
given
However, it is not true that the function concept is not a mathematical one, as Dahlhaus has criticized in [100] relating to an erroneous proposal. Rather is the problem of function theory to extend the values of tonality functions from common first, fifth, and fourth degree triads to any chords in such a way that the harmonic coherence is reflected. This procedure can be termed "musical logic" in the sense of Riemann. This concept is an obscure one, as Dahlhaus has rightly recognized. It is however possible to shed some light on this approach [100, p.96]: "Die Bestimmung der Akkordbedeutungen, der 'harmonischen Logik', ist also mit einer Regel liber die Reihenfolge der Stufen verbunden." According to Dahlhaus, this is the fifth sequence, the fundamental sequence along the harmonic strip's boundary 14. This sequence lays the basis of the idea of "different" (German: "differente") degrees: the values
are different according to the fifth steps V ---+ I ---+ IV. In order to proceed, the other four triadic degrees must obtain one of the existing values, each. 14Dahlhaus starts the sequence in degree I, but this is the equivalent to ours since the sequence is cyclic.
CHAPTER 13. GLOBAL COMPOSITIONS
324
nY Figure 13.11: The local orientation on the harmonic strip yields the parallel degree as the one staying in front and to the right when moving in fifth sequence direction such that the strip stays to our right. The second musicological idea relates every degree Y to its "parallel degree" 7rY. On the harmonic strip, the latter is deduced from the fifth sequence in the following way (see figure 13.11): Let bY be the successor of Y in the fifth sequence. Then, 7rY is the third member of the 2-simplex containing Y and bY. If we move along the harmonic strip in such a way that the strip stays to the right when moving from Y to bY, then 7rY is to the right in front of us. For example, if we move from Y = I to bY = V, the parallel degree is 7rY = III. We also have 7r I = V I, 7r IV = I I. The wording "parallel" is also geometrically correct since we look for the degree which is parallel to the present position. The contradiction in function theory comes out from the requirement in function theory that parallel degrees should have equal function values:
In fact, we then must have
This contradiction can also be read as follows: When applying the parallel function to every degree on the harmonic strip, the connectedness of the boundardy (=the fifth sequence) leads to the parallelism 7r7rY = bY (the "Gegenklang" relation), and therefore the tonal values cannot be different on the vertices of a 2-simplex. The orientation being only a local one, the musicological requirements can only be fulfilled locally and do not glue to yield a global function! Therefore we cannot follow Dahlhaus [100, p.102] when he says that function theory: "gerade dort versagt, wo auch das Phanomen, das sie erklaren solI, ins Vage und Unbestimmte gerat." In reality, the phenomenon is founded in a very precise fact: the non-orientability of the Mobius strip. Any attempt to define a function despite this fact must fail for mathematical reasons.
13.4. INTERPRETATIONS AND THE VOCABULARY OF GLOBAL CONCEPTS 13.4.2.2
325
Just Triadic Degree Interpretations
For this situation, recall the just C-major scale in the EulerPlane from section 7.2.1.2. The triadic degrees are not completely clear here, see figure 13.12. The problem is that one would like to respect certain pitch relations. Except for degree II, V I I we obtain major or minor chords. If we want to have a minor chord for degree I I which has two tones in common with degree IV, we need to take the tone d* = -2q + t instead of d = 2q, i.e., d* - d = Kt, the third comma-more precisely: its pitch class, see formulas (6.33) for the definitions. In the same sense, the seventh degree must, if we want it to be built from two minor thirds d - b and 1- d, refer to the second I-variant f* = 1- Kt. We then have the alternative degrees
o e 0
0
0
II'
e
e e e
0
0
0
0
IV
o
0
e
V
0
0
o 0
e e
0
e
VII
00
eVII*
0
0
e
o 0
0
0
0
II
0
0
0
e 0 e e
0
VI
o
0
0
e
o e e o e e o 0 e
e e o e
e e
0 III
e
e
Figure 13.12: The list of just triadic degrees, together with the variants I I*, V I I* which meet standard requirements of interval distances.
II' = {d*,J,a}, VII* = {b,d,f*}, and these have an empty intersection, in contrast to the canonical degrees II, VII. In order to harmonize this irritating situation-which stems from pitch-distance requirements on degrees---one proceeds as follows: We no longer work with pitch classes x but with third co~~a classes
]x[= x+7!..Kt. The justest scales can then be defined as local compositions by the quotient module J Kt 7!..2/7!..Kt, i.e., with ambient space @JKt, a space which can be identified with the Pythagorean tuning subspace @7!..q in the EulerPlane. In fact, we have 7!..2 = 7!..q EB 7!..Kt, see also figure 13.13. With such a construction, we may transport the triadic degrees of just tuning C(3) to C-major in justest tuning , and thereby obtain the interpretation Q:(3) with degrees J, JJ, JJJ, Jm, m, mJ, mJJ with J =]I[= {]c[, ]e[, ]g[}, etc. In this setup, ambiguities from just tuning disappear, and the nerve N(Q:(3)) is a Mobius strip. The second degree now can be realized by selecting one variant of the third comma equivalent just degrees (tone by tone, not as a whole!). The atlas of degrees of a just triadic degree interpretation also remains invariant under the automorphism group of the just scale. For the C-major scale, this group is generated by
CHAPTER 13. GLOBAL COMPOSITIONS
326
Pythagorean tuning subspace
Figure 13.13: The third (or Pythagorean) comma classes cover all just pitch classes when starting from the Pythagorean subspace. Third comma classes solve the problem of the second and seventh degrees, one is allowed to select anyone of the possible representatives within one comma class. the skew reflection
A
= eq •
(
~1 =~ )
(see also example 11 in section 8.1.1). The orbits of the degrees look as follows:
{I}, {II
<-+
V II}, {III
<-+
VI}, {IV
<-+
V}.
Exercise 26 Verify the invariance and the orbit structure as described above. We shall see however in chapter 27.1.6 that modulation in just tuning is also easily modeled without necessarily building justest tuning constructs.
13.4.3
Interpreting Time: Global Meters and Rhythms
Summary. Global meters and rhythms as interpretation by atlases of local rhythms and meters. We describe macros for rhythmic germs.
-EIn music, different local metrical structures mostly coexist in the same portion of music, as for example illustrated in figure 13.14. In this section, we want to give an account of global time constructions deduced from this type of phenomena. Recall from section 7.2.2 that a local rhythm is defined as a union of translates of a local 'germ' composition in an ambient space Onset x Para. We would like to include formally the
13.4. INTERPRETATIONS AND THE VOCABULARY OF GLOBAL CONCEPTS
~
I
..
4t
t-
2
., 3 .• !
a
t
3
espressivo ~
t
-
t 3
327
..
I
't
Figure 13.14: Example of a simultaneous presence of local meters in Beethoven's op.106, Allegro, bars 209-210. The right hand plays eight quavers while the left hand plays six triplets per meter. case where the rhythm is a local meter (empty parameter set for space Para) on one hand, and where the parameters may vary, on the other. The first requirement is met by the form
Rhythm(Para) with R(Para)
-----+
Id
-----+
Id
Colimit(Onset, R(Para))
Limit(Onset, Para). This means that an objective A-addressed local com-
position Rh in ambient space Rhythm(Para) is a subset of A@Onset u A@Onset x A@Para (with corresponding functors of these forms). For any period P : 0"", Duration(p), we have the evident translation action eP on Rhythm(Para), and therefore on the local Para-rhythms, in the sense of section 7.2.2.
Definition 41 With the above notation, for a rhythm germ G
c A@Rhythm(Para)
and an interval [a, b] of extended natural numbers, a local (A-addressed, objective) Para-rhythm is an A-addressed local composition R defined by R =
U etPG
tE[a,bj and denoted by e[a,bjPG.
We have an obvious projection endomorphism
Pmeter : Rhythm(Para) .-, Rhythm(Para) which is the identity on cofactor Onset and projects onto the first factor on R(Para). For a local composition L in Rhythm(Para), the projectionpmeter(L) is called the associated metrical rhythm, and the intersection L n Onset is called the metrical component of L. A local Para-rhythm defined by a singleton germ G = {g} is called a local Para-meter. Conformal with the known terminology, a local Para-meter is again called a local meter if its
CHAPTER 13. GLOBAL COMPOSITIONS
328
germ lives in the cofactor Onset and is equal to its metrical component; it is called local rhythm if its germ lives in the cofactor R(Para). A finite local Para-meter M with germ {g} may be written as M = e[O,bJp{g}, with nonnegative b; we abbreviate this object by M = [b, p, g]. For a local Para-meter M, 9 is called the origin onset, while b = l(M) its length. For length zero, we shall always assume the period to be also zero; the now uniquely defined period of a local Para-meter M is denoted by p(M). The second requirement is met by suitably generalizing the form Para: Suppose that we want to take local rhythms from different parameter spaces
Then we may collect these options to Para --+ Colimit(Paral, Para2, ... Para n ) , Id
and the second problem is also solved.
Lemma 18 The intersection of any two A-addressed local Para-meters R, S is a A-addressed local Para-meter.
Proof. Let R = [b, p, g], S = [c, q, h] be a representation by germs, periods, and origins. Suppose that R n S is not empty (otherwise, the empty germ will do it), and take x = e SP 9 = e tq h. This implies that either g, h are both onsets, or they have the same Para-component. WLOG 15 , we may assume the first case, i.e., g, h E A@JR are A-addressed onsets. So g, h have the same linear parts and if their translation parts are 'Y, T/ E JR, respectively, we have the equation sP+'Y = tq+T/ of real numbers. If there is no other common point, we are done with length zero. Else, if there is another common value w = s*p + I = t*q + 'f), we have ~ = (s - s*)p = (t - t*)q. We may suppose that ~ is the smallest possible positive number of this type. Then we claim that the maximal integer interval a :::; b such that e[a,bJLl. w eRn S equals R n S. Clearly, the interval [0,0] is in RnS. So, there are such numbers a, b. Take a maximal interval. It is evidently unique. Take any element y = s**P+'Y = t**q+T/ in RnS-e[a,bJLl. w. It cannot lie between two elements of the local meter e[a,bJLl.w since this would contradict minimality of ~. So the point lies to the right or left of the set e[a,bJLl. w. WLOG, suppose it lies to the right. Then we evidently can increase b until we reach y, and this is a contradiction. So the intersection is this local meter.
QED. Lemma 19 For two local A-addressed Para-meters R, S, if ReS, then l(R) :::; l(S). Proof: Exercise. With this terminology, the interpretation G1 of an objective, A-addressed local composition G in ambient space Rhythm(Para) is said to be (Para-)rhythmical iff every chart is a local (Para- )rhythm; it is said to be (Para-)metrical iff every chart is a local (Para- ) meter. These definitions englobe Vuza's concepts of rhythms and of canons developed in his works ~52,
554, 555, 556,
55~.
15WLOG = without loss of generality.
13.4. INTERPRETATIONS AND THE VOCABULARY OF GLOBAL CONCEPTS
329
In general, it is not easy to decide how to interpret a local composition by (Para- ) rhythms since the decision between large repetition numbers and large germs has no evident rationale. When taking (Para- ) meters, the situation is easier. We have several basic examples of (Para)metric interpretations of an A-addressed objective local composition X in Rhythm(Para). 1. Consider the set M ax(X) of all maximal local A-addressed Para-meters contained in X. Observe that by lemma 19, the set M axM et(X) of all finite intersections of elements of Max(X) consists of local A-addressed Para-meters which form a base of the so-called maximal meter topology on X. We identify it with the corresponding covering and also denote it by the interpretation symbol xMaxMet. Its charts are called the canonical (local A-addressed) Para-meters of X. 2. Take the atlas of all local A-addressed Para-meters U c X such that their lengths l(U) are at most equal to a limit L. This interpretation is denoted by xMetLg[LJ.
3. Select the set of all local A-addressed Para-meters U c X such that their periods p(U) are at most equal to a limit P. This interpretation is denoted by xMetPer[PJ. 4. For any interpretation KI of a finite local composition K, we may introduce a level function Lev: I -+ N as follows. We define the inverse images leVi = lev- 1 (i) by recursion:
• leVa = Max(I), the set of (set-theoretically) maximal members of I; • leVi+l
= levo(I -
Uk::;i levk).
The charts in levj are called charts of level j, and we set Iii = Uk
n:
n(XMaxMetIO)
nTen
-+
xMaxMet : a
f----+
n
a,
(13.6)
i.e., aCT implies a, whence the claim. QED. The open set U(x) = nSp(x) is the minimal neighborhood of x. The dominance relations in this topology read as follows:
Lemma 20 With the above notation, if x, yare points in the local composition X, and if < is the dominance relation16 on the maximal meter topology, then we have
y 16See appendix F.2.1.
iff Sp(y) C Sp(x) iff U(x) C U(y).
330
CHAPTER 13. GLOBAL COMPOSITIONS
Proof. We have y < x iff x E U(y) iff U(x) U(y) and therefore U(x) c U(y). QED.
c U(y), whence Sp(y) c Sp(x), whence x
E
U(x) C
Proposition 12 The irreducible closed sets of the maximal meter topology are the closures of their points with maximal simplexes in the nerve.
Proof. Suppose F is an irreducible closed set and x is a point having maximal simplicial metrical weight among the elements of F. If y is any point of F, the minimal neighborhoods U(x), U(y) intersect, so take a z in that intersection. This point dominates x and y. By maximality of the simplicial metrical weight of x, the simplicial metrical weight of z is equal to the weight of x, and lemma 20 implies that Sp(x) = Sp(z), so x dominates y. The rest is clear. QED. For example, the closures of points of maximal simplicial weight are irreducible components, but not necessarily vice versa. The irreducible components correspond to the maximal simplexes of the nerve n(XMaxMet) , and the generic points are the elements ofthe images of the maximal simplexes under the map (13.6). Musically speaking, the generic points are those which participate in maximal collections of local Para-meters, and this means that they are present in a maximal collection of regular patterns in onset time. We shall introduce refined metrical weight functions which generalize the above simplicial weight, and take care of the lengths of the local Para-meters as well as of the periods and the positions of the points within the maximal local Para-meters in chapter 21.
n
13.4.3.1
Macros for Rhythmic Germs
We have seen that for general rhythmic germs, there is no unique maximality of germs, and that the metrical special case is built upon the singleton germ set. There are however two approaches in order to circumvent such a problem. The first deals with address change. Suppose we are given a finite germ G which is an A-addressed local composition which is completely contained either in the sub-form Onset or in the sub-form R(Para) of Rhythm(Para) , and where Para ~ @PARA is simple with a module PARA. This means that the germ has coordinates in a module. Then the elements of G are in A@M, where M is one of the modules JR, JR EB PARA, and we suppose that the coefficient ring for A is R In this case, by the construction of fiber sums of modules (appendix E.3.8), we have l7 (A@M)n ~ AEEln EB JRn-l@M and we can parametrize an n-element germ G by the n-tuple (G) of A-addressed points, and then by the corresponding point IJ(G) with address IJn A = (A)n EB JRn-l. The second solution to ambiguities in germ definitions uses macros and a flattening operation 18, as discussed in section 6.7. We start from the hypothesis of the preceding discussion and therefore may have the germ G living in a simple form indexMakroBasicBasic of module M = JR EB PARA where-for the sake of mathematical simplicity-the first factor parametrizes the Onset coordinates. We need the circular form KnotBasic: 17The bijection takes the n-tuple (gi = e t ; 'gi,O)i into et1'g, where g(al, ... ) = gl(aa), g(O, ... ai,"') = gi,o(ai) for 1 < i, and g(O, ... 1n +j,"') = tj+l - tl for j = 1, ... n - 1. The injections £i : A >--> Un A are these: £I (a) = (a, 0, ... ), £i(a) = (0, ... ai = a, 0, ... 1n +i-1, ... , 0) for 1 < i. 18The present flattening operation is just the infinite one from the previous discussion.
13.4. INTERPRETATIONS AND THE VOCABULARY OF GLOBAL CONCEPTS
K not Basic
--+ Id
Limit (Basic, M akroBasic)
with MakroBasic and F
331
--+ J:F:::'2FK>-+f!FK
Power(KnotBasic)
= Fun(MakroBasic),FK = Fun(KnotBasic).
We have the flattening operation on KnotBasic Denotators D 1. FlattenOO(D)
= {b}
2. FlattenOO(D)
= Ui b + FlattenOO(Knot i ),
if m
= (b,m)
defined by
= 0, if m
= {Knott, ... Knotk}.
Then the idea is to define a macro germ G* such that the given, 'flat' germ G is the flattened version of the former, G = FlattenOO(G*). For example, we may take a germ which contains ornaments as M akrobasic-formed denotators, and/or drum patterns or the like as 'satellites' of the macro. The translation in time is the same as before, only that it operates exclusively on too basic event in the knot macro, Le., if eP is a period translation in time, and if the macro germ is G* = (b,m), we define eP(G*) = (eP(b),m). An A-addressed local Para-rhythm can be paraphrased by the local compositions in ambient space K not Basic; they are local compositions of the shape e[a,bjPG*, built upon macros G*. In this language, a global rhythm is an interpretation of a local composition in ambient space KnotBasic by local Para-rhythm of macro type. So global macro rhythms can be projected to global rhythms by flattening, and they constitute a hierarchical shaping of the flat rhythmical germs. We should stress that these reflections are merely 'germs' of a theory of macro-events, and that we are far from having established any systematic treatment of such objects. But there is an evident relation of this approach to the Schenker idea, a perspective which we shall reconsider in chapter 21 when discussing grouping rules in Jackendoff-Lerdahl's GTTM.
13.4.4
Motivic Interpretations: Melodies and Themes
Summary. Motives are considered as being local ingredients of melodies. We discuss the local/global dichotomy in this subject, and, in particular, the subtle interpretative activity in motivic analysis. -~-
Concepts such as "motif", "melody", "theme" are among the most fuzzy of musicology. There are three main sources of such fuzziness: On the one hand, the very concepts are not clear, neither in the delimitation against each other, nor in the intrinsic attribution. On the other hand, it is not clear how to compare motives, for example, how to decide when two motives are "similar" . Thirdly, the concept of a theme or motive is not only a structural one, but includes a semiotic perspective. A theme is a subcomposition of a composition which plays a certain role in the entire construction of the composition qua expression of meaning. We shall deal with this aspect in chapter 22. We should also stress that a theme is not necessarily related to the motif
332
CHAPTER 13. GLOBAL COMPOSITIONS
concept, a theme may as well be a harmonic or rhythmic object. Its characteristic property is the semiotic role, not the structure. In this section, we shortly focus our attention on the problem of elementary and compound motivic structures. We have already introduced the motif concept in definition 15 (subsection 7.2.3) as an objective local composition with onset and pitch coordinators in its ambient space, and such that the onset projection is bijective. We should therefore define a "motivic" interpretation as being an interpretation of a local composition in a determined motif ambient space (having onset and pitch factors) such that every chart is a motif in this space. In particular:
Definition 42 A melody is defined as a motivic interpretation of a motif. In fact, a melody is a local composition with successive tones in time. But it is more: it is understood as a compound object with "characteristic" melodic ingredients, the melody's motives. Now, these units are not neutral data, they depend upon the interpretative, analytical, or poietic interaction. In other words: they are a covering of the motivic tone material of the melody by specific selected submotives. In traditional musicology (see, e.g., the article "Melodie" in the Riemann Musiklexikon [457, p.554]), a melody is also associated with the Ehrenfels gestalt qualities of super-summativity and transformational invariance. The AST has dealt with this aspect in the contour theory. We shall deal with this ramification in chapter 22. Gestalt is a central invariant of melody, but it is not the intrinsic concept, it is a derived attribute which comes out from a complex and not uniquely defined abstraction process. Every contour or gestalt or shape concept is derived from the motif structure by elaboration of specific motif aspects, such as interval sequences, onset-pitch-interval angles, length ratios, diastematic ambitus, etc. Before the gestalt aspect comes in, the motivic interpretation which defines the melody upon a motif has a more elementary-but nonetheless dramatic-function which we discuss now. If we cover a motif by characteristic motifs, as expressed by the melody's nerve, this action reflects a disintegration of the original underlying melody motif into several relatively small parts which may intersect in a rather loose way. For example, the covering may split into several disconnected (disjoint) motives, or motives may have just one tone in common. The crucial question is whether we should keep in mind the original melodic motif (the entire set of tones) or whether the splitting activity is really meant as a decomposition in henceforth in- or inter-dependent parts. So this elementary interpretation activity precedes refined considerations: it deals with the initial statement of which are the parts and which is the whole. But even if we are given two fixed motivic interpretations, i.e., melodies, it is not clear how one should compare two such objects. Gestalt is a concept of equivalence which should be built upon comparison devices for melodies. The only thing which is clear now is that melodies are not neutral objects, they are results of interpretative interventions. We are urged to set up the basic tools for comparison of global compositions, and this is what we shall do in the next chapter.
Chapter 14
Global Perspectives The woods are lovely, dark, and deep, But I have promises to keep, And miles to go before I sleep, And miles to go before I sleep. Robert Frost (1874-1963) Summary. Global perspectives deal with relations among global music objects. We introduce this subject together with its musical motivation and append the formal definition of morphisms among global compositions. This leads to the categories ObGlob of objective and Glob of functorial global compositions. We describe the combinatorial aspects of globality and associated functors, as well as corresponding geometric classification tools: nerves and simplicial weights. -~-
14.1
Musical Motivation
Summary. Since global compositions are of musical and musicological interest, requiring their comparison is quasi-automatic. However, this is not in the tradition of musical analysis. We discuss this issue with regard to comparative analysis of melodic themes in a composition, and in the more general context of comparative analysis of musical works. It turns out that the latter is virtually non-existent, at least on the level of a reliable scientific language. -~-
Mathematically, it is straightforward how to extend the categories of local compositions to categories of global compositions. But musicologically, this is not a common situation, on the contrary, The musicological fuzziness of global structures appearing in melodic analysis-which we have made precise via motivic interpretations-makes it impossible to take a step further in the comparative methodology of global structures. In the typical case of a comparison of two melodies, or in the discussion of the motivic-thematic work in, say, Beethoven's compositions, it is intuitively accepted that one deals with smaller and larger units, and that comparison
333 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 14. GLOBAL PERSPECTIVES
334
should take into account comparison of parts, but it is not clear how the relations of parts within a first melody (for example) should be taken over to relations among parts of a second melody. Comparison of the gluing of parts is not conceived. So Ehrenfels' requirement of an "added value" to the "sum of parts" against the "whole" is not understood: How much more do we need to get the whole? How can we compare this specific difference. For example, if we have split a local composition into isolated singleton charts (the 'silly' interpretation), the disintregation is extremal: We essentially have to count the tones, and there is no connection left. So the comparison reduces to counting notes, the cardinality of the original local composition. This is the trivial case where the added value is naught. But in general, for example with the harmonic strip, there is a rich connectivity, in fact the Mobius strip, in the nerve. So the added value is complex: it lies in the single charts (the triads) as well in their intersections (thirds or single notes) and also in the combinatorial configuration. So, how strongly is the harmonic strip related to the original scale? Are there many scales with essentially equal harmonic strips? Same situation for motivic analysis: If we take the main theme of Bach's Art of Fugue, is it the 8-tone theme or the 12-tone theme? This still debated question depends on the development of a sophisticated technique of motivic analysis of melodies or themes. And, finally, without answering these questions, a comparison of gestalts is a fortiori impossible. On the other hand, overall comparative analysis of musical works of course transcends the theory of local or global compositions. In fact, the denotators of musical works, such as European scores, cannot be grasped by compositions (in the technical sense), they deal with more general forms of all possible typologies: limits, colimits, synonymy, and all kinds of simple forms, see the contribution of Mariana Montiel Hernandez [378] concerning a standard form for piano scores. In this chapter we shall introduce only comparative tools (morphisms) for global compositions. The category theory for general denotators is still embryonal.
14.2
Global Morphisms
Summary. This section introduces the technical definition of morphisms between global compositions. Special aspects, e.g., isomorphisms or composition of morphisms, are discussed. The concept of interpretable global compositions is presented and illustrated. -~-
The first definition regards objective global compositions:
Definition 43 Suppose we are given two objective global compositions G 1 at address A and H J at address B. Then a morphism from G 1 to H J is a triple (j, L, a) where 1. f: G 2. L: I
-t
-t
H is a set map,
J is a set map such that f(i) C L(i) for all covering sets i E I,
3. a is a family a
=
(ai : A
-t
Bh of address changes, such that
4. for any atlases (Kt, A@Et)T for G and (Ls, B@Fs)s for H, if we take the chart isomor-
phisms (Pt : K t ~ It and 'l/Js : Ls ~ J s for some pair t, s of indexes which correspond under the map L (i.e., L(It ) = J s ), then the induced maps ft : K t - t L8 define morphisms ft! ai : K t - t Ls of objective local compositions.
14.2. GLOBAL MORPHISMS
335 L
This morphism will be denoted by flo. : G 1 fla:G-+H.
-+
H J or, if no misunderstanding is possible, by
Observe that this is the precise generalization of the morphism concept from objective local compositions. For functorial global compositions, the definition of morphisms runs correspondingly:
Definition 44 Suppose we are given two functorial global compositions C I at address A and H J at address B. Then a morphism from c I to H J is a triple (f, L, a) where 1. f: G
2.
-+
H is a natural transformation,
I -+ J is a set map such that we have subfunctor relations im(fli) C L(i) for all covering functors i E I,
L :
3. a is a family a
4.
= (ai : A
-+
B)r of address changes, such that
for any atlases (Kt , @A x Eth for G and (L s , @B x Fs)s for H, if we take the chart isomorphisms ¢t : K t ~ It and'l/Js : Ls ~ J s for some pair t, s of indexes which correspond under the map L (i.e., L(It ) = J s ), then the induced maps It: K t -+ Ls define a morphism Itlai of functorial local compositions. L
This morphism will be also be denoted by flo.: G 1 by flo. : G -+ H.
-+
H J or, if no misunderstanding is possible,
If we have G 1 = H J , the identity morphism is the triple (Ide, IdI, IdA). This being the case, defining the composition of morphisms of global compositions is straightforward.
Definition 45 Given three objective or functorial global compositions C I , H J, K L, a morphism L
I<
<:
I<
L
flo. : G 1 -+ HJ, and a morphism gl {3 : H J -+ K L , their composition hh = gl {3. flo. : G 1 -+ K L is defined by the triple (h = 9 . f, C, = Ii . L, 'Y = (3 . a), where {3 . a is the family ({3L( i) . ai)r.
Exercise 27 Show that composition of global morphisms is associative whenever corresponding factors are defined. Show that the identity morphisms are in fact left and right identities in the sense of category theory. Definition 46 The category of objective global compositions ObGlob has the objective global compositions as its object set oObGlob, as defined in definition 36, whereas the set 10bGlob of morphisms is defined by definition 43. The category of functorial global compositions Glob has the functorial global compositions as its object set oGlob, as defined in definition 38 whereas the set 1 Glob of morphisms is defined by definition 44. It is useful to view functorial global compositions as a system of objective global compositions as follows:
CHAPTER 14. GLOBAL PERSPECTIVES
336
Lemma 21 Let G I be a functorial global composition at address A. Suppose that ¢" : K" ~ II< and ¢),. : K),. ~ h are any two charts for G I . Fix any address change f : B - t A, and consider the subsets f@KI< C B@K", f@K),. c B@K),.. Then the intersection of their images under B@¢I< and B@¢),., respectively, with B@II< n B@h coincide. Proof. In fact, the intersections are the same since the induced isomorphism ¢",)"/ a on the inverse images of the intersection have denominator a = IdA, and therefore, the parts related to f correspond. Definition 47 For every address change f : B - t A and a functorial global composition G I at address A, we have a B-addressed objective global composition fOOl which is defined on the subset of B@G which is locally covered by the images f@I" of the chart subsets f@KI< of any atlas, and which has the atlas defined by the second factors in the charts f@K" = {f} x K",!, together with the given transition isomorphisms. We denote this covering of foo by f@I, and call the objective global composition the f-slice of G I . The entire construction is well defined by lemma 21. The identity slice IdA 001 is denoted by (G I ) V • Example 23 Given two interpretations GI , G J of an objective local A-addressed composition G, and a refinement map, i.e., a map L : I - t J with i c L(i), all i E I, then the identity Ide
•
and L induce the refinement morphism IdA/IdA: G I - t GJ, also denoted by L. In particular, any interpretation GI yields a unique refinement morphism! : GI - t G. For example, with the notation of 13.4.3, we have a chain of refinements
(14.1) of an A-addressed finite objective local composition K. If the local composition is a major or minor scale scale X as discussed in section 13.4.2, we may consider the different versions of degrees. We have discussed the triadic interpretation X(3) , but this is just one possibility. More generally, set X = {Xl, X2, ... X7} in the order of increasing position on ::£12, i.e., also with indexes mod 7. One starts with the singleton interpretation X(1), where (1) = {{xi}1 i = 1,2, ... 7}; the third interpretation X(2) is defined by charts of thirds, i.e., (2) = {{Xi,Xi+2}1 i = 1,2, ... 7}; the triadic interpretation is defined by the atlas (3) = {{Xi, Xi+2, Xi+4} I i = 1,2, ... 7}; and the tetradic interpretation X(4) which is important in jazz harmonics is defined by the atlas (4) = {{Xi, Xi+2, Xi+4, Xi+6}1 i = 1,2, ... 7}. With this setup, we have 27 refinement maps for each succession of the above atlases. More precisely, if Zi is the degree Z in X(i), we have two embeddings Zi C Wi+1 depending on whether we take Zi as the lower or higher part within Wi+1. The lower embedding is denoted by 0, the higher one by 1, and the refinement map 8 = (81, 82 , ... 87 ) : (i) - t (i + 1) denotes the choice of 8i E BIT = {O, I} for the ith value. So we have this succession of refinements, ending with the unique morphism to the local scale composition:
(14.2) which has, of course an important musicological meaning: If ever we are given a chart of atlas (i), it may be reinterpreted as being one of two charts of the successive refinements, etc., and
14.2. GLOBAL MORPHISMS
337 . 11
•
VII
•
V
•
· IV
.VI
·1
singletons
III
as a whole
~
note degree
Figure 14.1: The four interpretations of a major or minor scale, starting with the singletons, up to the tetradic interpretation for jazz harmony. We have successive refinement morphisms, and associated maps between nerves (see section 14.4 for this association). Every interpretation uniquely maps to the 'uninterpreted' local scale composition shown to the right. Observe that the tetradic interpretation has a nerve which is a full torus, in fact a union of tetrahedra, the intersection configuration around any fixed tone. The harmonic strip of the triadic interpretation is embedded in the tetradic torus as a strip which is 'entwined' along the torus' interior circle. So the absence of orientation of the harmonic strip vanishes on the tetradic jazz harmony. this documents the vast ambiguity in elementary harmony of degrees! For the corresponding nerves, see figure 14.l. This ambiguity is also dramatic with regard to the geometric configuration of the nerve geometry. The absence of orientation on the harmonic strip vanishes on the tetradic interpretation of jazz. This may not only ease jazz harmony, more radically, it throws the context of jazz harmony one dimension higher: Moving in a three-space such as the torus, is more complex, there is more freedom of choice, more improvisational flexibility~at the cost of unambiguous harmonic syntagmatics. Example 24 This example refers to the analysis in [328] of the first movement of Beethoven's 'Hammerklavier' sonata Op. 106. The general harmonic principles of this sonata yield a motivic germ G c Onset x Pitchiz[1/2] with coefficients in the localization ring 2[1/2] of fractions with denominators that are powers of 2. This germ is essentially an ascending and then descending
CHAPTER 14. GLOBAL PERSPECTIVES
338 G(2)
r----------~----------,
Figure 14.2: The germ G of motivic work in Beethoven's 'Hammerklavier' sonata Op. 106 is essentially an ascending and then descending chromatic scale of three half-tone steps 'octave' period. Here we show the Pitch-rhythm with three copies of the germ. chromatic scale of three half-tone steps 'octave' period. The claim of our analysis is that the motivic work (German: "motivisch-thematische Arbeit") relates to interpretations ofthe Pitchrhythms oftype G(t) = e[O,tJ6G, see also figure 14.2 for this 'motivic zig-zag'. In [328] it is shown
...... + .... :. 0-1-
....
.j]
! !
-H K -1-++.1-+ !;
I Ii I
,.
Figure 14.3: The right-hand melody which appears in bars 75-76, and 77-78 of the first movement in op. 106. that a large number of motivic charts can in fact be interpreted such as to become isomorphic to interpretations of local subcompositions of G(t). Of course, the problem is not the existence of such interpretations since the silly interpretations always fit in such a framwork. We therefore look for good quality interpretations.
Definition 48 Let KI be an interpretation of a local composition K. The quality of KI is the pair (s,p), where s = card(I) , and p is the cardinality of connected components of N(K 1 ). Qualities are compared lexicogmphically: (S!'Pl) < (S2,P2) iff either SI < S2 or SI = S2 and PI < P2· SO the best quality is (1,1). With this in mind, we look for best possible qualities in our interpretations. To begin with, we look at the right-hand melody which appears in bars 75-78 (figure 14.3). The (projected) representation K in Onset x Pitch space is shown to the score's right.
14.2. GLOBAL MORPHISMS
339
A.K
Figure 14.4: The right-hand melody in bars 75-76 and 77-78 is isomorphic to a local subcomposition of the motivic 'zig-zag' under a vertical arpeggio A. In figure 14.4, we see that this melodic part is isomorphic as a local composition (quality (1,1)) to a subcomposition of G(2). The isomorphism is given by the matrix
A=e q
G~)
representing a vertical arpeggio. The second sample needs a proper interpretation of quality (3,3). The sample, a twelveelement motif, is shown to the left in figure 14.5. To the right we see the (projected) representation and interpretation K of this motif in Onset x Pitch space. In figure 14.6 we see the subcomposition L of G(2), as well as its interpretation L J by three disjoint charts L1, L 2 , L 3 . This interpretation is isomorphic to the interpretation KI under the following isomorphisms: L2 and L3 are translations of K2 and K 3 , respectively. The first chart L1 equals B· K1 with a transformation B
= eq .
(2° 9) -1
where the linear part is an onset dilatation by 2, followed by a ninefold horizontal arpeggio, and the pitch inversion, an isomorphism since its determinant -2 is invertible in Z[1/2]. Example 25 Consider the two following non-isomorphic, zero-addressed three-element motives in OnPiMod 12 ,12:
K = {(O,O), (0, 1), (1,0)} isomorphism class Nr.lO, L
= {(O, 0), (0, 1), (3, ll)}
isomorphism class Nr.13,
and consider the interpretations K(2), L(2) by the three two-element 'interval' subcompositions. Evidently, every interval is isomorphic to {(O, 0), (0, I)}, and since the intersections are singletons, we have K(2) ~ L(2). So the isomorphism of the interpretations by submaximal charts is no guarantee for the isomorphism of the original local compositions.
CHAPTER 14. GLOBAL PERSPECTIVES
340
Figure 14.5: A twelve-element alto-voice motif K in bars 79-80 in the first movement of op. 106. Exercise 28 Consider the following zero-addressed local compositions in PiM Od 12 :
C = {O, 2,4,5,7,8, 9} C-major scale, isomorphism class Nr 38"'; X
= {O, 2, 3, 4, 7,1O,11}
isomorphism class Nr 50"';
which are not isomorphic. Take the triadic interpretation pretation of X:
C(3)
Ix = {11, 3, O}, IIx = {7, 10, 2}, IIIx IVx = {10, 2,11}, Vx = {O, 4, 7}, V Ix VIIx = {4, 7, 1O} and call the associated interpretation C(3) which is defined by
X(3).
and the following triadic inter-
= {3, 0, 4}, = {2, 11, 3},
Show that we have an isomorphism
•
1/1 : X(3)
-.:::.
= 7,/(2) = 9,/(3) = 4,/(4) = 11, 1(7) = 2,/(10) = 5,/(11) = 0, 1(0)
~(Yx) =
Yc, Y = I, ... VII.
Show that this example fails if we do not allow isomorphisms with fifth or fourth transformations, i.e., if only inversions and transpositions are allowed, then we have C -.:::. X as soon as C(3) -':::'X(3).
Exercise 29 Show that the ecclesiastic modes (section 13.4.2) are rigid, i.e., they have trivial automorphism group: Aut(Xj,t) = 1; it is even true that any two different ecclesiastic modes
14.3. LOCAL DOMAINS
341 L
K'
o
.0.0·.••••••••
000
u
Figure 14.6: The interpretation of the alto voice in bars 79-80 of quality (3,3). on the same scale are not isomorphic. This is false for modes Xt, in fact, since Aut(C) we have a permutation of these modes (we take the authentic mode names):
= (Ud),
Ud(Cd)= Cd, dorian ...... dorian Ud(Ce ) = C e , phrygian ...... ionian Ud(Ct ) = C b , lydian ...... locrian Ud(Cg )= Ca , mixolydian ...... aeolian and this means that modes become rigid when transformed into their ecclesiastic castings, a completely traditional effect... .
14.3
Local Domains
Summary. The morphisms which are addressed in local compositions separate morphisms of global compositions. We discuss the musical consequences. -I; -
The Yoneda lemma (see our discussion in section 9) tells us that we may replace any global composition X in Glob by its functor @X E Glob@ and we shall not lose any information about its isomorphism class, more precisely, for any two global compositions, X, Y , we have a canonical bijection Hom(X, Y) :::.. Hom(@X,@Y). It is a fundamental question in every category to find out whether some determined subclass of arguments of the functor @X is already rich enough to determine X up to isomorphism. In algebraic geometry, it is well known ([198]) that the restriction @SIAff of the scheme functor @S of a scheme S to the subcategory Af f of the affine schemes is classifying, i.e. ,if S , T are two schemes, we have a bijection H om(@S, @T) :::.. Hom(@SIAtt,@TIAtt)· We shall now show that we have a weaker result in the theory of global compositions, i.e., the local arguments are 'separating'.
CHAPTER 14. GLOBAL PERSPECTIVES
342
Definition 49 Let X be a global composition in Glob. By @zocX we denote the restriction of the Yoneda functor @x to the subcategory Loc of local compositions. By definition, we have Y @zocX = Y @X = H om(Y, X) for any local (functorial) composition Y.
Since a local composition has just one chart, the map on the atlases for a local domain Y reduces to the indication of one chart Ii or chart index i in the target composition X. Therefore we write i
f/a:Y----+X
to indicate elements of Y@zocX.
Lemma 22 If X I, Y J are local compositions at addresses A, B, respectively, the canonical restriction map
loc: Hom(@X I , @yJ)
----+
Hom(@zoc XI , @zoc yJ )
is injective, i.e., local domains are separating. If Xl = X is a local composition, then loc is a bijection.
Proof. Let u : @X I
----+
@yJ be a natural transformation. We show that u is uniquely determined
by its restriction to local arguments. By the Yoneda lemma we know that u L
morphism f / a : X
=
L
@f/a for a
L
----+
y. So let us show that f / a is determined by the restriction loc( u) to i
local arguments. To this end, consider the canonical embeddings hd IdA : Xi L
i
i
----+
X of the L( i)
subfunctors Xi of the covering I of X. We have u(hi/IdA) = f/a· hi/IdA = f· hi / ai, where ai is the address change of the ith covering subfunctor. Therefore, the map L : 1----+ J is determined by the local evaluation at Xi. Also, the address change 0'.= (ai)i is determined, and the restriction f· hi on the covering (Xi) of X determines f. Therefore f
L
/0'.
and its functorial
L
counterpart u = @f /0'. is uniquely determined by the local restrictions. Let now X be local and let u : @zocX ----+ @zoc yJ be a natural transformation. We have to extend u to a natural transformation U : @X ----+ @yJ. Let WI be a global composition with charts Wt. Then the difference kernel K er(Il t Wt@X :::4 Ils,t Ws,t@X) of the restriction morphisms Ws,t ----+ Wt, Ws,t ----+ Ws identifies! to WI@X. By u, this kernel is mapped into the kernel K er(Ilt Wt@yJ :::4 Ils,t Ws,t@yJ) which is a subset of WI@yJ, and we are done. QED.
Lemma 23 If X I, Y J are local compositions at addresses A, B, respectively, then there is a bijection l : Hom(@zoc XI , @zoc yJ ) ~
II II Xi@~(i).
L:I --->J iEI
Proof. In fact, the contravariant functors @zocXI,@zocyJ : Loc morphic to UiEI @zocXi,UjEJ @zoclj, respectively. Hence, Hom(@zoc XI , @zoc yJ ) ~
----+
Sets, respectively, are iso-
II Hom(@zocXi,@Zoc yJ ) iEI
1 For
global codomains, the kernel is strictly smaller than the morphism set because of the index map!
343
14.4. NERVES
and by lemma 22 and Yoneda's lemma
whence our claim follows from the distributivity laws for products and coproducts of set-valued functors. QED. Lemma 24 With the above notation a natural transformation u from a morphism X I --t Y J of global compositions iff its image
@locXI --t @loc yJ
stems
with index map ~ : I --t J has the property that for every couple iI, i2 E I, the transformations fi2 coincide on the intersection functor X il ,i2 = XiI n X i2 . By H om1oc ( @locXI, @loc yJ ), we denote the set of these natural transformations and call them localizable.
fill
Exercise 30 Prove these statements. So we have this proposition: Proposition 13 If X I, Y canonical restriction map
J
are local compositions at addresses A, B, respectively, then the
is well defined and bijective.
In other words, we may replace the study of isomorphism classes of global compositions X I by the study of the isomorphism classes of their local domain functors @locXI under localizable natural transformations (which define a non-full subcategory locLoc@ on the local domain functors). The musicological meaning of these results is that the Yoneda philosophy for global musical structures may be restricted to local perspectives, i.e., morphisms on local compositions. With the restriction that the functors of local perspectives only reflect isomorphisms of global compositions if these isomorphisms are localizable. We are however far from completely understanding the nature of the local domain functors and their category loc Loc@. But we know that above the local domain argument, the nature of the patchwork of the local aspects is crucial, and this is why we look at the nerve functor now.
14.4
Nerves
Summary. To each global composition is associated a combinatorial structure, its nerve, in fact the simplicial complex of the covering of the composition by its charts. This functorial association retrieves important information on the global object. -I;-
CHAPTER 14. GLOBAL PERSPECTIVES
344
Nerves of objective global compositions were introduced in 13.2.1. For functorial global compositions, we have the same procedure. The abstract nerve n(G l ) of a global functorial composition is the covering's simplicial complex, i.e., the simplicial complex whose vertexes are the covering subfunctors G i , and whose simplexes are the (finite) vertex sets with nonempty intersection; the geometric nerve N(G l ) is defined as the space In(Gl)1 of n(G l ), but see appendix H.2.2, example 103. Since I is finite, the geometric nerve can be realized 2 as a £ polyhedron in lR 2m+1 if dim(n(G l )) ::; m. Clearly, every morphism f /a : G l --t H J of objective or functorial global compositions gives rise to a morphism £
n(J fa) : n(G l
) --t
n(HJ),
) --t
N(HJ),
and this one generates the continuous map £
N(J fa) : N(G l
everything in a functorial way, i.e., we have two functors
N : Glob
--t
Top, N : ObGlob
--t
Top
into the category Top of topological spaces and continuous maps3. In particular, the functor N of geometric nerves generates topological invariants for isomorphism classes of global compositions. £
K
Definition 50 Two morphisms f / a, g / (3 : G l --t H J between global compositions are called mathematically equivalent iff they coincide on the functors, i. e., iff f = g. An equivalence class of mathematically equivalent morphisms, i. e., a natural transformation f : G --t H which stems from a morphism of global compositions, is called a mathematical morphism between global compositions.
We shall come back to mathematical morphisms in section 19.1. K
£
Evidently, for two mathematically equivalent morphisms f / a, g/ (3 : G l --t H J, the corresponding simplicial maps are contiguous4 . So, by lemma 96 in appendix H, we have £
K
Proposition 14 Any two mathematically equivalent morphisms f / a, 9 / (3 : G l homotopic maps on the corresponding geometric nerves.
--t
H J induce
Exercise 31 Draw a picture of a homotopy between the equivalent morphisms 8 : X(2) --t X(3) from the third interval to the triadic interpretation (yielding the harmonic strip) of a diatonic scale as described in the diagram 14.2. 2See appendix H.2.2, theorem 74. 3See appendix H.1. 4See appendix H.2.3.
14.5. SIMPLICIAL WEIGHTS
345
Proposition 14 means that the variation of the embedding of a chart-of a refined interpretation, say-in one particular candidate of a larger chart of a coarser interpretation is related to the topological operation of homotopy if we consider the associated geometric nerves. Intuitively, this means that reinterpretation of an embedding is a kind of deformation on the geometric level. For every address change f : B --. A from the address A of a global functorial composition G 1 , we have the f-slice f@G 1 , an objective global composition. In general, its nerve has less vertexes and these may collapse. Also some simplexes could vanish. However, for every ksimplex 0' = {Gio ' ... G ik } of n(G1 ), there is an address change f : B --. A such that f@nu = njf@Gij =1= 0. But observe that 0 =1= f@ n 0' implies 0 =1= f . g@ n 0' for any address change g : C --. B. So take address changes fs : Bs --. A, one for each maximal simplex Us, such that all intersections fs@ n Us are non-empty. Take the limit (gs : L --. Bs)s of this system of address changes over A (it exists according to appendix E.3.8). Then we have one unique address change h = fs . gs : L --. A with nonvanishing intersections in every simplex of the functorial composition. This means:
Theorem 13 For every global functorial composition G 1 , there is an f -slice f @G1 such that the canonical simplicial map n(G1 ) --. n(fOOI) is well defined and surjective. On the geometric nerves, it appears as a specialization morphism by collapse of particular vertexes. For an interpretation X I of a local objective composition X, we have seen the construction of the global functorial composition XI associated with the covering of X by i. The nerve n(XI) ~y be significantly larger than the nerve n(XI). This can be checked on an adequate slice of X I. We shall discuss such an example relating to the functorial interpretation associated with the harmonic strip in section 25.3.4, example 61.
14.5
Simplicial Weights
Summary. The structure of a nerve induces weights on the nerve's simplexes with values in isomorphism classes of local compositions. This yields a tool for local and global classification, to be treated in the next chapter. We present the elementary example of simplicial motive weights. -~-
Lemma 25 Let G 1 be an A-addressed objective or functorial global composition, and take a k-simplex 0' = {Gio , ..• G ik } of its nerve. Consider the subobject (set or functor) nO' c G and the chart isomorphism
CHAPTER 14. GLOBAL PERSPECTIVES
346
If we view the abstract nerve n(G1 ) as a category by the inclusion morphisms iu,T : 0" among its simplexes, we have a contravariant functor
n : n(G1 )
--->
Lac:
0" f-+
nO"
'--+ T
(14.3)
(with the usual identification of local and one-chart global compositions), the simplicial weight of G 1 .
III
Figure 14.7: The class nerve of the triadic interpretation of a major scale. The numbers are the class numbers from the classification list in appendix L.1. This functor induces a map ison: n(G 1 )
--->
Loc/:::'"
from the abstract nerve to the set Loc/ :::... of isomorphism classes of local compositions which we call the class weight (function) of the global composition G 1 . Summarizing, we have a geometric nerve N(G 1 ) and a class weight ison for each global composition. If we parametrize the isomorphism classes of local compositions by a determined set of symbols, this means that this information can be visualized by a polyhedron with a class weight attached to each simplex; we call this object the class nerve of G 1 and denote it by CN(G 1 ). This is a very intuitive invariant object which we want to illustrate in two examples. Example 26 We consider the harmonic strip of X(3) of a major scale X. The classes are numbered according to the classification in appendix L.1. For example, this nerve has seven 2-simplexes with weight number 2 for the unique zero-addressed singleton class in PiM Od 12 . The class nerve CN(X(3)) is shown in figure 14.7. We see that the only automorphism of this interpretation must fix the seventh degree. So we have only the exchange of the boundary around the seventh degree, in fact the induced automorphism from the uninterpreted local scale. Example 27 Consider the motif situation, more precisely, the zero-addressed local threeelement compositions M in OnPiMod 12 ,12. We look at the 'face' interpretation M(2) by maximal subcompositions, i.e., by the three 2-element 'interval' charts in the motif. Clearly, if two
14.6. CATEGORIES OF COMMUTATIVE GLOBAL COMPOSITIONS
347
class nr. 22
CN(M(2)
1Z
~ Figure 14.8: A three-element motif of class number 22 and its class nerve. motives are isomorphic, the class nerves are also. So the class nerve yields invariants of the local classification. Figure 14.8 shows a motif of class number 22 in the classification from appendix M.3, as well as the interpretation, the chart class numbers of the intervals from appendix L.1, and the class weights (2,2 , 6) on the class nerve. From the table in appendix M.3, we see that the class nerve is not classifying, but, together with the motif's volume (see section 11.3.8) , the class nerve yields a complete set of class invariants. For motives with larger cardinality this is, however, no longer true. The four-element motives show counterexamples. The classification list in appendix M.4 which was calculated by Hans Straub [513] shows the class weights on the tetrahedra CN(M(3)) of the face interpretation of the four-element motives in OnPiM Od 12 ,12. The motif classes which are not uniquely determined by volume and class nerve are indicated by a star.
14.6
Categories of Commutative Global Compositions
Summary. The category ObLocA of commutative local compositions introduced in section 8.3.5 has a global counterpart which is discussed here. -~ -
We fix an address module A over a commutative ring R. Recall from definition 28 in section 8.3.5 that the category of commutative local compositions refers to the fixed address module A; the morphisms are of form f / IdA with f being induced by an R-affine morphism of the Coordinator R-modules. If we consider global objective A-addressed compositions which are glued together by commutative local charts, this means that the transition morphisms are isomorphisms of commutative local compositions. And it means that morphism between commutative global compositions are induced by R-affine morphisms on the ambient modules of the local charts. Denote this category of commutative global composition at address A by ComGlobA·
Chapter 15
Global Classification Wir sehen also, daft in der Musik die Mehrzahl der sogenannten "wissenschaftlichen" Geister fast so naiv ist wie Monsieur Achms - eine Figur von Jarry -, der eine Sammlung von Polyedern anlegte. Nun, es mag dahinstehen, ob in unserem Fall Polyeder wirklich von unerschopfiichem Interesse sind. Mir jedenfalls will die N otwendigkeit solcher pataphysikalischer Spekulationen nur schwerlich einleuchten. Pierre Boulez [60, II, p.19] Summary. Global classification relies on two concepts: affine functions and resolutions of global compositions. These constructs are discussed and exemplified. We derive classifying spaces and compare them to the situation in the Dreiding-Dress-Haegi theory of molecules: The latter are deduced from global compositions by additional structures concerning orientation, distances and angles (bilinear and exterior forms). It is therefore possible to view "molecules" as being global compositions with additional constraints; their musical meaning is discussed.
This chapter deals only with objective (more precisely, commutative) global compositions. This is probably not always necessary from the technical point of view, but presently, no more general theory has been elaborated and we prefer dealing with situations where concrete results are available. From the musicological point of view, this is one of the most difficult chapters since the relation of classification and musicology, in particular esthetics, is quite implicit. But classification is a deep concern since it reveals the a priori extent of a structural framework and therefore its power as an expressive tool of artistic activity, be it in composition, performance, or understanding. So it is a kind of essence of the overall efforts of the art of music.
349 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 15. GLOBAL CLASSIFICATION
350
15.1
Module Complexes
Summary. Module complexes describe systems of global affine functions in a classical homological language.
-EFor this chapter, we fix a commutative ring R. If we do not stress the contrary, we only consider R-modules and R-affine morphisms between R-modules, in particular, addresses A are R-modules. Given a global composition G1 , we recall the category n( G1 ) of its abstract nerve, see section 14.5. Definition 51 An (R-)module complex over G 1 is a covariant functor (a coefficient system, see appendix H.3) (15.1)
into the category RMod of R-modules and R-affine morphisms, where Mu,r : M(a)
-+
M(T)
are transition morphisms of modules for the simplex inclusions (morphisms) aCT. As usual in sheaf theory, we put fM = limn(GI) M(a) and call this the set of global sections. Example 28 Since any morphism of global compositions
• transformation n(J la) : n(G1 ) complex on
•
-+
•
f la : GI
-+
HJ yields a natural
n(HJ), every module complex Mover HJ induces a module
f I a * Mover G1 , with
• • f la * M(a) = M(n(J la)(a)). Example 29 If M is any R-module, the constant module complex of M is the complex with M(a) = M for all simplexes and identity transition. Observe that its global sections are in bijection with the set Me, if N(G 1 ) has c connected components.
15.1.1
Global Affine Functions
Summary. Global affine functions are patchworks of affine functions on charts of atlases. Their role is that of generalized coordinate functions for musical compositions. This approach makes it possible to separate the core process of composing music from its realization in instrumental parameter spaces.
-E-
351
15.1. MODULE COMPLEXES
In the sequel of this chapter, we shall work on categories ComGlob A of global commutative compositions-except when explicitly mentioning the contrary. Suppose that we are given an Aaddressed commutative global composition G 1 . Let A@RR be the A-addressed local composition of the full R-module A@RR. For a simplex IJ of n( C I ), a morphism f IdA : nIJ ---+ A@RR may be identified with the supporting map f : nIJ ---+ A@RR. We call such a morphism an A-addressed function on the simplex. The set nr(IJ) of these functions is provided with the structure of an R-module by the usual addition and scalar multiplication of function values: (J + g)(x) = f(x) + g(x), (r J)(x) = r f(x), r E R.
i
Exercise 32 Show that sums and scalar multiples of A-addressed functions is again an Aaddressed function. Suppose that we have an inclusion of simplexes IJ C T of n(C 1 ). Then the ambient spaces of the charts of these simplexes are the same, i.e., the inclusion of local compositions nT C nIJ are in bijection with an inclusion of local compositions KT C Ku C A@RN for a specific module N. Since an affine function on nT is the restriction of an affine morphism A@h : A@RN ---+ A@RR, f evidently extends to the restriction A@hI Ka , and we have proven that the transition morphisms by restriction of affine functions are surjective. The corresponding complex of affine functions is denoted by nr(C I ). The sub complex C = COl of constant functions is defined by C(IJ) = {f E nr(C 1 )(IJ), f = constant on n IJ}. The set of global functions of the function complex is denoted by r(C 1 ). For the following construction, we select one ambient space A@RFu for every simplex IJ of n(C I ). Take a chart n(IJ) ofnlJ in A@RFu . Then we have the submodule R.n(IJ) of A@RFu (see definition 17 in section 7.4). If we have a morphism IJ C T, we deduce a R-linear homomorphism R. n (T) ---+ R. n (IJ) which induce these homomorphisms
on the homomorphism modules for any given module M. This module complex is denoted by LinR(C I , M), although it is only determined up to an isomorphism of module complexes! on CI; it evaluates to LinR(CI, M)(IJ) = LinR(R. n (IJ), M). In the particular case M = A@RR, we abbreviate LinR(C I , A@RR) = C h and call this the module complex of A-addressed forms on C I . For an affine function f : nIJ ---+ A@RR, we have an R-linear homomorphism R.f : R. n (IJ) ---+ A@RR (see lemma 6 in section 8.3.5), and the map f ~ R.f is R-linear and functorial in IJ, i.e., we have a morphism of module complexes on C I : (15.2)
If we have a subcomplex situation N C M on C I , we get the quotient complex M / N with values M/N(IJ) = M(IJ)/N(IJ). Lemma 26 The kernel of ROI is the module COl of constant functions. Therefore, if N is any module complex with C C N C nr(C I ), we have an embedding of module complexes
(15.3) 1 By
definition of module complexes, this is an isomorphism of functors.
352
CHAPTER 15. GLOBAL CLASSIFICATION
Proof. The function f is in the kernel iff its values R.f(Si - Sj) in turn means that f is constant on na. QED.
=
f(Si) - f(sj) vanish, and this
Definition 52 Let P be a property of modules. If M is a module complex over G I , it is said to share property P iff all its values M(a) do so. In particular, M is called injective (projective) iff the module M(a) is so for every simplex a. Lemma 27 If A is the zero address A = OR, and if nr is injective (in particular, if R is a field), the morphism ReI is surjective, and therefore, we have an isomorphism nr IC ~ G h . Proof. Since we have the zero address, any na is isomorphic to an embedded local composition in the same ambient module, and any linear map l : R. n a --+ R extends to a map on the ambient module. QED .
•
Let f I I dA : G I --+ H J be a morphism of A-addressed commutative global compositions. Take a simplex a in n(GI ), and its image a' under the associated simplicial map. Then, each restricted morphism
gives rise to a map
•
JlIdA *nr(HJ)(a) = nr(HJ)(a')
--+
nr(GI)(a)
by right composition with this restricted morphism. Moreover, the map is R-linear. Therefore,
•
if M is any subcomplex of nr(HJ), its induced complex f I IdA * M is mapped R-linearly onto what is called the retracted module complex
•
MlflIdA C nr(G I ).
(15.4)
•
In particular, if M = CHJ, we have CHJIJlIdA C Cel. If M is any sub complex of nr(HJ) which contains the constant complex CHJ, we have the short exact sequence
and therefore the corresponding short exact sequence
•
0--+ flId A *CHJ
--+
•
JlIdA *M
--+
•
JlIdA *MICHJ
--+
0
15.1. MODULE COMPLEXES
353
which projects to an exact sequence on G 1 as follows:
,
,
0
0
,
f/IdA *M -
0 - f/Id A *CHJ -
j,
f/IdA *M/CHJ -
j,
j,
• CHJlf/IdA --- Mlf/IdA --- M/CHJlf/IdA
I
1
1
• nf(G1 ) _
• COl
nf(G1)/COl
0
• 0
• 0
,
including the definition of M/CHJlf/IdA. In particular, for M = nf(H J ), we have this surjective R-linear homomorphism
,
, f/IdA *nf(HJ)/CHJ
-+
nf(HJ)/CHJlf/IdA.
(15.5)
Finally, by construction of global sections, there is a canonical projective system of R-linear maps f(HJ)
,
-+
f/Id A *nf(HJ)(a)
-+
nf(G1)(a)
and therefore we have a canonical R-linear map
,
f(f/IdA): f(HJ)
-+
f(G 1 )
which is a contravariant functor f : ObGlobA
15.1.2
-+
RMod.
(15.6)
Bilinear and Exterior Forms
Summary. Bilinear and exterior forms capture the language of classical geometry of angles, distances, and orientation in Euclidean three-space. This formalism is built on the systems of global functions. -~-
Until now, no considerations regarding angles and similar properties from Euclidean geometry have been made in the theory of local or global compositions. Here is the formal framework for such perspectives. Suppose that N is a module complex of A-addressed affine functions on G1 which contains the constant complex C = COl. Consider the module complex BN defined on simplexes a by the expression (15.7) BN(a) = LinR((N(a)/C)*,N(a)/C))
CHAPTER 15. GLOBAL CLASSIFICATION
354
with X* being the R-linear dual of X, and with the evident transition homomorphisms. With the injection 15.3 in mind we have a canonical R-linear map
LinR((N(a)jC)*, N(a)jC))
-+
LinR((Gh(a))*, Gh(a))
and the right side is
LinR((LinR(R. n (a),A@RR))*,LinR(R. n (a),A@RR)) which, according to appendix E.3.2, lemma 81 identifies to
We now assume that A = Rn, 0 ::; n natural. Then appendix E.3.2, proposition 85 yields linear maps
d: A@RR. n (a)* -+ (A@RR. n (a))*, u: A@RR. n (a) -+ (A@RR. n (a)*)*, and therefore a map
LinR((Gh(a))*, Gh(a))
-+
LinR(A@RR. n (a), (A@~R. n (a))*)
which gives an R-linear map (15.8) The right side describes the bilinear forms on the space of A-addressed points in the module R. n (a). We therefore associate a bilinear form (3(x) with each linear map x : (N(a)jC)* -+ N(a)jC, and in this framework, we also speak of the bilinear form x, even when we cannot instantiate an associated form (3(x) (which is the case for general address A). Definition 53 With the above notation, the pair (C 1 * ,(3) with a global section (3 E C I * is called an N -formed global composition .
r B N on
•
f j IdA : C I
-+ H J be a morphism of A-addressed commutative global compositions. Let M, N be submodules of nr(CI ) , nr(HJ) , respectively, both containing the constants. Take global sections {3 E r B M, 'Y E r B N, respectively. Suppose that we have an inclusion of the
Let
restriction of N under
• f j I dA in M:
(15.9) and therefore an inclusion
•
NjClf/1dA
>->
MjG
of quotients modulo constants. Then we have a canonical map
B f : BN over
•
f j I dA
-+
BM
(15.10)
-+
rBM.
(15.11)
and therefore one of global sections
rBf : rBN
15.1. MODULE COMPLEXES
355
•
Definition 54 With the preceding notation and hypotheses, we say that the morphism f j I dA
•
is a morphism of formed compositions f j IdA: (e I , {3)
---t
(H J, I) if r B f (,) = {3.
e
Given a submodule complex N of the complex of affine functions on I , containing the constants, we have an evident complex An(NjC) of exterior n-forms. If n E rAn(NjC) is a global exterior n-form, we call (e I , n) an oriented global composition. As with bilinear forms,
•
we have the concept of a morphism f j IdA : (e I , n) ---t (H J , ~) of oriented global compositions if rAf(~) = n, with the evident nth exterior power morphism whenever we have the above inclusion (15.9).
15.1.3
Deviation: Compositions vs. "Molecules"
Summary. Molecule structures, as they are considered in the Dreiding-Dress-Haegi theory, are very similar to representable global compositions. They emerge from the latter by adding bilinear and exterior forms. Geometrically this means that compositions may be "deduced" from molecular structures by abstraction from angles, distances, and orientation. We consider the specific difference of such an abstraction in the musical perspective: What does music gain after adding "molecular" information? -~-
For the coefficient field R = IR of real numbers, and for the zero address, special formed and oriented global compositions may represent molecular structures such as have been considered in [129]. We shall see below (16.4) that classification of formed and oriented global compositions, and therefore of molecular structures, is significantly more difficult than classification of global compositions. So this short deviation is not only an application of the general formalism of global compositions, it also shows that enriching the inner structure of a global composition may cause substantial difficulties in the understanding of the corresponding category. For this situation, the (zero-addressed) points of I are viewed as being momentary positions of atoms of a molecule in 1R3 . The atom species are parametrized by natural numbers, and their distribution on I is defined by a marking application a : ---t N. Moreover, we ask that
e
e
e
• dim(nr(IJ)jC) :=:; 3 for all charts IJ of e I
• eI
,
is formed by a symmetric bilinear form {3 which is positive definite on all charts,
• we are given an exterior 3-form is valid 2 .
n which is {3-normed, i.e., the Gram identity n2 =
det({3)
Definition 55 A quadruple (e I , a, {3, n) with the above properties, i.e., an a-marked, {3-formed, and n-oriented (zero-addressed) global composition C I is called a global molecule. A morphism between global molecules is a morphism of global compositions which respects the additional 2The Gram form verifies the identity O(Xl, ... Xn)O(YI, ... Yn) = det((!3(xi,Yj)kj, for all sequences Xn , YI, ... Yn of elements in the chart vector space, see also [196, VII,3j.
Xl, ...
356
CHAPTER 15. GLOBAL CLASSIFICATION
structures as discussed above. A global molecule which is isomorphic to an interpretation of a local molecule3 is called interpretable. Interpretable molecules are canonically associated with distinguished structures in the theory of Dreiding~Dress~Haegi, the proof may be omitted in this context:
Theorem 14 The molecular structures which are associated with interpretable molecules are (d, x)-defined in the sense of [129].
15.2
The Resolution of a Global Composition
Summary. The second device for classification is the global standard composition, canonically associated with the composition's nerve. Special module complexes on such standard objects allow the reconstruction of the original composition. ~I;~
This section introduces two methods which are used in classification theory: The first is the global standard composition; it is a free object in the category of global compositions and helps standardize the various special objects via corresponding special module complexes of affine functions. The second method is the construction of quotient compositions from given modules of affine functions. Combining these methods, we shall classify global compositions by the classification of special module complexes of affine functions in free objects, and by a theorem which tells us how to rebuild compositions as quotients from such special modules.
15.2.1
Global Standard Compositions
Summary. The standard compositions are objects representing compositions with "notes in general position", i.e., their configuration is as 'free' as possible from 'occasional' coincidences. In fact, the standard composition is a geometric realization of the composition's nerve and thus depends only on combinatorial information. There is a natural projection from the standard object onto the generating composition. ~I;~
We first want to broaden the provisional concept of a standard local composition given in definition 30, section 11.3.2. Given a module A in RMod and a natural number 0 :::::: n, we denote by AUn the n + I-fold coproduct Un+! A of A. Recall from its construction (appendix E.3.8) that there is an isomorphism AUn ~ Rn EEl An+!. We denote the canonical basis of Rn by (el, ... en), and for any element a E A and 0 :::::: i :::::: n, we set ai = (0, ... a, ... 0) for the n + I-tuple in An+! having a at its i + I-th position and zero else; the zero element is denoted by eo. We have the inclusion morphisms (15.12) 30ne with a single chart; on the interpretation's charts, bilinear forms and the orientation are induced from the local data.
15.2. THE RESOLUTION OF A GLOBAL COMPOSITION for
°
~ i ~
357
n, with .( a ) -_ (. e.,a,.) -_ {linear if i = 0, affine if i > 0.
IJ,
(15.13)
This defines a local, A-addressed composition A~n C A@RAUn which is called the A-addressed local standard composition of dimension n. By construction, it has the following property: If M is any R-module, and if s. = (so, ... sn) is any sequence of A-addressed points in M, with associated local composition S = {so, ... sn} C A@RM, then there is exactly one morphism of local compositions (15.14) (s.): A~n ....... S : IJi I--t Si for i = 0, ... n. This morphism is in fact defined by the universal property of the coproduct and is mediated by the following affine function f : AUn ....... M: Write Si = et ; . Si,O. Then we have f(eo)
to,
f(ei)
ti - to (linear) for i > 0, si,o(a) (linear) for i ~ 0,
f(ai) and the formula Si
= f· IJi
is immediate.
Exercise 33 If A, B are two addresses, sitting in RM od, s Mod, respectively, we denote by Id: A~n ....... B~n the "identity" IJi I--t IJi. Consider base changes a : A ....... B. Show that the assignment A I--t A~n and a I--t Idla: A~n ....... B~n is a functor ~n: eMod ....... ObLoc on the category eMod of modules over commutative rings and diaffine morphisms. To define global "free" objects among the A-addressed objective compositions with finite charts, we shall represent the nerve n( G 1 ) by an isomorphic standard representative nerve n* induced by a covering of the natural interval [0, m] = {O, 1,2,3, ... m = card( G) - I} of natural numbers. For n*, we define the global standard composition A~n* at address A by the interpretation of the local standard composition A~m which is given by the present covering of [0, m]. We are also given a standard atlas of A~n*. In fact, for any subset q = {to, ... t e } C [0, m] of c + 1 elements, indexed corresponding to increasing values, we have the canonical injection i q : A~e ....... A~m via IJj I--t IJtj. This defines the standard atlas. The universal property of this global standard composition reads as follows. Take the category Covens of coverings of sets 4 , and consider the covariant functor ACOVn* : ObGlob~H ....... Sets: G 1
I--t
Homcovens(n*, (G,no(G 1 ))).
(15.15)
Then we have this result: Proposition 15 The functor ACovn* is representable by the standard global composition A~n* , i. e., we have a bijection Homcovens(n*, (G, no(G1 ))) ..'::+ HomObGlob@A (A~n*, G 1 ) which is functorial in the A -addressed composition G 1 . 4See appendix H.2.1, example 102.
CHAPTER 15. GLOBAL CLASSIFICATION
358
The proof is left as an exercise. In particular, if we take the standard nerve n* = n*(G I ) of the nerve of G I and then the corresponding 'identity' morphism Id : n* ~ (G, no(G I )), we obtain a corresponding bijective5 morphism
(15.16) with the notation D.GI = AD. n * (GI), this object and the morphism reSGI /1 dA being called the resolution of G I . Clearly, the associated simplicial morphism n(resGI / IdA) : n*(D.GI) ----> n*(G I ) is an isomorphism. In particular, due to the universal property of the global standard compositions, every
,
morphism f / I dA : G I
H J can uniquely be lifted to a corresponding morphism res, of j/IdA resolutions to make the diagram res t. f/ldA D.GI D.HJ ---->
)
resGI/IdAl
lresHJ/IdA
, j/IdA G I -------> commute. We therefore have a resolution functor
res@A : ObGlob@A
---->
(15.17)
HJ
ObGlob@A
(15.18)
and a natural transformation
(15.19) The following deals with the reconstruction of the category ObGI@A from its subcategory of free objects.
15.2.2
Compositions from Module Complexes
SUIllIllary. The projection of the standard composition onto its generating composition canonically induces a module complex of global affine functions on the standard composition. This complex is used to reconstruct the generating composition from the standard composition. -I;-
The resolution functor res@A and its associated natural transformation D@A give rise to a module complex of affine functions D.nr(G I ) = nr(GI)lresGI/Id A in nr(AD.GI), for each global A-addressed composition G I . Call this complex the resolution complex of composition G I . Moreover, this assignment commutes with the morphism of the resolution functor, i.e., for
,
a morphism f / IdA: G I
---->
H J we have a canonical inclusion
,
nr(HJ)lflIdA 5It is, however, not an isomorphism in general.
c
nr(G 1 )
(15.20)
15.2. THE RESOLUTION OF A GLOBAL COMPOSITION
359
of the retracted resolution complex of H J in the resolution complex of G 1 . The next step deals with the reconstruction of G 1 from nf(G 1 ) and the related question of classification of global compositions by use of the resolution complex which is suggested by the functorial relation (15.20) . The generic situation from the preceding constructions is that we are given a module complex M c nf(G 1 ), containing the constants C, and that we would like to construct a kind of "quotient" composition whose affine functions are those of M. We first look at the local situation.
Definition 56 Let S C A@RU be an A-addressed objective local composition in the R-module U. For a submodule L c f(S) of affine functions on S, the evaluation map': S --+ A@RL* into the A-valued points of the dual module L* = LinR(L, R) of L is defined by s(a)(l) = l(s)(a):
The problem is that the evaluation is not a morphism of local compositions in general. We have to investigate sufficient conditions for the existence of a morphism. In the special case which is of interest, we have this guarantee: In fact, let S = A~n C AUn. Then, the dotted points d; : A --+ L* define the universal map HL : AUn --+ L*, and we have interpreted ': A~n --+ A@RL* as a morphism of local compositions. Next, suppose we are given two local compositions S C A@RU,T C A@RV and a morphism f / IdA: S --+ T, together with a module LT C f(T) whose retract LTlf /IdA is included in a module Ls C f(S). We then have a commutative diagram S
f/1dA
1 T
where
11*
-----+
-----+
A@RL
s
lA@'f*
(15.21 )
A@RL:}
is the R-dual of the canonical linear map
If : LT
--+
Ls·
Exercise 34 Give a proof of the commutativity of diagram (15.21). This construction yields a morphism j / IdA: S --+ T of local compositions in the ambient spaces L L:}, respectively. With this technique, we may associate a global composition with a module complex N C nf(A~n*) of affine functions in the standard composition A~n* of a standard covering n * . To this end, suppose we are given such a module complex N of affine functions, and suppose that the restriction transition morphisms N (T) --+ N (rY), T CrY, are all surjective. This is certainly the case for retracted function modules from resolution morphisms. For a simplex rY of the nerve n(A~n*) ~ n* we note c(rY) = card(nqEO"q) - l. In the following discussion of classification, we shall tacitly assume that all module complexes of affine functions have surjective transition morphisms. With this, if we apply the construction from diagram (15.21) to the situation where S = A~c(O") c A@Auc(O"), and T = A~C(T) c A@AUc(T) for simplexes T C rY of A~n*' and with Ls =
s,
360
CHAPTER 15. GLOBAL CLASSIFICATION
N (a), LT = N (T), then we have injective vertical arrows in the corresponding commutative diagram
inclusion
1
1
(15.22)
A@res*
where res is the restriction map. We write na = A~~(<7)' and therefore get a surjective morphism of diagrams of local compositions A~c(<7) ---- na over n*. Setting A~n*IN = colim n* n a, we have a commutative diagram of sets A~c(<7)
1
A~n*
na
~
/N=colim·
1 )
(15.23)
A~n*IN
induced by the dot morphisms of local compositions.
Definition 57 Call a module complex N c nf(G1 ) separating, iff for every zero-simplex (chart) a E no(G1 ), the dot map a ---- A@RN(a)* is injective. Intuitively, this means that for any pair of points on any chart, there is a function of N on this chart which separates these points. So, if in the above situation N c nf(A~n*) is separating, the dot maps are all bijective onto the images since the vertical maps in diagram (15.22) are injective. Therefore, the colimit diagram (15.23) has bijective horizontal arrows, and the images na are injected into the limit A~n*IN. So these images cover the limit and the images of the zero-dimensional simplexes build a canonical atlas of a global A-addressed composition, i.e., the diagram (15.23) becomes a bijective morphism of A-addressed global compositions. So, if N is separating, we have constructed a canonical global composition and a bijective morphism from the free object to a global composition which is defined by the functions of N.
Definition 58 We call this composition A~n*IN the N-quotient of A~n* ---- A~n*IN from diagram (15.23) is denoted by IN.
A~n*.
The morphism
In particular, if reSaI : A~aI ---- G 1 is the resolution of the A-addressed composition GI, we have the resolution complex ~nf(GI). Clearly, the resolution complex is separating iff the complex of affine functions nf( G1 ) is. So there is a ~nf( G1)-quotient if the complex of affine functions nf( G 1 ) is separating, in which case we also say that G 1 is separating. Of course, this is a property which is invariant under bijective morphisms among A-addressed compositions. Proposition 16 If the A-addressed composition G 1 admits a projective atlas, i.e., an atlas whose charts have projective R-modules, then G 1 is separating.
15.2. THE RESOLUTION OF A GLOBAL COMPOSITION
361
Proof. In this case, the canonical bidual map U ----t U** is injective for each ambient space U of a projective atlas, see appendix EA.2. Now, if s =1= t are two points in the chart a c A@RU, there is a E A with s(a) =1= t(a), and we may take a linear form h E U* such that h(s(a)) =1= h(t(a)) which means that the induced affine form l = A@hla separates s from t. QED.
Proposition 17 Suppose that G 1 is separating. Then we are given a commutative triangle of covering set isomorphisms
(15.24)
which stem from morphisms of global compositions except-possibly-for f. To look for conditions when f is a morphism, we may concentrate on the local situation. This means that we are given a separating local composition S = {so, ... sn} C A@RU of cardinality 6 n + 1. For the module f(S) of affine functions g : S ----t A@RR on S, we have the corresponding commutative triangle of set bijections:
(15.25)
which becomes a triangle of morphisms of local compositions if we can derive f from an affine morphism F : f(S)* ----t U of ambient modules. On the level of ambient modules, we have the following commutative square of affine morphisms: uS
-----+
usl
(15.26)
where the b(u)(t) = t(u) is the extension ofthe bidual, where q : U@RR ----t f(S) is the canonical surjection, and where uS, uS are the universal morphism associated with S, S, respectively.
Exercise 35 Verify the commutativity of diagram (15.26) by use of the universal property of the (n + I)-fold coproduct A Un. 6Recall that in this discussion, we are considering global compositions with finite charts.
CHAPTER 15. GLOBAL CLASSIFICATION
362
If U is finitely generated and projective, the bidual U ---> U** is an isomorphism (appendix E.4.2), and the dual r : (U@RR)* ---> U** of the inclusion U* ---> U@RR defines a right-inverse i ofb. Therefore, we are given the commutative triangle
(15.27)
r(s)* - - ---. U i . q* which yields the required morphism for
f
in diagram (15.25). This means that
Theorem 15 If the composition G 1 has a finitely generated, projective atlas7 , the factorization diagram of proposition 17 is a diagram of global compositions.
Proof. In fact, by proposition 16, G 1 is separating, and by the local diagram (15.27), applied to the charts of this situation, the map f in diagram (15.24) is a morphism of global compositions, QED. The factorization morphism f has a chance to become an isomorphism if we succeed in constructing an ambient space morphism in the other direction. A sufficient condition for such inverse morphism is that the surjection q : U@RR ---> r(S) has a section, i.e., a left inverse p: r(S) ---> U@RR. This implies that on the duals, one has Idr(s)* = q*. p*, and we may take p* . b to go back from U to r(S)*. This condition is certainly satisfied if r(S) is projective. We say that the global composition has projective functions iff its complex of affine functions is projective on the zero simplexes. So we have this result: Theorem 16 If G 1 has a finitely generated projective atlas and projective functions, then the horizontal arrow f in the factorization diagram of proposition 17 is an isomorphism.
This condition is evidently satisfied if R is semi-simple, i.e., a finite product of commutative fields. This means that in this case, we are able to reconstruct G 1 from its retracted affine functions on the resolution. Moreover, in this case, the retracted module can also be recovered from the quotient composition, i.e., (15.28) so that we are now left with the question of characterizing those module complexes of affine functions in A~n* ;Vhich could give rise to compositions having this free object as their resolution. Clearly, the first condition on such a module complex is that it is separating. Secondly, we may suppose that it is finitely generated projective, i.e., its modules on the zero-dimensional simplexes are so. The first requirement is in particular the case for the global compositions in theorem 16 above since the charts have finitely generated projective ambient modules. The second requirement is the one we had in theorem 16. Also, under these conditions, the quotient 7 All
charts have finitely generated projective modules.
15.3. ORBITS OF MODULE COMPLEXES ARE CLASSIFYING
363
composition has charts r(S)* which are again finitely generated projective and separating. The third and last requirement is obvious: N should contain the constant functions. So we proceed to the analysis of module complexes N c nr(A~n) which are separating, finitely generated projective, and contain the constant functions; call these complexes representative.
15.3
Orbits of Module Complexes Are Classifying
Summary. By means of the representation of a composition via its module complex of functions on the purely combinatorial standard composition, we obtain a classification frame for global compositions. -~-
Let us first recall the overall situation of resolutions. For a global composition GI, we have the functorially associated standard resolution ~Cl = A~n* with the standard covering n* of G I , and the resolution bijection reSCl : ~Cl ---> G I , as described in section 15.2.1. If £
a global composition H J is isomorphic to G I via a morphism f / I dA : G I ---> H J, which we now abbreviate by f if no confusion is likely, we have a commutative diagram with horizontal isomorphisms: res! A~n* -------> A~n*
GI
(15.29)
lres H J
resall f
------->
HJ
By the universal property of the standard compositions, the automorphism group SA,n* of the standard composition A~n* identifies to a subgroup of the symmetric group 6 m+ 1 of permutations of A~n* if the standard covering is defined on the integer interval [0, m] as discussed above. By retraction (see (15.4)), this group acts from the right on the set RepA,n* of representative module complexes on A~n* ret: RepA,n* x SA,n*
--->
RepA,n* : (N,g)
t-->
Nig.
(15.30)
So, if our isomorphic global compositions G I , H J have a finitely generated projective atlas and projective functions, their retracted function complexes NCl, NHJ on the standard composition A~n* are representative, and we have NCl = NHJlresf. Conversely, if we are given a representative module complex on A~n*, we have a bijective morphism (15.31) (see definition 58). Since the charts of the quotient have ambient spaces N(a)*, the quotient has a finitely generated projective atlas. What are the functions of the quotient? Since the simplicial complexes are isomorphic, it is sufficient to consider the local situation on a chart. So we are given a representative module8 0f affine function N c r(A~n) in a local standard composition. An affine function f on the image ~ c A@RN* is induced by an affine morphism 8S 0 N is finitely generated, projective, contains the constants, and is separating.
CHAPTER 15. GLOBAL CLASSIFICATION
364
F : N* ---> R, and since N is finitely generated projective, we may write F = e t . n *, with the bidual n* of an element n E N. So the retraction of the induced function f on ai, and on an argument a E A evaluates to (15.32) and, since the constants are in N, we may absorb the constant t in n, which means that IN) Ir) = N. Since this retraction is isomorphic to the function module of the quotient, the latter is also representative. Conversely, if we are given two representative module complexes M, N E RepA,n*, and an automorphism g E SA,n* with N = Mig, then we have a commutative diagram with isomorphism on the horizontal arrows nr(A~n*
(15.33) A~n*IN
-----t
A~n*IM
by the factorization of the resolution map g. rM through the quotient A~n* IN. SO conversely, isomorphic representative module complexes give rise to isomorphic quotients, and we have proven this: Theorem 17 The orbit space RepA,n* I SA,n* is in bijection with the set of isomorphism classes of A-addressed global compositions with projective functions and finitely generated projective atlases which have a covering complex isomorphic to n*. This bijection is induced by the retraction of the function module complex to the resolution A~n*' in one direction, and by the quotient composition on a given representative module complex on A~n*, in the other.
In particular, this classification result is valid for the global compositions having as their address a module A over a semi-simple commutative ring R, i.e., a finite direct product of commutative fields (see appendix E.2.3, theorem 48).
15.3.1
Combinatorial Group Actions
Summary. We discuss the action of the automorphism group of the standard composition on module complexes since it induces isomorphism classes of global compositions. -I;-
By the above classification theorem 17, we are interested in a more explicit description of the action of the automorphism group SA,n* on the set RepA,n* of representative module complexes on the standard composition A~n*. To begin with, the module complex of affine functions on A~n* reads as follows: We recall from section 15.2.1 that the standard covering n* may be given in form of a covering of the integer interval [0, m] by subsets (the zero simplexes) a = {to, ... tcCO")} of c(a) + 1 elements, and therefore by the standard atlas injections iO" : A~O" ---> A~m: aj 1---+ atj. On each simplex a of this covering, we have a corresponding local standard composition A~cCO")' and its function module is
15.3. ORBITS OF MODULE COMPLEXES ARE CLASSIFYING
365
whereas the constants correspond to the diagonal submodule
and our module complex N must be a module
for each simplex a, finitely generated projective as well as separating in dimension zero. What does it mean that it is separating? It means that for any zero-dimensional simplex a, and for any pair i =f. j of indices between and c( a), there is an element n E N (a) such that its coordinates ni and nj differ. Moreover, for any two simplexes Tea, we have a commutative diagram
°
~(A@RR) c....-
Id
j
N(T) c....- (A@RR)c(T)+l
res,.
~(A@RR) c....-
j
P
j
N(a) c....- (A@RR)c(u)+l
with surjective vertical arrows induced by the projection to the right side. Further, an automorphism of the underlying global standard composition boils down to a permutation of the interval [0, m] which is compatible with the covering. In particular, this is a finite group action which, on every module N(a) acts by induction from the permutation of components on the supporting direct sum module (A@RR)c(u)+l. Recall the diagonal embedding and its factorization from formula (11.12) in section 11.3.2:
which is equivariant for the given permutation group action. So we have a direct equivariant decomposition (A@RR)c+l = ~(A@RR) ffi (A@RR)C which carries over to the submodules:
N(a) = A@RR ffi N(a)red, with N(a)red C (A@RR)C, The action on the reduced factor has been described in formula (11.14) of section 11.3.2. So the above projection diagram reduces to
N(T)red c...... (A@RR)c(T)
"",' j
P
j
N(a)red c...... (A@RR)c(u) It is known that N(a) is projective of finite type iff A* and N(a)red are so (see appendix E.4.2). So, if we make the general hypothesis that A* is projective of finite type, we may concentrate on the reduced part of the module complex, i.e., on a finitely generated projective module complex N red = N /e and on the reduced group action.
366
CHAPTER 15. GLOBAL CLASSIFICATION
15.3.2
Classifying Spaces
Summary. This section is devoted to the classification theorem: There is an algebraic scheme whose rational points represent certain isomorphism classes of global compositions. The scheme is a fine moduli space in the sense of David Mumford. -~-
To begin with, in the local situation of a local standard composition A~e of dimension c, we consider a (covariant) Grassmann functor x~,e : ComRingslR --+ Sets on all commutative R-algebras. Writing WA,e = (A@RR)e, if S : R --+ S is an R-algebra, we have
GraSSWA,C,r(S) {V c S ®R WA,el S ®R WA,e/V
(15.34)
=
locally free of rank r}.
Since9 S ®R WA,e ~ «S ®R A)@sSY, this functor parametrizes S-modules of affine functions on SI8iRA~e which have locally free lO quotients of rank r. Next, we have to deal with the separation property. We are in a similar situation as in the discussion of the local classification in section 11.3.2, where we had to deal with subfunctors of the Grassmannian (formula (11.21)) with respect to a selected point (formula (11.22)). Here, the S-valued ambient module is the scalar extension S ®R WA,e. For any pair 1 ~ i ~ j ~ c, consider the difference projections if i
= j,
(15.35)
ifi=lj,
where pri is the projection onto the i-th factor. The separation property means that no difference projection Pi,j vanishes on the module V E x~,e(s). For two indices 1 ~ i =I j ~ c, let ~A,i,j,e be the submodule of WA,e whose i,j entries are equal, while the other entries vanish. Define these submodules of W A,e:
Vi,j
=
{
(A@RR)i-l EB 0 EB (A@RR)e-i
~A,i,j,e EB (A@RR)i-l EB 0 EB (A@RR)j-i EB 0 EB A@RR)e-j-l
if i = j, if i
=I j.
We have these quotients S ®R WA,e/S ®R Vi,j ~ (S ®R A)@sS. Let us assume henceforth that A is locally free of rank m. Then we have S ®R Vi,j E GrasswA,c,m+l (S). The separation condition for V then means that we do not have V c S ®R Vi,j for all 1 ~ i ~ j ~ c. The latter condition defines a subfunctor X~iJ of x~,e which is represented by a closed subscheme
Grass~i~j of the scheme Grass~,e over Spec(R). In fact, it is the closed subscheme of the flag scheme Drap~~+l defined by the m + 1-codimensional flag component being fixed to S ®R Vi,j. It is a closed subscheme since a section of structural morphism of the Grassmannian is a closed immersion, see appendix F.5, lemma 86 for details. 9In fact, 8 ®R (A@RR) ..::::. 8 ®R (A* EB R) ..::::. (8 ®R A)* EB 8"::::' (8 ®R A)@s8. See [63] for scalar extensions on duals. lOThe property "locally free" is equivalent to "finitely generated projective", see appendix F.2, theorem 58.
15.3. ORBITS OF MODULE COMPLEXES ARE CLASSIFYING
367
In other words, we have to take V in the (open) complement scheme O:,c of all closed subschemes Grass~i~j of Grass~'c. So locally, the separation property defines an open subscheme of the Grassmannian which is also invariant under all permutations of the indexes, an action which is induced by the automorphisms of the underlying standard composition. So if the zero-dimensional simplexes of the global standard composition are A~Ci' i = 1,2, ... k, our module complex is given by an open subscheme O:.,C. = IT i =I,2, ... k O:"Ci of the fiber product of Grassmannians over Spec(R). Since all combinations of coranks are possible, we should also take the coproduct OA,c. = lir. O:.,c. of all these open subschemes on which the permutation (automorphism) group of the covering acts. To obtain the effective candidates, we have to consider coincidence on restrictions. So if p is any simplex whose vertexes are 0"1,0"2, we need to know that N(O"dlp = N(0"2)l p' This evidently is a closed, equivariant condition and we have shown that the representative module complexes over the R-algebra S which are of any rank configuration on the respective simplexes define the S-valued points of a locally closed subscheme cA,n* of a projective scheme over Spec(R) , and since any finite number of points is contained in an open affine subscheme in this situation (projective schemes, see appendix F.6), we know from appendix F.6, theorem 59, that the quotient scheme of orbits of the finite group action of S A,n* in the sense of the difference cokernel of the group action p, and the first projection prl
C A,n*
X
Spec(R) S A,n*
prl ==: c A ,n*
exists. This means that we have this result: Theorem 18 For an address A which is locally free of rank m over the commutative ring R, there is a subscheme In* of a projective Spec(R)-scheme of finite type such that its S-valued points In* (Spec(S)) for an R-algebra S are in bijection with the classifying orbits of module complexes N in S®RA~n* which are locally free of defined co-ranks on the zero-simplexes of n*. In particular, if the ground ring R is semi-simple, this theorem gives the classification of any global composition which is addressed in a finitely generated R-module A.
Chapter 16
Classifying Interpretations Weil die mathematische Methode die Wissenschaft ist, die zur Zeit die am weitesten entwickelte Methodologie besitzt, war mir damn gelegen, sie zum Vorbild zu nehmen, das uns helfen kann, unsere gegenwiirtigen Schwachstellen zu beheben. Pierre Boulez [60, II, p.71] Summary. This chapter exposes criteria for characterizing interpretable compositions in terms of classifying spaces. This is a central issue since interpretability yields access to instrumental parameters for the physical 'rendering' of a compositional structure. In contrast to general classification, interpretable molecular structures are difficult to classify. We also review global enumeration theory as well as global American Set Theory. -~-
We are not going to give further musicological comments and interpretations of the preceding and the following classification results. This will be dealt with in the next chapter. However, it should be kept in mind that this subject is far from pure mathematical exposition; the meaning of the classification techniques for musicology cannot be underestimated. It is the turning point between thinking and making music, between mental construction and physical realization. This is why affine functions are so important: they yield the entire potential of parametrizing music in acoustic realms. The interpretation of isomorphism classes in terms of points of a scheme also lead to a comparative theory of global compositions: We can now deal with the problem of which compositions are more generic than others, so this entire approach from algebraic geometry opens the subject of germinal vs. derived ideas in musical composition. In particular, the question of those compositions which can effectively be played on ensembles of musical instruments is embedded in the perspective of algebra-geometric specialization, and this is the view we shall deal with in this chapter.
369 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
370
CHAPTER 16. CLASSIFYING INTERPRETATIONS
16.1
Characterization of Interpretable Compositions
Summary. This section presents a condition for a composition to be interpretable. This condition regards the restriction behavior of affine functions which relates to fiasque sheafs.
-EPursuing the path of the last chapter, we again assume that the address A is a module over a commutative ring R, and we consider the category ComGlobA of A-adressed commutative global compositions. As we were able to classify those commutative global compositions in ComGlobA with A locally free of a defined rank, and function modules which are locally free of defined coranks on the zero simplexes of their resolutions, by the orbits of the retracted function module complexes, it should also be possible to characterize the interpretable compositions by their function module complexes. Definition 59 A module complex on a global composition is called flasque iff every section on
a zero-simplex stems from a global section. For example, if a global composition G 1 is interpretable by the covering J of a local composition K c A@RM, for an R-module M, then we have a bijective morphism flIdA: GI --t K, and the (global) functions of K, by retraction yield all the functions of G 1 on their charts (=zero-simplexes), this means: Lemma 28 Interpretable global compositions have fiasque function module complexes.
Moreover, if G 1 is interpretable by a projective module M, the global functions are separating, this follows from proposition 16 of section 15.2.2. If a local composition has ambient module M which is not projective, quite pathological things may happen. Example 30 Let us study such an example. Take the ring R = Z of integers, a free module I = Rd of defined rank d, and take a non-zero prime ideal m = (p) in R. Consider the R-module M = I EB Rim, and any address A. Clearly, we have M@RR"::' eR . 1* since all linear forms on Rim vanish. Consider a local composition L = {x, y} c A@RM with x = ee . u, and y = eTi • u, where /;" "l E Rim are two different elements, and where u : A --t I is a surjection. Then no affine function on M can separate x from y since their difference is annihilated by any affine function. Moreover, if we consider the interpretation of L by the two singleton charts {x}, {y}, we have free function modules r( {x}), r( {y}) ..::. REB I* of rank d + 1. We therefore have an interpretation with locally free function modules and charts whereas the global functions are not separating, essentially because the ambient module is not projective.
But we have the following: Proposition 18 Take G 1 in ComGlobA with A locally free of rank m, and function modules
which are locally free of defined co-ranks on the zero simplexes of their resolutions. Suppose that its function module is fiasque, and that the global functions are separating. Then G 1 is interpretable.
16.1. CHARACTERIZATION OF INTERPRETABLE COMPOSITIONS
371
Proof. We know from the quotient construction of global compositions from function modules which backs the classification theorem 18 in section 15.3.2 that an atlas of G 1 is given by the duals N(a)* of the functions on the given charts a of G 1 . So on the one hand, G is injected into A@Rf(G)* with ambient module f(G)*. On the other, we have a split I injection i(J: N(a)* --; f(G)* into the dual of the global function module since the latter is flasque, and since by hypothesis, the function module N(a) and therefore its dual is locally free of defined rank. So the two charts a c A@Rf(G)*, and a C A@RN(a)* are isomorphic, and we have reconstructed G 1 as an interpretation. QED. In particular, if the ring R is semi-simple, we obtain a complete characterization of interpretable global composition via their function modules in the resolution given in theorem 18 in section 15.3.2. But in the general case, a necessary and sufficient condition for module complex classes to yield interpretable global compositions is not known. Exercise 36 Let A be the zero address, and R be an infinite field. Take J-L = max(J(dim(R.a)) the maximum of dimensions of the modules associated with the charts a of G 1 . Then an interpretation of G 1 which is guaranteed in proposition 18 can be constructed in an ambient vector space of dimension J-L. Before terminating this classification discourse, we should add examples of non-interpretable global compositions. We shall discuss a further example of this type in chapter 17, but with a more semantic orientation. Here, we simply give the examples as facts of the classification discourse. Example 31 We discuss a non-interpretable global composition G 1 in the zero address, and over the field of real numbers, see figure 16.1. It has six points,
and three charts for its covering, i.e., 1= {UI , U2 , U3 } with
UI =
{XI,X2,X3,X4},
U2 =
{XI,X2,X5,X6},
U3
=
{X3,X4,X5,X6}.
The three covering charts are all in bijection with a 'square vertex' composition S (0,0), b = (1,0), c = (1,1), d = (0, I)} C 0@1R]R2 as follows: U I ~ S : Xl
f---+
a, X2
f---+
d, X3
f---+
b, X4
f---+
C,
U2 ~ S : Xl
f---+
a, X2
f---+
d, X5
f---+
C, X6 f---+
b,
X5
f---+
a, X6
d,
U3 ~ S :
X3
f---+
b,
X4
f---+ C,
f---+
= {a =
so that the intersections of any two of these charts yield an isomorphism of a two-point composition in ]R2. If we visualize this configuration by means of a subdivision of each chart as a union 1 The
image is a direct summand.
CHAPTER 16. CLASSIFYING INTERPRETATIONS
372
3
4
flo'
\
c'
3
2
Figure 16.1: A non-interpretable composition and its resolution. of two triangular surfaces, we obtain a Mobius strip, see figure 16.1. The resolution of G 1 in the same surface representation is also shown. Here, the charts are no longer plane compositions but tetrahedra in three space. Let us show that G 1 is not interpretable. By the above results, it is sufficient to see that the global affine functions on this composition do not separate points. Let f : G ---+ ~ be an affine function (we suppress the zero address and just work in the respective ambient spaces), set f(Xi) = fi for i = 1,2,3,4,5,6 and fi,j = fi - fj. Since f is affine on each chart, we have on chart UI : h,2 = f4,3, on chart U2: h,2 = f5,6, on chart U3: f4,3 = f6,5. Therefore f5,6 = f6,5, and 15,6 + 16,5 = 0, by definition, so 0 = h,2 = 14,3 = 16,5, and no one of the pairs (Xl, X2), (X3, X4), (X5, X6) can be separated by f.
16.1.1
Automorphism Groups of Interpretable Compositions
Summary. We show that every finitely generated abelian group can be represented by the automorphism group of an interpretable composition.
16.1. CHARACTERIZATION OF INTERPRETABLE COMPOSITIONS
373
Theorem 19 Let H be a finitely generated abelian group. Then there is a zero-addressed interpretation Clover R = IR with H .::. Aut(C1 ).
The proof idea runs as follows. Suppose that we have H .::. zr X Zml X ... Zmk' We shall construct a disjoint union Mr U Mml U ... Mmk of mutually non-isomorphic connected interpretations M r , Mmi with Aut(Mr) .::. zr, and Aut(MmJ .::. Zmi' The cell of our interpretations is a zero-addressed local composition L a ,{3 = {a, a, /3,1} C O@]RIR for any pair < O! < /3 < 1 of real numbers such that a + /3 =J 1. Clearly, L a ,{3 is rigid, i.e., Aut(La,{3) = Id. Moreover, any two L a ,{3, L y,8 are isomorphic iff {a,/3} = b',8}. Given a directed graph D without zero-Ioops 2, take the global composition Ex(D, L a ,{3), the extension of L a ,{3 by D, which has the following structure: Its charts are all isomorphic to L a ,{3, and correspond one-to-one to the arrows of the graph D. Map the elements in the copies of L a ,{3 to the graph vertexes corresponding to the arrow tails, whereas the elements 1 in the copies of L a ,{3 are mapped to the graph vertexes corresponding to the arrow heads. Identify these elements iff their images under this map coincide. Intuitively, this means that we replace each arrow by a copy of L a ,{3, with the on the tail and the 1 on the head. The graph D canonically identifies to the global subcomposition of Ex(D, L a ,{3) defined by the arrow tails and heads: D >--+ Ex(D, L a ,{3) (16.1)
°
°
°
What are the automorphisms of such an extension? They are related to an underlying automorphism of the arrow set of D. And on each chart L a ,{3 for a given arrow, the chart morphism must be the identity since the charts are rigid. So the automorphism must also conserve heads and tails of arrows, and therefore we have this fact: Lemma 29 The automorphism group of Ex(D, L a ,{3) is isomorphic to the automorphism group of the directed graph D. No two extensions are isomorphic if their cells aren't. Exercise 37 Give a proof of this lemma.
Suppose that the graph D has no multiple arrows (independent of their direction). Then such an extension can be realized as an interpretation in IR3. We are now ready to define the interpretation with the required automorphism group. Take the group zr X Zml X ... Zmk' and fix r + k mutually different pairs ai, /3i as indexes of mutually non-isomorphic cells Li = L ai ,{3i' Consider the directed graph Doo with vertexes Vi, i E Z and arrows ai : Vi --+ Vi+b i E Z. We have Aut(Doo) .::. Z, the group of translations of Z. For a positive integer m, consider the regular polygon graph Dm with vertexes Vi, i = 0, ... m - 1 and arrows ai : Vi --+ Vi+l, i = 0, ... m - 1, where we close the polygon by defining Vm = Vo. Clearly, Aut(Dm) .::. Zm. We then consider the extensions Ei = Ex(Doo, L i ), i = 1, ... r, and Ei = Ex(Dm;_r' L i ), i = r + 1, ... r + k. No two of these interpretable compositions are isomorphic and we have Aut( E i ) .::. Z, i ::; r, as well as Aut(Ei) .::. Zmi' r < i ::; r + k. It is now immediate that the disjoint union
CI = does the job, i.e., Aut( G1 )
.::.
zr
X Zml X ...
2No arrows with identical domain and codomain.
II
Ei
i=l, ... r+k Zmk' QED.
(16.2)
CHAPTER 16. CLASSIFYING INTERPRETATIONS
374
16.1.2
A Cohomological Criterion
Summary. Module complexes of define cochain complexes and their cohomology modules. We derive a necessary condition for the vanishing of the first cohomology and for the complex of the module complex of affine functions to be flasque, and hence the interpretability of the composition.
-1:If G1 is a global composition at address A, everything over the commutative ring R, we may define the cochain complex C*(G 1 ) as follows, and in congruence with the general cohomology theory of coefficient systems (see appendix H.3.1). Denote by Skn( G 1 ) the set of singular k-simplexes of the nerve n(G1 ). For each k ~ 0, we have the module Ck(G 1 )
EB
=
(16.3)
nr(s)
sESkn(GI)
of k-cochains of functions. We have a linear differential coboundary map for each k: (16.4) where for any singular k
+ I-simplex s =
(i o, i l , ... ik+l), we set (16.5) j=O,I, ... k+l
where Sj is the face of s after omitting vertex i j . It is well known that we have the usual cohomology modules Hk(G 1 )
for positive k, and HO(G 1 ) sections.
=
ZdjB d with Zd
=
Ker(dk),B d
=
di+l·di =
0, and therefore,
Im(dk-d
= Ker(do), which evidently identifies to the module r(G 1 )
of global
Proposition 19 Suppose that the A-addressed global composition G 1 has a nerve n( G 1 ) which is a finite acyclic graph. Then HI (G 1 ) = 0, and nr( G 1 ) is fiasque.
Proof. The second statement is obvious by induction on the cardinality of the nerve: There are leaves, i.e., vertexes which are connected to (at most) one other vertex. Take such a leaf, omit it, and extend a function on the rest to the omitted leaf. For the vanishing of first cohomology, we may suppose that the nerve is connected and take a cochain f = (f(i,j))(i,j)ESln(GI) in Zl. The vanishing of df means that we have f(i l , i 2 )
-
f(i o, i 2 )
+ f(i o, i l )l i oni lni2 = 0
for any singular simplex i o,i l ,i 2 . In particular, we have f(i,i) = 0, and therefore, f(i,j) + f(j,i) = 0 for any singular I-simplex (i,j). We may suppose that there is a tree (a directed acyclic graph) T such that its undirected image ITI is isomorphic to n(G 1 ). Denote by AT the arrow set and by VT the vertex set of T. We then conclude the existence of a O-chain 9 = (g(i))iEI E CO(G 1 ) with dg = f by the following lemma (QED).
16.1. CHARACTERIZATION OF INTERPRETABLE COMPOSITIONS
375
Lemma 30 Let f E EB(i,j)EA T r(inj) , then there exists g E EBiEVT r(i) (= CO(G 1 )) such that g(j) - g(i)linj = f(i,j) for all (i,j) EAT. Proof. Induction on card(AT ) 2': 1. For card(AT) = 1, suppose that f resides on i , j. Extend f(i , j) to h on i , and choose g(j) = h, g(i ) = O. For the general case, take a io E VT which is the head of a non-empty set A' of arrows, and where no arrow tails exist. Consider the subgraph T' of T obtained after removing io and A'. It is a not necessarily connected graph, in fact a forest with possibly several maximal subtrees, i.e., connected components. By induction, there is a cochain g' E EBiEVB , r(i) such that g'(j) - g'(i)linj = f(i , j) for all (i , j) E AT" Since T is a tree, every arrow (i,io) E A' defines exactly one connected component Zi of T' which contains i. We may add any global section d to the system (gDiEA z without altering the differences which yield g'(i) - g'(j)linj' Since nr(G 1 ) is Basque, we may'therefore choose g'(i) freely; set g~ = f(io , i). The final definition of g runs as follows : Take g(i)) = g'(i) for all vertexes except io, and g(i o) = O. This does the job. QED. The following examples (which are also exercises) show that we are far from understanding the connection of cohomology of global compositions to the problem of interpretability.
Example 32 Consider the interpretation G 1 of the zero-addressed local composition G C ]R2 by three charts G I , G 2 , G3 as shown in figure 16.2 Since every intersection G i nG j is generating,
•
G,
•
• •
• • •
• •
Figure 16.2: An interpretation with non-vanishing first cohomology. we have r(G 1 ) ~ r(G i ) ~ r(G i n G j ). We have CO(G 1 ) ~ (]R2@]R)3 , HO(G1 ) ~ ]R3, therefore BI(G 1 ) ~ ]R6, and 3 ZI(G 1 ) ~ (]R2@]R)3 ~ ]R9, whence HI(G 1 ) ~ ]R3.
Example 33 This example is the "Mobius" strip composition introduced in example 31 above. It is not interpretable since the functions are not Basque, however, we have the global functions r(G 1 ) ~ HO(G 1 ) ~ ]R3, and HI(G 1 ) = O. 3Please , check!
CHAPTER 16. CLASSIFYING INTERPRETATIONS
376
Example 34 Take the zero-addressed global composition over the finite field R = Z3, defined as a gluing of two copies of Z3 = 0@Z3Z3 along two common points, see figure 16.3 The
Figure 16.3: A non-interpretable composition with acyclic nerve and vanishing first cohomology. composition cannot be interpreted since the three points in each card are colinear, but it is not possible to maintain this property in a common ambient Z3-vector space while distinguishing the two non-common points of the charts!
16.2
Global Enumeration Theory
Summary. Enumeration of interpretations in special cases, such as tesselations, or canons, has a strong root in the American tradition. Contributions by George Halsey, Edwin Hewitt , David Lewin, Dan Tudor Vuza, and Harald Fripertinger are discussed. -I; -
Enumeration theory deals with the calculation of isomorphism classes of interpretations of local compositions in finite abelian groups, i.e., in finite Z-modules. Since these are never projective, the techniques of global classification presented in chapter 15 do not apply here. Further, to this date, only very special non-trivial coverings have been classified: Global enumeration theory is mostly restricted to zero-dimensional nerves, Fripertinger's recent classification of canons are an exception.
16.2.1
Tesselation
Summary. We review the work by George Halsey and Edwin Hewitt [204] on the tesselation of finite Z-modules (abelian groups). -I; -
A finite Z-module M is a zero-addressed local composition, and we may consider its interpretations MI with mutually disjoint (non-empty) charts, i.e. , with discrete nerves. Recall from sorite 6 that any isomorphism f : U .:::. V among two such charts may be extended to an affine automorphism of M. Fix a subgroup G C Aut(M). If these charts (U,M) are all isomorphic under G, we call MI a G-isotypic tesselation of M. So this is an interpretation by mutually isomorphic charts which can even be transformed into each other by a transformation in G, in other words, the class nerve CN(MI) (see section 14.5) is discrete and has constant
377
16.2. GLOBAL ENUMERATION THEORY
weights under G. We have the canonical bijective tesselation morphism MI -+ M of zeroaddressed compositions. The classifications in [204] deals with these tesselation morphisms: A G-isomorphism between such tesselation morphisms MI -+ M, and M J -+ M is an element g E G which defines this commutative diagram MI
-----+
M
(16.6) M
J
-----+
M
of global compositions 4 . In [204] a tess elating chord is defined as an isomorphism class of tesselation morphisms on M under the group G = eM of translations on M. For any natural number j, we denote by f t (M, j) the number of tesselating chords in M which represent tesselations by charts of cardinality j. We have two formulas for special cases: one for j = 2, and one for j = 3 if the 3-Sylow group5 is cyclic. Theorem 20 [204, Satz 11.10] Let m(2) be the number of direct cyclic summands in the 2Sylow group of M, and let c be the number of elements of odd order in M. Then we have
ft(M,2)
=
21 (card(M) + 2m (2)
- c - 1).
Theorem 21 [204, Satz 11.15] If the 3-Sylow S3 group of M is cyclic, i.e., 8 3 ~ Z3A, and if card(M/83 = d), we have
In principle, the number ft(M,j) can be calculated by Sands' algorithm for all finite Z-module M which are Rajas groups (this property is related to Rajas' solution of Minkowski's problem; see [204, §12] for further references). For a local composition (W, M), denote Trans(W) for the translation symmetries of W. Then Definition 60 A finite abelian group M is called a Rajas group if either M is trivial, or for every tesselation M = UVEV eV(U) by translates of a subset U, either Trans(U) or Trans(V) is not trivial. All finite cyclic Rajas groups have been classified ([204, §12]). They are the groups of order n = 1, n = p(Y., n = p(Y.q for positive natural Q, n = p2q2, and n = p2qr, n = pqr, n = pqrs, for any distinct primes p, q, r, s. In particular, Z72 is the smallest cyclic non-Rajas group. Based on an algorithm similar to Sands' algorithm, the numbers of tesselation chords have been calculated in [204, §12] for all cyclic groups of order:::; 24 (they are automatically Rajas by the above). Their list is given in table (12.13) in [204]. 4Instead, we could equivalently consider the interpretation which adds the chart M to the atlas I and then just look for G-isomorphisms of such interpretations of M. 5See appendix C.3.4.2.
378
16.2.2
CHAPTER 16. CLASSIFYING INTERPRETATIONS
Mosaics
Summary. We review the work by Harald Fripertinger [172] on the enumeration of mosaics, i.e., zero-addressed discrete interpretations of finite cyclic groups. -~-
This classification deals with discrete interpretations Z~ of Zn by card(I) = k non-empty charts of arbitrary size. So the ambient module is more special than in section 16.6, but the partition is not necessarily isotypical. Also, in this case the calculation of isomorphism classes is not restricted to translations. Again, we are selecting a subgroup G c Aut(Zn) ~ eZn . Z~, and we say that a k-partition is the canonical morphism Part(I) = Z~ --Y Zn of a discrete interpretation Z~ with card(I) = k. Isomorphisms among partitions Part(I) , Part(J) are defined by commutative diagrams as for tesselations. We ask for g E G that the diagram Z~ ~Zn
(16.7) Z~~Zn
commutes. With this, an orbit of partitions is called a G-mosaic. According to the general combinatorial P6lya methodology applied by Fripertinger, the calculation of mosaic numbers requires the action of a finite group on a set of objects which represent the partitions. To this end, a partition P is identified to an orbit of a function p : Zn --Y [1, n] under the left action of the symmetric group Sn, i.e., P corresponds to an orbit Sn . p. The reason is that the fibers of the function p define a partition of Zn, and that we are only interested in the fibers, no matter which values they stem from, whence the action of the symmetric group on the value domain. On the other hand, the isomorphism g E G acts from the right on p, and we may identify the mosaics with the orbits of the action of the direct product S[l,n] X GOpp on the function set [1, n]Zn. Of course, the set M os~ of G-mosaics in Zn is partitioned into the subsets M os~ k of k-element partition mosaics for k = 1,2, ... n. These subsets correspond to the function s~ts of functions p : Zn --Y [1, n] with card(Im(p)) = k. As usual in P6lya theory, we need to calculate the cycle index Z (G) = Z (G, P) as defined in section 11.4.1, formula (11.42). The notation is related to the second variable P = Zn, as in that context, and G c S p which we omitted in the previous notation of Z since P was fixed. With this index, a rational polynomial in the indeterminates Xl, ... X n , with n = card(P) , if each variable Xi in Z( G, P) is replaced by a value ~i' we write Z( G, PIXi = ~i)' Then we have the following enumeration theorem. Write [¥'] for the greatest integer::; n/i, and Si = i(Xl + ... X[1[J).
Theorem 22 [172, Theorem 1] With this notation, we put
Mk = Z( G, Zn IXi =
a~i )Z( Sk, [1, k]IXi = e IXi = 0 Si )
(16.8)
for 1 ::; k ::; n (differential operators acting on the second factor, and then being evaluated at zero). Put Mo = O. Then we have (16.9)
16.2. GLOBAL ENUMERATION THEORY
379
and therefore
(16.10) A calculation by use of the computer program SYMMETRICA in [172] yields this table of mosaics for three important groups: k~
1
2
3
4
5
6
7
8
9
10
11
T12 TIl?
1
179
7254
51075
115100
110462
52376
13299
1873
147
6
1
1
121
3838
26148
58400
56079
26696
6907
1014
96
6
1
GL(Z12)
1
87
2155
13730
30121
28867
13835
3667
571
63
5
1
12
-----*
This yields a total of 351773 T12 -mosaics, 179307 T h2-mosaics, and 93103 GL(Z12)-mosaics. This method can be refined to yield finer classifications of mosaics. We summarize one such refinement. If we are given a partition Part (I) of Zn, not only the cardinality card(I) = k is invariant under isomorphism of the group G, but also the block type, i.e., the sequence A(Part(I)) = A = (AI, A2,'" An) where Ai is the number of charts in I with cardinality i. So we clearly have two linear conditions 2:i Ai = k, 2:i iAi = n for A. We again replace sets by functions. Given a type A, take the set Bi(Zn, [1, n]) of bijections f : Zn -+ [1, n], and fix once and for all a partition A of [1, n] which is of type A. Then the inverse image fA = f- 1 A of A under a bijection f defines a partition of Zn, and two such inverse images fA, gA define the same partition iff g-l . f stabilizes A. Let HA C 6[1,nJ be the stabilizer of A. Then the orbit set HA \ Bi(Zn' [1, n] of left action HA x Bi(Zn' [1, n]) -+ Bi(Zn' [1, n]) : (g,1)
f---+
g.f
identifies to the set of partitions of type A. To get the mosaics of this type, we add the well-known right action of G, and we obtain the canonical identification Mos~,).. .::. HA x GOpp \ Bi(Zn' [l,n])
of the set M os~ ).. of G-mosaics of type A with the orbits of the two-sided action of G and H A. Using this identification, we have
Theorem 23 [172, Theorem 2] The number of G-mosaics of type A is given by the formula (due to de Bruijn)
card(Mos~,)..) =
Z(G,ZnIXi =
8~i)Z(HA' [l,n]IXi =
iXi)IXi = O.
(16.11)
Since the cycle index of a wreath product 6 of groups can be deduced from the indexes of its factors, and since the stabilizer is isomorphic to
the formula is controllable. The numbers of T12 -mosaics of all types have been calculated by the SYMMETRICA program and yield this table: 6See appendix C.3.2.
380
CHAPTER 16. CLASSIFYING INTERPRETATIONS
A (12) (3,9) (1,3,8) (5,7) (1,22,7) (1,5,6) (1,2,3,6) (14,2,6) (3,4,5) (2 2,3,5) (13,2 2,5) (1,3,4 2) (2,3 2,4) (15,3,4) (1 6,2,4) (13,3 3) (16,3 2) (1 7,2,3) (1\2 4) (112)
16.2.3
1 12 85 38 510 236 2320 610 1170 3510 3510 2915 5890 1170 610 2610 424 340 2325 1
A (1,11) (1,2,9) (22,8) (1,4,7) (13,2,7) (2,4,6) (13,3,6) (16,6) (1,2,4,5) (1 2,2,3,5) (1 5,2,5) (22,42) (12,3 2,4) (24,4) (18,4) (23,3 2) (1,24,3) (19,3) (1 6,2 3)
1 30 84 170 340 610 781 50 3480 6960 708 2347 5890 2325 29 6005 8725 12 645
A (2,10) (13,9) (12,2,8) (2,3,7) (15,7) (12,4,6) (2 3,6) (2,5 2) (13,4,5) (1 4,3,5) (17,5) (12,2,42) (1,2 2,3,4) (1 2,23,4) (34) (12,22,3 2) (13,2 3,3) (2 6) (18,22)
6 12 140 340 38 610 645 386 1170 1170 38 4470 17370 8860 713 17630 11623 554 84
A (12,10) (4,8) (1\ 8) (1 2,3,7) (6 2) 2 (3 ,6) (1 2,2 2,6) (12,5 2) (1,3 2,5) (1,2 3,5)
W)
(14,42) (13,2,3,4) (1\ 22, 4) (1,2,33) (14,2,3 2) (15,2 2,3) (12,25) (1 10 ,2)
6 29 29 340 35 424 1820 386 2330 3500 297 792 11580 4463 7740 5890 3510 2792 6
Classifying Rational Rhythms and Canons
Summary. We give a short comment on the classification of rhythms and canons on rational onsets.
-EFollowing Vuza's context [552], we consider rhythms without any further parameters, except the onsets which we also restrict to the rationals. Formally, we work with zero-addressed objective local and global compositions on the space Onsetll(l! of rational onsets. In Vuza's theory, a (periodic) rhythm is a zero-addressed local rhythm R = e[-oo,oo]PG over a finite germ G C O@Onsetll(l!, and positive period p. Equivalently, R is a zero-addressed objective local composition in Onsetll(l! which has a non-zero translation in its automorphism, and which is locally finite. Call Vuza rhythm such a local rhythm, and we suppose that it is non-empty since otherwise everything is known. Therefore, the translation automorphisms of R form the group ePz , o < p being called the period Per(R) of R. The group Trans of rational translations acts on the set RVuza of Vuza rhythms, and defines the Vuza classes as being the translation isomorphism classes or orbits in Trans \ RVuza of Vuza rhythms. In other words, we are working on the additive group of rationals, i.e., on the Z-module Qz. The determination of Vuza classes runs as follows. Given a Vuza rhythm R, we write Ru = Pe;(R) e- r R the unified rhythm with period one which is obtained by contraction after a
16.2. GLOBAL ENUMERATION THEORY
381
shift by the minimal non-negative element r of R. We have Per(Ru) = 1, and Ru is embedded. Clearly, two Vuza rhythms of fixed period p have the same class iff their unified rhythms do so. Hence we may concentrate on classifying embedded rhythms with period one modulo translations. Since the Z-module Z.R c Qlz is finitely generated, it is monogeneous, and as it contains Z, it must be of shape .lZ, with uniquely determined positive integer nR. After a nR dilatation by the invariant nR, we are left with the classification of zero-addressed objective local compositions (R, Z) which 1. are generating: Z.R
= Z,
2. have period nR under the group of integer translations. But this classification is clearly equivalent to that of the objects in ObLocgen,Zn which have trivial translation automorphisms (they are "maximal" in the terminology of Halsey and Hewitt [204]). Since these classes are the classes of ObLocgen,Zn, minus those which have non-trivial translation groups H c Tn, and the latter are recursively determined as the classes stemming from ObLocgen,Znl H, we have a recursive procedure to enumerate the classes in question. In [204, §6], this has been explicated. The classification itself can be performed along the lines of our exposition for the finite case in section 11.3.3. We leave it as an exercise to go through that procedure with the translation group instead of the full automorphism group. Based on this local situation, Vuza [555] considers "unending rhythmic canons" which are defined as finite, non-empty subsets of a Vuza class, i.e., a non-empty finite set of Vuza rhythms of the same translation class. Equivalently, a Vuza canon is the interpretation C J of a zero-addressed objective local composition C c O@Onseti
CJ
------->
C (16.12)
DJ
------->
D
as above for tesselations. So this situation is a generalization to non-discrete isotypic coverings, and not necessarily covering the entire ambient module, but this module is Qlz instead of a finite abelian group, as discussed by Halsey, Hewitt, and Fripertinger. As with the local situation of one Vuza rhythm, the period of a Vuza canon is a welldefined positive rational number given by Per( C J ) = Per(groundclass( C J )). Two Vuza canons C J , D J are equivalent iff they have the same period p and the contracted canons ~ C J , ~ D J are equivalent canons of period one, so we may restrict our discussion to C J , D J having this special period. Moreover, we may also suppose that the resulting classes are embedded, i.e.,
CHAPTER 16. CLASSIFYING INTERPRETATIONS
382
C c Z.C, Dc Z.D. Since these classes are translation-equivalent, we obtain the same module Z.C = Z.D = ~Z, with the invariant resulting divisor nc = nD = n as already discussed above in the local case. So, if we dilatate this situation by the resulting divisor, we have the generating local compositions n.C C Z, n.D C Z with a period which divides n, and which are covered by local chart compositions which are all (strictly) n-periodic. So modulo this period, we have to look at interpretations c", d'" of local compositions c, d C Zn by equipollent translationisotypic atlases L, K, which consist of card(L) = card(I) (!) not necessarily generating local chart compositions which do not have non-trivial translations in Zn, i.e., are "translation rigid". So our classification problem reduces to the classification of "canons" in the cyclic residue groups Zn. By definition, these are interpretations of generating, zero-addressed objective compositions by translation-isotypic translation rigid charts, but the charts are not necessarily generating. And the classification goes by translations on the ambient space Zn which carry over to the interpretations. The classification of such interpretations of local compositions in Zn cannot be settled by the known resolution theorems from chapter 15 since the involved function modules are not projective. However, we have a number of numerical invariants, such as the (translation) class nerve and the cohomology groups. We do not know how far these invariants are away from being classifying. In a recent work, Fripertinger has also classified such canons, but see [173]. A special family of Vuza canons are the regular complementary canons of maximal category (for short: RCMC-canons). In [557], Vuza presents an algorithm which enables the calculation of any tesselation M = llvEveV(U) of a group M by translates of a subset U, where both Trans((U) and Trans(V) are trivial (a non-Haj6s group, see definition 60). In particular, he proves that six is the minimal number of voices of an RCMC-canon and nR = 72 is the shortest period. For a detailed discussion on Vuza's algorithm in the perspective of the theory of nonHaj6s groups and the Minkowski conjecture, see [16]. The algorithm has been implemented in OpenMusic by Carlos Agon and Moreno Andreatta, see [17] for a complete list of solutions.
16.3
Global American Set Theory
Summary. American Set Theory has developed a number of "combinatorial" structures which relate to interpretations of pitch class sets. We give a short account to this sector, as it was developed by Forte, Lewin, Morris, and Rahn. -~-
As mentioned in section 11.5.2.2, part of what we called the American tradition focused on global instead of local musical properties. The local/global dichotomy is already present in Allen Forte's book [159], in which the second part is concerned with the so-called Pitch-Class Set Complexes, i.e., sets of sets associated by virtue of the inclusion relation. Inclusion relations are basically of two type: the K and the Kh relations. By definition, given a pc set class X, called the nexus, a pc set class Y is a member of the set complex about X iff Y can contain X or can be contained in X (or the corresponding for the complement of X), with some preliminary conditions on the cardinality of X and Y which are: 1. the inequalities 2
< card(X) < 10 and 2 < card(Y) < 10,
383
16.3. GLOBAL AMERICAN SET THEORY
2. the inequalities card(Y) complement of X.
#
card(X) and card(Y)
#
card( -X), where -X means the
Two sets that belong to the set complex about a given pc set class X are said to be in the K relation. As pointed out by Forte, "the rule of set-complex membership yields aggregates of considerable size" [159, p.96]. A stronger condition, called the Kh relation, enables one to reduce drastically the number of sets which are in relation of a given set. The new family of sets is called the sub complex of a given pc set class X. By definition it is the collection of all sets Y such that Y can contain or can be contained in X and can contain or can be contained in the complement of X. A preliminary remark consists of the distinction between literal and abstract relations. Literal relations are among unordered collections of pitch classes, whereas abstract relations are among collections of pc sets related by some equivalence relation (usually transposition and/or inversion). In the case of the abstract inclusion, it may happen for example that a pc set class may be included in its complement, a statement that would be absurd in the case of literal inclusion. Forte's K and Kh relations are examples of abstract relations. We agree with Robert Morris that "aside from reviews of Forte's book, there has been scant theoretical elaboration on the K and Kh relation in the literature" [382, p.175]. In order to give a new interpretation of Forte's set complexes, Morris suggests to represent K and Kh relations as lists of set classes displayed in complementary pairs which are called SC-comp lists [382, p.284]. Before describing the lattice representation of the K and Kh complexes, we follow Morris's discussion of a preliminary relation that he calls the KI relation. By definition two set classes A, B are said to be in the KI relation iff A ~ B. The main difference compared to Forte's relations consists of the fact that K relates two pairs of set classes whereas KI simply relates two set classes. The KI relation is a partial ordering which can be displayed in a lattice representation called the "KI-inclusion lattice". The following example in figure 16.4 shows the KI-inclusion lattice about the octatonic collection 8-28 considered as the set class of the pc set {O,1,3,4,6,7,9,10}. Notice that the abstract inclusion relation has been independently theorized by Rumanian composer Anatol Vieru by means of the concept of "modal structure". By definition, a modal structure is an equivalence class of a pc set under simple transposition and without taking inversion into account. In [545, ch.3] Vieru explains how to generalize the concept of inclusion in the case of modal structures. The greater power of selectivity of Forte's Kh relation compared to the KI relation is well explained by the concept of SC-comp list we mentioned before. In this representation, each class is listed together with its abstract complement (the two set classes are separated by a slash). In the case of a self-complementary hexachord, e.g., the set class 6-5 corresponding to the pc set {O, 1, 2, 3, 6, 7}, this hexachord is listed alone. Forte's Kh relation may be reformulated now in terms of SC-comp lists. A couple of (abstract) complementary set classes A/ - A is said to be the Kh nexus of the SC-comp list X if for all couples of complementary set classes Y/ - Y of X the couple A/ - A is in the Kh relation with Y/ - Y. Note that this new definition drops Forte's original condition by assuming that a couple Y / - Y may be equal to the Kh nexus A/ - A. The following figure 16.5 shows the great power of abstraction of the Kh complex about the octatonic collection 8-28/4-28. For a discussion of some difficulties arising in the analytical application of Morris K and Kh relations as well as of a possible generalization of the SC-list concept in relation to Forte's concepts, see [382, pp.285-288 and pp.299-304]. As mentioned in section 11.5.2.3, the main
CHAPTER 16. CLASSIFYING INTERPRETATIONS
384
8-28
I
1-31
~
6-13 6-23 6-21 6-30 6-49 6-50
5-10
4-3
4-9
5-16
5-19
5-25
5-28
5-31
5-32
4-10 4-12 4-13 4-15 4-29 4-11 4-18 4-25 4-26 4-21 4-28
3-2
3-3
3-5
3-7
3-8
3-10 3-11
Figure 16.4: Abstract inclusion lattice for the "octatonic scale" . properties concerning set complexes have been implemented in Ircam's visual programming language OpenMusic. We conclude this section by briefly discussing a different aspect of global American theory, which is usually called transformational theory. The theoretical basis of this approach is contained in the second part of David Lewin's book [300] as an alternative approach to the Generalized Interval System (GIS). As suggested by Lewin "Instead of starting with a GIS and deriving certain characteristic transformations therefrom, it is possible to start with a family of characteristic transformations on a musical space and derive a GIS structure therefrom" ([300, xiii]). Transformations produce a global network whose morphology is crucially dependent on the type of transformations which are modeled. Lewin discuss the case of Riemannian networks (i.e., networks involving so--called "Riemann transformations,,7) and networks making use of serial and inversional transformation. As observed by Vuza ([553, p.277]), from a mathematical point of view, the GIS structure cannot be replaced by the concept of a space together with a simply transitive group of operations on it. Nevertheless, the interest in a transformational 7 As mentioned by Richard Cohn in his recent survey on neo-Riemannian theory [93], Riemann transformations have been introduced in the music theory discipline by David Lewin in [298] and largely discussed in Lewin [300] as something acting on a consonant triad (i.e., a "Klang" after Riemann) in order to produce another "Klang". Examples of Riemann transformations are Lewin's MED function (i.e., the operation by which the transformed triad becomes the mediant of its MED-transform), the DOM operation (Le., the transposition by 5 semitones) and some so-called "contextual inversions" as the operations taking any Klang into its relative or its parallel major/minor (see also our discussion on harmony in chapter 25)
16.4. INTERPRETABLE "MOLECULES"
385
3-10&-10
4-2818-28
I
4-2818-28
I
5-310-31
A
6-27.6-27
5-310-31
6-30.6-30
/\
6-27.6-27
3-10&-10
6-30.6-30
Figure 16.5: The Kh complex around chord classes 4-28/8-28 in Morris' notation. oriented music analysis has become more and more evident, particularly in relationships with the concept of 'contextual transformations', i.e., transformations which are sensitive to a particular aspect of a given musical context. This approach has been extensively discussed by Lewin in the four analytical essays of Musical Form and Transformation [301]. For an example of the musicological relevance of the contextuality of the transformational network approach, see Lewin's analysis of Karlheinz Stockhausen's Klavierstiick III (pp. 16-67) , a piece which appears to be generated via a transformational network around the pc-set {O, 1,2,3, 6}. In some sense, as suggested by Robert Morris in his review of Lewin's book, the discovery of various transformational relationships in a piece can be regarded as building an abstract space in which a given piece lives. For a more detailed discussion of the concept of abstract space in composition and
improvisation see [381].
16.4
Interpretable "Molecules"
Summary. The question of interpretability of molecular structures is very difficult. We make the problem clear and explain its musical aspect. -~-
The classification of global molecules in the sense of section 15.1.3 is evidently based on the classification of global compositions over the real numbers. So the important question concerning interpretable molecules resides in the interpretability of the underlying global compositions, a question which is settled by the criteria of section 16.1, using the module of affine functions. In order to interpret a global molecule, we need to know how to construct a global space where an underlying local molecule could subsist. So we know that there is a big space where an
386
CHAPTER 16. CLASSIFYING INTERPRETATIONS
underlying local composition exists. And we have a bilinear form and an orientation on each chart. At present, we do not know of criteria to decide whether these local Euclidean structures extend to an overall Euclidean space. For the composer, passing from local Euclidean to global Euclidean structures in his global compositions means that he deals with the question of not only playing a global composition in a space of instrumental parameters, but also of playing the local distances between sound events in this instrumental parameter space. So if the property of interpretability means that an abstract global composition 'idea' can be played at all, the property of interpretability of the molecular Euclidean structures means that the local event configuration shape given by distances and angles (such as pitch or time distances, sound color distances, and similar metric categories) of the abstract 'idea' can also be played in a singular big instrumental parameter space. We are far from having settled this problem in theory or in practice.
Chapter 17
Esthetics and Classification La musique est la corporijication de l'intelligence qui est dans les sons. J6zef Marja Hoene-Wronski (1778-1853) Summary. Contrary to the seemingly bookkeping character of the concept of classification, the subject is deeply tied to esthetics. This is explicated and illustrated with a detailed example.
This final chapter on global classification is a more practical and philosophical one, and it thereby clarifies the theoretical background of a demanding subject which will be dealt with later: performance theory. It stresses the basics of the intriguing problem of performance, viz, the obligation to shape a composition beyond the given score data. Why is this necessary? Isn't the score enough? What is added, why, and how? In this chapter, we shall not answer all these questions but only one: What is added? We shall try to make this clear on a very abstract level: resolutions of global compositions. But this offers a powerful perspective on the deeper structural rationales for shaping performance. We hope that we can convey the essential connection between performance and classification: both deal with understanding, the former more on the communicative level, the latter more on the semiotic level. So they are indebted to each other as communication has to control its contents whereas contents should be put in evidence in the making of music.
17.1
Understanding by Resolution: An Illustrative Example
Summary. Before concluding this chapter on classification with a more programmatic and philosophical outlook, the theory and its esthetic implications are illustrated by the detailed analysis of a small two-part composition.
387 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 17. ESTHETICS AND CLASSIFICATION
388
Our example deals with two easy two-voice compositions, and we want to analyze what it means to understand these pieces, each in itself and comparing them to each other: What is their commonality, what is the difference? Are they isomorphic? Are they two expressions of one and the same compositional idea? To put everything into a completely familiar context without cumbersome mathematical abstracta, we select the ring JR of real numbers and modules over JR, i.e., real vector spaces and their affine morphisms. This eases the geometry and gives us an occasion to visualize the classification ideas in ordinary three-space. For the music this means that we may consider real-valued onsets, pitch, duration, loudness, etc., and that we do not consider pitch classes but pitch as such, including microtonal intervals and infinitely subdivided time. Intuitively this means that very 'elastic' parameter conditions are admitted.
'Is
F
~ F
~I FI
F2
Figure 17.1: The compositional germ P (top system) gives rise to two compositions X (middle system), and Y (bottom system), by use of contrapuntal replicates: translation and inversion. The example~composed by a fictitious composer~where we start from, is a two-voice zero-addressed local composition P c O@OnPi from elementary note-against-note counterpoint, see figure 17.1, top system. Duration and other parameters are of no relevance now and will be added later. Suppose that our composer now extends this germ by two procedures, see figure 17.1, second and third system. In the first, he unites pO = P to successive translates in time: Consider pI = e(3,O) P, p2 = e(3,O) pI, ... pHI = e(3,O) pk, and then the union X = Uk=O,I, ... r pk. In the second, he unites CO = P to successive translates in time, but every new translate is also inverted in pitch by I, the inversion at middle e: C I = e(3,O) . IC o, C 2 = e(3,O) . IC I , .•. CHI = e(3,O) . IC k , and we get the union Y = Uk=O,I, ... r C k . The composer would like to understand the essential difference between X and Y.
17.1. UNDERSTANDING BY RESOLUTION: AN ILLUSTRATIVE EXAMPLE
389
According to the Yoneda philosophy, we first examine the discrete interpretations of X, Y and cannot find any difference since both just enumerate to 6( r + 1) points. The next interpretation by use of vertical interval 'slices' does not show essential differences since the corresponding interval events in X and Yare just either the same or inverted copies, so these interpretations are isomorphic with each other. In the next refinement, Xc, Y D, the composer lays the charts of couples of successive contrapuntal intervals, each, see figure 17.2. So each of the r charts has four events. In XC as well as in yD, the first two charts are the same, they have these
X2
Xl
" '''' w
~
~v
tJ
".,.
3
~ -'-,V
tJ
2 1"'>0.
li1
~
1.1
-...
1"'>0.
3
li-
5
Y2
4
~
0
'-'
1
-
2'
-...
~
li-
~
1"'>0.
~
YI
I
6
4
...
2
X3
I'
Y3
.
6 ".,.
'-'
1*- t--t---
~-
'" ;N; -...
u
2*
5
Figure 17.2: The interpretations shown here refer to charts which represent the contrapuntal sequence of two successive intervals per chart. event numbers Xl = Yl = {I, 2, 3, 4}, X 2 = Y2 = {3, 4, 5, 6}. But the third charts are different: X3 = {5, 6, I', 2'} -=I- Y3 = {5, 6,1 * = 1(1'),2* = 1(2')}. After that, things get repeated: Y4 , Y5 are the inversions of X 4 , X 5 , respectively, etc. In the next step, the composer tries to boil down these compositions to their common 'germ' F. In order to do so, he considers two point sets
Fx = {lx,2x,3x,4x,5x,6x},Fy = {ly,2y,3y,4 y ,5y,6y} consisting of six points each. According to the above contrapuntal interpretations, the composer constructs two global compositions Fx, Fy on these sets by the following charts for Fx (see left part of figure 17.3):
Xl
X2 X3
----7 ----7 ----7
FX,l = {lx,2x,3x,4x}: i f-+ ix FX ,2 = {3x,4 x ,5x,6x}: i f-+ ix FX,3 = {5x, 6x, lx, 2x} : 5 f-+ 5x, 6
f-+
6x, I'
f-+
lx, 2'
f-+
2x
390
CHAPTER 17. ESTHETICS AND CLASSIFICATION
which are taken from the interpretation XC. So this is a global composition which is isomorphic to the interpretation FA of F by the atlas
A = {{I, 2, 3, 4}, {3, 4, 5, 6}, {5, 6, 1'2'}}.
- --- '*-
6 -----
O -~:~ 5
2*
Figure 17.3: These two global compositions are abstract models of the composition principles which give rise to the contrapuntal interpretations as shown in figure 17.2. The left one is interpretable whereas the right one is not. On the other hand, we construct a global composition F{r by this atlas (see right part of figure 17.3): Y1
---t
Y2
---t
Y3
---t
Fy ,1={l y ,2y ,3y ,4y}:if---4iy FY ,2 = {3 y , 4y , 5y , 6y} : i f---4 i y FY,3 = {5 y , 6y , 1y, 2y} : 5 f---4 5y , 6
f---4
6y, 1*
f---4
1y , 2*
f---4
2y
where the first two charts are defined as before whereas the third is different: The points 1*,2* are identified via inversion to the points 1, 2, therefore we have set 1* f---4 1 y, 2* f---4 2y on the third chart. These global compositions should be viewed as being formalizations of the compositional ideas in X and in Y. Now, although the class nerves CN(Fx:) and CN(Ft) are equal, these compositions are essentially different. Whereas the first is interpretable, the second is not, it is in fact the composition which we had already discussed in example 31. So these compositions
17.1. UNDERSTANDING BY RESOLUTION: AN ILLUSTRATIVE EXAMPLE
391
are not isomorphic, and the compositional ideas are essentially different, though subtly differentiated from each other. The first composition can be played in its integrity on one space of musical coordinates, even in two-dimensional space, by construction. The other can never be played in this way. It is a somehow abstract object.
Figure 17.4: By means of differentiation of durations, the non-isomorphic abstract compositions become isomorphic. It is a good exercise to ask for the subtle reasons why the second composition cannot be interpreted. The calculation of the function modules which was carried out in example 31 showed that the impossibility for separating functions is the parallelism of the lines drawn through couples of points in the charts. We have parallelisms (1 y, 2y) II (3 y , 4y ), (3 y , 4y ) II (5y, 6y), and (5y, 6y) II (1 y , 2y). This entails ample dependencies of functional values. The argument to blow up this fiat world is to shift points into mutually more independent positions. Geometrically, this means that one should try to displace the four points on a chart to a general position, i.e., such that they span a non-degenerate tetrahedron. But how should our composer accomplish this geometric task? One solution is to add more parameters to the given two-dimensional situation. This means, for example, to add duration, and to view the given compositions as living in a hyperplane of the product space OnPi(£JDuration with duration as third component, the hyperplane being defined by a constant duration, a whole note, say. This is in fact what we were drawing in the above figures. To obtain a more general position of the chart events, we just have to alter the relative duration of one event: all notes being set to quarters, the first middle c can be given duration 1/2. We show this change in figure 17.4. Now, the three charts are in fact isomorphic to each other: the first to the first, the second to the second, and the third to the third, because any two standard compositions are isomorphic! The trick simply is to add enough parameters in order to accomplish the goal of being in a state to play the piece in a single big parameter space. But dearly, the new compositions are no longer isomorphic to the old ones, they do, however, project onto the given original samples.
392
CHAPTER 17. ESTHETICS AND CLASSIFICATION
\ :----1-
Figure 17.5: The resolution of the non-interpretable abstract model Fy is obtained by further differentiation of durations such that each chart becomes a configuration of events in general position: In fact, every chart shows a non-degenerate tetrahedron of points. More systematically speaking, one could have put any chart configuration into general position, i.e., the second chart, too, by just making an eight note from the second middle e, this is shown in figure 17.5. This figure of the resolution f1Fy of Fy shows that we have an evident bijection of note events f1Fy --> Fy as given in the general theory of resolutions, but the positions of the events in the different charts proj ect from a general to a special position. But where has the special position been hidden in the resolution structure? It is packed in the retract of the function module of the original composition. This retraced module tells the composer, whether and how global coordinate functions for instrumental rendering are at his disposition. We shall shortly come back to this point in the discussion of Varese's approach to composition.
17.2
Varese's Program and Yoneda's Lemma
Summary. Implicitly, the part of Yoneda's lemma dealing with variations of perspectives is of primordial importance in compositional concepts. In Edgar Varese's programmatic writings, a thoroughly geometric approach to the Yoneda philosophy is sketched. It accomplishes the classical variational principle in composition; we give an overview to this central connection between modern mathematics and music.
17.2. VARESE'S PROGRAM AND YONEDA'S LEMMA
393
To begin with, let us recapitulate the impact of the resolution of a (commutative) global composition on the esthetics of music. The resolution f},G 1 of a global composition G 1 yields points which are in general position in every chart of the atlas. Moreover, the resolution is interpretable, and G 1 can be reconstructed from the retracted function module in the resolution. Since the resolution's nerve is isomorphic to the original nerve, and since the resolution projects bijectively onto the original composition, no note event and no overlapping relation of charts is destroyed in the resolution. So we essentially have the same set of notes, except that they were enriched by a number of parameters which allow us to place these events in optimal relative position. So the resolution can-in principle-be played by physical instruments, and it can also-in principle-be played such that the old idea can be heard since old parameters can be preserved. However, the freedom of choice for an optimal realization of the resolution (which after all is only determined up to isomorphism) is also mandatory since a good performance on the physical level has to cope with a number of additional conditions of human cognition. In fact, the auditory system, the instrumental skill of an artist, the material possibilities, the time frame at disposition, etc., all these conditions impose serious boundary values on the possible parameter values which can be accepted in a good realization. For example, these parameters have to reflect the syntactical structure of the composition, and not only the resolution's general position context. So time must be given a delicate role in the parametrization of events. And the distinction of events must also be optimized when the unfolding of a performance in time is to be a good communication stream. We shall come back to these subtleties later in the context of performance theory. Nonetheless, the resolution classification technique yields very important necessary conditions for a comprehensible parameter setting in performance. What could now be, after all, the program of classification? Its core value is that it deals with understanding musical works. And we should stress that our concept of a musical work is not the narrow one which restricts to those individual opera which-at least in Europestarted to emanate in the Renaissance. It includes as well general musical corpora such as scales, systems, everything that can be represented by means of global compositions, and-in the limit-any denotator if we admit the most general topos of this theory. From the precise parametric description of a work and of its ambiguities, this work appears as a point configuration in a more or less complex space (or form, if you prefer the denotator terminology). However this configuration is already a determinate perspective which shows a multitude of relations among its ingredients. It is the composer's perspective (now including an abstract 'composer' or creator of a general musical structure like a scale). For example, the choice of tonality, instrumentation, tempo, etc., are points of view which mayor may not pertain to the composition, this is a question of the epoch of creation. But their character can undoubtedly be subject to variation. Among others, here we do address the question of historical instrumentation for early music. In order to understand the relations among different parts of a composition, and even to simply recognize them, a change of the given perspective is mandatory. If a never seen object must be inspected, what should we do? You walk around it. This is the most common version of Yoneda's lemma. The analogy to cartography is straightforward: The natural perspective of the landscape in which we live does not coincide with the perspective which best meets our need for orientation. To reach this goal, we preferably build maps which show the landscape from an infinitely far vertical point.
394
CHAPTER 17. ESTHETICS AND CLASSIFICATION
The same happens to music. You playa piece in slow motion 'from very near', in a zoomed optics, a complex chord is arpeggiated, i.e., viewed from a skew angle, and so forth. This idea of variation of the perspective has also been integrated in the compositional thinking of the 20 th century. We want to illustrate this remarkable fact by a citation from Edgar Varese's comments on his composition "Integrales" [542, p.67]:
Die Integrales wurden fUr eine riiumliche Projektion entworfen. (. . .) Wiihrend wir in unserem musikalischen System Kliinge anordnen, deren Werte festgelet sind, suchte ich eine Verwirklichung, bei der die Werte fortwiihrend im Verhiiltnis zu einer Konsanten veriindert werden. (. . .) Um dies besser zu begreifen, ilbertragen wir, da das Auge viel schneller und geilbter ist als das Ohr, diese Vorstellungen ins Optische und betrachten die wechselnde Projektion einer geometrischen Figur auf eine Fliiche, wobei Figur und Fliiche sich beide im Raum bewegen, aber jede nach ihren eigenen Geschwindigkeiten, die veriinderlich und verschieden sind, die sich verschieben und rotieren. Die augenblickliche Form der Projektion ist durch die Relation zwischen Figur und Fliiche in diesem Augenblick bestimmt. Aber wenn man erlaubt, daft die Figur und die Fliiche ihre eigenen Bewegungen haben, ist es moglich, mit der Projektion ein iiufterst komplexes und scheinbar unvorhersehbares Bild zu erhalten. Diese Qualitiiten konnen noch vermehrt werden, wenn man die Form der geometrischen Figur ebenso wie ihre Geschwindigkeiten variiert. (. . .) Ich hoffe, innerhalb kurzer Zeit einen Apparat zur Verfilgung zu haben, der es erlauben wird, ein riiumliches Relief zu geben. Nur des Beweises wegen wilrde ich daran interessiert sein, die Integrales eimal so zu realisieren, wie sie ursprilnglich konzipiert worden sind.
In 1960 Maurizio Kagel transferred these principles on paper strips and discs of the score for "Ttansicion II" for piano, percussion, and two tapes. Varese's idea basically is a remake of the classical variation principle. Bach's Goldberg Variations (BWV988), Beethoven's Diabelli Variations (op.120), or Webern's Variationen fUr Klavier (op.27) are compositions in this spirit. The subject is always an artistically interwoven change of perspectives of a theme: the variation in the parts of the theme and their relations. Principle 7 Variation as a principle of musical shaping is nothing else than the identification of an idea-such as the theme-as a sum of its perspectives. Especially for Web ern a composition is a cellular organism, a connected manifold (in the naive sense) of transformations, of ever changing perspectives, of metamorphoses of a single cellular germ (in the sense of Goethe), which in fact is Schonberg's dodecaphonic series. In front of this historical background the classification problem of global compositionstogether with its central process of resolution-appears as a canonical program. In particular, the nerve of a resolution, a concept related to that of a "cell complex" from algebraic topology, reminds us of the cellular organism alluded to by Webern. And the projections which Varese describes in a visionary fashion show a surprisingly similar geometry to the projections of a resolution onto the original composition, projections which are distinguished in that they project a general position onto specializations. Finally, the variation of these projections corresponds to the variation of the modules of affine functions, i.e., the variation of the compositions which
17.2. VARESE'S PROGRAM AND YONEDA'S LEMMA
395
are distinguished from each other via their retracted function modules on one and the same resolution. But the variational principle is not only a compositional strategy, it equally, or even more dramatically, applies to the performance level. Performance-we shall discuss the issue in depth later-deals with a transformation from the mental score space to the physical space of the acoustic realization. But this transformation locally is a deformation of the "rigid" parameter values set out on the score. Why should the artist deform a perfect opus? Wouldn't this be blasphemy or at least a tremendous lack of respect? No, the added value of such a deformation is not a destruction of given structure, it is a subtle change of parametric perspectives which let the auditory still recognize the written relations, but on top of that puts configurations into general position such that their generic, or better: resolved, structure becomes "visible" on the auditory level-to restate it in the wording of Varese.
Principle 8 So the structural rationale of performance is a strategy of small changes of the composer's perspective to make the resolution of the composition audible and thereby ease understanding of the underlying composition class (in the strict sense of classification).
Chapter 18
Predicates Die Welt ist alles, was der Fall ist. Ludwig Wittgenstein [580] Summary. Denotators are purely mathematical structures which do not specify "what is the case" and what is not. This chapter deals with the existence problem of music-related objects in contrast to mathematical 'fiction'. This amounts to loading mathematical constructs with an additional semiotic signification process in order to express "which denotators are the case". These existence specifications instantiate an interface between mental potentiality and historical actuality. It reveals two fundamentally different existentialities, termed "textual" and "paratextual" signification, respectively. The former involves predicates defined by classical extension over specific denotators, whereas the latter transcends pure extensionality and thus points into domains of open semiosis. Both, textual and paratextual predicates are essential enrichments of mathematical constructs: The platonic ontology is thereby supplemented by a differentiation which cannot be reduced to pure "mentality". The variety of textual predicates follows certain construction rules of logical and geometric nature and is founded in a triply typed set of "atomic" predicates of (a) mathematical, (b) musical, and (c) deictic types.
18.1
What Is the Case: The Existence Problem
Summary. Music and musicology cannot refrain from distinguishing fiction against facticity. There are three reasons for this constraint: (1) historically, works, interpretations, and performances are in a substantial interaction which deals with what is the case; (2) in view of interactive discourse on music, actual perspectives of involved instances are not contingent but proper to the results; (3) the documentation of facts related to music are part of their accessibility not less than physical bodies do share and-by Einstein's gravitation theory---even shape a site in the universe of space-time. -~-
397 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
398
CHAPTER 18. PREDICATES
In mathematics, once a domain of objects has been consistently defined, it is no question that they are all available or 'exist' without further differentiation, respectively. Once prime numbers are defined, their instances are just there, no question of distinguishing explicitly those we have already dealt with from the othersl. The fundamental difference between mathematical set or category theory and denotator theory is that the 'existence' of denotators with respect to music or musicology is not equivalent to their purely mathematical existence. Rather must we consider a specific type of allocation or instantiation, be it on a computer's memory or in an intellectual framework, such as the composer's mind or a given composition. The point is that, within a fixed discourse, we do not permit automatic access to denotators. This is a rigorous discipline about what is given and what is only possibly given, a fundamental feature in musical thought or musicological analysis. For example, in the diachronic evolution of the music system or within a specific material allocation within an information system 2 , this may be relevant. In particular, if one deals with musicological analysis, it is essential to make precise the universe of objects one deals with, be it for classification or for ad hoc reference. Principle 9 We do not deal with a priori limitation of the available 'material', but with a 'declaration duty' of what we are allowed to refer to. Denotators share a kind of mathematical existence. Whether we view them in a set or categorytheoretical perspective, they participate in a layer of abstract existence. In the context of music semiotics, it becomes relevant to couple semiotic specification in the sense of Hjelmslev's glossematic [227] with existentiality: Denotation and connotation are ontologically sensitive concepts. They not only reflect ontology but are possibly responsible for its very production. This contrasts musicology with physics: The latter deals with a fairly objective subject to be described by mathematics, whereas the subject of musicology is far more human nature and as such does not only exist but is essentially created. Therefore this ontological enrichment has to be dealt with explicitly and in a differentiated way. In order to differentiate denotator ontology from the musicological one, the denotator system is viewed as the denotative layer of a supersystem whose signs are called (musical) predicates. Here, the denotators play the role of significants, whereas the significates are instances of 'specifically' musicological meaning. This is the model to be developed in the sequel3 .
18.1.1
Merging Systematic and Historical Musicology
Summary. Predicates prepare the field for a reconciliation of systematic and historical musicology. -~1 However, for computer mathematics, the prime numbers which have been dealt with are definitely more concrete than the others. 2 A database, for example. 3It is based upon the insight of music semiotics [361] that musicological meaning is a multilayered fact to be successively constructed via Hjelmslev chains of denotator/connotator systems.
18.1. WHAT IS THE CASE: THE EXISTENCE PROBLEM
399
We cannot refrain from stressing how profoundly the distinction between denotators and predicates hits the very structure of musicology. Following a strong tradition [103], this is commonly divided into systematic and historical musicology. While historical musicology seems to deal with history and historicity of music, systematic musicology seems to deal with musical systems. However, it is well known that history is also history of somewhat, of some systems, whereas systems are also historic in nature. The contrast to physics, for example, is that history of physics is history of the knowledge of one and the same subject: nature. So the physical systems evolve as knowledge bases about one and the same thing. Their coherence is a priori guaranteed by the identity of nature and by the invariance of physical laws in time. In contrast-and this is the argument which is usually given-music's identity is itself a historic variable since music is a constantly evolving human creation. The argument is however erroneous since music's identity has never been defined or even understood as a time-independent entity. In other words, the diachronic axis of music is essential, but not as a disconnecting instance, on the contrary, coherence of the historical development is part of the overall system. Like language, music is a semiotic system with diachronic and synchronic coordinates and laws which pertain specifically to dia- and synchronic locations. But the study of time-dependent systems is not essentially different from the study oftime-independent systems such as physics 4 . In other words: The fact that laws of music are susceptible to be time-dependent is no reason to believe that there are no laws, that there is no all-embracing system beyond a valid chain of stories. There are several methods to turn this pseudo-dichotomy of historic and systematic musicology into science. One way is to seek for time-invariant laws which model historic development as such in specific areas. Such an approach is undertaken in the theory of increasing paradigmatic groups as a significant parameter for historicity, see section 11. 7.2. This approach resembles the second law of thermodynamics which-for a determined set of physical systems-defines a time arrow towards the future by postulated increase of entropy. This first approach is somewhat naive since it may happen that the dynamics of knowledge acquisition requires a conceptual reengineering. For example, the concept of reference frame in physics had to be revised because of the relativization of time in special relativity. The relativistic time concept had to be attached as a proper attribute to each reference frame, and was no longer an all-embracing, divine quality in the sense of Newton. Whereas in physics, epistemological events of this size are rather rare, in fact appear as veritable scientific revolutions, music is in a state of incessant big bang: Creativity constantly adds to the fundamental data and knowledge. In other words, dynamical concept and theory handling is much more vital to musicology than to physics. Therefore a second method consisting of dynamic concept frameworks and explicit handling of the incessant creative impact is required. Dynamic concept frameworks have been developed in the previous theory of forms and denotators. The impact of incessant big bang creativity is handled by the theory of predicates. More generally speaking, historical sciences are likely to be absorbed by dynamical knowledge management in the following sense. Understanding history means conceptual and data control of the world's diachronic dynamics 5 . Professional data control is no longer feasible 4Even physical laws do not persist eternally, it is argued that in the very beginning of the big bang, physical laws were rightly established; in fact at physical zero time, all laws break down. 5Wittgenstein's dictum heading this chapter should then be refined diachronically: "Die Welt ist alles, was mit der Zeit der Fall ist."
400
CHAPTER 18. PREDICATES
without powerful data base management systems, and conceptual control cannot survive if it is not part of a comprehensive concept management system. Otherwise, historic science runs out of self-control or downsizes to well-known variants of ideologies and "weltanschauungen". In this sense, innovative musicology could playa role of a prototype for future historical science.
18.2
Textual and Paratextual Semiosis
Summary. Review of Kofi Agawu's and Roman Jakobson's introversive and extroversive semiosis. Truth values and meaning. The problem of open semiosis. Classifying open semiosis: processuality, synchronic and diachronic pointers of competence and tradition. Introducing the formal system of a predicative semiology.
-ETo get off the ground with the discourse on musicological meaning, we review Agawu's work on music semiology [8] which builds on the tradition of Jakobson's [245, 246] research in modern poetology. Agawu follows Jakobson in distinguishing introversive vs. extroversive semiosis. Introversive semiosis is production of meaning on the basis of intratextual signs. Agawu calls them the universe of structure. Examples: Schenker's "Ursatz" (beginning/middle/ending), Ratner's model of harmonic functions, and, of course, all elementary signs for metric, rhythmical, motivic, harmonical, etc. structures. Introversive semiosis can be said to be production of textual meaning because the text is the relevant reference level for introversive semiosis. Extroversive semiosis involves signs which transcend the system of musical signs in the narrow sense of the word. Agawu calls them "the universe of topics". Topics are signs which have a significate beyond the text. The author gives 27 examples within the form of his analysis (reaching from 1770 to 1830, i.e., embracing the first Viennese school):
alla breve, alla zoppa, amoroso, aria, bourree, brilliant style, cadenza, Empfindsamkeit, fanfar, French overture, gavotte, hunt style, learned style, Mannheim rocket, march, minuet, musette, ombra, opera buffa, pastoral, recitative, sarabande, Seufzermotiv, singing style, Sturm und Drang, Turkish music. These signs have a surface which has a regular textual meaning involving particular groups of sound events with a particular structure. But the deeper meaning, in some sense a connotative significate, reaches beyond the text, and understanding it requires historical and/or music (ologic )al competence. It can only be adequately realized by the competence of the listener/musician and his/her idiomatic expertise. Agawu's access to musical meaning is characterized by a dichotomy between precise textual and denotative semiosis and some kind of black box semiosis referring to an exterior context. This music(ologic)al context is an open system and should be treated as such. With regard to it, there is open ended semiosis dealing with what we call paratextual meaning. The distinction between textual and paratextual meaning is motivated by the fact that meaning is never formally closed and must therefore be treated in an open way. When building the predicate concept, one has to make the semiosis that takes place on the denotator level more open with respect to
18.2. TEXTUAL AND PARATEXTUAL SEMIOSIS
401
• diachronicity: the set of predicates may change with time (cultural epochs); • synchronicity: the set of predicates depends on the spatial (cultural) context; • incompleteness: meaning may be incomplete, provisional or overtly undetermined. Let us give an illustration: The diachronic extension of the repertoire of compositions determines the experimental material upon which music analysis has to be executed, tested and developed. It is not time-invariant. For instance, the Tristan chord is far more than just one item within an abstract list of chords. The synchronic analysis of ethnomusicological data is heavily dependent upon the cultural region from where they are applied. The extroversive meaning of data (such as "fermata") may be incomplete (from the point of view of performance) and one has to find a way of handling it formally. We do not yet know 'everything' a sign is supposed to convey, but we have to handle it within its context. In the life of musicians and musicologists, incomplete semiosis is the rule, not the exception! To control this variety of sign processes it is necessary to set up an adequate system of signification mechanisms.
18.2.1
Textual and Paratextual Signification
Summary. Truth values and meaning; extension vs. intension.
In order to distinguish potential from actual instances of denotators in the spirit of Agawu's universe of structure, it is necessary to be in a position to tell which instances "are the case" and which are not. This is why we stipulate that predicates are related to sets of denotator instances which do exist. For example, if we describe piano notes by special types of denotators, the predicate "piano notes of concerto XY" would cover all these denotators for the notes of the score XY attributed to the piano. We may say that this predicate creates meaning by extension. Meaning by extension can be distinguished from meaning by intension. This is what Agawu alludes to when introducing topics. It turns out that intensional meaning involves a much richer type of semiosis. The following list enumerates some types of open semiosis (without claim of completeness) :
• Semiosis as a process--To begin with, semiosis is not a state but a process so that richness of meaning increases or decreases as a function of system time. What could be an intensional meaning at a given moment can be transformed into an explicit extensional meaning after additional information was added, see [122]. • Synchronic pointers: competence----Meaning of the intensional type may consist in the reference to another instance which 'knows' more about the music in question. This is a pointer to a competence exterior to the structural data. For example: The expert teacher in piano music knows how a specific articulation sign has to be realized on a grand piano. • Diachronic pointers: tradition and progress--Another pointer type of intensional character is directed towards historical topoi or towards paradigms of progress. As an example, meaning may be anchored in historical style knowledge.
402
CHAPTER 18. PREDICATES
18.3
Textuality
Summary. The category of denotators. The textual predicate system. Predicate expressions as significants. Classification of textual predicates by three dichotomies: arbitrary/motivated, punctual/relational, and objective/morphic. Classification of motivation mechanisms: logic and geometry.
18.3.1
The Category of Denotators
Summary. We first have to introduce a category structure on the set of all denotators.
In section 8.2, we briefly alluded to an extension of morphisms between local compositions to denotators of more general types. For the following discussion of textual predicates, we need such a framework which admits morphisms between any denotators. In order to define morphism for denotators of any type, i.e., for the category Den of denotators, we start by the following axiom: Axiom 2 If x, yare denotators of different form type, the set Den(x, y) of morphisms in Den is empty. This means that only typed morphisms are allowed here, in other words: The category of denotators is the coproduct
Den
= DenSirnple U Dens yn U Denpower U DenLirnit U DenColirnit
(18.1)
of the subcategories of types as notated in the indexes. Let us now look at the different types. The type where we already have a good theory is the category Denpower = Loc of local compositions. Observe that here, the morphisms do not presuppose that the form morphisms which induce the natural transformations f of the respective subfunctors restrict to specific form types, i.e., the coordinator forms may be of any type. This means that if we want to associate a denotator x : A'V'tF(~) of form type tx with a denotator y : B'V'tG(rJ) of form type t y, by a form morphism h : x 1-+ y.a, say, then we can do so under their "wrapping" as singleton local compositions, i.e., we have f/a: {x} ---+ {y} for f = hl{x}, as described in section 8.2. So Principle 10 The only way to change type by morphisms is to wrap form morphisms into local compositions. Let us now define the subcategory DenSirnple. Give two simple denotators x : A'V'tF(~), y : ----4 Simple(M), G ----4 Simple(N), we set
B'V'tG(rJ), with simple forms F Den(x, y)
Id
= {(x, y, a), a
Id
E A@BI there is hE M@N with h.x
= y.a}
(18.2)
and we denote such a morphism by !/a : x ---+ y. This definition should be read properly with respect to the underlying identifiers. We have ~ : A ---+ Fun(F), rJ : B ---+ Fun( G), a notation
403
18.3. TEXTUALITY
which we may use in view ofYoneda's lemma, by identification of A@Funwith Hom(@A,Fun). The existence of h means that we have a form (i.e., a functor) morphism h : Fun( F) ----+ Fun( G) which is induced by @h. Then the equation h.x = y.OI means that h.1;, = rJ.OI. The rest is obvious.
Proposition 20 The subcategory Densimple has arbitrary fiber products. Proof. Suppose we are given three simple forms F
----+
u
Simple(M), G
----+
u
Simple(N), H
----+
u
Simple(L),
three respective denotators x : Av-+F(I;,) , y : B-v-+G(rJ) , z : C-v-+G(() and two morphisms
x
!/OI y -----,--,:-::---.
!/f3
Z
of simple denotators. Suppose that !/OI is induced by a morphism h : M ----+ L, k : N ----+ L. Then we have a fiber product form F XH G : Jd-v-+Simple(M XL N) with identifier equal to the fiber product of the identifiers of F, G, H under the given form morphisms. Since these form morphisms are induced by two module morphisms h, k, the projections F X H G ----+ F, F x H G ----+ G are also induced by the projections M XL N ----+ M, M XL N ----+ N. With this in mind, there is a denotator x xzy : A Xc B-v-+F xHG(1;, x(rJ) which is universally defined to make this diagram
A Xc B
-------------+.
Fun(F XH G)
B-----
A
-------+.
f3
Fun(F)
.C
h
~
k Fun(G) - - - - - - - . .• Fun(H) commute, and we are done. QED. For synonymy, define the reference denotator x~ : A-v-+F~ (I;,~) of a denotator x : A-v-+F(I;,) with F ----+ Syn(F~) and C = Jd(l;,) E A@Fun(F~). This is essentially the same as x, albeit
404
CHAPTER 18. PREDICATES
leaving aside the synonymous 'disguise'. Suppose by recursion that for two synonymy denotators x : A'V'-7F(~), Y : A'V'-7F(T]) , we have already defined and symbolized morphisms of referenced denotators by the fractional notation f la : x~ ~ Y~ with address change a : A ~ B. Then we identify the morphisms Den(x,y) with the set of triples {(x,y,fla)1 fla E Den(x~,y~)}; and we denote such a morphism by the evident symbol f la : x ~ y. The composition of such morphisms is evident, and this (sub)category is defined. This subcategory has fiber products iff its referenced counterpieces have. Observe that this is a problematic requirement for circular denotators; in a mathematical discussion, the situation should receive a special treatment. For the limit type, give two x : A'V'-7F(~), y : B'V'-7G(T]), with forms F ---+ Limit(]]])), G ---+ Id Id Limit(lE). By definition, we have ~ = (~d)dEllIJJl' T] = (T]e)eEIEI where 1]]])1, IlEl denote the corresponding diagram vertex sets. In this case, a morphism f I a : x ~ y means the following: 1. an address change a E A@B, 2. the symbol f la for a family of morphisms (Jdla : Xd ~ Y¢(d») with a map ¢ : 1]]])1 ~ IlEl, and where the denotators Xd, Y¢(d) are reference denotators as above, i.e., Xd : A'V'-7Fd(~d) for the d-th form Fd of the diagram. In particular, we suppose that the types of the factor domains and codomains are identical. The composition of two such morphisms is again evident. In this subcategory, fiber products can be constructed as follows: Give a diagram of three limit denotators x
fla Y----.
gl(3
Z
with morphisms fla: x ~ z = (Jdla: ~d ~ (¢(d») for a map
Pd,e
-----7
1 Ye
Xd
1 -----7
(18.3)
Zh
and whose existence is supposed by circular or regular recursion. It is easy to verify that this defines a fiber product on DenLimit. For the colimit type, give two x : A'V'-7F(~),y : B'V'-7G(T]) , with forms F ---+ Colimit(]]])), G ---+ Colimit(lE). Here, we have either x = y where we just Id Id take the identity as a morphism, or else it is supposed that the diagrams are both discrete, in which case both, x, y live in coproduct spaces, x '" Xd E A@d, Y '" Ye E B@e, say. Then we identify Den(x, y) with the morphism set Den(xd, Ye) as above. In all other cases, no morphisms
405
18.3. TEXTUALITY
are provided in the colimit type. We leave it as an exercise to verify the existence of fiber products for the subcategory DenColirnit. We stress once more that these are recursive definitions which require a special treatment in case the denotators pertain to circular forms. Observe that, for example, if we have a denotator x in a product form of two simple factors with modules M, N, this form is isomorphic to the simple form associated with the direct sum MffiN. However, the denotator is not isomorphic to its corresponding denotator x' in the simple form of the direct sum! It only becomes isomorphic if wrapped as a singleton, i.e., {x} -=:::, {x'}. One could also see this technique as a "type casting" procedure, known from (strongly) typed computer programming languages.
18.3.1.1
Morphisms as Denotators
Summary. For formal programming reasons, it might be useful to transform morphisms into denotators. We look for such techniques. -~-
If a E A@RB is an address change over a fixed ring R, and x : A-v-+F(~), y : B-v-+G(7]) are two denotators, the triple (x, y, a) can be seen as a denotator as follows: We have these sections and projections of addresses: iA : A >----> AffiB,PA : AffiB -» A and iB : A >----> AffiB,PB : AffiB -» B. We also have the injection PB@B : A@B >----> A ffi B@B. We refer to these maps in the following discussion. Take the two injections PA@F : A@F >----> A ffi B@F, and PB@G : B@G >----> A ffi B@G. Consider the form F x G x B ----+ Limit(F, G, @B). Then the triple (x, y, a) is identified with a Id
denotator (x, y, a) : A ffi B-v-+Fx G x B(~, 7], a). This settles the description of simple denotators. For the local compositions, take the usual morphism situation: x
--+
@AxF
1
@axh
y
--+
(18.4)
@BxG
which represents the morphism f la : x -+ y. We may take the graph f j c @AxFx@Bx G ~ @A ffi B x F x G. This is a A ffi B-addressed local composition in the form F x G. Taking the form exp(F, G) ----+ Power(F x G), we have this denotator f j la : A ffi B-v-+exp(F, G)(r) and Id
we can couple it with the equally addressed denotator (x, y, a) from above to get a denotator (f la) in form exp(F, G) x F x G x B ----+ Limit(exp(F, G), F x G x B): Id
(fla): AffiB-v-+exp(F,G) x F x G x B(fjl"" (x,y,a))
for the given morphism of local compositions. The case of synonymy is trivial since the morphisms are just "synonymy casts" of already given morphisms. The limit and coli mit situations essentially reduce to a finite list of morphisms or one selected morphism, respectively, and this is settled by the above techniques; we leave it as an exercise to work out the details. We should add two remarks:
406
CHAPTER 18. PREDICATES • Morphisms of forms can be restated as new forms. In fact, if h : F ---. G is any morphism of forms, look at the form
rh rh:Fun(F)>->Fun(F)xFun(G) ----+ Limit(F, G). The morphism is represented by the identifier of this form. This can be used to represent general limit data by products: If we have a diagram ][)) : D ---. Mod@, each diagram arrow fd,e,k : d ---. e can be wrapped in its graph form r !d,e,k of limit type (in fact a product of two forms). Of course, one should not believe that wrapping a morphism in an identifier is an essential simplification of real problems, but it shows that morphisms are controlled on the level of identifiers within forms, and that, conversely, controlling forms qua objects means controlling the whole category of forms. • If two denotators of same address and form have same coordinate ~ but different names "d I ", "d2 ", we can shift this difference into the coordinate by adding a simple factor of character strings to the given form, and restating the coordinates as (d I , ~), (d2'~) such that now, the coordinates are different and not only the total denotator data. This discussion makes clear that we may either look at denotators or at the category of denotators, the information is essentially the same: Each morphism yields an object, so that the knowledge about all objects is sufficient to describe knowledge about the whole category Den. This is why we shall in fact use the whole category and not only its objects in the following discourse about predicates.
18.3.2
Textual Semiosis
Summary. We introduce the formal system of a textual semiosis, including the question of switching between predicates and denotators. -~-
The textual predicate system is a formal system of semiosis which manages the difference between mathematically possible denotators and denotators which are the case in some technical or more informal data base. The typical application is the storage of denotators on computer data bases for music research including retrieval, communication, and extension of knowledge. A textual semiosis (over the category Den) is a map
sigDen: Tex ---. Texig(Den)J
(18.5)
with a domain Tex c Ex =< UNICODE> in the monoid of all UNICODE strings6 , which we call expressions here. To define its codomain, let Den oo = III
407
18.3. TEXTUALITY
given a module I of 'truth values' whose role will be discussed in the sequel 7 . This module will only be evoked via its role within a simple form Val(I)
-----+ Id
Simple(I)
(18.6)
of truth values 8 . This in turn gives rise to what is really needed, i.e., the form TRUTH(I)
-----+ Id
Power(Val(I))
(18.7)
whose set of A-addressed truth denotators (any address admitted) is denoted by ']'1, whereas the union of all such sets over the totality of addresses is denoted by ']' I and called the set of truth denotators. In other words, we have a 'fibration' add: ']'I ---7 Mod whose fiber add-I(A) at address A is ']'1. The codomain Texig(Den)r is the set ,],?en oo of all characteristic functions on the set of all finite length tuples of denotators or morphisms of denotators, and with values in ']'1. If X E Texig(Den)r, and if x E Den oo , the address of X(x) is often related to the addresses which are involved in x. For example, if all these addressed coincide and are equal to A, then X(x) should also live in ']'1, but this is not mandatory and depends upon the construction of X from previously defined functions. To understand the truth denotator set, let us first discuss the completely reduced case of zero address and zero truth module, i.e., A = I = 0 0 = O. This is the standard situation of truth values in topos theory: They are just the set of morphisms 1@0; in our situation of the presheaves over the module category Mod, the final object is representable: 1 = @O, and by Yoneda, we have 1@0 ~ 0(0) = Sub(@O). Warning: This topos Mod@ is not Boolean. In fact, a topos of presheaves Sets C is Boolean iff C is a groupoid, i.e., iff all arrows are isomorphisms (see [314, exercise VI.2,p.343]). In general, we have A@TRUTH(I) = A@O@I ~ Sub(@(AxI)), therefore for the zero situation, a denotator DE ']'8 is D : O""'>TRUTH(O) (d) and has coordinate de @O, a sieve in the zero module over the zero ring. To describe all sieves in the zero module, consider a category C, and the equivalence relation R
rv
S iff C(R, S) and C(S, R) are both non-empty
on C whose classes are denoted by [R]. On C/ "', we have a directed tree graph9 structure Bc where an arrow [R] ---7 [S] is defined iff there is a morphism f : R ---7 S for representatives R, S. On any directed tree T, we can build the duplication tree Dup(T) as follows: Take two disjoint copies To, TI of T and draw one arrow d x : Xo ---7 Xl for each couple (xo, xd of points corresponding to the original point X of the tree T. Call a subset of a directed graph G open iff it contains the domains of all arrows whose codomains it contains. The set Open( G) of all open sets in G defines a topology. In fact, the empty set and G are open, and any union/intersection of opens is open. Therefore, the topological space Open( G) is a Heyting algebra in the usual sense (with intersection, union and implication, see appendix G.5.1 and [314, p.51]). With this in mind we have the following description of all truth denotators modulo names: 7We refrain from introducing non-representable truth value objects here and restrain to modules as there are merely speculative reasons to generalize. sMore generally, a truth form may be any form F, and not only a simple one. 9This is a directed graph whose undirected associated graph (forget about arrow directions) has no cycles.
408
CHAPTER 18. PREDICATES
Theorem 24 The Heyting algebra Open(Dup(BRings)) of open sets of the duplicated tree Dup(BRings) and the Heyting algebra Sub( 00) of (zero) truth coordinates are canonically isomorphic. Proof. Let X c @O be a sieve, and A any address. Then either A@X = {!A} (the unique arrow to the final module) or A@X = 0. So such a sieve is defined by the subset supp(X) of addresses where A@X = {!A}. Conversely, if any subset S of addresses is given, this corresponds to the support set of a sieve iff it is closed under address changes, i.e., iff, whenever A E Sand there is an address change B --+ A, then B E S. Therefore, the support set S of a sieve must be a union of module classes in the above sense, i.e., elements of B Mod ' We therefore view S as a subset of vertexes in BMod which is open in B Mod . Now, every non-empty R-module M is equivalent to the zero-module OR over the same ring. And for two such modules, there is a morphism iff there is an underlying ring homomorphism. Therefore, the restriction of the module class tree to the non-empty modules is isomorphic to the class tree of the ring category BRings. Moreover, the empty modules are also classified according to their coefficient rings, and therefore, the class tree of these modules is also isomorphic to the class tree BRings of rings. Finally, there is a morphism 0R --+ OR whereas there is no morphism in the other direction. Therefore, the tree BMod is isomorphic to Dup(BRings)' Moreover, a sieve support set S of module classes in BMod must be open, and we are done. QED. Exercise 38 Observe that there are open sets in Dup(BRings) which are not defined by a discrete set of leaves. For example, take the direct product ring R = ITi=I,2,3,oo. ZPi over all prime fields for increasing primes PI = 2,P2 = 3,P3 = 5, .... Consider the infinite increasing sequence of ideals h = ITk:S;i ZPi and the corresponding chain of projections R
--+
R/ h
--+
R/I2 ... R/ h
--+
R/ h+1 ...
which defines non-equivalent rings and whose opening (the smallest open set which contains this sequence) is not defined by a leaf. Example 35 A first example of a characteristic function is the attribution of one of the two universal values T, 1. : 1 --+ n with T = @O, 1. = -, T = 0 for each tuple f of denotators. This may and will happen in a completely uncontrollable, mathematically non-foreseeable way, see section 18.3.3. Proposition 21 The coordinate Heyting algebra o",@n of truth denotators for address A = 0", and truth module 00 is isomorphic to the Heyting algebra of sieves S c @Z in the category Rings. The proof is left as an exercise. For any truth value module I and address A, and for any local composition t C A@I, we have the canonical functorialization i (see section 6.2.3), in particular, for every value T E A@I, we have f = {T}"': SO every 'set' t C A@I of A-addressed 'truth values' in I gives rise to a truth denotator in ']['1.
409
18.3. TEXTUALITY
Example 36 Take, for example, the value module 8 1 = lRjZ, i.e., the real circle group (a Z-module). For each number ¢ E [0,1]' we have the local, zero-addressed truth denotator (coordinate) ¢~=
[O,¢t E 1'~~. This means that we have traced a (naive) fuzzy logic on the unit
interval by the interval truth denotators ¢"', included in the special values o~= The upper limit is the largest objective local composition in @(Oz x 8 1 ).
0c
¢~c 1~=
81 .
Principle 11 Philosophically, this means that fuzzy logic (and also other variants according to the given value module) is interpreted as a logic of intervallic local compositions in the unit group. This is quite the spirit which we shall now evoke: to view logical values as special local compositions, and to rephrase statements about local compositions as if they were generalized truth values. In music, there is no deeper reason to restrict truth to the final power 0 1 of the subobject classifier instead of extending the discourse to general powers OJ. Exercise 39 For every morphism of truth value modules h: I --+ J, we have a canonical form morphism TRUTH(h) : TRUTH(I) --+ TRUTH(J) defined by images of subfunctors, and a corresponding map
1'(h) : 1'J
--+
1'J : d: A-v-+TRUTH(I)(t5)
f---+
h.d: A-v-+TRUTH(J)(Im(t5))
on the truth denotators. So we have a corresponding map
Texig( Den) (h) : Texig( Den h
--+
Texig( Den) J
which canonically extends to a change of textual semioses, i.e., if
sigDen : Tex
--+
Texig(Denh
is a given textual semiosis, the composition
h.sigDen
= Texig(Den)(h) . sigDen
defines a new textual semiosis. More generally, given two textual semioses
sigben: Tex1
--+
Texig(Denh and sighen : Tef
--+
Texig(Den)J
a morphism sigben --+ sighen is a pair (u, h) with a set map u : Tex 1 --+ Tef on the expressions and a morphism h : I --+ J on the truth modules, such that the diagram . 1
Tex1
S'tgDen
------+
ul Tef
Texig( Den) J
1
Texig(Den) (h)
(18.8)
. 2
S'L9Den
------+
Texig( Den) J
commutes. This defines the category of textual semioses on Den. Verify the details. In particular, we may now compare different textual semioses according to variations on the expressive as well as on the truth value module levels.
410
CHAPTER 18. PREDICATES
The construction of the category of textual semioses in exercise 39 will be used when we have to combine characteristic functions which evaluate to different truth domains. This is essential in the whole construction process of musicologically relevant functions and predicates (see below, definition 61). The problem is not a formal one: As soon as one has opened the truth domains from naive logic of "Yes" and "No", combinations of logical statements must deal with a comparison paradigm, otherwise the choice of logic may terminate in a failure of communication between different logical territories. Very often, a characteristic function is not really relevant except to a small subset of Den oo , e.g., the zero-addressed local compositions. For the remaining arguments, the function takes value 'false', more precisely, the value is the empty sieve ..1A E A@OI .:::::. (A x I)@O at address A, write also ..1 if the address is clear or irrelevant. Define the support supp(x) of a characteristic function X E Texig(Den)J as the set of tuples f E Denoo such that xU) #- ..i. Whenever the value of a characteristic function is not specified, it is supposed to be 'false' (the address of this ..1A still being undetermined).
Definition 61 A predicate for the category Den is a pair (E, sigDen) of a textual semiosis sigDen over Den, together with an expression E in sigDen' The characteristic function sigDen(E) is called the content of the predicate, whereas the mapping (the functional relation) of sigDen is called its signification. The predicate's extension ext(E) is the content's support suPp(sigDen(E». In a more sloppy language we shall identify a predicate with its expression if the rest (the semiosis in its very meaning!) is clear. Also we will refer to the predicate's textual semiosis when referring to the predicate's truth module etc. We now introduce the concept of a predicative object (a pre-object), morphism (a premorphism), or more generally a tuple of morphisms. The denotators are the fictitious (mathematical) objects and we need those denotators which are supported by predicates. So let E be a predicate of a given semiosis. Then a predicative object, morphism, tuple (for this predicate) is a pair, denoted by xl E, where x is a denotator, a morphism or a tuple of morphisms such that E(x) = sigDen(E)(x) #- ..1, Le., x E ext(E). In general, pre-objects and pre-morphisms do not build categories since predicates are completely 'orthogonal' to the underlying mathematical structures. It will however be a special task to delimit classes of predicates which relate to the underlying mathematics. Essentially, pre-objects are introduced to grasp those entities which are the case in the sense of topos-theoretic, and more or less fuzzy sense as explained above. And to guarantee the most unsystematic approach to facticity as it happens to pervade the mud of the humanities. Summarizing, if we denote the collection of all pre-objects, pre-morphisms and pre-tuples by Denool sig, we have a 'forgetful' map
sigl: Denoolsig
-+
Den oo : xlE
f-+
x
whose fiber sigl- 1 (x) represents the full facticity of x with respect to the given semiosis. Setting oDen oo = I1 1 S;i(oDen)i we complete the general terminology with this definition:
Definition 62 A predicate E in sig is called • objective iff supp(E)
c
oDen oo ,
18.3. TEXTUALITY
411
• morphic iff it is not objective, • punctual iff supp(E) eDen, • relational iff it is not punctual. The above observation that a morphism of denotators may be restated as denotator means that we may restate morphic and relational predicates as objective and punctual ones. Although time is not mentioned explicitly in this construction, the concrete situation is that the entire textual semiosis is not time-independent. The point is that the construction of new pre-denotators from old ones is not automatic but has to be performed by someone. This is not a serious drawback for the theory, but we should be aware that the setup changes every time a new item is added or removed! The details of this dynamics will become more clear in the following construction discourse. A final point concerning reality and fiction: finiteness conditions. The realistic situation is that every predicate should only have a finite number of supporting denotators, a computer hard disk can not have infinitely many items. This is a delicate condition if we start producing new pre-denotators from given ones, since evident logical or geometric procedures yield infinite support if they are applied carelessly. So the axiom oj finite Jacticity support for predicates is an omnipresent boundary condition to be observed in all the following construction. 18.3.2.1
Predicates as Denotators
Summary. Predicates should be convertible into denotators for the sake of management in information technology, but also because this is an essential method for generating new denotators from given ones. We discuss such techniques. -~-
We suppose that we are given a predicate E, and that it is objective and punctual, a restriction which is not substantial-modulo some restatement acrobatics as described above. By the technique from section 18.3.1.1, we may further suppose that all the denotators of its support have the same address A, viz the direct product of all the addresses of its denotators. And we may suppose that different denotator names also have different coordinates. The set supp(E) is of course not a denotator, but there is a canonical way to make one: Suppose that we have the list FormSupport(E) = (Fi )i=l .... n of forms pertaining to denotators of support(E). Define this operator lO on forms:
SupportForm(E)
J:! Limit«P(Fi ))i=l .... n)
(18.9)
with the factor forms
P(Fi )
---+
Id
Power(Fi).
This defines a canonical restatement of supp(E), in fact, take sUPP(E)i = supp(E) n SUPP(Fi ?), where we view Fi? as the predicate with support exactly the denotators of form Fi . Set C oordi = lOThe index limit n could be a finite of infinite number. but as supports will be finite in every practical situation, the finite case is the default case.
CHAPTER 18. PREDICATES
412 set of coordinates of supp( Ek Then we have the denotator
den(E) : A"""SupportForm(E)((Coordi)i=l, ... n)'
(18.10)
As we suppose that the coordinates distinguish denotators, we get a one-to-one representation of the predicate E by a product of objective, A-addressed local compositions. The usage of this kind of denotators is not harmless! In fact it is important since we want to build denotators by this technique, but we do not want to produce contradictions by circularity, as, for example by Russel's antinomy! However, since the construction involves already given denotators, the new one is just a further construction which uses given supports. But then, we have to pay attention to avoid conflicts with history! The point is that the construction of den(E) involves the actual status of the predicate E. So, as time passes, E may change its signification (sic!) and we should update the associated denotator. For example, if E is already constructed from other predicates, this may be a cascade of updating duties. So the updating process may even involve circular constructs (referring to E), we will not produce contradictions since the referred to instances of E will be the last one, as with all replacement processes of type "x = x + I" in programming. We insist on this conflict potential since in the increasingly accelerated cycle of production, documentation, dissemination, and reflection of knowledge, a dramatic interaction between successive cycles may cause such strong intercyclic distortion forces that knowledge crashes, since the known and the new may overlap to the degree where the older 'version' cannot even stabilize its contents before the new 'version' intervenes and puts into question its predecessor.
18.3.3
Atomic Predicates
Summary. The criteria for atomicity. Classification of atomic textual predicates: mathematics, primavista, and deixis (shifters).
As announced earlier, predicates are not just invented in the air but germinate from a well-delayed arsenal of elementary or "atomic" predicates. The different types of atomicity are first treated before we deal with the combinatorial chemistry of compound predicates in section 18.3.4. Evidently, this concept reflects the overall topography of construction in music and its theory. Basically, we distinguish three approaches to music related predicates: 1) those which are given by a fundamentally mathematical reasoning without further relation to music or to other rationales, 2) those which are defined by a purely music(ologic)al reasoning, and 3. those which are just the user's decisions beyond any further foundation, i.e., deictic morphemes in the sense of semiotics (see [361, 2.5.1] for a more semiotic discussion of the present topic). They will be discussed in the sequel.
18.3.3.1
Mathematical Predicates
Summary. Why do we need mathematical predicates since denotators already share mathematical existence? Examples of mathematical predicates for the working musician and musicologist. -~-
18.3. TEXTUALITY
413
We insist on distinguishing pure mathematical existence from musical relevance (musical existence) of mathematical properties in the lexical foundation for arbitrary predicates. This may-at first sight-seem unnecessarily restrictive. Why should a mathematical criterion not be unrestrictedly accessible within music and its theory? The reason is that, besides some historical attempts to identify music with mathematics (e.g., the Pythagorean school), mathematical objects are not automatically relevant to music. For example, a Fibonacci sequence does not automatically have musical meaning. Only when it is introduced as a composer's or analyzer's criterion will it gain the status of a predicative instance. Musical meaning consists of predicates which are not free from mathematical criteria, but have to be given a semantic status beyond mathematics. One of the reasons for this duty is that cognitive aspects of music must be examined with respect to their relevance within a given context. For example, the discussion of consonance and dissonance perception should be sensitive towards the mathematical procedures involved in the recognition of sonance classes. As soon as cognitive performance has to be traced in the cortical or subcortical tissue, the question of modules for mathematical tasks becomes primordial. So mathematical predicates have extensions of denotators defined by mathematical criteria. One looks at mathematically defined properties of denotators, such as onset quantities or Boolean specifications, and then selects those denotators which fulfill these properties. Here fulfillment means attributing to denotators truth values in some set of truth denotators. Let us start with four elementary examples which may be extended and completed in many ways.
']['1
1. Chords. We want to specify denotators which comprehend chords as sets of simultaneous notes. To this end, sets of notes, for piano say, are defined by the form N oteGroup --> Id
Power(Piano-Note) , using the well-known coordinator Piano - Note. Then, by definition, the expression "PianoChord" will take truth value T E ']['8 precisely on the denotators Ch : A'V'7NoteGroup(Notel, ... Note n ) such that all its substance points Notel, ... Note n have one and the same E-formed onset coordinate El = E2 = ... En. It takes the false value .l in all other cases (we take 1=00 here). Recall that this pre-denotator is denoted by Ch/PianoChord.
2. Melodic motifs. We again start with the form NoteGroup in the above example, but require now that the onset coordinates Ei be pairwise distinct and define the content of the expression "M oti!" to evaluate to T A E ']['~ those denotators Mt : A'V'7NoteGroup(Notel, ... Note n ) which verify this condition: Mt/Motif, .lA E ']['~ for all other denotator morphisms with codomain A, and .l E ']['8 in all other cases. So these values also take care of the address for objects and morphisms in Den.
3. N -element groups of notes. We fix a natural number N and want to select those sets of piano notes which have at most N elements. Denote this predicate by the expression < N, thus defining the corresponding pre-denotators by D / < N, the truth values being again taken from ']['8. This last example shows that expressions may contain variables from the mathematical concept framework, such as N in this case.
414
CHAPTER 18. PREDICATES
4. Form positions. Each denotator refers to its form, address, and coordinate. Hence we are also interested in the information which comes from this data. For example, we may look with this prefor the form's type and define a characteristic function with values in scription: We codify the forms by i(Simple) = 0, i(Syn) = 1, i(Limit) = 2, i(Colimit) = 3, i(Power) = 4, and then attribute the value t(d) E defined by the constant sieve i(type(Jorm(d))~ for denotators d or codomains d of morphisms, and the false value ..1 E else.
ll 'lI':;ZI ,
'lI':;ZI
'lI':;ZI
These four examples suggest that there may be a legitimate musical or musicological interest to introduce certain mathematical properties for denotators. Nevertheless, these properties are of a purely mathematical nature and cannot be deduced from other properties by formal arguments; this is why they are arbitrary to the predicate system. 18.3.3.2
Primavista Predicates
Summary. Primavista (= PV) predicates are the main source for score-related predicates. We give a list and discuss some representative PV predicates for classical European notation; further examples of non-European PV predicates.
-EPV predicates are predicates related to scores. The concept of score is however not restricted to the classical European music culture, even if we shall only give a rather complete list of primavista predicates for classical European music. From this list, some selected predicates will be discussed in more detail in order to illustrate the general procedure and to present templates for other predicates.
List of typical primavista predicates from classical European music scores: Staves, Braces and Systems, Ledger Lines and Octave Signs, Clefs, Stems, Flags and Beams, Rests and Pauses, Ties, Key Signatures, Time Signatures, Accidents, Bar-lines and Repeat Signs, Slurs, Dynamic Marks, Articulation, Ornamentation Signs, Tempo Indications, Arpeggio, Composer, Name of the Composition, Expression, Instruments, Lyrics, Comments, JazzHarmony, Gestures, Number Sheets (see [435]). For non-European predicates see [103, Vols. 8 & 9]. Absolute Tempo. The absolute tempo indication can be either a numerical Malzel sign or by verbal indications. To grasp this information, we take the forms
M aelzel Rate
~ Id
~ Id
(18.11)
Simple(Z)
VerbAbsTpo AbsTpo
Limit(Rate, Onset)
~ Id
~ Id
Simple(Z < ASCII
(18.12)
»
Colimit(Maelzel, VerbAbsTpo)
(18.13) (18.14)
11 Recall that the notation A[zJ means restriction of scalars to the integers, i.e., reduction to the underlying abelian group, see appendix E.1.1.
18.3. TEXTUALITY
415
where a denotator m : Oz'V'tM aelzel(x, y) denotes that the Malzel sign m is read as "Play x units of duration y!"; the meaning of the V erbAbsTpo is clear, and denotes indications such as "Play 'Lento assai~". The total form codifies the alternative between Malzel and verbal indications. The next step is the embedding of the absolute tempo information in a named composition and at a precise onset position. We have
AbsTpolnComp AbsTpoEvt NComp
---->
Id
---->
Id
Limit (AbsTpoEvt, NComp)
Limit(AbsTpo, Onset)
Simple(Z
---->
Id
< ASCII»
(18.15) (18.16) (18.17)
and understand that the absolute tempo event denoted under the form AbsTpoEvt is referred to a composition which is represented by a name. Of course, this could be refined, but the idea is clear from this. Until now, we have just defined forms which yield the spaces where predicates could play a role. The predicate which is expressed by "Absolute Tempo" then is more than the abstract set of denotators. It should tell us which absolute tempo indications are the case from the point of view of a score reading expert. The trace of such a judgment is the predicate's content. If it is not always very clear from the philological point of view whether there is a determined absolute tempo, we could take the truth value of the characteristic function in ']['~~ and thereby model the fuzzy approach as explained above. Therefore the value for a denotator atp : OZ'V'tAbsTpolnComp(x) is set to the fuzzy value qr attributed to the expert judgment that atp is the case with "certainty 0 ::; ¢ ::; 1", and ..1 in all other cases. Jazz CD Reviews. If a jazz researcher wants to investigate the role of certain jazz directions in the historical line, it may be important to refer to a judgment of CD releases in professional jazz media. Let us therefore introduce a PV predicate about the importance of a CD in a determined review. The idea is that the predicate is set up by a jazz expert who judges the importance: Is the CD review saying that the product is important? Yes? No? "Rather yes, but, well, I changed my mind about the contents of that review.", etc. So we want a fuzzy, time-dependent predicate Importance on denotators which parametrize jazz CD reviews. Here are the forms:
CD
Limit (Label, Number, Year, Title)
---->
Id
Label
---->
Id
Number
< ASCII> )
Simple(Z
---->
Id
Simple(Z)
Year
---->
Simple(Z)
Title
---->
Simple(Z
Id
Id
Review Media
---->
Id
---->
Id
(18.19) (18.20) (18.21 )
< ASCII> )
Limit (CD, Media, Year, Number)
Simple(Z
(18.18)
< ASCII> )
(18.22) (18.23) (18.24)
416
CHAPTER 18. PREDICATES
Under the Importance predicate, a review denotator rev may take truth values in ']['~:XS1. This means that we attribute the fuzzy value in the module 8 1 as above, but we also denote the date (on ;z';J) to fix the time-dependence of the judgment which may change even if we concentrate on an individual expert. The rest is clear. Why is this time parameter inserted in the predicate and not in the denotator? Because it is a quality of the predication and not of the review. Of course, one could also have introduced the time parameter in the review form, but then, the predication would be hidden in the denotators instead of the truth values. Fushi in Noh's Utai Chant The Noh tradition (see [269] for the following facts) includes a sophisticated chant formalism which is titled "Utai", a triply articulated structure. We distinguish between melodic, dynamic, and speech articulation in the Utai. In this example, we shall concentrate on the melodic Utai which is described by a combination of melodic units, the ''Iushi'', see figure 18.1. Each fushi (or combination of such) is a TextFushi
haro0
araski m
kana
furiIe
(n)
,
_ -<1) .-1nuIU Jc!--(2) haru --(3) u (uki)
.!
seed swell float
.~ --(4) 0 (oroski)
drop
--(5) i (irodoru)
color
--(6) mitsu-kiki
triple draw lower
itbt t~ t
sageru
~:
r
..., - ( 7 ) a (atsukai)
y9)
(8) irodori nom; mar'" ,nd of phrase unit (ku) (10) ya-a
~
"!J ~
special treatment of pronunciation color glottal nasalization
rhythmic notation
Figure 18.1: An example of a Noh Utai song text (left column) with the fushi sequence (column to its right). prescription to shape the melodic content of one text syllable in duration, pitch, and color. So fushis are paired with syllables which we codify by the 16-BIT UNICODE codification (the Japanese-Chinese-Korean (JCK) character set, in this case). This is encoded in the
18.3. TEXTUALITY
417
form
JCK
~
Id
Simple(Z < JCK
»
(18.25)
where we use the free Z-monoid algebra over the JCK part of the UNICODE alphabet to encode text. The proper fushi part is built from the three fushi articulations into basic, special, and pitch-change segments. As it is not guaranteed that the special fushi signs are integrated in UNICODE, we better give a plain visual representation of those signs by a sufficiently fine binary encoding, on a k + l-Bit basis, say, i.e., in the forms
FushiPic ~ Simple(Z~ x Z~) Id
(18.26)
with k Bits for the pixel color encoding and 1 Bits for the pixel number. This enables us to add a name to each fushi sign. However, the fushi part is not just one of these denotators but it has a recursive compound structure since fushis may be enriched by auxiliary sequences (strings or lists) of fushis for pitch and other specifications. For any form F we may consider the circular list form where the entries are of form F:
ListF
Limit(F, ListEntrYF)
~
Id
ListEntrYF Count
~
Id
~
Id
Colimit(ListF, Count)
Simple(Z)
(18.27) (18.28) (18.29)
Let us then codify the fushi sign strings in form
FushiSTRG
~ Id
Syn(ListFushiPic)
and then construct the fushi concept as a circular form as follows:
FSH
~
Id
Limit(FushiSTRG, FushiOrnament)
FushiOrnament ------:-+ Power(FSH) F,n
(18.30) (18.31)
with the usual finite set identifier. We therefore have a fushi structure which is essentially the same as the M akroEvent structure discussed in 6.7. With this, we may finally complete the fushi structure associated with a syllable:
Fushi
~
Limit(Basic, Special, PitchChange)
(18.32)
Basic
~
Syn(FSH)
(18.33)
Id
Special
Id
~
Id
Syn(FSH)
PitchChange
~
Id
Syn(FSH)
(18.34) (18.35)
where the three synonymous forms refer to the basic fushi (usually named sugu, hiki, mawashi), the special fushi (two types, hashiri, yari, the latter being a class of recursively
CHAPTER 18. PREDICATES
418
ornamented trill fushi), and thirdly: pitch-change fushis. Our construction terminates with the form of combination
JCKFU
----t
Id
Limit(JCK, Fushi)
which yields the units of the list form ListJCKFU fo the full fushized Utai text, and we may pose Utai ----t Syn(ListJCKFU) (18.36) Id
whose denotators are the melodically articulated Utai texts. It is clear how to complete this construction in order to add all the philologically necessary information from the Noh tradition. In this context, we may define what are the existing melodic Utai songs by a PV predicate M elodicUtaiSequence which takes its values in some fuzzy (uncertainty of identification) 1I'~~ to be 1>~ for 0 :s; 1> :s; 1 in the same sense as with the above example on absolute tempo. 18.3.3.3
Shifter Predicates
Summary. Organizing the non-lexical resources: the relevance of shifter predicates in production and reception of music and its science. -~-
Until now, atomic predicates were drawn from lexical data, be they of notational primavista nature related to scores or of mathematical nature. There is another major source for the creation of predicates: deictic criteria. For such a predicate the extension may be different, depending on its user. Whereas in language, lexical signs dominate and determine a fixed ontology, the basic means of semiosis in music (and to some degree also in musicology) are shifters: being and becoming a musical object may be a result of individual decisions. Once a predicate has been introduced, it may be used just like the others; however, its mode of coming into existence will remain important. In accordance with communicative coordinates, shifters are differentiated relative to poietic and esthesic perspectives. The neutral level is omitted here since it excludes the deictic dimension by definition. Poietic shifters typically come up when the composer decides to specify a determined set of denotator instances which will become the objects of the composition. For example, the choice of a composition's motivic germ MyMotif : A""'>NoteGroup(N1 , ... N k ) by a composer does not favor just any possible predicate loaded with the property of being a motif, i.e., verifying a predicate M yM otif / Motif; on the contrary, it is this particular motif that was chosen objectively by the composer and it is this choice that gives it a special position among the denotators of form N oteGroup and among the predicates encompassed within the predicate expressed by Motif. The expression corresponding to this very individuum could be denoted by ThisM otivicGerm, an expression that is evaluated to T if and only if we deal with this composer's concrete and unique choice of the germ. The predicate MyMotif /ThisMotivicGerm has exactly one affirmative answer and no possibility to be generated within a genuinely lexical environment.
419
18.3. TEXTUALITY
The esthesic counterpart of creational shifters is the set of extensions considered by an analytical interest. Often, there is no lexical reason to consider a particular composition or a special chord, there is just scientific curiosity, or interest. In this context, a composition which we may identify by a simple character string denotator S for the composition's name, is marked by!, i.e., we write S! for the fact that there is a special interest in S. Let us terminate this subject by two remarks and a principle statement: • Comparison of different truth value domains is eased by the concept of a morphism of textual semioses which is induced by morphisms between corresponding truth value modules. So at least in principle we have a canonical technique for comparing truths as they are modeled in specific and adequate contexts . • Truth in the topos-theoretic sense is not meant to model some formal languages but to add contents to the poor truth value of absolute logic in order to target towards a possible fusion of logic and beauty~after the marriage of logic and geometry initiated by topos theory.
Principle 12 After all, it would be wonderful if one day, we could say that something is true because it is beautiful. The feeling of such an insight has been around for a long time-let us finally face its substance.
18.3.4
Logical and Geometric Motivation
Summary. Analysis of the motivation mechanism for textual predicates. Besides the classical logical "recombination" of predicates, music preconizes more geometric methods in order to produce new predicates from given ones. Motivation is the technical counterpart of productive navigation in the EncycloSpace. -~-
We announced that predicates are mainly generated by certain constructions from arbitrary basics such as mathematical, PV, and shifter predicates. Recall that semiotically, constructing the signification of signs by a determined mechanism from given significations is called motivation (see 2.3.4). So we now have to deal with motivation from the three named (arbitrary) basics. One must, however, point out a major difference between predicative existence and mathematical fiction in the motivation process. The construction (motivation) of new denotators from given ones is a purely mathematical routine process: it simply has to fit in the generic mathematical framework of soundness. But with predicates, generating new denotators which "are the case" from already instantiated facticity is not only an abstract affair: these objects have to be the case for us, they have to be available, on a storage media, in a determined reservoir of shared information, in other words, predicates are the elements of knowledge in the sense of ordered access to information. So accessibility of the predicates' extensions is a serious requirement. Let us look at this condition in the different arbitrary basics. Start with the arbitrary PV predicates. They mean music(ologic)al facticity. They have extensions somewhere, and this somewhere can be accessed without restrictions, it is not only as if it existed, it is within our reach. So the existing sonatas of Beethoven, the existing Noh
420
CHAPTER 18. PREDICATES
Utai, the existing jazz compositions of Duke Ellington are a knowledge base which is strictly antagonistic to possible, but never conceived music works. Principle 13 If we are going to extend this level of facticity, we should meet the requirement that the knowledge ontology is conserved and not evaporated into fiction of whatever nature. The mathematical predicates are a bit more delicate. We contended above that these are introduced to indicate that there is a mathematical predication which is music (ologic ) ally relevant. Nonetheless, such a predicate may have an infinite extent. So it is a fact of music but not necessarily a finite one. The point is that this differentiation from purely mathematical existence has to be combined with the requirement of principle 13. More precisely: Principle 14 If a mathematical predicate is conceived, it must be set up to cope with the accessibility requirement of knowledge. This accessibility is guaranteed by an explicit declaration of its characteristic function, and not necessarily by the predicate's extension which could be infinite or simply too large for any human storage device. If such mathematical predicates are used to generate extensions, they must meet the specific storage conditions of the medium, be it a technological or a human one (memory). In the case of deictic predicates, the facticity is more a question of declaration discipline: We do not want to deal with uncertainty in the choice of predicate extension instances. Rather would it be important to say that you have to attribute to a determined denotator a fuzzy truth value instead of saying that you do not know whether you want to consider that denotator or not. So this means
Principle 15 Deictic predicates must be understood as the duty to declare facticity versus the freedom of leaving your decision unsettled. The latter is not part of the predicate system but of the psychology of the decision process in its making. This is not a proper part of the predicate theory which is a precise theory of fuzziness in music and not a properly fuzzy theory. Summarizing, we want to construct new predicates from already given ones in the sense of a consistent extension of knowledge ontology: Principle 16 The explicit and unrestricted access to information must be declared on the germinal level of arbitrary predicates and conserved in each motivation process. After these preliminary reflections on knowledge propagation, we may proceed to the description of motivation mechanisms. Basically, two levels of construction can be distinguished: the logical and the geometric motivation. Logical motivation means that new predicates are constructed from given ones by use of the codomains of characteristic function. In fact, the topos-theoretic truth domains ']['1 are provided with logical functionality of the associated Heyting algebras, such as conjunction, disjunction, implication, and negation. This type of logical operation extends to universal and existence quantifiers. Geometric motivation acts on the domain or characteristic functions. This means that the topos-theoretic constructs such as fiber products, fiber coproducts, power objects and other
421
18.3. TEXTUALITY
universal objects are used to build new denotator objects in Den oo on which truth values are applied. Of course, nothing can prevent us from applying these methods in a not so topos-theoretic manner or to introduce still other motivation methods. This is no problem, the scope of these delimitations is rather to put into evidence the richness of the purely topos-theoretic vein of motivation. Before giving further comments on these motivation methods, we should present some easy typical examples.
Logics: conjunction, disjunction, implication, and negation. This type is completely straightforward. We suppose given two predicates (named) P, Q which have their truth values in the same domain of truth denotators ']['1. Then the logical combinations P /\ Q evaluates as follows. Let P(f) = p : A"""TRUTH(I)(7r) and Q(f) = q : A"""TRUTH(I)(",) for an object f of Den oo . Then we set P /\ Q(f)
= P /\ q : A"""TRUTH(I)(7r /\ "')
where 7r /\ '" is calculated in the Heyting algebra of the functor A@Fun(TRUTH(I)). The analogous construction works for the other logical operations. (If we deal with different truth denotator domains, we have to be provided with a common domain, for example by use of colimits such as fiber sums.) Despite the evident procedure there is a delicate point in these construction. If we look for the extensions of predicates and suppose that the extensions of the given predicates are traced on some human or man-made data storage or memory devices, it is not clear whether the same may also be the case for the resulting logical combinations. For disjunction and conjunction this works since the extensions are--roughly speaking-unions or intersections of the given extensions. For negation, however, the resulting predicate may obtain an infinite, not controllable extension. In other words, negation of facticity is not necessarily the same type of facticity. In fact, what can be conserved in facticity is not the extension but the operation which takes a decision on every proposed tuple of denotator morphisms.
Logics: Existence and universal quantifiers. In the existence and universal quantifier situation, typically involving a predicate P(x, y) with two free variables x, y, these two logical operators, 3 x P(x, y), and \fxP(x, y), invoke a variable, x, say, which must be run through and checked for truth of the given predicate. In our situation, this check will not be a dichotomic one between true and false, but an 'integration' over all values of P(x,y) as x varies. The statement 3 x P( x, y) then reads as the maximal possible truth value of all P(x, y) as x varies, so if there is at least one truth value T, then we assert that 3 xP(x, y) has this value. Analogously, the statement \fxP( x, y) reads as the minimal possible value among all P(x,y) as x varies. To put it more every n-tuple f index 1 :::; i of variable at this whereas glb(X)
formally, suppose that a predicate P of a textual semiotic evaluates on = (11,··· ik) of denotators to a truth denotator P(f) E Fix an tuples. Then the quantifiers are defined with respect to the denotator index. To this end, if X c denote by lub(X) its least upper bound denotes the greatest lower bound of X in the complete Heyting algebra
']['1.
']['1,
CHAPTER 18. PREDICATES
422
']['1. Further, if dE Den, 1 ~ i ~ k + 1, and if I = (h, ... Ik)
E Den k is a k-tuple, write I>id = (h, ... ii-I, d, Ii, ... Ik)' Then the existence quantifier is the predicate 3 i P which at the k-tuple I evaluates to 3 i P(f)
=
{P(f) lub( {P(f>id)ld EDen}) if 1 ~ i ~ k
whereas the universal quantifier at the k-tuple ViP(f)
=
I
if k + 1 < i,
+ 1,
(18.37)
evaluates to
{P(f) glb({P(f>id)ld E Den}) if 1 ~ i ~ k
if k
+ 1.
+ 1 < i,
(18.38)
Observe that the calculation of the lub and the glb needs only the set supp(P). So if this one is so small that a computer or human memory can control it, the quantifier predicates can be calculated at a comparably small expense. If this support is not finite, or if it is too large to be stuck on a memory device, the range of the bound ith variable d must be restricted to a controllable predicate or else the predicate P must be joined to a predicate Q such that supp(P 1\ Q) becomes controllable by the available memory. In this situation, we write 3 i ,QP, Vi,QP. We cannot solve the details of this problem of facticity here, but we feel obliged to insist on the relevance of its investigation because abstract mathematical reality is not sufficient for predicates to be handled by human technology and culture. So one should in any case point out the delicate turning points. Geometry: fiber products. Geometric motivation implies predicate construction methods which are built on the domain Den oo of characteristic functions. We want to illustrate this for fiber products on denotators, a universal construction which~modulo some circularity traps~is known to exist for all five typed denotator subcategories (see section 18.3.1).
We take two mathematical predicates having their support on oDen (see section 18.3.3.1): Chords Chi PianoChord,
(18.39)
Melodic motifs MtlMotiI,
(18.40)
and a PV predicate OnBeetSon which takes value T E ']['8 exactly for each zero-addressed local composition C consisting exactly of all onsets of the onset-bearing events of one of the 32 Beethoven sonatas (and -.l else), so we are looking for the predicative support objects C IOnBeetSon. (18.41 ) To relate these three predicates, we consider a mathematical predicate OnsetProj supported on ']['8 with T by morphisms of local compositions I : X -+ Y such that X has form N oteGroup, Y has form Onset, and I is the projection morphism retaining the note events' onsets.
18.3. TEXTUALITY
423
To relate the different truth denotator domains, we have to reduce the motif domain ']['1 to ']['8, but this is straightforward by the unique projection A ----+ 0, what yields ']['1----+ ']['8, and this is what we carryover from the truth values for motifs. Finally, the geometrically motivated predicate BeetM otChordFiber has its T support on octuples j = (X I ,X2,X3,X4,iI,12,h,j4) where the first four objects are veritable objects in Den (identity morphisms), and the second four are morphisms of Den. The value T in ']['8 is taken on all octuples of cartesian diagrams (in ObLoc)
(18.42)
such that
X 2/ M otij, X 3/ PianoChord, X 4/OnBeetSon,
(18.43)
h/OnsetProj, j4/0nsetProj
(18.44)
are all true. This construction can be decomposed as follows: The predicate BeetM otChordFiber on octuples is built as a logical conjunction of several projection predicates. A projection predicate is a geometric motivation as follows: Given an increasing sequence of positive natural numbers j. = 11, .. . js, denote by 7rj. the natural transformation Den oo ----+ Den oo which on a k-tuple j = (iI, ... ik) evaluates to (18.45) Then the projection jth predicate
P.7rj.
deduced from a given predicate P is defined by
P.7rj.(f)
=
P(7rj.(f)).
This is an easy geometric motivation built upon this projection. We may then consider the mathematical predicate Cart of cartesian diagrams which gives T on octuples j = (Xl, X 2, X 3, X 4, iI, 12, h, j4) if they yield a cartesian square 18.42. So our above predicate reads as a logical motivation from projection motivation via
BeetMotChordFiber(f)
=
Cart 1\ Motif.7r2 1\ PianoChord.7r3 I\OnBeetSon.7r4 1\ OnsetProj.7r7 I\OnsetProj.7rs(f)
which means that we have built this predicate by a (logical) conjunctive motivation and a (geometric) projection motivation from mathematical and PV predicates. Strictly speaking, this second construction of BeetM otChordFiber is a synonymous predicate: It should have another expression, but its content is the same as the first one's content; in fact, synonymy of predicates is a common feature as in every semiotic system.
CHAPTER 18. PREDICATES
424
Now, the predicate BeetM otChordFiber is not quite what we have in mind: We would like to speak about the fiber product as a local composition, and not about the whole bunch of auxiliary structures. So the final construction would be this:
BeetMotChordFiberObject(X)
=
T iff there is an octuple with
7rl (f)
f
=X
and BeetMotChordFiber(f)
= T.
Exercise 40 Tty to unify the above existence condition with the logical motivation via the existence quantifier as two special cases of a more general logical existence construction. (Hint: use more general projections.) So we finally get a predicate which selects local compositions (in fact chords) consisting of piano notes which share the onset of a note of one motif and of a chord within the onsets of one of Beethoven's sonatas.-But we should, once again, be aware that the verification of the cartesian predicate is a mathematical operation of complex nature. It involves mathematical existence and universal quantifiers for the verification of a universal property. The operationalization of such predicates on the basis of computer implementation can be quite tedious since it always must be solved on a constructivist basis.
18.4
Paratextuality
Summary. Beyond textual semiosis, a type of predicate semiosis can occur which is not limited to extensions over denotators. -I;-
Besides textual semiosis, expressions of predicates can also (or exclusively) invoke historical and stylistic competence, for example, and thus point at exterior strata of signification. However, this does not enforce physical reality; it can still live on the mental level of signification. Classification of paratextual signification includes reproductive and productive behavior. Reproductive paratextual predicates include performance, lyrics, expressive, choreographic, and musicological aspects. Productive paratextual predicates can be of lexical nature, such as ornaments or improvisational patterns-or else of shifting character, such as creational and unintentional predicates. For a more systematic treatment of this subject, we have to refer to [361, 2.6]. It is however important to give a short account of the germ of paratextuality as it is preconized by the very nature of truth denotators in the textual semiosis. In fact, we have already learned that the classical logic of extensionality is canonically embedded in the truth denotator domains which means that extensionality and truth evaluation is only a very 'poor' signification codomain. It is natural to view this situation as a special case of a more general validation operator w process where a validation just gives us back a denotator for every given tuple, i.e., we have a signification process
1'1
w : Den oo
--+
Den
18.4. PARATEXTUALITY
425
which attributes some values to tuples, values which share a proper extratextual semantics, such as 'truth' or 'falsity'. For example, such an operator is given for the evaluation of certain PV predicates, such as "Fermata": An operator w would turn this abstract denotator myFermata (which essentially has the coordinates onset and duration) into a tempo curve w(myFermata) which may be described by a parameter set for a spline, or by a string which represents a mathematical expression for that tempo curve, etc. This operator process does however not yet produce physical contents, but it gives an explicit representation of such objects in physical terms. So the generalized textuality which evaluates to not necessarily truth-oriented denotators yields a connotation stratification for deeper semantics beyond text of whatever abstraction. This successive absorption of semantics by denotators is not only a methodological mechanism, it is already a tendency in music history where contents of musical signs have always been progressively split into connotational substrata which in turn have been absorbed by conceptualization and music theory [122].
Chapter 19
Topoi of Music Das ist wohl schon die Mathematik des "Neuen Zeitalters". Alexander Grothendieck [200] on "Geometrie der Tone" [340]
Summary. This chapter is a conceptual synthesis of the previous achievements. We show that the overall structure of the category Glob of global compositions carries a Grothendieck pretopology via finite covering families. It is well known that such a pretopology generates a Grothendieck topology J and therefore a Lawvere-Tierny topology j on the presheaf topos Glob@. We discuss the associated instances, such as the subobject classifier sheaf n, and the subtopos Sh(J, Glob) of sheaves. -1:In this chapter we tacitly make use of the topos theory of Grothendieck topoi, refer to appendix G.4.
19.1
The Grothendieck Topology
Summary. This section introduces the Grothendieck topology on Glob via covering families. Musical motivation is given. -1:The finiteness of common local and global compositions is not favorable to intuition from standard topology, but we have known the application of traditional methods from combinatorial topology and corresponding Cech cohomology for classification purposes of musical compositions in chapter 15. However, this is not the full potential of topological methodology: This one has been developed with great success by Grothendieck in algebraic geometry, and applied-among others-by Pierre Deligne in his celebrated solution of the Weil conjectures, see also [209]. The idea is now fully absorbed by topos theory in the context of sheaf construction from presheaf categories. Here, the yoga of topology boils down to the gluing catechism which gives 427 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 19. TOPOl OF MUSIC
428
insights to compatibility of local data without having to deal with the original intuitive neighborhood connotations from Euclidean geometry. Therefore it is a natural idea to integrate the Grothendieck topos construction which generalizes the Cech canon for the analysis of musical composition categories, or even of denotator categories provided they admit fiber products, a mandatory condition for the definition of Grothendieck topologies via covering families (see appendix G.4). To begin with, we have the following theorem:
Theorem 25 The category Glob of global compositions is finitely complete. Proof. We have to show l that Glob has fiber products and a final object. The latter is already known from section 8.3.3 in the local case, but for global compositions, the same object does the job. The former claim is known for local compositions, and the fiber product of the underlying functors is known to exist. So the proof reduces to a control of the atlas index maps and the gluing of local fiber products. The fiber product of the index sets and their maps defines a system of sub functors of the fiber product of the functors which underlie the global compositions. The union of these subfunctors is therefore covered by charts which are isomorphic to fiber products of local compositions and which clearly glue together as desired. QED. Exercise 41 Show that the fiber product of interpretable compositions is interpretable. Despite this positive result, this is not the right perspective to introduce Grothendieck topologies. In fact, consider the fiber product of two charts U l , U2 in a global composition G I . Since their index singleton injects into different indexes of the covering set, the fiber product Ul x G U2 is empty! The reason for this strange fact is that we had-and still have-serious musicological reasons for specifying the index map t as well as the associated family a of adL
dress changes of a morphism f I a. So this is a strong dichotomy between musicological setting of global compositions and the traditional setting with mathematical manifolds, where the atlas is only an existence and not a uniqueness requirement. If we stick to the musicological categories, such as Lac, Glob, we cannot apply those powerful tools of Grothendieck pretopologies as we would like to do: even the intersection of two charts would be intractable qua fiber product. So the approach for this conceptual environment must be slightly weakened: it is reasonable to forget about the index and address change maps and to retain only the transformation f between sets or functors, respectively. In other words, we consider the categories /-LLoc, /-LGlob, /-L0bLoc, /-L0bGlob as follows (p, for "mathematical"). The objects are those of the old categories Lac, Glob, ObLoc, ObGlob. The morphisms are just the maps of the underlying functors (or sets in the objective case). So we have, by definition, surjective "forgetful" functors p,: p,: p,: p,:
Glob ----> /-LGlob Loc----> /-LLoc ObGlob ----> /-L ObGlob ObLoc
---->
(19.1) (19.2) (19.3)
/-L0bLoc
(19.4) L
(and the other known subcategories) which are all defined by the action p,(f la : G I ----> HJ) = (G I , H J , f : G ----> H) which we shall also write as f : G I ----> H J , and which are all the lSee appendix G, proposition 99.
429
19.1. THE GROTHENDIECK TOPOLOGY
identity on the objects. We shall denote the image of Hom(G1,HJ) by G1@HJ, the set of mathematical morphisms, as already introduced in definition 50 of section 14.4, to make the reference to the manifold mathematics evident. So intuitively, a mathematical morphism is just a morphism of compositions where we forget about the underlying chart index and address change information-it is just known that they are there, as in manifold theory. If we restrict to the subcategories LOCA over a fixed address and without address changes, then evidently, we have an isomorphism J.L : LOCA .::::. ,..LOCA, whereas the restriction to the global compositions GlCoA with fixed address A gives us as a fiber of a morphism f all index maps which give rise to f. More precisely: £
Proposition 22 Let f /0. : G 1 -+ H J be a morphism of global composition. Let
K, :
1-+ J any
K
set map such that the H-chart V KCi ) contains im(Ui ), for every i E I. Then f /0. is a morphism.
Proof. If im(Ui ) C VKCi) n v"Ci) , then we may apply the transition isomorphism between the intersection of the two charts V KC i), v"C i) to induce a morphism of local compositions Ui -+ VKC i) with the same address change. QED. Observe that the general transformation J.L does not reflect isomorphisms. In fact, if G 1 -+ G is a refinement map for a local composition G, as discussed in example 23, section 14.2, then the identity I de : G 1 -+ G of the underlying functor G is an isomorphism in ,.. Glob, but generically the objects are evidently not isomorphic qua global compositions if card(I) > l. Before introducing a Grothendieck topology on ,.. Glob, we have to guarantee the existence of fiber products. Here is the theorem: Theorem 26 The category ,.. Glob of global compositions and mathematical morphisms is finitely complete. Proof. Evidently, the final object of Glob does also the job for ,.. Glob. So we are left with the proof that ,.. Glob has fiber products. We are given a pair of mathematical morphisms f : G 1 -+ HJ,g : LK -+ HJ. As these are natural transformations we may consider the fiber product G XH L of functors with its canonical projections to G, L. This functor can be covered by the non-empty fiber products Pi,k = G i XH Lk of the chart subfunctors G i , Lk in I x K. Let (G i , L k ) E I x K be such that G i XH Lk is not empty. Take J-charts H£Ci) , HK(j) such that f on G i factorizes through H£Ci) , and such that g on Lk factorizes through HKCk). Then evidently, the image of Pi,k in H is contained in Hi,k = H£Ci) n HK(j) , and we have G i XH Lk .::::. G~ XHi,k L~, where G~, L~ are the inverse images of the intersection Hi,k in G i , L k , respectively. By the existence of fiber products of local compositions, these fiber products are local compositions, and we obtain an evident atlas together with two mathematical morphisms of the fiber product functor (which is a global composition) into the factors. The details are left as an exercise for the reader. QED. To make the ideas precise, the present approach to Grothendieck topology will be focused 2 on the category ,.. Glob. We consider the following system of covering families: For a global 2Generalizations to other denotator categories are possible but we do not have concrete concepts and leave this subject to future developments.
430
CHAPTER 19. TOPOl OF MUSIC
composition3 G, the covering basis K(G) consists of all surjective finite families (Ii : G i -> G)i, i.e., their functorial images generate the functor G. Clearly, isomorphisms are such families, fiber products of such families are such families, and the composition of such families are again finite and surjective. Therefore we have a basis (originally called a pretopology) for a Grothendieck topology. Call the Grothendieck topology J of these covering families the finite cover topology on I'Glob.
Example 37 The interpretation G 1 of a local composition G by a finite covering I of subcompositions is in K (G). Example 38 The family of local sub compositions of a defining finite atlas of global composition is a covering family. Example 39 More generally, if we add all finite intersection compositions as considered in the theory of affine functions on the compositions nerve, we obtain another covering family. So musicologically, the finite cover topology is the "least common denominator" of all our previous local-global constructs. In other words, all hitherto known concepts and techniques for gluing together finite assemblies of local data in music and music theory are special cases of the overall approach by the finite cover topology. This also applies to the Cech cohomology of the nerve which we have shortly discussed in section 16.1.2.
Remark 4 The finite cover topology is not sub canonical (this would mean that every representable presheaf @X over I' Glob is a sheaf). The point is that the exactness condition for sheaves only yields a natural transformation from a family of compatible mathematical morphisms, but not (in general) a mathematical morphism. Possibly, a narrower definition of covering families would yield subcanonical topologies.
11'1
Example 40 Suppose we are given a predicate P with values in and support on the local compositions, to fix the ideas. Consider this data as a function t on Loc. We would like to be able to extend this function to the global compositions. To this end, suppose that this predicate is invariant under isomorphisms of local compositions. As the truth denotator domain is a complete Heyting algebra, we may attribute to a global composition G with charts G i the truth values t(G) = SUpi(t(G i )) or else t(G) = in!i(t(Gi )) which are well defined by our invariance assumption and do extend the values on local compositions.
19.1.1
Cohomology
Summary. We sketch the topos-theoretic Cech cohomology associated with the finite cover topology J on I' Glob. -~-
3Pay attention to the different morphism set of mathematical morphisms whereas the objects are unchanged, even if we often omit the covering atlas to ease notation!
431
19.1. THE GROTHENDIECK TOPOLOGY
Following the ideas of Verdier's exposition [22, expose V], every covering family f· = (fi : G i -+ G)iEI induces the following simplicial diagram. Let n(f.) be the simplicial complex of all subsets ScI with non-empty fiber products IT~s Gi . For a natural number 0 S k, denote by S(f')k the set of singular k-simplexes, i.e., maps s : 6. k -+ I on the abstract standard ksimplex 6. k = {O, 1, 2, ... k} such that im(s) E n(f.). For such a singular k-simplex s, we write ITs(f·) = IT~O, ... k Gs(i) and 'E,k(f.) = IlsEs(j.h ITs(f.). Every set map q : 6. k -+ 6. 1 induces a universal map
(19.5) which on a cofactor
It (f.)
of 'E,I (f.) at singular i-simplex s is the arrow
'E,(q): I1(f.) ~ I1(f.) -
'E,k(f.)
s.q
s
where x is the universal arrow defined by the respective projections. In particular, we have the arrows (Ji = 'E,(qi) with qi : 6. k -+ 6.k+1 the omission of the ith entry for 0 SiS k + 1, i.e., qi(j) = j if j < i, and qi(j) = j + 1 else. Finally, we also consider the augmentation map (Jo : 'E,o(f.) -+ G given by the coproduct of the original family f. of morphisms into G. Summarizing, we get the simplicial diagram 'E,(f.) =
(J1 (J2'
(Jo (Jo 'E,2(f.) ~ 'E,1(f.) ----;;:; 'E,o(f.) (J2 -
-
(Jo
• G
of global compositions which is the basis of the Cech cohomology if we succeed in applying contravariant functors with values in abelian groups to this diagram. Suppose that we are given such a functor F, defined in a category which comprises the objects and arrows ofthe simplicial diagram 'L,(f.). Then we have the cochain complex C*(f., F) =
(19.6) with the alternate sums
dk
= 'E,i=O, ... k+l (_l)i F( (Ji)
of abelian group homomorphisms for indexes 0 S k and d_ 1 = O. This cochain complex has its usual cohomology groups Hk(f.,F) = Ker(d k )/Im(dk_ 1) for 0 S k.
Example 41 Affine functions. In section 15.1.1, we have dealt with the category ComGlob A of commutative global compositions where the address A is a module over a commutative ring R. We now want to look at affine functions on functorial global compositions. More precisely, we work in the following category RGlob: instead of Mod@, we work in the category Mod~ of presheaves over R-modules over the commutative ring R. And RGlob is the category of global compositions over this context. Clearly, RGlob is finitely complete, and so are RGlobA, :Glob, and :Glob A .
CHAPTER 19. TOPOl OF MUSIC
432
To define affine functions on a global composition G 1 in ~GlobA' consider the objective composition A = A.@;R. We claim that the set HomA(G 1, A) of morphisms which leave the address fixed, is canonically provided with an R-module structure. For any commutative local composition K c A@RM with ambient module M, which is an R-module, i.e., K = R.K, each f-slice f@RK for f : B ---> A is an R-module when identified with its second factor K.f. Now, given two morphisms ujldA,vjldA : G1 ---> K, we can add them or apply a scalar multiplication when restricted to each f-slice:f@RG 1 ---> f@RK. This defines an R-module structure on HomA(G1,K). In particular, for M = Rand K = A@RR, we get the required module structure. We call the module HomA(G1,A) the module of global affine functions on G 1 and denote it by r(G 1). We therefore have a representable contravariant functor (15.6)
which associates the R-module r( G 1 ) of global affine functions with a given global composition G 1. It is easily seen that this functor is well defined on the mathematical category :GlobA and gives rise to a synonymous functor
r:
:GlobA
RMod
--->
entailing cohomology R-modules Hi (f., r) for any covering family f. in !'ObLoCA. • For the default covering family i. of charts which define G 1 , this gives back the cohomology modules Hi(i., r) = Hi(r) discussed in 16.1.2 . • Consider next the (mathematical) resolution morphism (15.16)
and its restriction (19.7)
to the resolution's standard composition chart ALli for chart i E I of G 1. This family res. is covering for G 1 and we therefore obtain the resolution cohomology modules H1. (G 1) = Hi(res., r) for the affine function functor. Clearly, we have an epimorphism of simplicial diagrams (19.8)
~res : ~(res.) --* ~(i.)
which is induced by the canonical morphisms
for the G1 -charts Ui j • Therefore, the corresponding function complexes are related by an exact sequence
o -----+ C*(i., r)
-----+
C*(res., r)
-----+
C*(res., r)jC*(i., r)
-----+
0
(19.9)
433
19.1. THE GROTHENDIECK TOPOLOGY
of differential R-module complex homomorphisms. So by the standard procedures for exact sequences of differential complexes, we have a long exact cohomology sequence ... ------+- Hi(G 1) __ H1.(G 1) ___ Hi(t::.jG 1) -L Hi+l(G 1) --+ ••• with the cohomology modules Hi(t::.jG 1) defined by the quotient differential complex C*(res., r)jC*(L., r). Example 42 Linear forms. For this example, we work over a fixed R-module A as address, R being a commutative ring. Recall from the discussion of module complexes in section 15.1 that for a commutative (objective) global composition G 1 in ComGlobA and an R-module V (in particular the already known module A@RR which yields the classical forms), we have the module complex vG1 = HomR(G 1, V) of V-forms on G 1. We want to make this complex into a contravariant functor I-'GlobA ----t Mod R . On the category ComLocA, it is the well-known dual functor to the associated module functor R : ComLocA ----t Mod R . It sends a local composition K c A@M to the module vK = HomR(R.K, V). For each global composition G 1 in ComGlobA, select an atlas (Ki C A@Mi)iEI and restrictions (Ku C A@Miju»)uEn(G) where i((T) is a selected vertex index of the simplex (T. When taking the limit r( vG ) = limuEn(GI) vKu for global sections, this is clearly already the limit on the double index restrictions. So a global section x E r( vG1) is just a family (Xi)J, Xi : R.Ki ----t V such that the restrictions xilR.Kj and xjlR.Ki coincide.
.
. If we are given a morphism fjld A : G 1 ----t HJ, and if we select an atlas (Lj C A@Nj)jEJ, the global sections are mapped into each other by the universal property of limits via
r( vHJ)
----t
VL.(i)
----t
VKi
for every i E I. Moreover, this linear map is invariant under a change of the index map fact, if we have a second index map Ii : I ----t J, the map
vr(HJ)
----t
VLI«i)
----t
to
In
VKi
factorizes through the intersection of charts:
v L.(i),I«i) ----t v Ki and we are done. Moreover,the functor vr(G1) is also natural in the module V, i.e., any module homomorphism u : V ----t W entails the evident natural homomorphism ur : vr(G1 ) ----t wr(G 1 ). Therefore v f(HJ)
----t
Proposition 23 Given an R-module V, we have a contravariant functor of V -forms
vr: I-'GlobA
----t
ModR
(19.10)
which associates the R-module v r( G 1) = r( vG1) of global sections of V -valued linear forms to each global composition G 1 in the mathematical category over the fixed address A. The functor v r is natural in V. In particular, for the integers as ground ring, and for any covering family f. of a global composition G1 which is addressed in a Z-module A, if we look at the projective Z-module system ZjZlr and its projective limit, the l-adic integers 4 Zl, we first have cohomology groups 4With the usual unusual confusing index notation...
.
434
CHAPTER 19. TOPOl OF MUSIC
H*(f., z/Ztrf) and then their limit, the l-adic form cohomology
(19.11)
19.1.2
Marginalia on Presheaves
Summary. We briefly discuss function presheaves, and the subobject classifier. -~-
This section is a sketch of what should be investigated more carefully and systematically.
19.1.2.1
Function Pres heaves
Summary. Function sheaves are a basic instance for global music theory, we review the corresponding discussion of the resolution covering. -~-
The resolution covering is a remarkable example for the finite cover topology since it is not just a covering by subcompositions, it is somewhat analogous to etale topology, the settheoretic map is even bijective, only the relative position of points has changed. We feel that this presheaf type is crucial for understanding music. Let us make this more explicit---even beyond the resolution covering. We recall that the resolution was a procedure to lift non-interpretable global compositions to interpretable ones built on point configurations in general position. Now, this approach may be interpreted from still allother angle. Whereas classification uses resolutions to calculate the 'minimal perspective of a global composition' which tells you everything of its structural contents, it could also happen that a composer does not want to classify a composition but to view it as a germ of a real composition, a concentrated abstract version of something which will be used only on a different level. So the germinal composition could be a global composition G1 which will only be used via some covering morphism family fi : G[; --t G1 which the composer uses to grasp particular local aspects of the germ. From this construction, she could also deduce function modules to realize her music in determined parameter spaces. A study of such poietic aspects and applications of general covering families of the finite cover topology is still outstanding, but surely will furnish powerful compositional tools.
19.1.2.2
The Subobject Classifier
n
Summary. The subobject classifier is a key object in a topos. We have a closer look at its structure and function in the topoi J.tG1ob@ and Sh(J, J.tGlob). -~-
19.2. THE TOPOS OF MUSIC: AN OVERVIEW
435
Recall the general relation between presheaves and sheaves. Given the finite cover topology J on /LGlob@, we have the subcategory Sh(J, /LG1ob) of J-sheaves and its embedding i: Sh(J, /LG1ob) >----> /LGlob@. Whereas the subobject classifier 0 of /LGlob@ gives the set
O(G 1 ) = { sieves S c @G 1 } of GI-sieves, the subobject classifier OSh of Sh(J, /LG1ob) evaluates to the subset
OSh(G 1 ) = {closed sieves S c @G 1 } of closed GI-sieves, i.e., the sieves S such that any (mathematical) morphisms I : HT -+ G 1 covering5 S are contained in S. In terms of covering families this means that whenever there is a covering family fi : Hi' -+ HT such that every composite morphism I.Ii E S, then I E S. From a predicative point of view, a closed sieve S c @G 1 corresponds to a predicate Ps whose support is S, i.e., Ps(f) = T iff I E S, and Ps(f) = -.l else. The predicate Ps is characterized by
• Ps(f) is invariant among mathematically equivalent arrows, • invariance under right multiplication, i.e., if Ps(f) the product makes sense;
• Ps(f) = T if there exists a covering family
f.
= T,
then Ps(f.g)
such that PS(f.Ii)
=
= Ps(f)
whenever
T.
There are different approaches to view the sheaves within the presheaves. First, we have the adjointness property: There is a left adjoint a: /LGlob@ -+ Sh(J, /LGlob) for the embedding i, called the associated sheaf functor, see appendix G.4.l: For any presheaf X and sheaf Y, we have a functorial isomorphism Homl"Glob@(X,i(Y)) ~ HomSh(J'I"Glob)(a(X), Y). It resides on the sheafification operator P f--+ P+, see appendix G.4.l which yields the associated sheaf aP = (P+)+. To this property corresponds the construction of the Lawvere-Tierny topology j : 0 -+ 0 which associates with every sieve S C @G 1 its closure S consisting of all morphisms which cover S. In terms of predicates, this amounts to adding to a predicate all morphisms which fulfill the third property above.
19.2
The Topos of Music: An Overview
Summary. This section summarizes the overall categorization of music with regard to the top os-oriented structures worked out from denotators to Grothendieck topologies. -~-
The overall theory of forms and denotators is a concept framework based on the top os of presheaves over the category Mod of modules and of selected subcategories. Special types of denotators: the local compositions (power type) are naturally provided with a local-global paradigm. In fact, they are essentially subobjects of special functors, together with induced morphisms, quite similarly to elementary approaches to affine algebraic varieties and to local 5i.e., such that
f* S
covers J(HT).
436
CHAPTER 19. TOPOl OF MUSIC
Figure 19.1: Alexander Grothendieck. differential geometry. This part of the denotator theory-which stresses rather the conceptual construction than the geometric manifold approach-is embedded in the global theory which can be seen as a passage to inductive limits, i.e., 'patchworks' of local objects. This latter construction is essentially different from the algebraic and differential geometry in that the atlas covering is not variable but an intrinsic data from the musicological point of view. The inductive limit over variable atlases is not the point for musicology, it would be a too coarse approach. However, the patchwork of local structures can be boiled down to a more mathematical manifold theory: the categories of global compositions, together with mathematical morphisms. In this framework, it is possible to introduce natural Grothendieck topology, for example the finite cover topology considered above. This one is also the Grothendieck topology which englobes all the local global approaches which we have performed in interpretation and classification theory. Undoubtedly, larger denotator subcategories should be accessible by either globalization processes or-at least-by Grothendieck topologies which extend the one sketched above. This is a priori feasible since we have seen that fiber products are also given for a large class of denotators as soon as they are not pathologically circular. The cohomology theory for Grothendieck topoi, as sketched by Verdier in the legendary SGA seminar, is naturally given via function presheaves, for example. We do not know whether the predicate sheaves can also reveal cohomological spin-offs since it is not evident how truth denotators should be given structures of abelian groups. They rather live in Heyting algebras, and one should first adapt these structures to become candidates for cohomology theory. However, if truth denotators are just singletons in truth modules, they evidently add up to groups. But this perspective is neither within the reach of our experience with concrete examples nor
19.2. THE TOPOS OF MUSIC: AN OVERVIEW
437
do we have theoretical results which would enforce such a development. Summarizing, the topos of music first centers around the concept architecture of music objects in the general denotator theory, and therein around the presheaf topos over modules, and secondly, it evolves to a universe of local-global perspectives which are readily described by Alexander Grothendieck (see figure 19.1) with his awe inspiring topologies and their functors and cohomology theories. The following development of this book will now descend to more concrete investigations, regarding those fields which are more in the tradition of the composers, music theorists, and performers. This does not mean that we can forget about the general framework, on the contrary, it will often not be possible to understand the technical and conceptual procedures without knowing about the topoi which back the more down-to-earth scenery.
Part V
Topologies for Rhythm and Motives
453
Chapter 20
Visualization Principles For they shall see eye to eye, when the Lord shall bring again Zion. ISAIAH 8
Summary. As a compensation to the abstract nature of general top os theory, some principles for visualizing such abstract objects are mandatory, in particular in view of implementations on computerized knowledge bases. We give an account of such principles as they are being applied in graphical interface design. -~-
20.1
Problems
Visual navigation on general databases is a difficult task for three reasons: • The data structure is not a priori in a geometric shape. • The geometric shape, if it occurs, is not a priori adapted to human 3D vision. • Objects may be composed of other objects which in turn are composed, and so on in a recursive way. Visualization then should take care of a recursive architecture. The first obstacle may happen if we are given a bunch of textual objects. How should we arrange them as if they were points in a geometric space? Or else, if we are given a collection of chords in a score, is it possible to deal with such objects as if they were nicely distributed as points or spheres or whatever geometric objects in an adequate geometric space? The second obstacle may occur when we have a high-dimensional geometry, for example six-dimensional representation of tones (onset, pitch, loudness, duration, glissando, crescendo). How could we grasp all these dimensions without forgetting some of them by plain projection to 2D or 3D partial representations, such as 2D for onset and pitch? The third obstacle is a fundamental design problem. If we are able to visualize 2D situations, for onset and pitch, say, this only works if the visualization of these coordinates is 439
G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
440
CHAPTER 20. VISUALIZATION PRINCIPLES
already settled. But in general, the coordinate references are not elementary decimal or integer numbers, so how is it possible to set up a recursively stable visualization strategy? Apart from this agenda, visualization has to make knowledge accessible, i.e., visualization must provide a distinguished knowledge navigation tool. We have already discussed knowledge navigation in chapter 5. From that discussion, one could draw the following condensed definition of knowledge:
Definition 63 Knowledge is ordered access to information. Accordingly, navigation is intimately related to knowledge. In fact, knowledge involves the access activity. And it obeys ordering principles, this access is not a random walk as it is the case in present internet surfing environments. So the instantiation of knowledge involves navigation on ordered spaces, not just blind date behavior. For example, in a traditional encyclopedia, the retrieval of a keyword starts with a linear movement along the alphabetic order axis, but then, when one delves into the text, presents a completely chaotic navigation environment: Only some rare linkage arrows tell us where to navigate, the rest must be constructed by semantic lecture, interpretation and then rebuilding of new keywords to deepen the searched concept's understanding. The textual abstraction is produced by an extreme knowledge hiding, whose disclosure requires considerable efforts in supplementary knowledge navigation. Let us put this insight as follows:
Principle 17 Visual navigation must be built on orderingsl. We may now recall the treatment of denotator orderings in section 6.1.3 and-more technically-in section 6.8. We have shown there that for quite general addresses, there is a generic recursion process to define linear orderings on denotator systems of any form space. Let us rephrase those results in an informal way:
Fact 9 Any system of denotators can be linearly ordered according to a generic recursive procedure which is based on linear orderings of address modules. These module orderings can be constructed from canonical orderings of elementary modules, such as ground rings, and basic quotient modules, in a natural way. This fact lends itself to visualization: To begin with, in all relevant cases, we only deal with finite, and mostly comprehensible, collections of denotators, of size, say, below 106 . This means that we can deal with a number of points on the real line. But this is not sufficient, since the abstract order relation is not ideal for representing the underlying 'distances'. If, for example, a number of pitches is represented by pure order relations, this reduces to an equidistant arrangement of points of the real axis and destroys the pitch distances. Of course, it is not wrong to recur to this equidistributed arrangement which we may call the generic linear visualization. One must be able to invoke this one if no further information is required. But we have to ask for this: lObserve that visualization means using the ordered field JR, and this means writing-typically-decimal numbers, which in turn amounts to building on the ordering of the natural numbers 0 < 1 < 2 < ... 9.
20.1. PROBLEMS
441
Principle 18 A generic construction method for natural distances among successive items in the generic visualization must be developed to generate intuitive visual distance perception. Call the visual representation on the basis of principle 18 the metrical linear visualization. The ongoing implementation of denotator visualization 2 has already realized this principle on the modules which occur in 'real life'. The next problem to be settled is to deal with high-dimensionality, more precisely: more than three dimensions in product type denotators. Now, the principle of metrical linear visualization basically solves this problem, but it is too fiat, we are able to visualize in 3D and should make use of this human feature. So we have to respect the following packing principle:
Principle 19 In a product type space, the linear visualization should be distributed among a number of packages of coordinate spaces such that at most three spatial visualization axes are needed. So, for example, we should pack three real dimensions in the linear visualization of a first axis, another four dimensions in the linear visualization of a second axis. The remaining dimensions should be packed into some different visualization level. Even if we apply the linear visualization principle to these latter dimensions, we cannot, by construction, add more spatial visualization axes. We therefore propose the following object visualization principle:
Principle 20 In the product (more generally speaking: limit) type, the non-spatial-possibly linearly packed groups of-linear dimensions must be attributed to object parameters of multimedia objects. Such objects can be specified by geometric shapes, such as spheres, cubes, faces, by surface texture, by sonic parameters such as sound, earcons 3 , and by gestural parameters, such as handling behavior, like drag and drop response, of more or less physical nature (elasticity, inertia, gravity, magnetism). Summarizing, we obtain a distribution of high-dimensional denotator visualization among space and object visualization. With this in mind, how does navigation look? You may specify the space/object distribution and associated linear (metrical) visualization according to the previous principles, including selection of adequate multimedia objects. You may move around within such visual spaces, zoom in and out, and interact with the multimedia objects, like listening to tones, feel the response of a piano key, etc. But other navigation methods are at our reach, let us just name some evident variants: 1. Pick a specific selection of factors or cofactors in products or coproducts; this is the classical "coordinate projection" or "book selection" method.
2. Greeking 4 of deeper recursive information. This means that in the recursive denotator space ramification, one decides to collapse all values and to forget about existing differences. This is very important for first approaches to complex databases. 2PhD Work of Stefan Goller at Zurich University. 3Earcons are icons for the ears. 4 An operation known from text applications, meaning that a structure is blurred, recalling the English idiom "that sounds Greek to me".
CHAPTER 20. VISUALIZATION PRINCIPLES
442
3. Differentiating between more or less generic visualizations. In a coproduct, one may concentrate on the 3D visualization of one cofactor, leaving the adjacent and more distant cofactors in a decreasingly detailed state of visualization.
20.2
Folding Dimensions
According to the packing principle (Principle 19) we need a method to fold dimensions. Taking principles 17 and 18 into account, this method needs to fulfill the following properties: • order preserving • injective Fair enough, it is not homeomorphic, as it can't be injective and homeomorphic. Obviously there are many ways to define such a mapping; we choose the following generally valid algorithm. Denote by Pi : ~n ---> ~ the ith projection, and by Pi ... i+k : ~n ---> ~k+l the projection onto the coordinates i through i + k.
20.2.1
IR2
---t
IR
We are given a set P of points in ~2. 1. Partition P = Po UPl U ... UPn":'l with Vi E {O, 1, ... n -I} Vx E Pi : Pl(X) and ti -I- t j if i -I- j.
= ti = const.,
2. Subdivide the x-axis by giving every Pi a lower and upper bound Ui and 0i:
(i = O)!\ (n > 1),
(i;::: 1)!\ (n > 1),
ti+ti-l
-2-
else;
ti - ~ (i
=n-
1)!\ (n > 1),
(i
443
20.2. FOLDING DIMENSIONS
Lemma 31 If we impose the lexicographic ordering on 1~.2, p, is order-preserving and therefore injective. Proof. There are two possible cases: If Pl (Xl) < Pl (X2)' then p,(xt) E]Ui, 0i [ and P,(X2) E]Uj, OJ [ with i < j. But then, by construction of the intervals ]Ui,Oi[,]Uj,Oj[, ]ui,od<]uj,oj[, hence P,(Xl) < P,(X2). Else, if Pl(Xt} = Pl(X2) and P2(Xl) < P2(X2), then on these points, p, = P,i for an index i, and P,i(Xt) < P,i(X2) since P,i is strictly increasing, QED.
x
t
Ul
Ol=lli
02=U3
03
Figure 20.1: The two-dimensional folding algorithm p,.
20.2.2
~n --+ ~
With p, : ]R2
-t
]R
we can define
p, : ]Rn - t ]R inductively: n= 2,
n > 2.
Lemma 32 P, is order-preserving and hence injective.
444
CHAPTER 20. VISUALIZATION PRINCIPLES
Proof. The map x 1--+ (PI(X),fi,(P2 ... n(X)) is order-preserving by induction, and 31, QED. Note that PI(P) becomes the coarse factor, whereas P2 ... n(P)) the fine one.
20.2.3
J1,
is so by lemma
An Explicit Construction of J1 with Special Values.
What remains undefined so far is the bijective proper increasing mapping from step 3 in 20.2.l. We define this map with two special real values low and high in mind, which will be explained in the following section 20.3. We define the following (partial) maps T and ¢ on jR5, jRIO, respectively, see figure 20.2.
,;(x)
cj>(x)
o 0'
nl2---------
arctan(~)
---
o----------~--~-------+
x
x
arctan(a) --
--i-------------,- -----------
- nl2 -- -- -- --- - u'
low
u
-n12
arctan(a)
o
arctan(~)
nl2
Figure 20.2: The maps
T(a,(3,h,l,x)
,
u -u
t-u'
- tan
and ¢.
- a ) , = tan-1 ( (3h_l(x-l)+a
~+tan '(a)
¢(o, 0', u, u', t, a, (3, h, l, x) =
T
1 (a)
o'-t tan 1 (;3)
X
+ 2"7r)
X -
x
+u
tan- 1 (a)) +u'
+t
tan -1 (a)
< x :::; 0
0< x :::; tan-1 ((3)
20.3. FOLDING DENOTATORS
445
The composed map on x E JR2
¢r(Oi, o~, Ui, u~, a, (3, high, low)(x) = ¢( Oi, 0:, Ui, U:, PI (x), a, (3, high, low, T( a, (3, high, low, P2 (x))) maps
P2(X) p2(X) P2(X) p2(X)
low = high
f----+
U'
f----+
a'
= -00
f----+
Ui
= +00
f----+
°i
=
" "
which is useful for our visualization. Note that in the variable P2(X), ¢r is continuous at tan-'(a), 0, tan-'((3) but not differentiable.
20.3
Folding Denotators
With fl defined, we can visualize finite sets in real vector spaces of any dimension in 3D. But what we really want, is to visualize denotators. For the time being, we exclusively address zeT'Oaddressed denotators. Recall from chapter 6 that a denotator can be either simple-with its coordinate in a module-or compound, such as a limit, a colimit or a powerset of 'lower-level' denotators, to speak in the naive denotator terminology. A typical example is shown in figure 20.3, where A to L are again denotators. If we provide every simple denotator with the ability
lim
7~
4j~ Ar~ 4r~
ABC
D
E
F
G
H
J
KKK
K
K
L
Figure 20.3: A typical compound denotator tree. to produce a real-valued representation of its coordinate, we can map every denotator to JR by applying fl all the way up the denotator tree, as shown in figure 20.4. The simple denotator gets this real value from the underlying module; the module also provides us with two more values low and high, representing natural borders such as lowest and highest note on the piano, which is useful for the subsequent visualization. But how can we apply fl to a limit- or colimit-denotator?
CHAPTER 20. VISUALIZATION PRINCIPLES
446
4-1
5-1
IR IR
1-1
IR IR
IR
IR
IR
IR IR
Figure 20.4: Mapping every denotator to
20.3.1
IR jR
IR
IR IR IR IR IR
by applying
p, all the way up the denotator tree
Folding Limits
When we forget about the limit's diagram, a limit is nothing but a product. Assuming that we already know (by induction) how to fold its factors, we just apply p, to the limit as if it were of type jRn. Later in the visualization we can add the diagram's information by use of color, transparency, etc. By this we visualize a denotator as shown in figure 20.5 as a set of points in the ordinary n-space as shown in figure 20.6.
power
~~
lim
lim
lim
lim
~!~ ~1~ ~1~ ~1~ Figure 20.5: Folding limit denotators.
20.3.2
Folding Colimits
The folding of colimits is something different. While the semantic of a limit is some kind of a vector space with some additional constraints, the notion of a colimit is the one of a library. A point in a colimit space is like a book or a set of books from a library while the cofactors mean different types of books and the diagram controls which books you may take out together. Therefore we decided to fold a set of colimit denotators (like in figure 20.7) in the following way:
447
20.3. FOLDING DENOTATORS x,
. .,:
~
,):'
:,'
..
,.
,-'
.,.
.......
'.
•
x,
.
..
Figure 20.6: Visualization of a denotator as shown in figure 20.5 by a set of points in an ordinary n-space.
K
<0
C0> <0'> <®?> ~
, ,[d r .. "1)
~
l
,.
:,
4£ ~
~ r----v l'TI&' './
't
!f
.
..0; I"
..0;
!"
't!~
..0; -
!j'
..0; I" -
i.f'
.
..0; I"
I' .:
"t..
~ ·f
~
-
Figure 20.7: Folding a set of colimit denotators (as shown in figure 20.8) yields different kinds books or CDs. etc. in rows of shelves.
448
CHAPTER 20. VISUALIZATION PRINCIPLES
power
~~
colim
colim
colim
colim
fi!~ fi!~ fi!~ fi!~ Figure 20.7: A set of colimit denotators. Firstly we choose which cofactors we bundle together; this information goes to the first axis. On the second we chose to map the index of the cofactor in question and the actual data is folded in the third axis. By this way we can get different kinds books or CDs, etc. in rows of shelves like in figure 20.8.
20.3.3
Folding Powersets
The most natural way to deal with a powerset denotator is to view it as the collection or 'container' of its elements and proceed by folding one element after the other recursively with respect to their type. (Note that all elements of the powerset are of the same type.) We call this powerset disclosure. The preceding folding of limit and colimit denotator collections ('powerset of limit', 'powerset of colimit') were special cases of this general situation. However, the 'powerset of powerset' case is somewhat special: Here we represent the elements in a 2 x n array, we use their position in their canonically ordered arrangement as first and a their barycenter-value as second coordinate. This arrangement is then folded with the 1 x n folding matrix. Logically there is also an undisclosed way of folding a powerset denotator. In this case the barycenters of the n elements are folded with the standard 1 x n folding matrix. This undisclosed folding is the default case if a powerset denotator is itself a (co-)factor or element of another denotator.
20.3.4
Folding Circular Denotators
In sections 6.5 and 6.7, the problems of circular denotators where already discussed. When actually working with those denotators, one has to take care not to run into infinite loops. Therefore we decided to run a three-level strategy: Down to level 'Y the usual folding takes place, between 'Y and T all the real values from the foldings are generalized with the index(x) function 5 , and at level T the folding process stops completely.
5See appendix D.5.1.
449
20.4. COMPOUND PARAMETRIZED OBJECTS II
u
/ mn~
-
/r~ "
~1Iq
. /'"
.....
/1 ...... .....7 ··.. ······ .. /i
.............•.•••••••.•• y
.. __ .............. ,....... ..... ....... .............. ,...... ········ __ ···· .. ···· 1
Figure 20.9: Down to level , the usual folding takes place, between , and T all the real values from the foldings are generalized with the index(x) function, and at level T the folding process stops completely.
20.4
Compound Parametrized Objects
Even if we are now in the position to be able to transform any denotator into a 3-dimensional vector we often prefer not to fold all those n dimension down to 3 but to keep some more and map them on any graphical, acoustical, haptic, etc. property of our display obj ect. Therefore we need lots of different objects-let's just call them multimedia objects------with many different parameters like color, transparency, pitch, stiffness, etc. s"
/m~ Gee
SatJist
limit
1"'-
RTC
co limit
/""
URL
Tenninator
ETC
~~/\
Trans
Rot
/n\
~
f f f I f f ff t limit
/\
o
0
limit
/\
o
D
Scale
Color
Tex
""
If~
' 1\
r t f f ft limit
/\
o
D
f
limit
/\
o
D
Sound
Satlisl
Sat
~
Iff f fI
limit
/\ o D
limit
/\ o D
Figure 20.10: Description of a compound multimedia object as a circular denotator. As it is not recommended to actually program each object again and again, we decided to describe a compound multimedia object as a circular denotator space (form) (see figure 20.10). The Sat form (Satellite) consists of a list of (sub-)satellites and the actual geometric, acoustical and time-critical information in the RTC denotator space, which actually is a limit
CHAPTER 20. VISUALIZATION PRINCIPLES
450
space according to the above information types. We can for example construct a Pinocchio puppet (see figure 20.11) by defining a denotator for the bodyo, one for the neck l , the head 2 , the hat 3 , the arm4 , the foot 5 , and the leg 6 . Reusing these denotators we can define the whole Pinocchio. The f-denotator coordinates in the RTCdenotators providj:) the parameter functions to steer the concerning properties respectively. By this construction every part of the Pinocchio can be moved, rotated, scaled, etc. as shown in figure 20.11. The Sat denotator space is recursively stable, which means that the parameters still work if the Pinocchio itself becomes a subpart of a more complex multimedia object.
~, G \d,
fi
[::1.
Figure 20.11: Construction of a Pinocchio puppet by defining a denotator for the bodyo, one for the neck l , the head 2 , the hat 3 , the arm4 , the foot 5 , and the leg 6 . Reusing these denotators we can define the whole Pinocchio.
20.5. EXAMPLES
20.5
451
Examples
The following example shows a denotator like the one in figure 20.5; the corresponding limit is 19-dimensional and was folded to seven dimensions to fit the Pinocchio object, which provides four object parameters additionally to the x-y-z-position.
Figure 20.12: In this figure some objects are shown, with relevant coordinates mapped to the leg length and the body color.
452
CHAPTER 20. VISUALIZATION PRINCIPLES
Figure 20.13: In this figure some 400 objects are visible with different quite random parameter mapping.
Figure 20.14: This figure shows the same data but with a more sensible parameter mapping. One immediately sees more of the structure; colinear and com planar subsets arise.
Chapter 21
Metrics and Rhythmics ... wir operieren mit einem zweiten Grundbegriffe, dessen Feststellung wir uns nun zuzuwenden haben, demjenigen des verschiedenen Gewichtes der Zeiten, der metrischen Qualitiit. Hugo Riemann [453, p.8] Summary. Metrics and rhythmics are an excellent elementary test-case for global structures in music. We shall critically review two commonly known approaches: the Riemann and the Jackendoff-Lerdahl theories. We then develop the concepts of global time structures and their topologies, including associated weight functions. -I;-
In this chapter, we refer to the introductory discussions of meter and rhythm in section 7.2.2 (local case) and section 13.4.3 (global case). We shall above all concentrate on the metrical aspect since it yields a simple and formally transparent situation from which the rhythmical complexification can easily be unfolded.
21.1
Review of Riemann and Jackendoff-Lerdahl Theo-
.
rIes Summary. We review these theories and trace back the concepts of metrical weight and metrical hierarchies. -I;-
The reader should however be aware that we cannot include a philologically valid analysis of these works here, our scope is a modest one: to present the tie between these historically important outlets and our own approach as it really happened. The interest of such a tie is to start reflecting about the incredibly fuzzy state of conceptualization in music which Riemann also had to suffer ([453, p.VIII]): 455 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 21. METRlCS AND RHYTHMICS
456
Meine musikalische "Dynamik und Agogik" (1884) arbeitet sich noch muhselig durch den Wust falscher Dejinitionen, die, wie ein undurchdringliches Gewirr der Wurzeln von Sumpfpflanzen unter dem anscheinend klaren mit Blumen geschmuckten Wasserspiegel, sich jedem Vordringen in den Weg stellen.
21.1.1
Riemann's Weights
Summary. Hugo Riemann's approach in [453J is discussed, stressing his conceptual specifications of meter and rhythm, in particular the weight idea. -L;-
Riemann's cor. r.pt of rhythm deals with a regular division of time ([453, p.2]):
Freilich ist aber doch wohl kein Zweifel, daJl in der erkennbaren Gleichmafiigkeit der Zeitteilung wirklich ein besonderes lustgebendes Moment liegt und zwar eben gerade dasjenige, welches man Rhythmus nennt... The temporal period defining a rhythm is not an abstract one, it is related to the concrete music events of the given composition ([453, p.8]):
Welches injedem Einzelfalle das GrundmaJl [des Rhythmus, G.M.] ist, ergibt sich wie bereits betont, nicht aus dem abstrakten absoluten MittelmaJl, sondern aus dem der konkreten Melodif- selbst. Die Zahlzeiten (Schlagzeiten, rhythmischen Grundzeiten) gewinnen unter allen Umstanden erst reale Existenz durch ihre Inhalte. For Riemann, rhythm is a result of grouping musical events to equitemporally distributed units, and to relate the grouped events to the units as relative instances (Relativitiit der rhythmischen Qualitiit [453, p.7]). This is why we have chosen the definition of a rhythm as described in section 13.4.3. This is also stressed in the explication of rhythmical unity ([453, p.9]):
· .. als Inhalt einer den Pulsschlag des Rhythmus bildenden Einheit ist prinzipiell eine Mehrheit von wahrnehmbaren Erscheinungen (also fUr die Musik: Tongebungen) anzunehmen. However, a detailed study of Riemann's theory is not our scope and should be taken up in the framework of mathematical music theory as it has been taken up in motif theory with Rudolph Reti's approach by Chantal Buteau [73J. In contrast to rhythmical grouping, Riemann views metrical quality as a differentiation among onsets (of notes) which comes from their roles and relations. This quality is stated as a weight ([453, p.8]):
· .. wir operieren mit einem zweiten Grundbegriffe, dessen Feststellung wir uns nun zuzuwenden haben, demjenigen des verschiedenen Gewichtes der Zeiten, der metrischen Qualitat. The weight is meant to be a metaphor for the degree of importance:
· .. die Unterscheidung der einander folgenden Zeiteinheiten in wichtige und minder wichtige oder wie man zu sagen pflegt: schwere und leichte ...
21.1. REVIEW OF RIEMANN AND JACKENDOFF-LERDAHL THEORIES
21.1.2
457
Jackendoff-Lerdahl: Intrinsic Versus Extrinsic Time Structures
Summary. The time domain is a prototype of a highly ambiguous dimension where intrinsic structures, such as notes, are mixed with extrinsic structures, such as bar-lines. The former are part of the musical material, the latter are part of the abstract time framework. We discuss the difference by the example of Ray Jackendoff's and Fred Lerdahl's approach in [243]. -I;-
In this section we shall compare the metrical analysis of the Jackendoff-Lerdahl theory (GTTM, Generative Theory of Tonal Music) with what is proposed by the present mathematical music theory and its implementations in the RUBATO® platform's MetroRUBETTE® for metrical analysis. For a more generic discussion of the GTTM method, see [366]. In order to perform a metrical/rhythmical analysis or interpretation of a musical composition, the basis of such an investigation must be questioned. This one is linked to the "neutral" work data on one hand, and to the analytical approach on the other. Every investigation of the neutral data presupposes a choice of relevant aspects. This choice occurs very early in the propaedeutic evaluation. For example, in the first Viennese school, bar-lines, period limits, notes, and pauses are recognized. In a metrical/rhythmical analysis, the question of the bar-lines' relevance would be set forth: Will this type of events strongly influence our analysis? As is shown in the MetroRUBETTE® concept, the answer is not necessarily "Yes" or "No", a fuzzy decision between these two extremal values is possible and reasonable. In this situation, a normative analysis would imply that we decide to include bar-lines as external formal limits of metrical/rhythmical analysis. This bed of Procrustes is what GTTM prescribes in its metrical structure analysis: Bar-lines are always instances of the metrical structure, this is implicit in the fourth rule l . of metrical structure 2 . Here are the four MWFRs:
• MWFR 1. Every attack point must be associated with a beat at the smallest metrical level present at that point in the piece. • MWFR 2. Every beat at a given level must also be a beat at all smaller levels present at that point in the piece. • MWFR 3. At each metrical level, strong beats are spaced either two or three beats apart. (This rule is recognized as idiom-specific in GTTM.) • MWFR 4. The tactus and immediately larger metrical levels must consist of beats equally spaced through the piece. At subtactus levels, weak beats must be equally spaced between strong beats. IThere are two types of GTTM rules: Well-formedness rules (WFR) , and preference rules (PR). The rule system has four parts: G: grouping structure, M: metrical structure, TSR: time-span reduction, and PR: prolongational reduction. Accordingly, the two rule types are symbolized by the part prefix and the type postfix. For example, metrical well-formedness rules are MWFR. The complete index of GTTM rules is given in [243, p.345ff.] 2See also [243, p.71]: Yet metrical intuitions about music clearly include at least one specially designated metrical level, which we are calling the tactus.)
CHAPTER 21. METRICS AND RHYTHMICS
458
Worse than that: even non-existent onsets are inserted in determined levels in order to enforce periodicity where the text does not show such. This is a consequence of (for example) the axiom MWFR3 which asks that successive metrical levels have duration (period) ratios 1:2 or 1:3. All other ratios are forbidden! The analysis of eight bars from Mozart's Jupiter Symphony are a typical example, see figure 21.1.
r~
-~--
I"
.
... " .
Figure 21.1: The metrical hierarchy in the finale's eight bar theme of Mozart's Jupiter Symphony, according to the GTTM, [243, p.73]. It shows that the whole notes in the first four bars have to be enriched by half-note points in order to cope with the succeeding onset structure. It also shows that the bar-lines are points even when there is no such note-onset. Such a procedure is not only rigid, it is also dangerous since the complexity of meter/rhythm is destroyed by secondary instances such as bar-lines, and by a spurious subdivision dogma (1:2, 1:3). And it forces a norm which only cements bad mass taste instead of representing possible compositional intensions. But even in tonal music, also in modern tonal music, rhythm and metrics are much less tied to bar-lines than the formal score notation suggests. For instance, in contemporary jazz3 , we recognize extremely complex poly- and microrhythms which escape bar-lines and would be eliminated by the said normative standardization as being non-metrical or non-rhythmical music. Moreover, the GTTM casting of metrical structure makes evident a remarkable drawback of non-computerized musicology4: The effective usage of the following metrical analysis method (21.2) would have been fictitious without computers because of its combinatorial complexity. We in fact argue that sometimes, a normative approach is only propagated because operational control of analysis would break down without such a "terrible simplification" , so the simple and wrong is preferred to reality which would possibly force one to wait until the power of tools has reached an adequate level. During the design period of the workstation RUBATO® it was therefore mandatory to ask for the following condition:
Axiom 3 We need concrete tools which are adequate to the analyzed work, and which yield an immanent analysis, not merely abstract or brutally normative principles. Despite our critical review of the GTTM method, the idea to cover the onsets of a given composition by "levels" has been a fruitful input for our global metrical perspective. It gave us the motivation to introduce global meters from the concept of a local meter, which were introduced in [340, pp.32-34]' and from the insight that simultaneous presence of several local meters in a piece of music is the rule and not the exception (see the example [340, Bild 11, 3Listen, e.g., to Steve Coleman's CD The Sonic Language of Myth, BMG(RVA Victor 1998). 4It is well known that the authors of GTTM do not preconize computer-aided music analysis.
21.2. TOPOLOGIES OF GLOBAL METERS AND ASSOCIATED WEIGHTS
459
p.34]). The crucial difference between the global meter theory and the GTTM metrical structure theory is-besides our non-normative attitude-the construction of level functions from global data (in fact the nerve structure) and not their a priori definition, see the level function defined in section 13.4.3.
21.2
Topologies of Global Meters and Associated Weights
Summary. The hierarchic structure of global meters gives rise to topologies and associated weights, i.e., numerical functions which contain information about the local connection of onsets within their topological neighborhoods. Metrical topologies are the theoretical background for the MetroRUBETTE® module for metric and rhythmic analysis in the RUBATO® software discussed in section 41.1. -~-
To make the discussion more transparent, we only deal with the zero address and meters in the Onset domain and leave it to the reader to add the extensions to proper Para-meters as defined in section 13.4.3. Let us first recall the maximal meter topology on a local composition X. This one is associated to the interpretation xMaxMet whose charts are all finite intersections of the maximal local meters in X. We know that every closed irreducible set is the closure of any of its maximal points with respect to the dominance relation. So if x, y have the same closure {x} = {y}, then they are mutually dominating, or equivalently, their simplexes coincide, Sp(x) = Sp(y), or equivalently, their minimal open neighborhoods U(x), U(y) coincide (lemma 20 in section 13.4.3). Recall from appendix F.2.1 the left adjoint construction of the continuous map i : X -+ XS of a topological space X into its associated sober space XS which sends x to the point {x}. This map is a quasi-homeomorphism (see lemma 85, appendix F.2.1), so the sober topology is coarser than the quotient topology. But i is also surjective, so any open set of the quotient topology lives in the sober topology, i.e., the sober topology and the quotient topology coincide. Let Sp(X) = {Sp(x) E n(X)1 x E X}. From this information we deduce this result: Lemma 33 The sober quasi-homeomorphism i : X -+ XS for the maximal meter topology is open, surjective, and (therefore) the topology of XS is the quotient topology. We have a bijection isx : XS ~ Sp(X) : i(x)
f-+
Sp(x)
of sets.
We may rephrase the sober topology definition to generate a topology on the nerve n(XMaxMet) which extends the topology on Sp(X) which is induced by the bijection isx. By definition, open sets in XS are given by V S = {{x}1 x E X, there is y E V,Sp(y) C Sp(x)}
for any open sets V in X. If we use the transition bijection, we have the corresponding open sets isx(VS) = {Sp(x)1 x E X, there is y E V, Sp(y) C Sp(x)}
CHAPTER 21. METRICS AND RHYTHMICS
460
and this can be rephrased to extend to the entire nerve by v sp = {al a E n(XMaxMet), there is y E V,Sp(y) C a}.
We have VSpnSp(X) = isx(VS). Let n(XMaxMet)s be the maximal meter nerve topology which is generated by the sets vsP. Observe that we have Ui V;sp = (Ui Vi)SP whereas the intersection vtP n V2sP need not be a generating open set. However, the restriction of the maximal meter nerve topology to the subspace Sp(X) is the induced topology. So we have this:
Proposition 24 With the maximal meter and the maximal meter nerve topologies as defined above, the map Sp: X ---t n(XMaxMet)s becomes a continuous map whose image (as a subspace with the relative topology) identifies to the associated sober space XS. This justifies the following definition:
Definition 64 An IE.-valued sober weight on the zero-addressed local composition X C Onset is a function W : X ---t IE. which factorizes through the nerve map Sp. A factornW : n(XMaxMet) ---t IE. is called a nerve weight for W. A nerve weight n W is said to be induced iff its values are determined by the values on the vertexes, i.e., by the restriction noW = nWlno(XMaXMe,). Example 43 If no W is any IE.-valued function, we may take the sum nW(a) = LMECT noW(M) of vertex values. A typical vertex function will be discussed for the MetroRUBETTE® (section 41.1): Fix a real exponent prof, weight's "profile", and two natural numbers min :S max. The function associates with the maximal local meter M E no(XMaxMet) the number noW (M )
=
{
l(M)PTOf if min :S l(M) :S max,
o else.
(21.1)
So musically, a sober weight is a valuation of onset events in X by their simplexes, i.e., their position within the covering by maximal local meters which are particular points in the topological space (xMaxMet)s. No systematic treatment of the real vector space of sober weights or, more generally, the cohomology of such weights, has been undertaken to date. Exercise 42 In this context, one may consider the covering M axM et of X and then restrict everything to the integers as a ground ring. Given a prime number l, the functors of Zzr-forms yields the l-adic cohomology Ht(xMaxMet) introduced in equation (19.11). We argue that this information should be important for the classification of global meters, and also, for defining refined sober weights. Elaborate the form functor on the maximal meters and its cohomology. The method of induced weights can be generalized to interpretations in the onset domain. Suppose that X C Onset is interpreted to yield the global composition X I. This may happen if we generate this interpretation from a knowledge about the origin of X. For example, it could happen that X = prOnset(yJ), the projection of a given global composition into the onset domain, and that we have the charts Yj C Y of a covering J. Then we obtain an interpretation Xl of X by the projections prOnset(Yj) of these charts. We shall learn from section 41.1 that
21.3. MACRO-EVENTS IN THE TIME DOMAIN
461
this is a very common situation in music research, for instance when collecting different kinds of rhythmically significant objects, such as notes, bar-lines, pauses, etc. We may then consider the maximal meter topology on each chart Xi of X I. Suppose we are given a sober weight Wi : Xi -7 lR for each chart. Then we may construct an induced weight from the nerve n(XI). For x E X, let SPI(X) be the simplex of x with respect to the covering I. Then we define W(x) = L:XiESPI(X) Wi(X). This enables us to create mixed weights for points in an interpretation and thusly build a refined weighting of onsets with respect to different predicative specifications. In particular, we shall see in 41.1 that the JackendoffLerdahl approach can be simulated (and this in a refined way) by reintroducing the bar-lines with a specific weight to add up to the total weight of an onset.
21.3
Macro-Events in the Time Domain
Summary. This section reviews macro-events in the light of topologies of global meters and rhythms and the associated weights.
The previous discussion of temporal denotators was centered around topological relations, such as dominance, of onset events. It is evident that topological relations yield a tool for grouping onset events according to their metrical role. To this end, we review the definitions of local rhythms from section 13.4.3. In that context, a (A-addressed) rhythmic germ is a local composition G C A@Rhythm(Para). We shall deal with germ denotators G which have the shape
where
Para
--+
Fin(Rhythm(Para»>-+!1 F
Power(Rhythm(Para))
with F = Fun(Rhythm(Para)). So the form Para is circular and reappears as a parameter space associated with the onset coordinator. We know from section 6.5 that Para exists. Such a denotator is a Para-rhythmic germ as well as a macro denotator as defined in formula (6.116). The point is that the germ parameter space coincides with the germ space. Intuitively speaking, such a germ is a local composition of onsets which are parametrized by germs of onsets, etc., recursively, until-in a 'tame' case-the parametrizing germs reduce to the empty set. This being true, we want to associate such a macro germ with each maximal meter topology on a (zero-addressed) local composition X C Onset. To this end, consider the Hasse diagram 5 Dom(X) for the partial ordering of the dominance relation on X under the maximal order topology. This defines a macro germ as follows. We take the level function 6 lev(x), x E X, and define by level recursion: Dom(x) = (x,0) if lev (x) = 0, and
Dom(x) = (x, {(Dom(Yh - x, Dom(yh)1 y E pre (x)} ) 5See appendix C.2, definition 121. 6See appendix C.2.
(21.2)
CHAPTER 21. METRICS AND RHYTHMICS
462
in the other cases with the two components Dom(y h, Dom(y h of the macro Dom(y). Then we put Dom(X) = {Dom(x)1 x E Max(X)}.
Example 44 Let us make an example, see figure 21.2
••
•
•
I•
••
I,
C0
G
G) G)
••
• •
G
G) Figure 21.2: An irregular metrical structure X and its nerve. The associated macro germ is described in the text.
Denoting simplexes by {I1 ,h,I6} function values:
Sp(xl) Sp(x3) Sp(x7) Sp(xlO) Sp(x14)
= A(1,5,6),{h,h} = A(1,5), etc., we have these
= A(l, 2), Sp(x2) = A(l, 3), = Sp(x4) = Sp(x6) = A(l), Sp(x5) = A(l, 5, 6), = Sp(x9) = A(3), Sp(x8) = A(3, 4), = Sp(xll) = A(5), Sp(x12) = Sp(x13) = A(6), = Sp(x15) = A(2),Sp(x16) = Sp(x17) = A(4).
Sp
21.3. MACRO-EVENTS IN THE TIME DOMAIN
463
So the points xl, x2, x5, x8 are generic, and we have
Dom(xl) = (xl, {(x3 - xl, 0), (x4 - xl, 0), (x6 - xl, 0), (x14 - xl, 0), (x15 - xl, 0)}), Dom(x2) = (x2, {(x3 - x2, 0), (x4 - x2, 0), (x6 - x2, 0), (x7 - x2, 0), (x9 - x2, 0)}), Dom(x5) = (x5, {(x3 - x5, 0), (x4 - x5, 0), (x6 - x5, 0), (xlO - x5, 0), (xlI - x5, 0), (x12 - x5, 0), (x13 - x5, 0)}), Dom(x8) = (x8, {(x7 - x8, 0), (x9 - x8, 0), (x16 - x8, 0), (x17 - x8, 0)}). We see that the same point, e.g., x4 appears in three different macro ramifications since it is a specialization of three different generic points. But flattening this macro-event eliminates this multiplicity. We then have
Dom(X)
= {Dom(xl), Dom(x2), Dom(x5), Dom(x8)}
which gives a complete picture of the dominance hierarchy; make a figure of this dominance hierarchy. If we are given a macro germ, it is possible to disregard its ramification structure which is deeper than a given limit, this operation of "greeking" 7 can be defined by replacing the macro satellite sets by the empty set from a given depth on.
Exercise 43 Give a rigorous definition of greeking macro germs.
7 An operation known from text applications, meaning that a structure is blurred, recalling the English idiom "that sounds Greek to me".
Chapter 22
Motif Gestalts In the organic sphere one cell is different from all the others. By a magic interplay between these identical yet different cells, the higher forms of life come into existence. In an astoundingly analogous way one musical motif, one theme releases another as an expression of its own innermost idea, yet the latter is a being entirely different from the first. Rudolph Reti [444, p.359] Summary. This chapter is not only a good test-case for the mathematization of elementary music concepts, it is above all a refined study of turning fuzzy concept sketches of the humanities into precise and consistent frameworks-without the expected side-effect of "terrible mathematical simplification". In the present case of motives, our topic is-within the general task of grasping motivic phenomena-the construction of Rudolph Reti's immanent motif analysis.
-EUnderstanding the motivic, melodic, and thematic organism of a musical composition is one of the most delicate tasks of musicology. It has three principal components which characterize its problematic status: semantic depth, formal complexity, and ill-defined aspects. Semantic depth is the phenomenon that motives, melodies, and themes are not only formal constructs, i.e., denotator structures with little or no paratextual and easy textual predication. A composition's motif is a germ of a structural hierarchy, unfolding into its most diversified ramifications, variations, and fragmentations. This hierarchy is an expression of meaning, of multiply layered semantic depth which may transcend pure textuality and point at more philosophical, ethical or at least esthetical programs. Formal complexity appears when one tries to grasp von Ehrenfels' easy-going definition of a gestalt. One recognizes that its characteristics: super-summativity, and transformational invariance, are very complex requirements if rendered in a precise concept framework. We have already discussed some of these issues in chapter 12. This problem makes clear that the 465 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 22. MOTIF GESTALTS
466
construction of semantic layers from motivic, melodic, or thematic surface is a task which involves some structural complexity. Ill-defined aspects of motivic, melodic, and thematic analysis show up if one asks for abstraction principles when defining gestalt qualities. It is commonly accepted in motif theory (if that name were really adequate) that gestalt is connected to simplification and class building from the given material. It virtually never happens that the given note information is taken literally to build gestalts. But it is completely unclear which aspects of the given material should be retained, and in what form, or even in what variety of particular forms. In what follows, we have attempted to give a number of solutions or to make proposals for such, and to discuss the relations between our approaches and the fuzzy but sensitive ideas from traditional musicology. The application of the following topological methods in motif theory to programming software for motif analysis is important and will be dealt with in chapter 41.2.
22.1
Motivic Interpretation
Summary. The first step in the present motive analysis framework is the insight that motives cannot be processed without preliminary abstraction. We present musical motivations and case studies. -I;-
To begin with, recall the definition of a motif from section 7.2.3: If A is an address, a (A-addressed) motif M is an objective local composition M C A@F whose ambient space F = Onset EEl Pitch x Para, for any Form Para of additional "parameters", and such that for any pair of different elements in M, their projections to the onset space are different. We shall write Mop for the projection of M onto its OP component, i.e., the local composition Mop C A@(Onset EEl Pitch), and analogously Mo C A@Onset. We therefore have bijective projection morphisms
prop M----+-. Mop
pro
•
M
a
for motives. Before entering into the technical discussion, let us have a look at the auxiliary space Para. As with meters, we can also use this space to realize macro constructs in the motivic context. ~ Power(F). More precisely, in this case the motif M is a denotator of Form Para Fin(F)>--+!!Fun(F) The flattening and greeking operations discussed in chapter 21 can be applied mutatis mutandis. So the parameter space may be an external space regarding duration, sound colors, etc., or a macro refinement construct which englobes hierarchies of onset plus pitch events. We denote by A@MOTF the set l of A-addressed motives of space F, and we write MOTF for the set zeroaddressed motives, and A@MOT (MOT) for the set of A-addressed (zero-addressed) motives in the space F = Onset EEl Pitch (the parameter space is trivial). For a positive natural number n, we denote by A@MOTF,n,MOTF,n,MOTn , respectively the corresponding sets of motives with cardinality n, we therefore have A@MOTF = Un A@MOTF,n, etc. 1 Pay
attention to the fact that this is not a functor since distinction of points by their onsets is not functorial!
22.1. MOTIVIC INTERPRETATION
467
Our first abstraction is a strong one: We shall forget about all parameters from the Para component and concentrate on the OP-projection Mop, i.e., on A@prop : A@MOTF -> A@MOT. Although our analyses will deal with the real motives, their properties will be 'filtered' through prop. This is not mandatory, any extension of the forthcoming theory to the other parameters is feasible, for instance inclusion of sound colors, loudness or duration in order to describe similarity of motives. So our exposition is just a prototypical one concentrating on the core parameters. We shall describe a compensatory action to this preliminary reduction below.
Definition 65 A (finite) motivic interpretation of a local composition X c A@F is an interpretation by a (finite) subset fJ C A@MOTF . The motives of fJ are called the covering motives of the motivic interpretation XI-', and fJ is called the motif space. We say that a motivic interpretation (or its motif space) satisfies the submotif existence axiom (SEA) iff every submotif of a covering motif is also covering whenever its cardinality equals at least the minimal cardinality of covering motives. The nerve of a motivic interpretation is called the motivic nerve. This extends the definition of a melody given in definition 42, sec section 13.4.4. Motivic interpretations occur for local compositions X in score spaces F with events having onset and pitch coordinates. Typically, the covering motives are selected by upper and lower limits min, max of admitted motif cardinality and by the maximal admitted difference span of onsets within a motif. We shall deal with this kind of covering satisfying the SEA in the discussion of the MeloRUBETTE® in section 41.2. A generalization of this selection method is given by a preliminary interpretation X I of X, for example by the charts of the voices in a polyphonic composition. We may then consider all motives within one of these charts, and subjected to the above limits (here the lower limit should be set to 1 in order to guarantee a covering of the given composition). Of course, there is already a strong semantics in the definition of a motivic interpretation. This one may be generated essentially by mathematical rules, by prima vista predicates (such as slurs, beams), by shifter predicates, or by combinations thereof, see section 18.3.3 for this differentiation where we refer the reader to anchor corresponding semantic questions. (J" of the simplexes (J" of a motivic nerve are also motives, Evidently, the intersections and we obtain a base for a topology on X. In contrast to the maximal meter topology, this one is not interesting since often, the intersection (J" is a singleton, and the topology becomes discrete. We call the simplex Spl-'(x) of a point x E X the motivic simplex of x (with respect to fJ). As with the metrical theory, the simplex of a point should give information about its role in the zoo of selected motives. The following study of topologies on motif spaces aims at providing us with that information. This possibility to select motives by use of charts and their predicative background is a certain compensation for the preliminary reduction to onset and pitch. On the other hand, the analytical, cognitive or compositional perspective still does not work with the full information produced by the onset and pitch data. For example, in counterpoint one is interested in the categorization of interval sequences: the cantus firmus and discantus may move in different directions: in parallel, oblique, or contrary motion. And this enforces a very coarse classification of motivic gestalts. In American contour theory [381], pitch is only regarded as a number in an abstract discrete pitch space whereas onset is abstracted to a pure ordering index.
n
n
468
CHAPTER 22. MOTIF GESTALTS
Therefore, a general abstraction concept framework is needed to grasp the variety of perspectives which contribute to the motivic gestalt concept. This is what we want to expose in the following section. It will help us in the construction of topological relations on a motif space as described above. Although the motif concept works for any address, all the abstractions which are known from musicological/cognitive contexts germinated on zero-addressed motives. In this setting, they all make use of the linear ordering on a motif M which is induced by the bijection with its onset projection. Now, this projection can be given a linear ordering if the address A is such that A@Onset is linearly ordered. For example, if we have Z-addressed motives, we know from section 7.2.3 that the onset coordinates of their points D are represented by arrows OND OF FD. This data bears a canonical linear lexicographic ordering: first order the onset times, and then the offset times. Moreover, since the pitch coordinates are represented by arrows PIFFD POFFD , the same construction works for pitch. This elementary situation can be used to generalize zero-addressed abstractions to Z-addressed generalizations, but there is no intuition or method to do so for general addresses. The following abstraction examples will therefore be of rather selective nature, restricting to zero and integer addresses, but the theory will nonetheless be carried out on a general address level, starting from the general axioms which we can learn from the given examples.
'*
'*
22.2
Shape Types
Summary. Several abstraction methods are in usage in musicology. We establish the formal concept of shape types and present a list of common constructs. -~-
We get off by the formal definition of a shape type and then give representative examples thereof: Definition 66 Given a parameter space F and an address A, a shape type t is a family (rt,n)nEN+ of non-empty sets, together with a set map
(22.1) with codomain r t = Un rt,n such that t(A@MOTP,n) c rt,n, for each index n. Equivalently, we may give the n-components (22.2) with the direct sum t = 11 tn. The spaces rt,n are called the spaces of (abstract n-motives of type) t and their elements are called abstract n-motives of type t; the space r t is called the space oft-abstract motives. For an abstract motifm E r t we write abcardt(m) for the minimal index n such that mE rt,n, this number is called the abstract cardinality of m. If ME A@MOTp , we write abcardt(M) = abcardt(t(M)) and call this number the t-abstract cardinality of M. We shall use the following notation: rtl k denotes the subset of abstract motives of abstract cardinality k, and we set A@MOTpl k = r 1 (r t lk), whence r t = 11krtlk, and A@MOTp = 11k A @MOTp lk.
469
22.2. SHAPE TYPES
Assumption 1 In the sequel we shall always make the assumption that the abstract cardinality is compatible with inclusions, i.e., MeN implies abcardt(M) ::::: abcardt(N). All examples which we shall present satisfy this property. In principle it is possible and reasonable to view the abstract motives as denotators, but we will refrain from this formalism since the denotator theory is not (yet?) primordial here and would only burden the symbolism.
22.2.1
Examples of Shape Types
Summary. We present the rigid, diastematic, elastic, and toroidal shape types. -~-
All the following types will undergo the announced generic abstraction from parameters outside onset and pitch. 22.2.1.1
Rigid Types
This abstraction is a generic one with respect to the others, in fact, it is a one-to-one restatement of the OP-projection Mop of a motif M in terms of the ordering which is induced by the onset projection if this is given at the selected address A. For A = 7iI, we have the well-known lexicographic ordering of Zr@Onset denotators 0 = (00, 01, ... Or), 0i E lR. In this case, the onset ordering is transported to the points of Mop, i.e., the rigid shape type t = Rg is defined by Rg(M) = (mo, ml, ... me-I) where the sequence (mo, ml, ... me-I) E rRg,e = jR2(r+1)xe is the sequence of all elements of Mop = {mo,ml, ... m e-l} with c = card(M) in their onset ordering. In this setup, we have abcardRg(M) = card(M), rRg = Un rRg,n, rRgin = rRg,n, and zr@MOTFin = zr@MOTF,n. Concretely, Rg(M) = (mo, ml, ... me-I) is denoted by a 2(r + 1) xc-matrix
Rg(M) =
00,0
OO,i
°O,e-l
or,O Po,o
Or,i
°r,c-l
PO,i
PO,e-l
PO,i
.. . Pr,i ... Pr,e-l
Pr,i
Pr,O
OO,i with mi=
Or,i
(22.3)
and where the upper half Rgo(M) refers to the onsets whereas the lower half Rgp(M) refers to the pitch coordinates. A second shape type l:iRg, the rigid difference shape type is directly derived from Rg: We take r llRg,e = jR2(r+1) x (e-l)e/2 and define
l:iRg(M) l:iRg(M)
= (mj - mi)o:S;i
if 1 < c,
(22.4)
We shall usually represent the i, j-indexed entities in the Delta rigid difference vector as coefficients l:iRg(M)i,j of an upper triangular c x c-matrix (starting at the upper codiagonal).
CHAPTER 22. MOTIF GESTALTS
470
As a specialization of the rigid difference shape type, we may consider a selected kth codiagonal with index difference j - i = k. This rigid k-difference shape type will be denoted by ~kRg , we have r LJ.k Rg,c = ~2(rH)x(c-k) , and A
~kRg(M) ~kRg(M)
22.2.1.2
= (mi+k - mi)o
if 1 < c,
(22.5)
Diastematic Types
The diastematic type t = Dia comes from classical musicology where one is often only interested in the direction of a melodic movement. The diastematic shape type is a 'child' of the rigid difference shape type. It resides on the index function 2 index(x) when applied to the coefficients of the rigid difference shapes. We take rDia,c = z(r+l)x(c-l)c/2 and define Dia(M)i,j
=
(index(~Rg(M)i,j,s) )rH
(22.6)
with the s-index being the row number of the matrix. And we may evidently carryover the codiagonal procedure to the diastematic situation in defining rDiak,c = z(rH)x(c-k), and Diak(M) = (Dia(M)i,i+k)05,i
(22.7)
This is the diastematic k-difference shape type, the special case of k = 1 is also called diastematic index shape type. For the special case r = 0, the diastematic shape type includes the COM matrix used in AST contour concepts, see [74,380]. As a function of this data, we may shape contrapuntal step types as follows. Take the address A = Z, a case which we have already illustrated in section 7.2.3 as being a common contrapuntal situation. We have the values Diai,j = (u, v) which means that the cantus firmus movement from interval i to interval j has index u whereas the discant movement has index v. Therefore, the product u. v is positive iff we have parallel motion, it is zero iff we have oblique motion, and it is negative iff we have contrary motion. More generally, we may define the contrapuntal motion shape type t = Contra as a 'child' of the diastematic shape type as follows: UTe take r Contra,c -- zrx(c-l)c/2 and put VVI Contra(M)i,j = (Dia(M)i,j,s .Dia(M)i,j,r+2)r+2
(22.8)
In the case r = 1 we get the above contrapuntal example. As above, we may also project to the k-th co diagonal and consider the corresponding contrapuntal k-motion shape type k-Contra, the details are left to the reader. The special case of the I-Contra shape type really gives us an abstraction which codifies exactly the successive contrapuntal motion type picture. Exercise 44 Try to define a reduced diastematic shape type by use of the above lexicographic order (end of section 22.1) on the pitch coordinate vectors and the diastematic values -1,0, +1 if pitch decreases, is unchanged, or increases. Describe the shape spaces rRedlndia,c for the codiagonal projection of this reduced diastematic type. 2See appendix D.5.1.
471
22.2. SHAPE TYPES
22.2.1.3
Elastic Type
From the cognitive point of view, it might be reasonable to concentrate on geometric configurations up to dilatations and translations. This type t = Elast is codified as a 'child' of the rigid difference shape type modulo dilatations, i.e., the diagonal action of the multiplicative group IR~ on the coefficients of the matrix fl.Rg(M). The orbits of the matrices can be parametrized by special homogeneous coordinates. We observe that each matrix fl.Rg(M) has in its onset part (upper half) its positive first coefficient oa(i,j),i,j, for each matrix position (i,j) = (0,1), ... (c - 2, c - 1) in (22.4). Take the unique matrix dilatation such that Lj 0a(i,j),i,j = 1, then we have a unique representation of that IR~-orbit. So we first fix a function a: {(0,1), ... (c-2,c-1)} ----+ {O,l, ... r} and denote ~a = Li,ja(i,j). We then define the smallest non-vanishing (positive) coefficients 0a(i,j),j, (i, j) = (0,1), ... (c - 2, c - 1) of the (i, j) position, together with the unity sum condition. The other coefficients are all free. So the abstract motif space looks as follows. For a fixed function a, we have the vector of initial values (Oa(i,j),i,j)j E lfl. e(e-I)/2-ll o , the interior of the affine c(c - 1)/2 - I-simplex. The other free values are distributed below the initial values in the rigid difference matrix (22.4). Each position (i,j) has 2(r + 1) - (a(i,j) + 1) = 2r - a(i,j) + 1 free values, which add up to L . 2r - a(i,j) + 1 = (2r + l)(c -1) - ~a and are therefore in IR(2r+I)(e-I)-~a. This means ",J that we have the shape spaces f Elast,e = lfl. e(e-I)/2-ll o x IR(2r+I)(e-I)-~a.
IL
22.2.1.4
Toroidal Type
For the toroidal shape type t = Toroid;;"l, we restrict to the zero address. We are given two positive natural numbers m, l, and an affine grid basis, = (0, x, y) E (Onset E9 Pitch)3, consisting of an origin 0, and a linear grid basis x, y which define the grid 3 G(,) = o+Z.x+Z.y C Onset E9 Pitch. With respect to " every point p E Onset E9 Pitch has a unique representation p = ,(p) + ~.x + 7].y with S ~,7] < 1 and ,(p) We then associate with p the grid point p,,! = ,(p) + ~.round(x) + 7].round(y) (see definition 134 in appendix D.5.1 for the rounding function). Let p,,! = 0 + a.x + b.y be the representation of p,,! in the grid. Then we have the element P,,!,m,l = (a mod m, b mod l) E OnPiM Odm,l whose coordinates are in the discrete torus Zm E9 Zl as discussed in formula (6.43) of section 6.4.1. If a motif M E MOTF,n, this defines a local composition Toroid;;"l(M) = {P"!,m,zI P E Mop}. It is contained in the set fToroid,n = ObLoc~n,OnPiMod~" of local, zero-addressed compositions in OnPiModm,l with cardinality at most n, and we have a shape map
°
E,.
Toroid;::,l , : MOTFn '
----+
fToroid ,n
(22.9)
on the given cardinality level n. Exercise 45 Observe that in this case, the shape spaces are not disjoint, but the inclusion assumption 1, section 22.2.1, is verified. By use of the rigid type, we can also define a toroidal sequence shape type TorSeq as follows. Again, we fix an affine grid, and a pair m, l of positive integers. Take the (zeroaddressed) rigid abstract motif Rg(M) = (mo, ... me-I), then the sequence TorSeq(M) (mo,"!,m,l, ... me-I,"!,m,l) E fTorSeq,e = O@(OnPiModm,l)C ~ (Zm E9 Zl)c, 3Recall that the underlying module of Onset E9 Pitch is
]R2.
CHAPTER 22. MOTIF GESTALTS
472
22.3
Metrical Similarity
Summary. This section deals with the metrical similarity of abstract motives. This concept is the base of motivic topologies used to understand Reti's thinking. -~-
The abstraction process which yields the shape space r t is only the first identification step. In fact, after this shape abstraction, we need to deal with similarity of motives. This relation will be defined on abstract motives and then retracted to the original motives in the following sense: Definition 67 Given a shape type t : A@MOTF ---- r t , a distance function is a sequence d = (dnh::;n of pseudo-metrics dn which are defined on rtln- A t-distance function is the sequence dt = (dt,n = dnIA@MOTFlnh::;n of pseudo-metrics which are induced on the spaces A@MOTFln by the distance function d, i.e., if M,N E A@MOTFln, then dt,n(M,N) dn(t(M) , t(N)). If no confusion is possible, we omit the index n and just write dt(M, N).
22.3.1
Examples of Distance Functions
Let us now discuss distance functions for shape types which have been introduced in section 22.2.1. 22.3.1.1
Distances for Rigid Types
The shape spaces for rigid types (subsubsection 22.2.1.1) are real matrix spaces rRg,n = jR2(r+l)xc and r~Rg,n = jR2(r+l) x (c-l)c/2. On such a space, we have the usual Euclidean metric which defines the Euclidean distance function Ed and the corresponding Euclidean Rg- or t:l.Rgdistance function EdRg , Ed~Rg. We may also consider the abstract cardinality and define the relative Euclidean distance function: Let abcard(x) = abcard(y) = n, for two abstract motives, then we set REd(x, y) = Ed(x, y)/n for rigid type Rg, and REd(x, y) = Ed(x, y)/(n - 1) for shape type t:l.Rg. Such a relative distance takes into account that on the cognitive level, we would like to say that the distance is a kind of mean value of all the point distances between the first and first, second and second, etc., last and last points of two motives. This reason also holds for the other relative distances to be defined in the sequel. 22.3.1.2
Distances for Diastematic Types
For the diastematic types (subsubsection 22.2.1.2), the abstract motives also lie in matrix spaces, this time with integer coordinates, and we may also take the Euclidean distance function induced from the surrounding real matrix spaces. A relative Euclidean distance function on r Dia which relates to the number of components in the triangular matrix is given by REd(x, y) = 2Ed(x, y)/(r + 1)c(c - 1). For the codiagonal shape types Diak, we set REd(x, y) = Ed(x, y)/(r + 1)(c- k). For the contrapuntal shape type Contra, we set REd(x, y) = 2Ed(x, y)/rc(c - 1).
22.4. PARADIGMATIC GROUPS 22.3.1.3
473
Distances for Elastic Type
As in the previous situations, we can apply the Euclidean distance function to the elastic shape type (subsubsection 22.2.1.3) since its abstract motives are also embedded in a real matrix space. In this case, it may be more adequate to take the metric which is deduced 4 from the I-norm on n-space since in this case, the component of initial values in the affine simplex has just length 1. Since it is well-known 5 that the topology on n-space is independent of the defining norm, this will have no influence on the following topological considerations. A relative Euclidean or I-norm distance can be defined via a division factor 2(r+ 1)(c-l) which takes into account the number of motif points and the dimension of the address-related columns. 22.3.1.4
Distances for Toroidal Types
On the discrete torus Zm EEl Zz, one defines a metric steps(x, y) by the minimal number of grid steps on all possible walks from x to Y which is a kind of geodesic distance on this discrete torus. On the shape space rTorSeq,c -=::', (ZmEElZZ)C, we may take the norm, Euclidean, or uniform distance construction. Given two sequences x = (Xi), Y = (Yi) E rTorSeq,c, we accordingly set
d1,c(x, y) = d2 ,c(x,y) doo,c(x, y)
=
=
L.i steps(Xi' Yi),
(22.10)
VL.i stepS(Xi,Yi)2, max{steps(Xi, Yi)1 i = 0, ... c - I}
(22.11) (22.12)
and get a corresponding distance on toroidal sequences. This construction is also used in counterpoint theory, see chapter 30. For the Toroid shape type, one can define a distance via the lexicographic ordering among torus points, thereby ordering a local composition lexicographically and then applying the above distance functions for the lexicographic sequences of abstract motives ([73]). Exercise 46 Fill in the details of the Toroid distance function and verify that it is a pseudometric.
22.4
Paradigmatic Groups
Summary. Together with abstraction mechanisms, groups of symmetries act on spaces of abstract motives and-in several cases-on the proper motif spaces. These groups are termed paradigmatic groups since they relate to Jean-Jacques Nattiez' paradigmatic theme, as discussed in 11.7.1. An important case are those actions which are equivariant with respect to the shape maps. The formal definitions of gestalts are associated with specific abstractions and group actions. This demonstrates that the concept of gestalt is the result of a multilayered concept hierarchy involving several non-automatic parameter choices. -~-
4See appendix 1.1.1. 5See appendix 1.1.2.
474
CHAPTER 22. MOTIF GESTALTS
As a matter of fact, comparison of motives is not restricted to abstraction and similarity, we know from the general discussion of paradigmatic concepts in chapter 10 that the topological similarity which has been induced (though not executed, this is the scope of the topological theory of motives in section 22.6) must be complemented by the transformational similarity. The latter deals with group actions on shape types in the following sense: Definition 68 A paradigmatic group P for a shape type t : A@MOTF ---+ f t is a left action P x f t ---+ f t on the shape space which leaves the components ft,n invariant. An equivariant paradigmatic group P for a shape type t : A@MOTF ---+ f t is a pair of group actions
7r :
which is equivarianif5 with respect to the shape map t and such that P is a paradigmatic group for t. We also write
A@MOTF -
t
o
P:",,(
f
t
0
P:7r
or more concisely (if the actions must not be specified)
for this situation. Lemma 34 If P is a paradigmatic group for type t, then the abstract cardinality is an invariant for abstract motives, i.e., the group action leaves invariant the components ftl n .
The proof is an easy exercise. We are now ready to define the gestalt concept. Definition 69 The orbits of a paradigmatic group P action on f t are called the abstract gestalts of shape type t. The inverse images rlG of abstract gestalts G of t are called the tgestalts. If ME A@MOTF , then its t-gestalt rl(P.t(M)) is denoted by Gesf(M) or Gest(M) if P is clear. Sorite 7 Let P be a paradigmatic group for shape type t. Then every t-gestalt Gest(M) is contained in ftlabcardt(M). If the paradigmatic group is equivariant, then the orbit of a motif M is contained in the gestalt: P.M c Gest(M), i.e., a gestalt is a disjoint union of orbits. We 6See appendix C.3.1, example 70.
475
22.4. PARADIGMATIC GROUPS therefore have the commutative diagram
t
A@MOTF---...·rt
j
£
P\A@MOTF
j~ This follows immediately from the definitions and lemma 34. Definition 70 The cardinality of a gestalt Gest(M) is defined as abcardt(N) for any N E gest(M).
22.4.1
Examples of Paradigmatic Groups
Here are some group actions which are of practical use. Preliminary remark: For any group ---7 P C GL(JR 2 ) of affine transformations on Onset EB Pitch, one has the canonical induced action 'f on the onset and pitch coordinates of motives in A@MOTF,n. Many paradigmatic actions are defined as equivariant actions which issue from this canonical action on the proper motives, i.e., one first transports the motivic action to a motivic shape action and then tries to generalize the latter to any abstract motif. The most prominent group is the affine counterpoint group CP = T 2 ,1Ft )
Paradigmatic Groups for Rigid Types
For the rigid shape type Rg : ZT@MOTF ----> r Rg , we have the above canonical action of p E CP on a motif M: p(M) = {p(m) I m EM}. Evidently, the retrograde motion reverses the time order of elements in M. Therefore we can define a CP-action 11" on rRg,c by the rule
_ {(P(XO)'P(Xl)' ... p(xc-d) p (Xo,··· Xc-l ) (p(X c-l),P(X c-2), ... p(xo))
if p does not have the retrograde component, (reversed order) else.
(22.13) 7The four-element group of plane symmetries generated by inversion U (reflection at zero pitch) and retrograde K (reflection at zero onset), see also section 8.1.1.
476
CHAPTER 22. MOTIF GESTALTS
Clearly, this defines an equivariant action r 0 Rg Z @M TF - -
fRg
o
0
CP:,,(
CP:7f
for rigid shape type. For the rigid difference shape type, the affine action of C P reduces to the linear action, i.e., we have the quotient action .0.1l" via an element q E LCP ~ CP/T2 ,lR on f f"'>.Rg,c by the following rule. If we write d = (do, d p ) for the onset-pitch decomposition, and U(d) = (do, -dp ) we have if q = U, (22.14) if q = K. This gives the equivariant action r
Z @MOTF
o
.0.Rg
-
ff"'>.Rg
0
CP:~7r
CP:'Y
for rigid difference shape type. 22.4.1.2
Paradigmatic Groups for Diastematic Types
The canonical action of the affine counterpoint group on the motives also carries over to an equivariant action on the diastematic shape types. Since the latter are children of the rigid difference shape type, we have to see if we can have an equivariant action index
f
ff"'>.Rg-
o
LCP:Cl.-rr
Dia
o
LCP:Dian
between rigid difference and diastematic shape types (in this direction!). Again, we may define the action Dia1l" on the generators and set if q = U,
__ ) = {((-indi,j))i,j q(( ind .) 't,) ~,J
(( -indc-1_j,c-l-i))i,j
if q = K
(22.15)
for a diastematic index family (indi,j )i,j. Evidently, this action makes the index map equivariant. So as above, the quotient C P ----> LC P defines the equivariant action
zr@MOTF
o
CP:--y
~
f
Dia
0
CP:Dia7r
of the affine counterpoint group in the diastematic case. This clearly carries over to the different co diagonal projections since the codiagonals are invariant under the retrograde action. Exercise 47 Define an equivariant action of the affine counterpoint group on the contrapuntal motion shape type by use of the diastematic action.
477
22.5. PSEUDO-METRICS ON ORBITS 22.4.1.3
Paradigmatic Groups for Elastic Type
The paradigmatic counterpoint group action is a bit more delicate for the elastic type since we have to take care of the initial value functions a and the change of these function under the counterpoint group. As the elastic type is a child of the rigid difference type, we again want to define an equivariant action of the linear counterpoint group LC P for the elastic map
r l!.Rg
o LCP:6.'1r
elast
rElast
-----
o
LCP:Elast7r
To begin with, the inversion U leaves the initial value function a invariant, whereas the retrograde K maps a to K.a with K.a( i, j) = a( c - 1 - j, c - 1 - i). Since the multiplicative action of ~~ commutes with any linear action, and since the condition Lj a(j) = 1 is invariant under the linear counterpoint group, we can take the rigid difference action on the homogenous coordinates in the elastic shape spaces r elast, together with the above initial value function transformation. The K-action permutes the summands ofthe disjoint union Ua ~(2r+1)(c-l)-Ea according to the rigid difference action and the operation K.a on the disjoint union "index" a. Exercise 48 Fill in the details of the previous elastic action. 22.4.1.4
Paradigmatic Groups for Toroidal Types
This case is a bit different. There is no analogous equivariant action of the affine counterpoint group for toroidal types since the rounding function in the affine frame quantization is not compatible with the inversion and retrograde operations (check this!). We however have an -----> action of the affine group (or any subgroup) Gm,l = GL('lt m EEl'lt I ) of the torus 'lt m EEl'lt1 on the abstract motives. In fact, if Q E rToroid, the pointwise action 7rm ,l of 9 E Gm,l, g.Q = {g.ql q E Q} gives the action on abstract toroidal motives. So here the abstract gestalts are the isomorphism classes of local compositions. For the toroidal sequence shape type, we have the usual coordinatewise action 7rm ,1 of 9 E G on a sequence x. = (Xi)i via g.x. = (g.xik
22.5
Pseudo-metrics on Orbits
Summary. Metrical similarity between abstract motives is extended to gestalts which essentially refer to orbits of abstract motives under paradigmatic groups. -~-
It is not sufficient to have metrical relations among the abstract and real motives, we also want to tell what it means that two gestalts are metrically neighboring. To this end, we need a preliminary knowledge about actions of groups of isometries. We know from appendix 1.1.1, lemma 98, that if a pseudo-metric d and an action of a group G by isometries8 are given on a space V, then the orbit space G\ V is naturally provided with a pseudo-metric d*. 8See appendix 1.1.1, definition 173.
CHAPTER 22. MOTIF GESTALTS
478
So we only have to look for isometric actions in our examples, and we obtain a pseudometric on the abstract gestalt spaces. Let us now check where this is the case. 1. Rigid Types. For the rigid shape type, the distance between two matrices is the Euclidean norm of their difference. On the other hand, the inversion and retrograde are isometries on the onset-pitch space, therefore, the counterpoint group CP acts by isometries. The same is valid for the rigid difference shape type which has only its triangular matrix entries permuted and is a priori invariant under translations; the same holds for the co diagonal projections and relative distances. 2. Diastematic Types. The counterpoint group acts via its linear projection LCP = (U, K) on the diastematic shape type. Thereby, only signs are altered and indexes of coefficients are permuted (equation (22.15)). This leaves the Euclidean distance invariant. Same result for the co diagonal projections and relative distances. 3. Elastic Type. Since the action of LCP on the elastic type is the action of this group on the rigid difference shape type with the corresponding homogeneous coordinates, LC P also acts by isometries on the elastic type. 4. Toroidal Types. Here, we have to observe the step number between two points on the torus Zm EEl Zz when an affine automorphism is applied. In general, this not an invariant. For instance, we have the fourth multiplication on Z4 EEl Z12 (diagonal matrix with 5 in its pitch direction) which sends a fifth (minimal number of 5 steps) to the minor second (one step). However, if the numbers m, l are relatively prime and the Euler functions 9 are ¢>(m) , ¢>(l) ::; 2, then the minimal step number is conserved. This function value is the case for a very small number of cases: m, l = 2,3,4, and the combinations (m, l) = (2,3), (3, 2), (3,4), (4,3). In this shape type, we should therefore concentrate on selected subgroups of the general affine group in order to preserve distances. One such subgroup is the LC P which acts in the canonical way on the factors. After these case studies we can define what is an orbit pseudo-metric for shape types. We say that a paradigmatic group P for a shape type t with distance d consists of isometries if P acts as group of isometries on each dn-pseudo-metric shape space rtl n . If this is the case, we have a pseudo-metric d p n on each orbit space p\r t In- In a more sloppy language we shall also say that we have a pseu'do-metric d p on the abstract gestalt space p\r t when we think of the sequence ofpseudo-metrics (dpn)n. We shall also use this convention for all pseudo-metrics induced on the following spaces: Re~onsider the diagram just before section 22.4.1. By retraction of the abstract gestalt pseudo-metric d p to all the codomains, we obtain a system of pseudometrics which we denote as indicated in the corresponding diagram which now automatically 9See appendix C.3.4.1, proposition 72.
22.6. TOPOLOGIES ON GESTALTS
479
turns into a diagram of (sequences of) pseudo-metrics
mot. d*p
t
--
abs.d *p
j orb.d p
ges.d*p - - - - d*p where we have omitted the underlying (real or abstract) motif sets.
22.6
Topologies on Gestalts
Summary. The overall comparison of gestalts (of different cardinalities) is conceived. It is built upon the inheritance property which essentially captures the psychological fact that recognition of motives often amounts to recognition of their submotives. -~-
We are now ready to introduce topologies on motif and gestalt spaces. Except for the toroidal type, this approach combines the elements of gestalt and metrical similarity while comparing motives. In the toroidal situation, we shall however introduce a more algebro-geometric idea to topology which deals with specialization instead of metrics.
22.6.1
The Inheritance Property
Summary. This section is devoted to the formal discussion of the inheritance property and to the study of different shape types with respect to this property.
Though metrical properties suggest that we may have a common topology on A@MOTF defined by use of pseudo-metrics, it turns out that there is an important restriction to this program, a restriction which is of an evident cognitive semantic. We postpone this rationale and first present the formal aspect.
Definition 71 Given a shape type t : A@MOTF ---> ft, together with a distance function lO d on ft, we say that this data has the inheritance property iff for each motif ME A@MOTFln, any submotif M* c M, and any real number E > 0, there is a real number 0 > 0 such that if N E A@MOTFln is such that dt(M, N) < 0, then there is a submotif N* c N such that abcard( M*) = abcard( N*) and dt (M* , N*) < E. lORecali from definition 67 that this induces a t-distance function d t on A@MOTF.
CHAPTER 22. MOTIF GESTALTS
480
Intuitively speaking, neighboring motives have neighboring submotives. An equivalent definition states that a particular function on the motives is continuous: Suppose that for two motives, M*, N, abcard(M*) :::; abcard(N) = n. We define Sub M* (N)
=
{N* c
NI
abcard(N*)
=
abcard(M*)},
the set of submotives of N with fixed abstract t-cardinality abcard(M*). Consider the function (22.16) defined by dt,M*,n(N)
=
MinN*Esub M *(N)(dt(M*, N*)). Then the inheritance property reads
Definition 72 Given a shape type t : A@MOTF ----+ rt, together with a distance function d, we say that this data has the inheritance property iff for any motif M* and any abcard(M*) :::; n, the function dt,M* ,n is continuous in every supermotif M of M* (i. e., M* C M of abstract cardinality n with respect to the pseudo-metric topology of d n ). In fact, suppose that the second version of the inheritance property holds. Then dt,M* ,n is continuous in any supermotif M of M* of abstract cardinality n. But then, dt,M*,n(M) = 0, and for a given positive E there is 0 such that d t (M, N) < 0 implies Idt,M* ,n (N) - dt,M* ,n (M) I = dt,M*,n(N) < E, and this is the claim of the first version of the inheritance property. The converse is evident. This property is not automatic. An illustrative 'pathology' is the diastematic k-difference shape type tDiak' Let us take the co diagonal difference k = 1, i.e., the diastematic index shape type, see figure 22.1.
o o Figure 22.1: Two motives M, N which are at diastematic distance zero but N has no submotif at this distance which corresponds to the submotif M* drawn with black points. Here, we work in MOTn , and we look at the motif M E MOT5 with Dial(M) = (1, -1, 1, -1), and the submotif M* C M with abcard(M) = 3 and Dial(M*) = (1,1). Take the motif N E MOT5 with Dial(N) = Dial(M). The distance is dt(M,N) = 0, but N has no submotif N* with Dial (N*) = Dial (M*), and so the inheritance property cannot be fulfilled. For rigid and elastic types we have affirmative results: Proposition 25 Rigid shape types Rg, /j.Rg, and corresponding Euclidean distance functions EdRg , EdtlRg have the inheritance property.
481
22.6. TOPOLOGIES ON GESTALTS
Proof. This is evident for the Rg-type since the passage to a nearby submotif of N is reflected by a projection onto a submatrix of the rigid image Rg(N) which is defined by selection of those columns which are defined by the submotif M*. And this projection is continuous. On the f:j,Rgtype, we have to observe that the matrix columns of the f:j,Rg-abstractions of submotives are given by summing up the Rg-difference vectors of their supermotives. But this is a continuous operation, and we are done. QED. Observe that this property and the following are independent of the chosen metric on the rigid or elastic shape spaces, since the continuity is only a topological property, and we know that the topology on the real n-space is independent of the defining norm. Proposition 26 Elastic shape type and Euclidean distance has the inheritance property.
Proof. The elastic shape type is derived from the rigid difference type by normalization of the initial value function a to I;a = 1. This means that we just divide the rigid difference matrix by I;a, and this is a continuous function on the interior If:j,c(c-l)/2-1Io of the affine c( c - 1) /2 - 1simplex which defines the shape space, see 22.2.1.3. QED. Proposition 27 Diastematic shape type Dia and Euclidean distance has the inheritance property. Exercise 49 Give a proof of proposition 27. Proposition 28 If we have an equivariant paradigmatic group action t
A@MOTF-ft
op
which consists of isometries with respect to the distance function d, and if the shape type t and d have the inheritance property, then the gestalt distance mot.dp and shape type t also has the inheritance property. Proof. Clearly, we have (mot.dp)t,M*,n(N) ::; dt,M*,n(N), and then, continuity of dt,M*,n in supermotives of M*, where the function value is zero, implies continuity of (mot.d p )t,M*,n in supermotives of M*. QED.
22.6.2
Cognitive Aspects of Inheritance
Summary. This section makes the inheritance property plausible from the psychological and cognitive point of view.
Before we proceed to the construction of motivic topologies, we should briefly inspect the cognitive relevance of the basic inheritance property. Recognition of similarity of motives is a complex task of gestalt recognition. Little if anything is known about recognition of auditory gestalts, in particular, it is not known which shape type could be a cognitively relevant one,
CHAPTER 22. MOTIF GESTALTS
482
and whether there is a kind of combination of different shape types introduced above which could represent good cognitive information. A particular problem of auditory gestalt recognition is the transitory nature of auditory gestalts: in the common situation of a piece being presented to a listener, motives pass by very fast and comparison of such transient objects must be an economic process, typically taking a time interval around 0.5 sec. Now, similarity of longer motives, consisting of eight tones, say, is not likely to be perceived as an irreducible entity. Fast processing must rely on elementary gestalt aspects of such motives. Mathematically speaking, it is plausible that gestalt perception breaks down to the perception of an interpretation of such motives, i.e., to the perception of motivic interpretations, i.e., melodies in the technical sense of global motivic compositions. If we assume that we are given melodies MI which are covered by motives Mi of a limited small cardinality c, then similarity of motives M can only be mediated via similarity of their charts Mi. So the present hypothesis is that similarity perception of large motives M breaks down into two subtasks: similarity perception of their charts M i , and perception of the nerve n(M 1 ). In this framework, the chart similarity check is related to the inheritance property. In fact, the latter means that similarity of charts is a necessary condition for similarity of the big motives. So if our cognitive performance is not strong enough to check similarity among big motives, it can at least make the chart check and thereby learn whether a necessary condition for similarity of big motives holds. Now, this necessary condition takes place iff the inheritance property holds: This is the cognitive interpretation of this topological property.
22.6.3
Epsilon Topologies
Summary. Topologies on motif and gestalt spaces are introduced in case we are provided with a pseudo-metric and the inheritance property is fulfilled.
-ETo begin with, we define the 'open discs' which will yield a base for a motivic topology:
Definition 73 If we have an equivariant paradigmatic group action t
A@MOTF-ft
op
which consists of isometries for the distance function d, if ME A@MOTF , and if E is a positive real number, the E-neighborhood of M is the set D€(M)
=
{N E A@MOTFI (mot.d'PkM(N) < E}
(22.17)
which implicitly means that the candidates N have abstract cardinality at least equal to abcard(M). Proposition 29 If we have an equivariant paradigmatic group action
t
A@MOTF-ft
op
22.6. TOPOLOGIES ON GESTALTS
483
which consists of isometries for the distance function d and such that the inheritance property is fulfilled, then the system of the E-neighborhoods DE(M), all E, M, is a base of a topology 'rt,P,d on A@MOTF . This topology is called the epsilon topology (for the data t, P, d). Proof. Let DE, (Md,D E2 (M2 ) be two E-neighborhoods and take a motif 0 E DE, (M1) n D E2 (M2 ). We must look for an E3 > 0 such that DE3(0) C DE, (M1)' A similar argument yields an E4 > 0 which does the job for M 2 , and the smaller of the two epsilons solves the problem. By construction, we have (mot.dpkM, (0) = q < E1. Take a submotif 0 1 C 0 with q = mot.dp (M1, 0 1). By the inheritance property, we find a positive E3 such that mot.dp(O, Q) < E3 implies (mot.dp}t.o,(Q) < E1 - q. Then clearly, DE3(0) C D E,(M1). QED. The next concepts relate gestalts and their representatives to motif inclusion:
Definition 74 If P is a paradigmatic group for shape type t, we say that gestalts behave well for t iff for every submotif pair M* C M, and motif M1 E Gest(M), there is a submotif M{ c Ml with M{ E Gest(M*). Definition 75 Let G*, G be two gestalts for shape type t. Then we say that G* is a small gestalt in G and write G* c G iff there are motives M* E G*, MEG such that M* eM. The proofs of the following two propositions are left as an exercise for the reader:
Proposition 30 [73, Prop.3] If P is a paradigmatic group for shape type t which behaves well, then the small gestalt relation C is reflexive, transitive and antisymmetric ll . Proposition 31 [73, Prop.8] Suppose that we are given an equivariant paradigmatic group action t
A@MOTF-ft () P
which consists of isometries for the distance function d, such that the inheritance property is fulfilled, and such that the pseudo-metric ges.dp on A@MOTF/GeSt is a metric. Then gestalts behave well for t. Exercise 50 Show that the action of the affine counterpoint group CP, or more generally the equivariant action of any group P with finite orbits in the shape space, defines a metric ges.dp on A@MOTF/GeSt. In particular, the described actions of CP on rigid difference, diastematic, and elastic types have this property. Corollary 7 [73, Prop.9] Given the epsilon topology 'rt,P,d on A@MOTF, consider the quotient topology 'rt,P,d/GeSt on the gestalt space A@MOTF/GeSt for the gestalt mapping Gest : A@MOTF ~ A@MOTF/GeSt. Suppose that ges.dp is a metric. Then Gest is open and the system
DE(H) = {G E A@MOTF/Gestl 3G* C G,ges.dp(G*,H) llSee also appendix C.2.
< E},H E A@MOTF/GeSt,O < E,
CHAPTER 22. MOTIF GESTALTS
484
forms a base for 'rt,p,dICest. More precisely, this follows from the formulas Gest(D,(M))
= D,(Gest(M)),
Ges;l(Cest(D,(M))) = D,(M).
(22.18) (22.19)
The topology 'rt,p,dIGest is called the epsilon gestalt topology. The following is immediate from the previous constructions: Corollary 8 For the epsilon topology 'rt,P,d on A@MOTF and the epsilon gestalt topology 'rt,p,dIGest on A@MOTFICest, and for a positive integer n, the relative topology12 'rt,p,dln on A@MOTFln is the pseudo-metric topology of mot.dj,ln, whereas the relative topology on 'rt,p,dIGestln on A@MOTFlnIGest is the pseudo-metric topology of ges.dj,1nExercise 51 Use assumption 1, section 22.2.1, to show that N E D,(M) implies abcard(M) ::; abcard(N). It follows that Un:5i A@MOTFli is open for any positive index n. Exercise 52 Show that N E D,(M) implies Gest(N) C D,(M). Exercise 53 Show that in the epsilon topology, if we have a submotif M* C M and 0 then there is 0 < 0 such that Do(M) C D,(M*).
22.7
<
E,
First Properties of the Epsilon Topologies
Summary. For the total motif space A@MOTF as well as for the motif space JL associated with a motivic interpretation XP of a finite local composition X, some elementary properties of motivic topologies are discussed. -~-
Suppose that we are given the motivic topology 'rt,P,d as introduced above. Whenever a motivic interpretation XP of a finite local composition X is given according to definition 65, section 22.1, the motif space JL C A@MOTF will be given the relative topology 'rt,p,dIJL which we also call the epsilon topology. Moreover, the image p,fCest of JL under the gestalt map Gest is also given the relative topology 'rt,p,dIGestIJLIGest. To ease notation, we shall henceforth write 'r = 'rt,P,d, 'rp = 'rt,p,dIJL, and 'r/Ges = 'rt,p,dIGest, 'r p / Ges = 'rt,p,dIGestIJLIGest if the background assumptions are unambiguous. Assumption 2 In this section we always assume that we are given a type t, a pseudo-metric d on the space of t, an equivariant paradigmatic group P for t, which acts by isometries. We further assume that the inheritance property is verified and such that the gestalt distance ges.dj, is in fact a metric. Proposition 32 [73, Prop.Il] The epsilon topologies 'rICes and'rp / Ges are T013. 12See appendix H.1.2. 13See appendix H.1.4, definition 167.
22.7. FIRST PROPERTIES OF THE EPSILON TOPOLOGIES
485
Corollary 9 Irreducible subsets in the epsilon topologies 'I/Ges or'I,,/Ges have at most one generic 14 point. Lemma 35 If the group TOnset of all onset translations is contained in the paradigmatic group P, the topological spaces (A@MOTF , 'I) and (A@MOTF/Ges,'I/Ges) are irreducible.
However, the motif space of 'I is not sober in this case since it is irreducible, but no motif can be the generic point of the space since such a motif is a proper specialization of all its proper supermotives. Lemma 36 Given an epsilon topology 'I, if M, N E A@MOTF , the following statements are equivalent: (i) There is
E
> 0 with N ¢ DE(M).
(ii) Gest(N) ¢ DE(Gest(M)). (iii) Gest(M) r,t Gest(N). Proof. If abcard(N) < abcard(M) , the above exercise 51 tells us that both, N ¢ DE(M), and Gest(M) C Gest(N) are the case for any E > o. Suppose abcard(N) ::; abcard(M). If Gest(M) r,t Gest(N), then E = (ges.dj,)N(M) > 0 because the gestalt distance is a metric, and therefore N ¢ DE(M). Conversely, if there is a E > 0 with N ¢ DE(M), then by definition of disk neighborhoods, E = (ges.dj,)N(M) > 0, so no sub motif of M can be in the gestalt of N. From corollary 7 we know that N ¢ DE(M) iff Gest(N) ¢ DE(Gest(M)), and we are done. Sorite 8 Given an epsilon topology 'I, if M, N E A@MOTF , the following three statements are equivalent:
(i) M dominates N. (ii) Gest(M) dominates Gest(N). (iii) Gest(N) C Gest(M). Moreover these statements are equivalent:
(i) {M}-
=
{N}-.
(ii) {Gest(M)} - = {Gest(N)} -.
(iii) Gest(M) = Gest(N). The following two statements are equivalent:
(i) {M}- = Gest(M). (ii) Gest(M) E A@MOTF/Gest is closed. 14See appendix F.2.1.
CHAPTER 22. MOTIF GESTALTS
486 Moreover, if M is closed, then abcard(M) mutually disjoint. We have
=
1, and conversely, if the spaces rt,n are
and
In particular: Gest(M)
=
{NI M dominates N} n {NI N dominates M}.
(22.20)
The only point to observe here is that a motif M with abcard(M) > 1 has its singleton subsets as submotives of abstract cardinality 1, and therefore their gestalts are different from Ges(M) and M cannot be closed. The converse is evident. The last property (22.20) is a purely topological characterization of a motif's gestalt. This means that talking about "gestalts" in our technical sense is absorbed by a purely topological fact, i.e.: Fact 11 The entire "gestalt" concept has been absorbed by a topological approach. Proposition 33 If A@MOTFldGes is finite, then (A@MOTF, 'I) is quasi-compact15 • Proof. Let C be an open cover of A@MOTF, then
A@MOTFll
=
II i=l, ... k
Ges(Mi)
=
U {Mi}i=l, ... k
is a finite union of its gestalts and of the closures of its member representatives Mi. So there is a finite subset C' = {U1 , ••• U l } of C that covers all representatives Mi' Since MOTF,l C A@MOTFll, each singleton motif is in one of C"s open sets Uj . Therefore any motif M is in the union of the open sets in C', and C' is a finite sub covering of C, QED. 22.7.0.1
Relative Topologies
In this general framework we consider the relative topologies 'I Jl , 'IJl/Ges when p, is a motif space for an interpretation XJl of a finite local composition X. For the relative topology on p" we have these notations:
• Dr(M) = D.(M) n p, . • Gest(M)
= Gest(M) n p,.
We have seen in corollary 9 that irreducible closed sets in the epsilon topologies 'I Jl , 'IJl/Ges have at most one generic point. Now, since p, is a finite set, an irreducible closed subset We p, has a maximal index n(W) such that W n (Un(W)~i A@MOTFli) =I 0. But by the metric hypothesis on gestalts, this intersection must consist of one single gestalt since this intersection is irreducible and the metric implies T2 on the abstract cardinality level n(W). So this gestalt is the generic gestalt of W, and we have 158ee appendix H.1.4, definition 169.
22.7. FIRST PROPERTIES OF THE EPSILON TOPOLOGIES
487
Proposition 34 On a finite space X, the gestalt topology 't,,/Ges is sober. The above procedure also yields [73, prop.21] an algorithm for constructing the irreducible components of J-l/Ges. First, start with the gestalts Ges(Mi ), i = 1, ... r of maximal abstract cardinality in J-l. These points define irreducible components. In J-l/Ges - Ui=l, ... r {Ges(Mi)} -, take the maximal abstract cardinality gestalts Ges(Nd, ... Ges(Ns ) which define irreducible components {Ges(Nj )} - of J-l/Ges, and so on, until the remainder is empty. Now, if any irreducible component C is given, it must be covered by at least two of such irreducible components both of which have proper intersection with C. And it easily follows that C must be in fact one of them.
Proposition 35 On a finite space X, the gestalt space J-l/Ges (with the gestalt topology 't,,/Ges) is homeomorphic to the associated sober space16 J-ls. Proof. Let W C J-l be irreducible and closed. We know that it is a union of relative gestalts Gest(M). Since 't,,/Ges is the quotient topology, the image Ges(W) is irreducible a~d closed. Since J-l/Ges is sober, Ges(W) has a unique generic point Ges(N). Then by sorite 8, W = {N} -. Therefore, the canonical continuous map q : J-l ---+ J-ls is surjective. By sorite 8, it has the same fibers as the gestalt map. So the canonical 17 continuous map PI' : J-ls ---+ J-l/Ges is a bijection. Let V C J-ls be an open set. By definition of the sober topology, it is associated to an open set U C J-l, V = Us, and we have q-l(V) = U. So the image p,,(V) = Ges(U) is open, and we are done. QED.
Exercise 54 If instead, we take the total motif space A@MOTp with the epsilon topology 't, we have a canonical continuous surjection p : A@MOT} ---+ A@MOTp/Gest which induces a homeomorphism on the image q(A@MOTp) of the canonical map q : A@MOTp ---+ A@MOT}. Proposition 36 Let X" satisfy the SEA property, nmin being the minimal abstract cardinality of members of J-l, and J-lmin = {M E J-li abcard(M) = nrnin}. Then the closed points are contained in J-lmin. If the shape spaces rt,n are mutually disjoint, then the set of closed points is exactly J-lmin. Proof: Exercise.
22.7.1
Toroidal Topologies
Summary. The special abstraction type of pitch and onset class formation leads to a similarity concept of gestalts which cannot be dealt with by use of (pseudo-)metrics. It is related to the topological specialization type discussed in section 12.2.2. -~-
16See appendix F.2.1. 17See appendix F.2.1, proposition 93.
CHAPTER 22. MOTIF GESTALTS
488
In this case we shall consider a completely different kind of topologies which stems from algebraic geometry rather than from Euclidean geometry. Omitted proofs in this section can be found in [73, ch.4]. To fix the ideas, we take the toroidal shape type with shape spaces f Toroid,n
of local, zero-a<;ldressed compositions in OnPiM odm,z with cardinality at most n and shape map Toroid = Toroid;;"z as introduced in formula (22.9). Here, the abstract cardinality is the usual cardinality of a local composition. Also we take the full affine group P as a paradigmatic group introduced in 22.4.1.4.
22.7.1.1
=
--+
GL('L m EB'L z)
Dominance Topology
The analogy to the pseudo-metric topologies on the shape spaces fToroidin is played by the dominance topology introduced in formula (12.4) of section 12.2.2. This time, we essentially work in 'LmEB'Lz and consider abstract motives x, y C 'LmEB'Lz of abstract cardinality abcard(x) = abcard(y) = n. We say that y is an abstract specialization of x (in symbols: x ..... y) iff there is a bijective morphism of local compositions s : x ---+ y; so until now, no topology is involved, and the terminology seems a bit ambiguous, but the reason for this confusion will be given soon. Given two motives M, N E A@MOTFin, we also say that N is a specialization of M (in symbols: M ..... N) iff Toroid(N) is an abstract specialization of Toroid(M). The following lemma makes clear what it means that two abstract motives are mutually abstract specializations.
Lemma 37 For the toroidal type Toroid, if x, yare two abstract motives, the following statements are equivalent:
(i) The abstract motives x, yare mutually abstract specializations x ..... y, y ..... x. (ii) The local compositions x, yare isomorphic. (iii) We have GesToroid(X) = GesToroid(Y). Proof. The equivalence of (ii) and (iii) follows from sorite 6, section 8.3.5. Clearly (ii) implies (i); for the converse, suppose we have two bijective morphisms f : x ---+ y, f : y ---+ x. Then their compositions go j, jog are bijective and there is a positive power n such that (g 0 j)n = Id x , (fog)n = Id y , whence (goj)n-1og is the right and left inverse of g, and x, yare isomorphic.
QED. This implies that the following settings are well defined.
Definition 76 If M, N and x, yare two motives and abstract motives, respectively, then we say that the gestalt Ges(N) is a specialization of gestalt Ges(M) (in symbols Ges(M) ..... Ges(N)) and that the abstract gestalt P.y is an abstract specialization of the abstract gestalt P.x (in symbols P.x ..... P.y), respectively, iff M ..... N and x ..... y, respectively. Corollary 10 Two abstract gestalts (two gestalts) are mutually abstract specializations (mutually specializations) iff they are equal.
22.7. FIRST PROPERTIES OF THE EPSILON TOPOLOGIES
489
For a fixed abstract cardinality k, we have the following Kuratowski closure operator 18 on A@MOTFik/Ges. If Xc A@MOTFik/Ges, then we set
X = {y
E A@MOTFik/Gesi :lx E X such that x
-7>
y}.
Clearly, in this dominance topology, the sets [x) = {yi y -7> x} are the smallest open neighborhoods of x and define a base for the dominance topology on the gestalt space. On the inverse image A@MOTFik of motives having abstract cardinality k, we take the inverse image of the dominance topology, i.e., the base [M) = {Ni Ges(N) -7> Ges(Mn = {Ni N -7> M}, the latter equality being the case because by lemma 37, dominance is invariant modulo gestalts. Clearly, the gestalt map A@MOTFik --+ A@MOTFik/Ges is open and the dominance topology is the quotient topology.
22.7.1.2
Specialization Inheritance and Specialization Topology
In the next step of our construction, we look at inheritance properties for the dominance topology.
Definition 77 If for any triple M, M*, N E A@MOTF with M* c M and M -7> N, there is N* c N such that M* -7> N*, we say that specialization is inherited. If for any triple M,N*,N E A@MOTF with N* c Nand M -7> N, there is M* c M such that M* -7> N*, we say that specialization is co-inherited. Proposition 37 [73, prop.27,cor.5] Every toroidal shape type Toroid;;"z is inherited and coinherited. This property guarantees that we may deduce another topology in the spirit of the Epsilon topologies, but now starting from dominance topology:
Proposition 38 [73, def.19,prop.28,cor.6] The system El(M)
= {N
E
A@MOTFi :IN* eN such that Ges(N*) E [Ges(M)n
of elementary neighborhoods of motives A@MOTF is a base for a topology on A@MOTF , which is called the specialization topology 'I~;~~fd,'"Y = 'I7oroid' Elementary neighborhoods are invariant under gestalts. The quotient topology 'I7oroid/Ges on the gestalt space A@MOTF/GesToroid is also called specialization topology. Under these topologies, the gestalt map is an open continuous map. The specialization topology 'I~oroid/Ges has a base which consists of all elementary neighborhoods (in fact images of the elementary neighborhoods of the motif space) of shape El(G)
= {H
E A@MOTF/Gesi :lH*
c
H such that H* E [Gn
for G E A@MOTF/Ges. 18See appendix H.I. Kuratowski closure operators are used to define topologies.
CHAPTER 22. MOTIF GESTALTS
490
In the specialization topology, we have analogous results as in the epsilon topology, regarding the closure of a point: Proposition 39 [73, prop.31,prop.39,prop.41,prop.42] Ff G E A@MOTFIGes for the specialization topology 'IforoidIGes, then
(i) El(G) = {H (ii) {G}
E
A@MOTFIGesl H dominates G}.
= G n El(G).
For the specialization topology 'Iforoid' if M E A@MOTF is a motif, then
(i) El(M) = {N E A@MOTFI N dominates M}. (ii) Ges(M) = M n El(M). The specialization topology'IforoidlGes is To, irreducible, and quasi-compact. Many of the properties for the specialization topologies 'IforOid,1-' on a motif space fJ of a finite local composition X, and 'Iforoid,I-'/Ges on fJIGes are analogous to those which we have discussed for the epsilon topologies. We refer to Buteau's work [73, ch.4] for the details. In particular, the irreducible components for 'IforOid,1-' can be calculated by a recursive procedure which is similar to the one discussed in section 22.7.0.1. Exercise 55 Show that the specialization topology 'Iforoid,1-' still homeomorphic to the associated sober space of the specialization topology on gestalts. Remark. In [73, ch.4], it is shown that specialization relations, together with different subgestalt relations can be used to define a gestalt specialization category f!5e(A@MOTF IGesToroid). Principle 21 As an overall result from the construction of motivic topologies, we can now consider generic points and related dominance structures from the theory of sober spaces. This is the basis for a topological realization of Reti's ideas in the sense that the semantic charge connoted with the very concept of a motif turns out to be a precise instance of the genericity properties of the topological motif space which is associated with a given score. In other words: Cognitive and structural motivic semantics is related to and reduced from the topological dominance configurations in the motivic topologies. We come back in section 22.9 to the more detailed technique of motif weights to make motivic semantics explicit on a quantitative level.
22.8
Rudolph Reti's Motivic Analysis Revisited
Summary. After the precise construction of immanent motivic "organisms" associated with determined scores, we conclude this chapter with a critical review of Rudolph Reti's classical texts in the light of the presented theory. For more details, see [73, ch.5.2]. -~-
22.S. RUDOLPH RETI'S MOTIVIC ANALYSIS REVISITED
22.8.1
491
Review of Concepts
Complete works about motivic analysis are practically non-existent, therefore Reti's book [444] is still a good reference to the methodology of traditional musicological motif theory. Despite his accurate overview on the evolution of a motif through a composition, Reti does not use rigorous terminology, i.e., he uses the same word for different concepts or different words for the same concept. It seems very difficult to grasp its contents and substance. This vague terminology produces hazy deductions, not to say contradictions. However, behind his words, Reti's conception of a first sketch for a thematic theory of music is considerable. Reti is not even concerned with reliable terminology: [444, p.12] In general, the author does not believe in the possibility or even desirability of enforcing strict musical definitions. But it is undeniable that before any attempt to interpret Reti's motivic analysis within our mathematical model, we must take a closer look at his terminology whose nuances are essential for bringing consistency into Reti's motivic score analysis. Let us get off ground with Reti's definition of a motif: [444, p.2-3] ... any musical element, be it a melodic phrase or fragment or even only a rhythmical or dynamical feature which, by being constantly repeated and varied throughout a work or a section, assumes a role in the compositional design somewhat similar to that of a motif in the fine arts. Since Reti asserts that he handles the words "motif" and "theme" without categorical distinction, we are aware that in some of the following quotations or examples, the word theme can be, for our purposes, replaced by motif All the same, Reti points out a difference between the two concepts by defining a theme as [444, p.3] ... a fuller (compared to a motif) group or "period" which acquires a "motivic" function in a composition's course. According to Reti, both a "motif" and a "theme" have the same function and therefore are reduced to almost the same concept. A first distinction between these two notions is length. Then, Reti adds that a theme is composed of few motives. Moreover, the difference of the length has consequences on how and where motives and themes can occur in the composition. A motif being short, it can be easily repeated throughout the composition. It can also be modified and inserted in parts having different moods. These frequent occurrences create a certain unity within the piece. A theme has a similar function but cannot, as easily as the motif, be modified or/and inserted in the composition. We now state a crucial idea in Reti's book. Reti asserts that abstractions have to be made in order to compare themes and motives: [444, p.243] Similar liberties were taken in our analysis on several other occasions ... by rearranging the design ... , by exchanging the octaves in which some phrases were notated in the score.
492
CHAPTER 22. MOTIF GESTALTS
In a more general way, when talking about classical symphonies, Reti claims that
[444, p.13-14] He (the classical composer) strives toward homogeneity in the inner essence... toward variety in the outer appearance. Therefore he changes the surface but maintains the substance of his shapes... Tempo, rhythm, melodic detail, in fact the whole character and mood are altered and adjusted to the form in which the composer conceived them fitting to the new movement. Reti clearly affirms that we have to make abstraction from some qualities of a motif in order to make the connection with its transformations in other parts of the piece. As a matter of fact, Reti always makes abstraction when comparing sequences of notes. For example, it can be the simple abolition of note loudness or more complex procedures such as the exchange of octave positions inside a given sequence of notes. We should remark that Reti never uses the word "gestalt" 19. Instead, he is concerned with comparisons of "shapes". The word "shape" is not chosen at random and is met throughout the book: [444, p.205] ... a theme is that shape around which ... The importance of the shape concept is that it englobes the idea of abstraction. Nevertheless, Reti does not have a well-defined usage of this concept. In similar situations, such as in examples [444, 328,p.21O] and [444, 334,p.213], both words "motif" and "shape" are used with the same meaning. The reason for this lack of rigor is simply that when comparing motives, natural abstractions like the abolition of loudness are somehow unconsciously operated. For Reti, starting a comparison with shapes seems trivial. According to Reti, the shape of a motif is incessantly heard, literally or modified, through the piece. The shape is imitated, varied or transformed: [444, p.240] 1. imitation, that is literal repetition of shapes, either directly or by inversion, reversion, and so forth; 2. varying, that is, changing of shapes in a slight, well traceable manner; 3. transformation, that is, creating essentially new shapes, though preserving the original substance. In other words, a motif has a "smooth shape", i.e., its shape can be imitated or smoothly modified and remains the same, or as Reti asserts, it remains "identical" 20. However the accepted transformations of shapes can reach a limit:
[444, p.355] ... his (Beethoven'S) most impressive thematic constructions lie on the border line between being matchless master strokes of transformation or utterances wherein the thematic bond has almost dissolved in that very transformation process by which they were created. 19 Nonetheless, the gestalt concept is omnipresent. We shall see that Reti's implicit definition of a gestalt is first the choice of an abstraction perspective, and second all the immalleable transformations (the imitations) of the motif's shape in the chosen perspective. 20 The words "identity" and "identical" are used over and over in the book and their meaning is not the one that one is accustomed to from logic! For example, see [444, pp.13,21,38,102,167,243, ... ].
22.8. RUDOLPH RETI'S MOTIVIO ANALYSIS REVISITED
493
Reti states that a shape can not be endlessly transformed and still remain identical with its initial shape: identity has a 'limit of transitivity'. This is by itself a contradiction since any accepted concept of "identity" is transitive. However, there are subtle identification problems which, in case the analyst is not precise enough, i.e., does more identification than the situation requires, cause contradictions while the essential points drop out. Unfortunately, Reti did not realize the implications that bears his use of the concept of "identity". Reti tries to avoid this contradiction by constant reference to the first appearance of a motif in the score. Reti usually affirms that a shape is a transformation, imitation or variation of a motif by, most of the time, comparing the shape of the first motif appearance with the considered transformation. In a sense, this "identity" is an order relation, the first appearance A being "greater" than all the later appearances, B, 0, ... : A > B, A> 0, ... , but transitivity is not relevant here, because, besides A > B, A > 0 ... , no relation B > ... , 0 > ... etc is
considered. So the transitivity question A > B & B > 0 =* A > 0 is of no interest. But again, we are confronted with a non-sense definition of the identity: it is not even symmetric! In any case the common concept of identity is violated. In the next section, we shall clarify how to use "identity of motives" in a consistent way. Although Reti was imprecise he was not irreversibly wrong. The abuse of the term "identity" is somewhat clarified in the concept of a motif, unfortunately hidden by a fluffy terminology. For Reti, a motif conceives ?
[73, p.108] the first appearance of a certain group of notes with shape abstraction such that, throughout the whole composition, there are "many" note groups having respective shapes (i. e., shape abstractions) which are imitation, variation or transformation of the first shape. To distinguish the different concept levels in our discussion, we denote by Reti-motif the above "definition" of a motif.
22.8.2
Reconstruction
We now proceed in reconstructing Reti's concept framework from the point of view of the present motif theory. 22.8.2.1
Choice of Parameters
We first fix the parameter space F = OnsetffiPitchffiLoudnessffiDuration since for the context of Reti's book, no other relevant tone parameters are of use. To be clear, this framework does not include pauses or bar-lines as relevant tone objects, only 'sounding' notes are considered. Here are the detailed settings (see also appendix A for these standard values): • The first note has Onset value O. • Duration values are taken by the prescription that 1.0 in the Onset coordinate corresponds to the literal mathematical value of 4/4 duration. • For the Pitch values, we select the usual gauge with middle a = 0, and the chromatic pitch set being parametrized by the integers, i.e., = = 1, D = 2, etc.
au Do
494
CHAPTER 22. MOTIF GESTALTS • Loudness is gauged by the usual setting where mf pp rv -2, etc .
rv
0, f
rv
1, ff
• By toroidal type we intend Toroid~2X12 with a default grid 21 (0,1).
0
rv
2, etc., and p
rv
-1,
= (0,0), x = (1,0), Y =
Furthermore, as might be expected, we always deal with the zero-address in Reti's context. 22.8.2.2
Shapes, Imitations and Transformations
Let X C O@F be the local composition consisting of all notes within the given composition, and with respect to the above gauge of the four parameters. Then let M* be the set of all motives within X. Reti's implicit first step in his motivic analysis of a composition is the choice of a determined set M C M* of (mathematical) motives, i.e., "sequences of notes" which he wants to compare with each other in order to find imitations, variations and transformations of the shape of a certain sequence of notes which he will then call a motif (a Reti-motif in our terminology). Call our mathematical motives M E MOTF Math-motives in order to distinguish them from Reti's more involved Reti-motives. In fact, it is remarkable that most of the Math-motives in M are no way qualified for becoming contributors to Reti-motives. They are just a kind of motivic 'raw material'. Furthermore, Reti naturally allows himself to compare parts (subsequences) of the chosen Math-motives (sequences). He normally does not regress to the single notes, thus he gives himself a lower limit of motif cardinality. This all means that in the mathematical model the SEA property must hold with respect to the lower limit for M. Most of the time Reti applies the limit of the intervals, i.e., nmin = 2 in SEA. Let us now reconstruct Reti's motif definition
... the first appearance of a certain group of notes with shape abstraction such that, throughout the whole composition, there are "many" note groups having respective shapes (i.e., shape abstractions) which are imitation, variation or transformation of the first shape. within the mathematical model: " ... with shape abstraction ... " As seen in the previous part of this section, the choice of an abstraction of the sequences of notes is the starting point for comparison. To 'simulate' Reti's abstraction procedure by our model, we select a shape type type t : MOTF ---- r t , and we model the shape abstraction of a Math-motif ME M by its associated abstract motif t(M) E rt. " ... which are imitation ... " We must emphasize that declaring that a shape is an "imitation of another" suggests the choice of a collection of admissible imitation transformations, a choice which we associate with a choice of a paradigmatic group P in the mathematical model. For example, most of the time strict repetitions (translations in time) are admissible, and it is clear that the choice depends on the selected shape abstractions. The implicitly conceived gestalt of a sequence of notes in Reti's book perfectly corresponds to a gestalt of a Math-motif in a motif space. 21This grid is by no means mandatory. According to the concrete situations, we shall define better adapted quantization data.
22.8. RUDOLPH RETI'S MOTIVIC ANALYSIS REVISITED
495
According to Reti's analysis, three examples [444, example 35, p.33] (first scene of Schumann's Kinderszenen, shape I), [444, example 20b, p.21] (second version of Beethoven's Ode an die Freude shape), and [444, example 1, p.ll] (descending third in measure 19 of the first movement of Beethoven's Ninth Symphony) of imitations seems to coincide with the counterpoint group. The choice of abstraction (the choice of the types) greatly influences our view on a composition, and this is a strong point of this analysis since it takes into consideration different qualities and characters of the composition. We now introduce the distance ges.d"p (defined in section 22.5) of Math-motives into our model of Reti's motivic analysis. In a sense the distance ges.d"p between motives, i.e., two sequences of notes such that their shapes have the same number of "elements" (same abstract cardinality in the selected shape type), gives a measure for their diversity. For Reti, two sequences of notes with shape having the same number of elements are identical (imitated), closely alike (varied or transformed) or different (not related). In the mathematical model, this kind of relation is more subtle. The shapes of two sequences of notes "being identical" means having distance 0, "being closely alike" requires to know "how close" they are from each other and being different requires to know "how far" they are from each other. There is no longer only three rough categories but instead, we are given subtly nuanced relations between sequences of notes. Remark 5 The ges.d"p distance is a priori defined on MOTF and not on f.J,. This means that the calculation of ges.d"p for two sequences of notes within a given composition X is determined by first looking at their shapes (following a given abstraction), and then, given a paradigmatic group P, by comparing not only all their imitations within the composition but also all the a priori imaginable imitations. According to rigid type, we easily see that a similar distance defined only on f.J, would yield absurd results. Moreover, these 'imaginary' shapes are naturally, but probably also unconsciously, used by Reti. "... variation or transformation ... " Variations and transformations of a sequence of notes are more delicate to translate into the mathematical model. Since the difference between variation and transformation is simply a question regarding the level of change between shapes, we merely consider transformations, i.e., variations are viewed as special cases of transformations. There seem to be no disadvantageous consequences to this identification. We recall that, according to the analyst Reti, there is a limit to a transformation of a shape such that it still relates to the original shape, i.e., such that it remains "closely alike". In mathematical terminology, there is a maximal admissible distance Emax between the Mathmotives. In the motivic topology, this limit corresponds to a fixed positive real number Emax. The latter is a border line such that if two shapes with the same abstract cardinality are less than Emax distant, then they are considered as being "closely alike". However, transformations do not necessarily maintain the number of elements. In the mathematical model, transformations of shapes can be translated to three different situations corresponding to the change of number of elements of the shapes: 1. If two shapes have the same number of elements and are closely alike, then, in the motivic space, their Math-motives M and N are in the other's Emax-neighborhood, respectively, which is M E Dcrnax (N) and N E DCrnax (M). Example: [444, examples 85,86, p.62]
CHAPTER 22. MOTIF GESTALTS
496
(Palestrina's Missa Marcelli, comparison of beginnings of the two first sections in Kyrie Eleison and the Christe Eleison). 2. If in the terminology of Reti, a shape A is "closely alike" inside the shape B, then, in the mathematical model, for a given shape type, this means that a Math-motif M with shape t(M) = B is in the lOmax-neighborhood of the Math-motif N with t(N) = A: M E D~max (N). Intuitively speaking, this means that the shape of N is closely alike to the shape of a shape of a submotif of M. Example: [444, example 118, p.80] (first movement of Brahms' Second Symphony). 3. If a part A* of a shape A is closely alike inside shape B (and conversely in Reti's terminology), then, in the motivic space, there is a submotif M* of M with shape A* and A, respectively, and such that N with shape B is in the E-neighborhood of M*: N E D~ (M*). Example: [444, example 218, p.143] (Brahms' First Rhapsody). max If the relation between a shape and its transformation is musically too hidden inside the transformation, in the sense that it is hardly heard or even not at all heard, then this is, what Reti calls, an "indirect affinity". This phenomena is similarly translated into motif space terminology as a transformation with possibly larger lOmax.
22.8.2.3
Reti's Identity Relation Revisited
In the previous section we mentioned the inconsistent use by Reti of the concept of "identity" in the sense of imitations, variations and transformations. However in the mathematical model, identity of gestalt can be compared with Reti's use of the concept of Identity. In fact, we know from sorite 8, that, under certain conditions, Gest(M) = Gest(N) for two Math-motives M and N iff M E D~(N) and N E D~(M), for all 10 > O. This gives us the possibility to review the gestalt identity concept under a purely topological perspective. We can see now that Reti's violation of the strict concept of identity is related to violation of one or several parts of the above neighborhood condition for identity of gestalt. It is undeniable that the identity of motives should be limited to imitations, i.e., the identification of motives with their gestalts yields consistent statements. The identification with transformations creates a system where every Math-motif can be identified to any other Math-motif! This is certainly not what Reti wanted to state unrestrictedly. A reconstruction of Reti's theory (and other fuzzy attempts, such as [428]) should be undertaken on the basis of topological concepts as modeled above.
22.9
Motivic Weights
Summary. Motivic weight functions are a means for understanding motivic topologies by use of real 'coordinate functions'. We describe the technique and its meaning for semantic purposes, in particular, regarding abstraction from motif parameters.
22.9. MOTIVIC WEIGHTS
497
In the analysis of metrical structure of section 21.2, definition 64, we have introduced sober weights. This can be carried over to motivic structure without any further modification; recall (see section 21.2) that for the motivic interpretation XJ1., we have the nerve map Sp : X --+ n(XJ1.). Also recall that an IR-valued sober (motif) weight on the A-addressed local composition X c A@F is a function W : X --+ IR which factorizes through the nerve map Sp. A factor nW : n(XJ1.) --+ IR is called a nerve weight for W. A nerve weight nW is said to be induced iff its values are determined by the values on the vertexes, i.e., by the restriction noW = nWlno(xI'). In the implementation of motif theory on the RUBATO®'s MeloRUBETTE®, such induced sober motif weights have been used for performance oriented motivic analysis. We shall discuss this approach in chapter 41.2. In the present discussion, we want to concentrate on the semantics of such weights. To calculate the weight W(x) means that we give the tone event x a value which stems from the weight values of motives noW(M),x E M, including some additional configuration functions which take care of the simplex structure Sp(x). Basically, this means that we validate x in its motivic environment: "What is the motivic position of x within the given motif space JL of X?" So we give each tone event x E X a 'motivic coordinate'. Isn't this too coarse? The point is that for performance, complex coordinates are not useful since the instrumentalist really needs numeric information of how strongly that key should be stroked. And that may be ok for performance, but it is not the yoga of motivic analysis. There is a more faithful approach to motivic coordinates if we step over to the nerve weight nW, which we suppose to be induced, to make the ideas more transparent. Restating the precedent question, a nerve weight fixes the motivic coordinate nW(M) of any motif M E JL. Why are we-beyond applications to performance theory-interested in such coordinates? The point is that the motivic epsilon or toroidal topologies are not immediately understandable in terms of classical Euclidean coordinates, in fact, these topologies have more of an algebro-geometric flavor, including generic points, specializations, and related 'far-out' structures. It has been shown in the discussion of Reti's theory that these tools are adequate. But the prize was a strong deviation from numerical, quantitative representation. The immediate semantical potential of motivic topologies is not a numeric one. On this basis, the introduction of nerve weights should provide us with a reconstruction of quantitative aspects of topological perspectives, possibly with a bunch of numerical invariants which could in the ideal case approach a complete set of invariants. One approach to this goal has been realized by the MeloRUBETTE®. Let us just have a look at that methodology. Fix a shape type t, an equivariant paradigmatic group P of isometries, the standard situation of our motif theory. Given a positive limit number E, we look at the '1' neighborhood configuration on JL as seen from a fixed motif M. What does this object 'know' from its topological position? We could be interested in knowing how many motives are in the disk neighborhood D~(M). This would yield a kind of 'presence' prJ1.,,(M) of M within larger motives, up to some tolerance E of shape similarity. It could also be interested in knowing how much motives M participate in their 'presence', thereby yielding a kind of 'content' ctJ1.,,(M) of M. This data can be used to give the motif a measure for its relevance or weight in the present motif space JL, a weight function
(22.21 ) for example with w(x,y) = x.y. This is only one approach to grasp topological information via numerical values, but we just want to give a concrete idea of how to proceed in general. Given
CHAPTER 22. MOTIF GESTALTS
498 these weights n W. : JL ---.
]R, €
> 0, we obtain a map n W : JL ___.
]RR+
(22.22)
on the vertexes of the nerve n(XI'), being extended to the entire nerve nW : n(XI') ___.]RR+ via a given induction function, i.e., nW(a) = f(nW(M), MEa). Since the motif space is finite, we may restrict to a finite sequence €. = €1 < €2 ... < €k of limit numbers to grasp all changes within these weight functions. This means that we consider the weight nW•. : JL ___. ]Rk, and its induced extension nW•. : n(XI') ___. ]Rk. It is a major task in motif theory to construct weight functions following the scheme of n W •. in order to investigate the abstract motivic topologies via their images in real n-space. Remark 6 The AST-ideas on similarity, as developed by Morris, Lewin, Rahn, and others may be looked up in Morris' work [380, p.103ff.].
Part VI
Harmony
499
Chapter 23
Critical Preliminaries Aber atonal wird man irgend ein Verhiiltnis von Tonen sowenig nennen konnen, als man ein Verhiiltnis von Farben aspektral oder akomplementiir bezeichnen durfte. Diesen Gegensatz gibt es nicht. Arnold Schonberg [478, p.240] Summary. The present part on harmony is a traditionally dominant and extended portion of music theory. Therefore, it is adequate to review some of the important approaches to harmony. This chapter is however far from a complete synthesis of harmony and its history. We have selected three representative approaches which are systematically elaborate and theoretically founded: H. Riemann, P. Hindemith, and H. Schenker. The following overview concentrates on the divergence between claim and realization, and it does, once again, lay bare the enormous difficulty to set up a precise discourse about music without-MIas-the power of mathematical language. Also this critique is not thought to be a preliminary to something which in the subsequent chapters of this part will be perfectly solved by mathematical music theory. The discourse simply tries to persuade music theorists that a) the commonly cultivated status quo of the subject is scientifically unacceptable, and b) that mathematically sharpened concepts, constructs, and models can show ways to more in-depth and precise understanding of harmonywithout banning it to history and "atonal" negation. Generic harmony is a universal perspective of music, and it is unscientific as well as near-sighted if not anti-musical to abandon harmonic paradigms instead of embedding them into a diachronically and synchronically open, unified, and universal concept framework. To be clear, the main question is not to defend or instantiate any ideology of harmony-this is the unhappy business of Pythagorean fundamentalists-but to investigate its possible semiotic functions in musical works and their communicative explication, to develop an adequate language, and to propose consistent and sound models of harmonic processes. -~-
The theory of harmony as a theory of chords and chord progressions comprises a wide range of different approaches, methods and questions. Thereby the historic context is only one factor
501 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 23. CRITICAL PRELIMINARIES
502
among others to explain these differences. Harmonic research is situated in the intersection of various fields of interests, which correspond to different approaches and questions. For instance Paul Hindemith's interest in Unterweisung im Tonsatz [224] is that of a composer-so his focus is to explain, in how far and why diverse combinations of tones as chords differ in their esthetic effects. Arnold Schonberg's Harmonielehre [478] aims at students of composition as well. In comparison to this perspective Schenker and his adepts analyze the harmonic structure of pieces of music in order to explain the impression of "inner coherence" in this piece which includes a description of different hierarchic levels. This approach results in a completely distinct definition of what is called a "chord" compared to, for instance, Hindemith's theory. In the following preliminaries to the mathematical theory of harmony, we don't want to give a complete summary of all these different investigations, but to sketch some of the different ways of classifying chords and thereby shed light at least on some typical questions concerning chords and chord progressions.
23.1
Hugo Riemann
Summary. This section briefly reviews Hugo Riemann's concept of triad-based tonality as it was developed in his writings, such as Musikalische Logik [449] and Musikalische Syntaxis [450], from Rameau's early adumbration. Riemann's construction of tonality from a chord progression is described. It is based on the concept of thesis (German "These"); the tonal coherence of a composition is constituted by a succession of theses. To conclude, we discuss Riemann's unfinished program to assign a harmonic function to strictly every possible chord, given a specific tonality. -I;-
Rameau was one of the first theorists to deal with the classification of the variety of possible combinations of tones as chords in order to find systematic laws of combination. He classified chords on the basis of thirds, which means to understand the combinations of thirds which build chords. Thus a fourth-sixth-chord can be traced back to a triad by an inversion, i.e., by adding an octave to the lowest tone. The triad is the starting point not only of the horizontal, but also of the vertical organization of the tones in a piece of music. The system of tonal functions is adumbrated, but not fully exposed. This exposition can be found in Riemann's Musikalische Logik [449]. It involves a classification of chords into the three categories tonic, dominant, subdominant and the discussion of the cadence ("Kadenz") as a specific progression of chords when subsumed under these categories. His system includes a classification of chords on the basis of the minor and the major triad. This method classifies every chord by tracing it back to a triad (or a combination of triads) it was derived from (e.g., by altering the chord's single tones). Consequently, it involves the problem that the preimage from which a chord is derived is often far from unequivocal. The classification of all chords into the three categories tonic, dominant, and subdominant, is ambiguous as well. Another important question Riemann deals with in Musikalische Syntaxis [450] is that of tonality. How does the phenomenon of tonality arise within a specific chord progression such that one chord is classified as a tonic? Riemann discusses examples of such chord progressions,
23.2. PAUL HINDEMITH
503
but in some cases there is more than one possible triadic solution for the suspected tonality-a fact which Riemann does not reflect. So the question, how to find criteria for deciding upon possible tonalities in non-trivial ambiguous situations, remains open.
23.2
Paul Hindemith
Summary. This section briefly reviews Paul Hindemith's harmony as developed in Unterweisung im Tonsatz [224]. In order to explain the tension/relaxation within a chord progression, he constructs a hierarchy of intervals which is driven by their consonance character. The chords are then classified according to the hierarchical position of their constituting intervals. -~-
Hindemith's classification of chords starts with a discussion of the intervals with respect to their stability and consonant or dissonant character. He critizes the approach to alter all chords into triadically composed chords. Chords are classified according to the intervals they consist of in such a way that stable or consonant intervals generate stable chords. Among all chords, the triads are the most stable ones. Chord progressions are analyzed in terms of different stabilities of the consecutive chords. For instance, if a given chord consists of "more consonant" intervals and the next one of "less consonant" intervals, an effect of tension is produced. By this definition, Hindemith creates curves of tension within long chord successions. The concept of tension is an instrument to explain possible effects or impressions when listening to such a chord successions.
23.3
Heinrich Schenker and Friedrich Salzer
Summary. This section reviews Friedrich Salzer's Structural Hearing [470] as an updated version of tonal theory following Heinrich Schenker's ideas, as exposed in [474] and [475]. This theory considers a hierarchy of structural layers from the background ("U rsatz des Hintergrundes"), traversing the transformational middleground ("Mittelgrund der Verwandlungsschichten"), and developing into concrete music on the foreground ("Vordergrund"). Structural chords on a higher hierarchical level are complemented by (relative) prolongational chords on a lower level, expressing harmonic "motion" between structural instances. We discuss the question of ambiguities and indeterminacies in the case-specific definition of such a hierarchy. -~-
Salzer (as representative of Schenkerian analyses) critizes those harmonic approaches, which structure a given piece of music into a progression of chords and assign functions to them by analyzing the specific chord gestalts. Instead, he proposes the classification of chords into two groups-structural and prolongational chords-in order to explain phenomena like 'musical motion' or 'coherence'. Structural chords support the harmony, while harmonic meaning is not assigned to prolongational chords-they mediate between structural chords and support the 'motion' from one harmonic function to the next one. This approach starts with a grouping of the piece into periods and themes, followed by the specification of the structural chords within
504
CHAPTER 23. CRITICAL PRELIMINARIES
these periods. There are however no general arguments or criteria for the process which decides about the chords belonging to structural or prolongational categories. Also, the problem of which tones form a chord remains unsolved as well.
Remark 7 We should mention Sigfrid Karg-Elert's harmony [259] which is based on a strictly polar symmetry between major and (aeolian) minor modes, as well as Arthur von Oettingen's symmetry-based modulation theory [406]. A thorough discussion and understanding of their works is still unsettled.
Chapter 24
Harmonic Topology Tyger! Tyger! burning bright In the forests of the night, What immortal hand or eye Could frame thy fearful symmetry? William Blake (1757-1827) Summary. This chapter introduces a systematic correspondence between chords and symmetries. It lays the morphological fundament for a semantic theory of harmonic functions which will be exposed in the following chapter. Essentially, the idea of such a correspondence is to carryover the structural discourse on harmony to richer objects by an intermediate switch to "richer" addresses. -I;-
In sections 11.3.6 and 11.3.7, we have presented a preliminary study of symmetries of chords, and of the classification of self-addressed chords in simple ambient spaces of modules with finite length. In this chapter, the idea of switching between chords and symmetries will be developed a bit more systematically and then applied to the construction of harmonic semantics as suggested by Noll [400]. The basic idea is that to understand the full morphological potential of chords, it is not sufficient to look at their points, rather one should include the information given by their inner symmetries. Now, there are two ways to deal with this idea. The first is an ontological switch. It resembles the concept from quantum mechanics, stating that the measurable physical reality is a trace-in fact a bunch of eigenvalues-of more abstract hermitian (linear) operators: the observables. The higher reality of these operators is the semantic source of microscopic physics whose seemingly contradictory phenomenology of wave and particle are just projections of the consistent higher reality of observables [150]. In this vein, chords would be replaced by endomorphisms of the underlying form space. The drawback of this approach is that it enforces a new reality which is not that of addressed points but resides in abstract morphisms between form spaces. But music should not have its reality split into what has meaning and what is audible. It should avoid doing so as 505 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
506
CHAPTER 24. HARMONIC TOPOLOGY
much as physics should. We cannot speak for the physical ontology, but for music, there is an elegant solution which has been prepared via the very approach to musical objects which is based on addressed denotator substance. We contend that the richness of ontological reference which is offered on each address is essential and sufficient to condense the morphic flavor of symmetries on the level of address ontology (see also our discussion in 8.3.4, in particular principle 3)-provided that "generic" addresses can be found for a given form space. There is a further advantage of the 'address condensation' of morphic structures: With this tool it becomes possible to consider global compositions on the basis of local compositions on a generic address, a construction which Noll calls "Morphemfeld" in [403], and which he uses to (re)build harmonic semantics in the spirit of Hugo Riemann. We are aware that the following approach is not evaluated at all its implications, in fact, the harmonic topology is but a first application of a more general topological interpretation of denotator morphology. We nevertheless feel obliged to sketch this general perspective to make the stream of ideas evident and their coherence plausible.
24.1
Chord Perspectives
Summary. This section deals with chords in Euler and pitch class spaces, together with a comparative study while applying the enharmonic projection of self-addressed Euler spaces onto pitch class spaces. -~-
We start our discourse from concrete situations in harmony which arise from chord descriptions in different ambient spaces for pitch and pitch classes. Noll's term "perspective" for pitch denotators at different addresses seems adequate: it emphasizes the ontological perspective offered on each address, as well as the change of perspective induced by a change of address. These examples also will prepare the reader for the semantic discussion of chord morphology in the next chapter.
24.1.1
Euler Perspectives
Summary. This section discusses zero-addressed and self-addressed chords in the context of pitch classes in just tuning. -~-
Recall from section 7.2.1 that the coordinators of EulerModule and EulerPlane are Q3 and Q2, with the canonical decompositions Q3 = Q . 0 E6 Q . q E6 Q . t and Q . q E6 Q . t, respectively. Within these spaces, one identifies the subspaces EulerZModule = EulerModulelz and EulerZPlane = EulerPlanelz which are defined by integer coordinates, i.e., the lattices generated by the canonical bases 0, p, q and p, q, respectively. We shall mainly start from these lattices when talking about harmony in just tuning. For example, the C-chromatic scale of Vogel, from which one octave is shown in the lower half of figure 7.6, is a zero-addressed chord X in the Euler Z Module. In order to have a better
24.1. CHORD PERSPECTIVES
507
representation of this X, we transform it under the linear space automorphism
(24.1)
and get the more transparent configuration X*
=
S.X as shown in figure 24.1. To understand this
a
e
Figure 24.1: The transformed chromatic scale makes its inner symmetries evident.
chord better, observe that it contains (the heads and tails of) four Z-addressed points 9 ::::} ep, h, g::::} a p with common tail g. Then g::::} ep+g::::} b is a zero length arrow, in fact a major third 'coupling', and g::::} h + 9 ::::} a p is a zero length arrow, a minor second coupling. A similar situation holds for exactly one other set of four Z-addressed points, i.e., f::::} a, f ::::} d p, f ::::} c, f ::::} bp with common tail f· Again, we have these relations: f ::::} a + f ::::} d p is a zero length arrow, a minor third coupling, and f ::::} bp + f ::::} c is a zero length arrow, a fourth coupling. These two four-point configurations are each contained in one of two parallel planes. Besides these two four-element Z-addressed chords, there is one Z-addressed point 9 ::::} d in the first plane, and one Z-addressed point f ::::} e in the second plane. g::::} b, g::::}
Now, if u/1 : X* ~ X* is an automorphism, it must either exchange the two Z-addressed chords or leave them invariant, and it must also exchange the two additional points or leave them fixed. It is easily seen that an automorphism which leaves the planes fixed must be the identity since it must leave the above couplings fixed. But there is an automorphism which exchanges the two planes, and by the previous remark, this will be the unique non-trivial automorphism.
508
CHAPTER 24. HARMONIC TOPOLOGY
The automorphism has this effect:
u/1: b ..... d. u/1: d ..... e u/1: e...... a u/1: h"'" c u/1 : a • ..... b. u/1: g ..... f and on the original chromatic scale X (!) is given by the affine EulerZModule automorphism U = eS.uo with translation s = (-5,2, 1) and matrix
11
Uo = ( -4 -2
18 -7 -3
24) -8 -5
(24.2)
which is automatically an involution since it exchanges the planes of the two chords which are centered around g and f. Also observe that the Z-addressed point g =? f is reversed to f =? g under u/1. The interesting thing about this involution is that it yields an involution e6 .5 of OnMod 12 if we transport the tone names as usual, i.e., c'" 0, d. '" 1, ... b '" 11. Moreover, if we consider the translation X to 3 = e-hx by the vector h = (-3,1,1), the automorphism of this scale chord is the conjugate V = e- h .U.e h . But the elements of 3 are the just interval quantities of the chromatic chord X when viewed from h. The above dichotomy which was given by the two parallel planes now corresponds to the two following six-by-six element dichotomy of 3:
SK = {(O, 0, 0), (0, 1, -1), (4,0, -2), (-2,1,0), (6, -1, -2), (2,0, -I)}, 3 D = {(7, -2, -2), (1, -1,0), (3, -2,0), (3, -1, -1), (7, -3, -1), (5, -2, -I)}, and its interpretation in terms of differences of numbers in Z12 as above, e.g., b - b '" 11 - 11 = 0, d - b '" 2 - 11 = 3, b '" 3 - 11 = 4, etc. yields the consonance-dissonance dichotomy K = {O, 3, 4, 7, 8, 9}, D = {2, 5, 10, 1,6,11} of classical counterpoint. The literally transformed automorphism is the involution e2 .5 of OnMod 12 , the (unique) autocomplementarity function of the counterpoint model from which the rules of elementary counterpoint will be deduced in part VII. We come back in section 24.1.3 to this phenomenon. Instead of looking at a chord of Z-addressed points with tail in g, one can also take a Z3-addressed 'dominant seventh' point D7 with tail in g, and three arrowheads in b, d, f. The automorphism U is completely determined by this self-addressed point since it defines an affine base of the ambient space. So U is uniquely determined by U(D7). The same is true for the 'tonic seventh' point T7 with tail g and arrowheads in c, e, b.; same for the 'subdominant seventh' point 8 7 with tail f and arrowheads a, c, (observe that the major seventh would not work in any of these points!). So the three previous self-addressed seventh points X7 are all sufficient to describe the entire automorphism, Le., U(X7) determines U. Consider the canonical pitch class projection
e. -
e.
modo: Euler Module
--+
Euler Plane
24.1. CHORD PERSPECTIVES
509
defined in 7.11. This induces the projection of the corresponding chord spaces modo: EulerChord----+ o-ClassChord. Since modo is a projection with integer matrix, we have a corresponding integer situation modo: EulerZModule ----+ EulerZPlane and
modo: EulerZChords
----+
o-ClassZChord
with EulerZChord = EulerChordlz,o-ClassZChord = o-ClassChordl z , the restrictions to Z-valued points of chords. Two A-addressed chords C 1 , C 2 in Euler Module are called an inversion of each other iff modo(Cd = modo(C 2). Inversion equivalence classes of chords, i.e., pitch class chords, are central objects in harmony. The present just pitch class space is spanned by the fifth q and by the third class t. Vogel's zero-addressed scale C-chromatic which is the projection modo(X) of the chromatic chord X discussed before, gives rise to an infinity of zero-addressed scales
X-chromatic
=
eX.C-chromatic
in Euler Z Plane for any zero-addressed class X in Euler Z Plane.
Proposition 40 Consider the lattice L = Z.(l, 0, 0) translates of X define a tiling of Euler Z Module.
+ Z.(O, -4, 1) + Z.(O, 0, 3).
Then the L-
Proof. The modo(L) translates of C-chromatic define a tiling of EulerZPlane, and the octave translates fill up the space in octave direction. Observe, that the vector (0, -4, 1) corresponds to the fifth comma defined in 6.33, whereas the third generator ofthe lattice is Kq+19.0+3.Kt, so the lattice is generated by the octave and the two commata. We shall discuss this lattice in a moment. The zero-addressed major scale C-major and natural minor scale C-minor give rise to corresponding shifted scales
X-major = eX.C-major,X-minor = eX.C-minor. In the same way, the just zero-addressed harmonic minor scale X -harm stems from the harmonic minor scale C-harm which differs from C-minor by a non-flattened b, whereas the zero-addressed melodic minor scale X-mel stems from the melodic minor scale C-mel, and this one differs from C-minor by non-flattened a, b. For the time being, we shall use the same tone names for pitch classes as for their original pitches in EulerZModule. As with the EulerZModule, we have prominent Z2-addressed, i.e., self-addressed points in Euler Z Plane which determine any endomorphism V : Euler Z Plane ----+ EulerZPlane because their tail and arrowheads are an affine base. Here are such candidates: the 'major triad' point C 5 with tail C 5 (0) = c and heads C 5 (ed = g(= q), C 5 (e2) = e(= t), the 'minor triad' point c5 , same as C 5 , except c5 (e2) = e,(= q - t).
24.1.1.1
Just Mutation
Summary. On the EulerZModule a Lie algebra structure can be introduced in order to give a "processual" interpretation of the enharmonic projection (see 24.1.3).
CHAPTER 24. HARMONIC TOPOLOGY
510
For the following Lie algebra theory, see appendix E.4.4. The Euler Z M adule is charged with a Lie algebra structure in the following way. We represent the module of Euler Z M adule in the basis a, q, r = Kt - q (Kt = t - 4q - 4a being the syntonic comma, see 6.33) and induce the Lie (Z-)algebra structure from the module isomorphism Sl2 : Z.a EB Z.q EB Z.r ~ 5[(2, Z)
defined l by
sl2(a) =
(1o -10), Sl2(q) (00 01), sh(r) (01 00), =
=
([400]),
The Lie bracket [x, y], i.e., Noll's mutatian on this basis reads as follows: [a, q] 2q, [a, r] = -2r, [q, r] = a, the other values are mandatory by skew symmetry.
=
With this Lie bracket, the adjoint endomorphisms ad(q), ad(r) are nilpotent, by definition, and we have the Lie algebra automorphisms
Q= R=
= Id + ad(q) + ad(q)2/2, exp(ad(r)) = Id + ad(r) + ad(r)2/2.
exp(ad(q))
(24.3) (24.4)
It is well known that on the level of 5[(2, Z), these automorphisms Q, R identify to the conjugation by
respectively, a generator set of SL(2, Z). Consider the group Mut
=
(Q,R)
c
Aut(EulerZMadule)
generated by these operators and the center Z(SL(2,Z))
lSee appendix E.4.4, example 80, for the definition of .$[(2, Z).
= {±Id}. Then we have this result:
24.1. CHORD PERSPECTIVES
511
Theorem 27 [400, 111.4.3, Theorem 1J We have a group diagram
1
1
1
1
j
j
j
• 1
• (SL(2,7.),SL(2,7.))
j
(Mut,Mut) - - 1
j
1 - - Z(SL(2,7.))
1
-+
j • Mut
• SL(2,7.)
j
j
j
• 7.2
• 7.12
• 7.6
j
j
j
1
1
1
• 1
• 1
with exact columns and rows and the canonical homomorphisms that are related to the conjugacy operation of the special linear group on s((2, 7.). In fact, in [400,111.4.3]' it is proven that SL(2,7.)/(SL(2, 7.), SL(2,7.)) .:::. 7. 12 , but the center generator -Id is not in the commutator, and therefore projects to the element 6 of order 2 in 7. 12 , so we are done. Furthermore, we may define a set map
Ex: EulerZModule
---->
SL(2,7.)
by Ex(a.o+b.q+c.r) = exp(sl2(q)b.exp(sh(r)C (0 goes to the identity). By the above theorem, this map induces a surjective group homomorphism
ENH: EulerZModule
---->
7. 12 .
Consider the subform CommaZ Module of Euler Z Module which is spanned by the Pythagorean and Syntonic commata Kq,Kt, and by the octave o. Then we have Theorem 28 [400, 111.4.3, Theorem 2J The kernel Ker(ENH) equals the subform CommaZModule. We shall discuss this result in section 24.1.3 in the context of enharmonic identification methods. However, the point of the preceding construction is that it essentially introduces a
CHAPTER 24. HARMONIC TOPOLOGY
512
group of space automorphisms on Euler Z Module which reside on an "exponentiation lifting" from zero-addressed points 0, q, r to self-addressed points, i.e., I d, Q, R, related to a linear Lie algebra interpretation of the underlying module. So it not only lifts zero-addressed points to constant self-addressed points via address change projection, but gives them a non-trivial lifting. At present, no musicological justification of this Lie algebra construction is known~ except that it does give a new interpretation to enharmonic identification. It is in particular not known whether there is a more general and/or not arbitrary, viz., canonical construction behind this idea. It is also not clear why the special linear group should be invoked as a lifting of the mutation operator group M ut, and a musicological interpretation of the mutation operators Q, R is not known.
24.1.2
12-tempered Perspectives
Summary. This section discusses constant and self-addressed chords in the context of pitch classes in 12-tempered tuning. -1;-
This situation stems from the 12-tempered part of the Euler Module, i.e., the subform 12-Temp of Euler Module of the one-dimensional Z-submodule Z. 112 .0 C 1Q.0 EB lQ.q EB lQ.t, together with its projection
mod: 12-Temp
---->
12-PitchClass -=::-, PiMod 12
of octave identification. So we have to keep in mind that this construction is completely separated from the just tuning spaces. We are going to discuss the relation between these two approaches in section 24.1.3 below. Beyond the general Yoneda principle, and to our knowledge of mathematical music theory, the historically2 first passage from zero-addressed tones in PiM Od 12 to self-addressed tones, i.e., tones in Z12@PiMod 12 , was suggested by the question in [340, section 4.2.3.2.2] concerning the deduction of important chords of music theory from basic principles of the underlying pitch spaces. In the foundations of harmony, they try to deduce the structure of important chords, above all the major and minor triad, from the acoustic phenomenon of partials in Fourier analysis, for example with Rameau [433], Schonberg [478], or Hindemith [224]. A slightly different deduction is offered by Oettingen [406] or Vogel [547] by use of the EulerZPlane where fundamental, third, and fifth define a special generating zero-addressed local composition. "Deduction" means that it is attempted to justify the eminent role of those chords from an ontological point of view, as an a priori condition that precedes their role as structural fundamentals of harmony. In any case, this foundation is based on just tuning and contradicts the practice of well-tempered tuning. Mostly, the argument in favor of this contradiction is that we are living a compromise, a neat refusal of the tempered tuning being the exception, such as with Vogel who even ran a political initiative to banish tempered instruments from public schools. Our concern is however not a dogmatic one, we rather would offer an approach to fundamental harmony which also works for tempered tuning and~at the same time--evokes the 2See Noll's motivation in [400, 1.1.4].
513
24.1. CHORD PERSPECTIVES
rationales in just tuning ideology. In his approach, Schonberg views the major triad as an "imitation of its fundamental tone by partials". Starting from e, he denotes the sequence of partials d, g', d', e", gil, b~, e"', ... where the first three octave pitch classes are e, g, e (in this order!), i.e., an inversion equivalent of the major triad. There is no immanent reason to stop at this stage, except that one step further, the catastrophe happens: the "natural seventh" (German: "Naturseptime"), with tonality bp, where after 2,3,5 the new prime 7 appears in the frequency ratio, and we have to extend the Euler Module. So the triad is not really given as a closed unit, but as a first segment of an infinite chain. For the 12-tempered tuning, the 'partial' argument does not work, and we have to look for other mechanisms. Whereas the partial frequencies are positive multiples of the fundamental frequency, the 12-tempered tuning offers endomorphisms on 12-Temp and on PiM od12 . Let us look at the pitch classes in PiM Od 12 . If k E Z12 is a zero-addressed pitch class, and if f = eX.n is an endomorphism, its effect on k is f(k) = ek.nx, the k-fold translation of n from x. So the action of f on all pitch classes k = 0,1,2,3, ... 11 is just the sequence
x,x + n,x + 2.n, ... x
+ 11.n
which in music theory is called the n-circle from x (the number n corresponding to multiplication in Z12)' Exercise 56 An instructive example is the "circle of fourths" and the "circle of fifths" (just run through the circle of fourths in the opposite direction). Major scales are usually arranged in this order: ascending: C, F, B p, E p, Ap, D p, descending: C, G, D, A, E, B, FU where F = e5 .C, etc., and G = e- 5 .C, etc. It is well-known that the alteration signs increase one-by-one on the fourth circle, whereas they increase one-by-one on the fifth circle when we transgress the scales in this order. Why? Since this question only regards the white and black keys, we may really work in tempered terminology. The musician's common argument is that we may go from C to F by alteration b f-4 bp, and than proceed analogously. But the black keys do not shift along the transition from C to F. The exact reason is that we may multiply everything by 5 and then work in the chain of fourths. Thereby, C switches to 5.C = {b, e, a, d, g, c, f}, an uninterrupted sequence of fourths. And the shifted scale F = e 5 .C switches to 5.F = 5.e 5 .C = e 1 .(5.C), the shift of the switched scale 5.C by one unit. This shifting operation is repeated in every fourth step of scales, and therefore, each new scale adds one black key, Le., one flattening of a white key. Check the details of this argument. Try to convince musicians that the argument does not work for other white-black-key distribution such as those which would define "white = melodic C minor, black = remaining keys" . The 12-tempered simulation of the partial argument is this: We start from the fundamental tone e = 0 E Z12 and take an endomorphism 8 = e7 .3 of PiMod 12 . We consider the monoid (8) generated by 8 and let it act on the fundamental. We have this orbit:
(8)(e) = {e,8(c) = g, 8 2 (c) = e, 8 3 (c) = g, ... } = Ie
CHAPTER 24. HARMONIC TOPOLOGY
514
which is precisely the major C triad. Observe that the order of images is the same as with partials: c, g, e, except that it is completed on this triad, no further tones must be cut in order to obtain the triad. Exercise 57 Show that the minor triad Ie"" is a similar orbit (T)(g). Show that there are exactly two endomorphisms W such that (W)(x) = Ie for a selected tone x. This suggests the question of whether this method produces other prominent chords and whether there are chords which look essentially different from those defined by this orbit method. The second question is easily answered: Yes, take the dominant seventh chord = {g, b, d, J}. This one cannot be defined as an orbit of a cyclic monoid (8) which acts on a single tone. One needs at least two generators for such an orbit, e.g., R = e 8 .3, T = e 2 .O. We then have
V6
(R)(g)
= {g, R(g) = f, R 2 (g) = b, R 3 (g) = f,·· .}, (T)(g) = {g, T(g) = canst = d .. .}, R( d) = d, and therefore (R, T)(g) = V6.
Possibly, this bigeneric monoid simulates what Oettingen called "Bissonanz", however here this one is not given by two unrelated triads (as with Oettingen) but by two symmetries acting on one fundamental tone. Before we set forth a systematic discussion of this generative approach to chord theory, let us answer the first of the above questions. We call a zero-addressed chord of shape Ch = (8)(x) a circle chord (generated by 8 and x). Any chord which is isomorphic to a circle chord is also a circle chord. In fact, if h : PiM Od 12 ~ PiM Od 12 is an automorphism inducing an isomorphism h: (8)(x) ~ X, then we have X = (h.8.h- 1 )(h(x)). So we are left with the problem to describe all circle chords for representatives of conjugacy classes of symmetries of PiM Od 12 • The table of these sixteen chord classes in figure 24.2 shows that-except one class number 33 (see the classification of chords in appendix L.1)-all classes are common chords. Fact 12 Circle chords really yield a generative fundament for basic chords in harmony, and one may refrain from evoking just temperament and tuning compromises when laying this basis to harmony.
24.1.3
Enharmonic Projection
Summary. There is an enharmonic projection from Euler pitch classes to 12-tempered pitch classes which relates constructions in the two spaces. These relations are discussed. -~-
Just tuning and well-tempered tuning are fundamentally different choices of pitch spaces within the Euler8pace. The first is a free Z-submodule of rank one generated by 1120 , whereas the second is free of rank three, i.e., the Euler Z Module and intersects the well-tempered module
515
24.1. CHORD PERSPECTIVES o
[illt
o
[ill t
H
H
@] 0 -+ 1
t
I
2
t
4
7 +- 6
o
tt
o
[ill 0 -+ 3
t
t
9 +- 6
o [ill
t / 2').
t 1@J
1 3.4.5.6.7 1
o
t
3
H o
6 __ 10
x = 1.2.3.4.6
t @Jt
to
o
x = 1.2.3.4.6 1 3.4.5.6.7 1
3
®
Figure 24.2: The 16 circle chords in PiM Od 12 , together with their generating symmetry action (arrows). The numbers refer to the classification of chords. A large arrow from one chord to another means that the target chord is isomorphic to a subchord of the start chord, and that the subchord is generated by a sub orbit of the same symmetry action as for the superchord.
only in the octave-submodule Z.o. The usual practical argument for switching between the two tunings is that the pitch differences between the corresponding Euler points are relatively small. The theoretical counterpart of this switch resides on the so-called enharmonic identification under the comma module CommaZModule. Classically, two points in the EulerZModule are called enharmonic (with each other) iff their difference lies in the submodule Z.K ttfjZ.K qtfjZ.o of the CommaZ Module. We shall more generally say that they are enharmonic iff their difference is in the EulerZModule, i.e., they are identified by the enharmonic projection
enh: EulerZModule
-+
EulerZModulejCommaZModule ~
Z12
(see section 24.1.1.1, theorem 28) which besides the fifth and third commata annihilates the octave. Enharmonic identification is a construction which resides on the cognitive rationale of commata, i.e., their role for auditory identification, but it has nothing to do with tempered tuning. The enharmonic projection is not a tuning, but an abstraction. So making music theory on this abstraction is one business, whereas realizing this abstraction on the 12-tempered tuning is another. In his processual interpretation of enharmonic identification via theorem 28, Noll charges the enharmonic projection by a cognitive dimension in the following sense. Look at the com-
CHAPTER 24. HARMONIC TOPOLOGY
516 mutative diagram issued from section 24.1.1.1:
EulerZModule
lenh
Ex
, SL(2,Z)
~H j
EulerZModule ComaZModule
rv
-+
SL(2,Z) (SL(2,Z),SL(2,Z))
where the upper horizontal map Ex is the exponential map, and the lower isomorphism identifies both quotient groups to Z12. The lifting map Ex interprets the enharmonic homomorphism enh as follows: A point in the EulerZModule is lifted to a product of two-by-two matrices, starting from the lifting of the fifth to the upper triangular generator, whereas the third is lifted to the lower triangular generator. The general Euler point is lifted to a product of such matrices. This cognitively means that the cognitive identification of an Euler point is given by a sequence of steps of either fifth or third generator type. The non-commutativity of the concatenation of fifth and third generators is resolved by identification of commutators x.y.X-1.y-l with the identity. And this precisely yields the modular group Z12. If instead, we stay in the modular group SL(2, Z), the non-commutativity of the fifth and third has been interpreted by Noll [400J as a rationale of the enharmonic identification in the same sense that non-commutativity of operators gives rise to the uncertainty relation in quantum mechanics: enharmonicity resembles uncertainty. As we already stated, the nature of the modular lifting-up to now-has neither formal nor musicological or cognitive justifications; not every mathematical structure yielding Z12 in some way is automatically musicologically justified. However, the enharmonic projection helps understand the above remark concerning the automorphism U of the chromatic scale X (see the linear part Uo of U in formula 24.2) when it is transferred to pitch classes via "literal transformation" .
Definition 78 An endomorphism of the Euler Z Module is called enharmonic iff it commutes with the enharmonic projection, i.e., iff there is a (necessarily unique) endomorphism ¢ of PitchM od 12 such that the diagram Euler Z Module ~ Euler Z Module (24.5)
PitchM Od 12
~
PitchM Od 12
commutes. Here is a characterization of enharmonic endomorphisms:
Proposition 41
(i) Every translation ¢ = e S is enharmonic.
(ii) An endomorphism of the EulerZModule is enharmonic iff its linear part o is, z.e., iff 0 (CommaZModule) C CommaZModule.
24.1. CHORD PERSPECTIVES
517
(iii) The enharmonic endomorphisms define a multiplicative monoid
Endenh(EulerZModule) C End(EulerZModule). The invertible elements among the enharmonic endomorphisms define a subgroup, the --+ enharmonic group G Lenh (Euler Z Module). (iv) Rewriting the Euler points in the basis
in this basis,
0,
Kt, q, and
== a23 == 0
mod 12.
Proof: Exercise for the reader, except for the statement concerning the enharmonic group --+ G Lenh (Euler Z Module): Observe that an invertible linear endomorphism which leaves the CommaZModule invariant automatically induces an automorphism of CommaZModule since its index in Euler Z Module is finite. This statement also follows directly from the calculation of the inverse of a matrix and the criterion (iv) of the proposition.
Corollary 11 ([400, IlI.4.1]) The autocomplementarity involution U of the just chromatic chord X and its conjugate V, the autocomplementarity involution of 3, are both enharmonic. In fact, the first follows from the matrix criterion above, and the second follows from the fact that translations, and therefore conjugation with translations, are enharmonic.
Corollary 12 ([400, IIl.4.1]) The chromatic chord X and its translate 3 define two sections OnPiM Od 12 ---+ Euler Z Module to the enharmonic projection. The enharmonic autocomplementarity involutions U and V of X and 3, respectively, induce the autocomplementarity involutions e6 .5 and e2.5 on X and 3, respectively. This follows from corollary 11 and from proposition 40.-80 the literal translation is in fact induced by the enharmonic projection, and we easily recognize that the tiling lattice in proposition 40 is the CommaZModule. 80 the autocomplementarity function U permutes the tiling lattice and therefore also the tiling as such. Conversely, modulo CommaZ Module-translations, any affine permutation of the chromatic tiling identifies to the auto complementarity involution.
Theorem 29 The group of affine automorphisms of the x-chromatic tiling of the Euler Z Module is the semi-direct product eCommaZModule >
CHAPTER 24. HARMONIC TOPOLOGY
518
The moral of this discussion is that in both cases, 12-tempered and just tuning, certain endomorphisms of the underlying pitch space are crucial structures for understanding zeroaddressed harmonic, and-as will be shown in the counterpoint theory-contrapuntal phenomena. Intuitively, the above example of the automorphism groups of chromatic interpretations (by Vogel's X or by its 'intervallic shift' 3) suggests that this order two group is composed of two agents: the identity for the consonant, and the auto complementarity involution for the dissonant 'character'. In the same spirit, the preceding generators of zero-addressed circle chords in 12-tempered pitch class space suggested that these agents should be viewed as if they were tones of a more general nature. In the sequel we shall set up a more systematic context for such agents and their relations.
24.2
Chord Topologies
Summary. Chord topologies are built upon chord sets which are invariant under collections of symmetries. These setsdefille a topological basis.
-EAlthough this section deals with and stems from investigations of harmonic structures, it turns out that some fundamental buildings can be carried over to completely general local compositions. We prepend this setup before specializing to proper harmony.
24.2.1
Extension and Intension
Summary. The general technical setup for intension-extension topologies (to be presented in the next section) in terms of local compositions is presented.
-ELet F be any form space (which we identify with its functor if no confusion is likely), A an address, a C @A x F an A-addressed local composition in F, and f : F ---+ F an endomorphism. We shall say in this context that f is an endomorphism of a if flail: a ---+ a is a morphism, i.e., if Im(a) C a. Denote by SemiEnd(F) , MonEnd(F) , respectively, the set of subsemigroups, submonoids of End(F) (with the identity Id F as neutral element), respectively. Definition 79 Let M C End(F) a set of endomorphisms of F and A an address. The set
ExtA(M)
= {a
E
AOOFI f is an endomorphism of a for all f E M}
(24.6)
is called a basic extension of F at address A. The sets Int(M) Inte(M)
{m
I
E SemiEnd(F) Me m},
Int(M) n MonEnd(F) ,
(24.7) (24.8)
are called basic (semigroup, monoid, respectively) intensions of F. For a set E C A OOF, the monoid Int(E)
= {fl f
E
End(F) , f is an endomorphism of every a E E}
(24.9)
519
24.2. CHORD TOPOLOGIES is called the intension3 of E. Sorite 9 With the above notation, we have the following facts:
(i) For MeN
C End(F), we have ExtA(N) C ExtA(M), Int(N) C Int(M), and Inte(N) C Inte(M).
(ii) For Me End(F), ExtA(M) = ExtA((M)) Inte(M) = Inte((M)) = Inte((M)e). (iii) For M, N
=
ExtA((M)e)4, Int(M)
=
Int((M)), and
c End(F),
ExtA(M) n ExtA(N)
= ExtA(M U N) = ExtA( (M, N)) = ExtA( (M, N)e),
and Int(M) n Int(N) Inte(M) n Inte(N)
Int(M U N) = Int( (M, N)), Inte(M U N) = Inte( (M, N))
=
Inte( (M, N)e).
(iv) For a set E C AOOF, we have E C ExtA(Int(E)), and for a set Me End(F), we have Me Int(ExtA(M)).
(v) For an A-addressed local composition a the intension of a. Then we have ExtA(Int(a))
= {bl
C @A
x F, we set Int(a)
=
Int( {a}) and call it
. !l1 : a ---- a implies f /1: b ---- b}.
(vi) For a set M C End(F) and an address change a : A ---- B, we have
If a is a retraction, i.e., there is a right inverse (3 : B ---- A, a.{3 = IdB, then, for any local composition a C @B x F, Int(a.a) = Int(a), whence
(vii) For a set M
C
End(F), and a, bE ExtA(M), we have an b, a U bE ExtA(M).
(viii) If a C A@F is an objective local composition, the set of endomorphisms f /1 : a ---- a qua objective local composition coincides with the intension of its functorial counterpart ii. Proof. All statements except (vi), (vii) and (viii) are straightforward. Statement (viii) follows from the fact (see proposition 3 in chapter 8) that objective local compositions build a full subcategory of the functorial local compositions, and that underlying form endomorphisms 3Terminology introduced by Noll [403] in the case of self-addressed chords of pitch classes and to be justified below in chapter 25. 4(M} is the semigroup generated by M, whereas (M}e is the monoid generated by M, see appendix C.2.3.
CHAPTER 24. HARMONIC TOPOLOGY
520
of the functorial versions carryover to the objective versions. To prove statement (vi), let a E ExtB(M). Then, by definition, we have a cartesian diagram
a.a
1x
@A
a
F
@axld F
j
1x
@B
(24.10) F
and therefore, for any m E M, we have a unique vertical arrow! making the diagram
• @AxF
a.a
~
~
@IdB x m
• @BxF
a
j • @AxF
a.a
~
a
@Id B x m
~
• @BxF
commute. This shows that a.a E ExtA(M). To prove the second part of (vi), consider a.a E ExtA(M). Since under the hypothesis on a, the transformation @a x Id p is epi, and since the upper horizontal square is cartesian, the morphism a.a ----t a is also epi. Therefore, there is a vertical arrow a ----t a making the entire cube commute, and we conclude a E ExtB(M) and therefore, Int(a.a) = Int(a). By the first part of (vi), we conclude that a-1(ExtA(M)) = ExtB(M). QED.
24.2.2
Extension and Intension Topologies
Summary. Intension and extension topologies on local compositions are presented.
With the above technicalities, we may now introduce different topological structures.
Definition 80 The collection of basic semigroup, monoid, respectively, intensions of F define bases for two topologies on S emiEnd( F), M onEnd( F), respectively, the (semigroup, monoid, respectively) intension topologies
SemlnTop(F) , InTop(F) of F, respectively.
521
24.2. CHORD TOPOLOGIES Lemma 38 The projection
?e : SemiEnd(F)
---+
MonEnd(F)
(see appendix C.2.3, exercise 84) is continuous and open onto the (basic) open subspace MonEnd(F) Int(Id F ) of SemiEnd(F). Proof. Clearly, MonEnd(F) = Int(Id F ) C SemiEnd(F). To see that ?e is continuous, take a basic monoid intension Inte(M). Its inverse image under?e is {sl Me se}. Suppose first that Id F tf. M. Then M C Se = S U {Id F } implies M C s, and conversely. So ?;I(Inte(M)) = Int(M). If Id F E M, then s E?;I(Inte(M)) iff M \ {Id F } C Se \ {IdF}. This implies that ?;I(Inte(M)) = Int(M \ {IdF}), so the map is continuous. It is open since we have ?e(Int(M)) = Inte(M), and we are done.
Sorite 10 The intension topologies have these properties: (i) With the preceding notation, if s E SemiEnd(F) , mE MonEnd(F), respectively, then
Int(s) = {t E SemiEnd(F), s C t},Inte(m) = {n E MonEnd(F) , men} are the smallest open neighborhoods of s, m in SemInTop(F) , InTop(F) , respectively. (ii) A subset X C SemiEnd(F) , X C MonEnd(F) , respectively is closed iff for a subsemigroup t, submonoid n, respectively, t C s, sEX or n C m, mE X implies t E X, n E X, respectively. (iii) The closure of a point s E SemiEnd(F),m E MonEnd(F), respectively, zs s = {t E SemiEnd(F) I t C s},m = {n E MonEnd(F) I n em}, respectively. (iv) A closed subset Xc SemiEnd(F), X C MonEnd(F) , respectively, is irreducible iff for m, n E X, (m, n) EX, (m, n)e E X, respectively. (v) If every monoid in MonEnd(F) is finitely generated (for example, if End(F) is finite), then SemInTop(F) and InTop(F) are sober. Proof. We prove the monoid case, the other is completely analogous. The first three statements are straightforward. As to the fourth, the irreducibility means that for any two m, n EX, their smallest neighborhoods Int(m), Int(n) intersect in X, i.e., there is rEX, n, mer, therefore the monoid (m, n)e is in X since X is closed. As to the last statement, if X is closed and irreducible, there is at most one generic point. In fact, x = X = fj means x eye x. To find the generic point, consider the monoid U = (m, m E Xk Since U is finitely generated, there are finitely many ml, ... mk such that U = (ml, ... mk)e. But by the one but last statement, this is a member of X, and we have the generic point. QED.
Definition 81 The collection of basic extensions of F at address A defines a base for a topology on the set A@OF of A-addressed local compositions in F, the extension topology ExToPA(F) of F at address A.
=
CHAPTER 24. HARMONIC TOPOLOGY
522
Corollary 13 For a local composition a E ExToPA(F), the set
{bl f /1 : a ---+ a implies f /1 : b ---+ b} {bl Int(a) C Int(b)} is the smallest neighborhood of a, whereas the topological closure of a is
a
{bl f/l : b ---+ b implies f /1: a
---+
a}
{bl Int(b) C Int(a)}. This follows from sorite 9, statement (v). Corollary 14 The three topologies ExToPA(F), SemInTop(F) , InTop(F) are irreducible with generic points A@r,End(F),End(F), respectively (non-unique). The topology ExToPA(F) is the inverse image topology of InTop(F) under the map
IntA : ExToPA(F)
---+
InTop(F) : a
f-+
(24.11)
Int(a)
which therefore is automatically continuous. The fibers Intj/IntA(a) consist of the A-addressed local compositions b in F with b = a. Proof. The irreducibility claims are clear. For a set M c End(F), we have a E ExtA(M) iff M c Int(a) iff Int(a) E Inte(M), in other words, the inverse image Int7/Inte(M) equals ExtA(M). Corollary 15 For an address change a : B ExToPA(F) ---+ ExToPB(F) is continuous,
---+
A which is a retraction, the natural map .a :
(24.12)
InTop(F) commutes. This follows from sorite 9, statement (vi), in particular, we have Int(a.a)
=
Int(a) in this case.
Proposition 42 The induced topology on the image
is homeomorphic to the induced topology of the canonical image i(ExToPA(F))
C
ExToPA(F)8
in the sober topology of ExTop A (F). If all submonoids in M onEnd( F) are finitely generated, the map I nt A factorizes through ExTop A (F)8 .
523
24.2. CHORD TOPOLOGIES
Proof. We already know that the fibers of IntA and of i coincide. A base for the topology of IntA(ExToPA(F)) is the collection of sets I(M) = {Int(a)1 M c Int(a)} , M c End(F). A base for the induced topology on i(ExToPA(F)) in the sober topology is given by E(M) = {al ExtA(M) n a =I- 0}. But this set, by corollaries 13, 14, equals the set {al 3b, Int(b) C Int(a), and M C Int(b)}, and this equals I(M). If all submonoids are finitely generated, we know from sorite 10, statement (v), that the topology InTop(F) is sober, and the statement follows from the universal property of the associated sober space. QED. Remark 8 The fact from proposition 42 which relates the closure of an A-addressed local composition a in the extension topology, or, equivalently: the associated point in the sober topology, with its intension Int(a) justifies the terminology of Noll in [400] where Int(a) is called "Hiillakkord", in English "chord closure" or "closure chord". The difference to our point of view is, of course, that the intension of a chord is a set of endomorphisms, and not a chord. But this will be clarified in the following section.
24.2.3
Faithful Addresses
Summary. This section discusses the relation between endomorphisms and their "trace" on local compositions which live at addresses that capture enough information to produce a faithful image of endomorphisms. We deduce topological properties of such addresses. -~-
Definition 82 Let A be an address, F a form space, and z E A@F. The point z is called faithful, full, fully faithful, respectively, if the map .z : End(F) ---+ A@F : f f---* J.z is injective, surjective, bijective, respectively. We also say that the address A is faithful, etc., if there is a point z at that address which has the corresponding property. The classical example of a fully faithful point is the case of a representable space F = @M for a module M, and z = Id M . Definition 83 Let A be an address, F a form space, and z E A@F. Then we have a map
ex z : SemInTop(F)
---+
ExToPA : m
f---*
m.z~
which associates with a semigroup m the functorial local composition of the m-orbit of Zi we set monex z = exzIInTop(F) for the restriction of ex z to the monoids. Proposition 43 If the point z E A@F is faithful, then the map monex z is a continuous section of IntA, i.e., IntA . monexz = IdlnTop,
and the induced topology on Im(monex z ) identifies to InTop(F). Proof. If M c End(F), then the inverse image of Ext(M) under monex z is the set of monoids m such that M C End(exAm)). But the latter is m. In fact, by sorite 9, statement (viii), the endomorphisms of exz(m) coincide with those of the objective trace m.z. But if f is
524
CHAPTER 24. HARMONIC TOPOLOGY
such an endomorphism, we have j.ldF.z = p,.z, P, E m. Since z is faithful, f = p,. Since trivially, m C End(exz(m)), the claim follows, and therefore, M C End(exz(m)) means M C m, i.e., the inverse image of Ext(M) is Int(M). Moreover, by the above, we also have IntA· exAm) = End(exAm)) = m QED. This means that the topological space InTop(F) of monoids of F can be viewed as a retraction of the topological space ExToPA of local compositions at a faithful address A (as a function of the chosen faithful point). In particular, if B is any address, and if z is a faithful point at address A, we have the composed continuous map
intexz,B = ex z . IntB : ExToPB
-+
ExToPA
(24.13)
which associates with any B-addressed local composition the "objective trace" of its endomorphism monoid at the faithful address A. This means that the closure of a local composition a can be reinterpreted in terms of the objective local composition intexz,B(a), the zobjective (or, less precisely, A-objective) closure of a. This is, what in the elementary case of F = @Z12,B = 0, z = Id z12 was studied by Noll in [400J. Here, we have a fully faithful point and therefore a complete identification of endomorphisms and points. We should, however keep in mind the ontological difference of endomorphisms and points. But mathematically, this very special case of pitch class theory is indifferent to ontological or more general mathematical perspectives. So far, if z E A@F is faithful and a : A -+ B is a retraction, we have a diagram with these continuous maps
InTop(F) c......... SemlnTop(F) where a local composition a in ExToPB is mapped to its natural image a.a and to its objective closure intexz,B(a). In this general setup, the two images have nothing in common. We shall now discuss a context where these images are intimately related. To this end, we suppose that first, z is fully faithful. This means that .z is bijective, call / z its inverse. We further suppose that the image B@Y':a = [email protected]~ of the maximal B-addressed objective local composition in F corresponds to a set of mutually right-absorbing endomorphisms of F under / z, i.e., for all u, v E B@F,
u.a/z· v.a/z = u.a/z. Again, the classical example with F = @M, A = M, B = 0, z = I dM, and a : M -+ 0 the zerohomomorphism, yields the embedding of zero-addressed local compositions as self-addressed local compositions with points m E M corresponding to the constant points em .0. These are evidently mutually right-absorbing.
525
24.2. CHORD TOPOLOGIES
A point Z that is fully faithful and induces a set of mutually right-absorbing endomorphisms of F on B-addressed points of F is called B -absorbing". In the following calculations, we shall use the Yoneda notation x : X -+ F for the point x E X@F corresponding to x : @X -+ F.
Lemma 39 Let Z be a B-absorbing point and a : A -+ B a retraction. If 9 E B@F, then ga := g.a/z is constant on B-valued points of F, i.e., ga.h = 9 for all h : B -+ F. Proof. Since a is an epimorphism, it suffices to show ga.h.a
ga.Z = g.a, QED.
= g.a.
But ga.h.a
= ga.ha.z =
Corollary 16 Let z be a B-absorbing point, a : A -+ B a retraction, b = bO c @B x F a non-empty objective local composition at B, and x : B -+ F. Then Xa is an endomorphism of b (or, equivalently h), iff x E h. Corollary 17 Let z be a B-absorbing point, a: A -+ B a retraction, and mE SemInTop(F) a semigroup. Denote by Iml a the inverse image (a-l(exAm)V))~ in B@F, and call it the B-constant part of m (to be precise, we would have to specify the absorbing point and the retraction). Then we have
me Int(lmla). Proof. Let gEm, x E
y.a
Imla, and denote y = g.x E B@F. Taking over the notation oflemma 39,
= g.x.a = g.Xa.Z = gx.Z,gx
whence y E
Iml a. QED.
E
F@F. By hypothesis, Xa Em, and so is gx, i.e., y.a
= gx.Z,
Definition 84 For an address change a : A -+ B, form space F, and a local composition a C @A x F, we denote ala = an B@Y:.a
and call it the a-restriction of a. Observe that even for objective local compositions, their restriction need not be objective!
Proposition 44 Let Z be a B-absorbing point, a: A -+ B a retraction, and b = bO C @B x F a non-empty objective local composition at B. Then we have
(24.14)
In particular, for two objective, non-empty B-addressed local compositions bl, b2 , we have
5More precisely: "a-absorbing".
CHAPTER 24. HARMONIC TOPOLOGY
526 Proof. We have
(intexz,B(b))V n [email protected] End(b).z n [email protected] and the claim follows from corollary 16, QED. Therefore, if z is an absorbing point and a : A ----+ B a retraction, any objective B-addressed local composition b in F fulfills b.a C intexz,B(b). This turns the terminology of an "objective closure" into an operator that adds new elements to the given original local composition. The closure gives us back the original local composition by restriction to the "B-constant" points, i.e., the natural image of B@F under the address change a. Compared to the original approach [400], this presentation is independent of the specific properties of the basic form space F. If restricted to the objective local compositions, we have this fact: Fact 14 Let ObExToPB(F) the subspace of ExToPB consisting of objective local compositions. Under the conditions of proposition 44, we have a continuous injection
obintexz,B : ObExToPB(F)
----+
ObExToPA(F),
(24.15)
and we get the original objects back via a-restriction.
24.2.4
The Saturation Sheaf
Summary. Sheaves of monoids and monoid algebras are associated with each topological space ExToPA(F). -~-
For an address A and space form F, if Me End(F), the basic open set ExtA(M) identifies to ExtA(M) if we define (24.16) M=
n
aEExtA(M)
the A-saturation of M. In fact, a E ExtA(M) iff Me IntA(a), and therefore, M C M, whence ExtA(M) C ExtA(M), the converse is clear. The saturation of M is the largest monoid defining the given basic open set. This gives rise to a sheaf of monoids as follows. Proposition 45 Select an address B and space form F. On an open set U C ExToPB(F) the presheaf Sat~ defined by
Sat~(U) = {nUEU Int(u) 1 if U
= 0,
if U
-I- 0,
together with the transition inclusion maps i~ : Sat~(U) "-+ Sat~(V) for open sets V C U, is a sheaf of monoids, the saturation sheaf of F at address B. If a : A ----+ B is a retraction, the continuous map .00 : ExToPB(F) ----+ ExToPA(F) canonically extends to a map of sheaves .00 : Sat~ ----+ Sat~.
527
24.2. CHORD TOPOLOGIES
Proof. This presheaf is a sheaf because the restrictions are inclusions (everything within the big monoid End(F)), and because any two open sets intersect by the irreducibility of ExToPB(F). Moreover, in the case of a retraction a, we know from sorite 9, statement (vi), that for any local composition a E ExToPB(F), we have Int(a) = Int(a.a). To define the sheaf morphism, we can restrict to basic open sets ExtA(M). We have to construct a morphism of monoids Satl):.(ExtA(M)) --+ Sat~(a-1ExtA(M)). By sorite 9, statement (vi), we also know that a-1ExtA(M) = ExtB(M). So we need Satl):.(ExtA(M)) --+ Sat~(ExtB(M)). But we have
n
and
n
Int(b.a) =
Int(a)
n
so the inclusion map of monoids Satl):.(ExtA(M))
c
Int(b) "--+
n
Int(b.a),
= Sat~(ExtB(M)),
Sat~(ExtB(M)) does the job. QED.
Remark 9 Given any commutative ring R, the above constructions canonically extend to sheaves RSatl):. and morphisms of R-algebras via the monoid algebras RSatl):.(U) = R(Satl):.(U)). We leave the details to the reader. The saturation sheaf will be used in the discussion in chapter 25 of the tonal function concept as it was developed in [400, 404]. Exercise 58 The stalk of the saturation sheaf Sat~ at a point a E ExToPB(F) is the monoid Int(a).
Chapter 25
Harmonic Semantics "Logik" ist in der Funktionstheorie ein fundamentaler, aber dunkler Begriff. Carl Dahlhaus [100, p.95] Summary. This chapter is about "understanding" aggregates of pitches in their combination on the syntagmatic/paradigmatic axes. This requires constructions of targets of such an understanding, i.e., harmonic semantics. The present theory succeeds in a (re)construction of function-theoretic semantics which is based on the paradigms associated with classical tonal functions. We discuss different approaches, among others the morphological theory of Noll and the approach of Mazzola based on chains of triads. -~-
We have already seen in the previous discussion of metrical, rhythmical, and motivic issues that the question concerning the position of a compositional item leads to topological considerations. This will also hold for the harmonic perspective. The topology comes in by the fact that the syntagmatic network of a composition is filtered by a paradigmatic assimilation of units such that this simplified structure can be read as a logical one. It is not the scope of this book to establish dogmatic theories of how this logic should look. We rather want to make clear what general processes are involved when logical constructions are elaborated from the crude pitch configurations. There are two general differentiations of such logical methodologies: First, the harmonic functions must be related to the basic address where harmonic facts take place, i.e., the address of the local and/or global compositions which we investigate. This will in fact turn out to generate fundamentally different harmonic ontologies, such as it has been made clear by Noll's dissertation [400]. Second, harmonic functions germinate from prototypical harmonic units, such as major or minor triads, chains of thirds and the like, where one starts on a firm harmonic (pre )judice and then wants to extend the commonly accepted to less common chords. The point here is the extension approach from common germs to general situations. But it seems that the topological flavor of all the known harmonies is the common denominator, and that, beyond this, the basic constructors of harmony have very little to do with the special pitch domains. 529 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 25. HARMONIC SEMANTICS
530
We have therefore anticipated the chapter on harmonic topologies, in fact, as already stressed in that introduction, a chapter which is only accidentally directed towards harmony. We are not yet in the position to claim that all these topological approaches converge in the common topos-theoretic logic which of course specializes to the logic of open sets of topologies. But it is nevertheless important to open the path to a purely topological interpretation of musical paradigmatics, be it via combinatorial topology or via more or less exotic (but nevertheless canonically defined) sober topologies. The topos logic of predicates as described in section 18.3.2 does in fact suggest the following reinterpretation of harmonic topologies. In the truth form defined in formula (18.7)
TRUTH(1)
----7
Id
Power(Val(1)),
(25.1)
we took a special simple form Val (I) associated with a truth module I. Instead, we could have taken any form F and considered the F-truth form
TRUTH(F)
----7
Id
Power(F),
(25.2)
Ti
whose values stem from F instead of Val (I). This means that the set of truth denotators at address A are the A-addressed local compositions a C @A x F. We have the usual Heyting algebra structure of subobjects in together with its logic, as discussed in 18.3.2. But now, we have more: the extension topology ExToPA(F) with its natural Heyting logic of open sets (see appendix C.2.2 and [314, p.51]). What we should keep in mind in the following discourse is this:
Ti,
Principle 22 The extension topologies are a natural topos-theoretic framework for predicate logic. One the one hand, this principle preconizes a seamless transition from logic to geometry as already indicated in principle 12 in section 18.3.3.3, but on the other, it does also confirm the fuzzy character of harmonic logic in so far as the extension topology is quite contrary to a common geometric space. Rather could it be associated with the irreducible ring spectra in algebraic geometry.
25.1
Harmonic Signs-Overview
Summary. This section gives an overview on levels of harmonic semantics. -~-
Generally speaking, the semantics of harmonic signs has always been built via ontological "superspaces", such as the metaphysical level of the Pythagorean tetractys: Greek musicology was based upon transcendence, and the core activity of Pythagorean scholars was the introspection of this mystic unit via instrumental practice. It seems that-at least in European tradition-semantics of musical harmony has been attached to a transcendence of one or another type.
531
25.1. HARMONIC SIGNS-OVERVIEW
In modern times, especially with Hugo Riemann's never accomplished program after Rameau's incipit, harmonic semantics has been focused on the attribution of tonic, dominant, and subdominant meaning to chords. This semantics is a very mysterious one since it is quite formalistic on one hand, but carries with it a certain amount of romantic connotation, in the spirit of Moritz Hauptmann's dialectic overloading of octave, fifth, and third [548], say. We shall refrain from such transcendental semantics and restrict our analysis to the structural framework which can be recognized as a machinery for attributing to given chord configurations their harmonic functionalities. The basic setup is always this (concrete examples follow below): 1. A pitch space form F is chosen.
2. At a specific address A, a set StandardChords of local compositions in F is selected, playing the role of standard chords where the hoped-for semantics will be based, and where one knows from experience/history/practice what to expect. 3. A set of interpretations Sf of local "scale" compositions S at A by determined subsets of I C StandardChords is defined which will define the admitted tonalities in the specific setup. On this background, the harmonic semantics of any chord, i.e., any local composition c at selected (envisaged by the specific approach) addresses B in F, will be constructed by one of these two strategies: Degree Theory This approach relates c to the given standard chords and/or tonalities by coverings, i.e., c is included in a specific selection of standard chords and/or tonalities. This means that a chord is viewed as either a part of a standard structure, or else a part of a covering by standard structures, e.g., the seventh chord G 7 = is covered by two triadic degrees Ve, VIle. The moral of this approach is that the semantical process is a completion of a partial sign to fit in a standard object.
V6
Function Theory This approach relates c to the given standard chords and/or tonalities by topological proximity to a specific selection of standard chords and/or tonalities. Such a procedure presupposes a topological environment, in fact, we shall see that the extension topologies are the formal realization of this task. The moral of this approach is that the semantical process uses similarity to standard objects without necessarily being part of them as in degree theory, i.e., commonality instead of completion.
Evidently, both approaches imply degrees of fuzziness: Completion, because this can be achieved in different ways. For example, an interval {c, g} may be completed to a major third degree C = {c, e, g} or else a minor third degree Cm = {c, e~, g}, whereas this interval may represent the tonic function in C or the sub dominant function in G, for example. But this is the very nature of the harmonic approach which is a genuinely paradigmatic one. The problem of harmonic semantics is not its polyvalence, but the formally clear theory of such polyvalence, because only the precise analysis of polyvalence can qualify or disqualify a harmonic theory. We should stress that the degree-theoretic and function-theoretic approaches do not imply that harmonic functions can only be generated by the second approach. The degree-theoretic
CHAPTER 25. HARMONIC SEMANTICS
532
approach may also be used to attribute functional values, but these values will then be calculated from the complete standard chords instead of functions which are calculated from similarity to standard objects. Remark 10 We have to note another source of fuzziness which is often applied to both strategies: Instead of the given chord, one admits its deformation by a shift c' of c which is induced by an alteration (see section 7.5). As we have already discussed this deformation process, we shall restrict ourselves here to non-alterated harmonic semantics. The decisive point here is that one should keep in mind the alteration theorem 1 in section 7.5.1 which states that-in its negative interpretation-any two chords may be shifts of each other if the alteration interval is large enough. In contrast to this not very elegant deformation rhetorics stands the empirical use of alterations in existing musical styles which sometimes is quite subtle. In jazz harmony, for example, the fifth of a dominant seventh chord can be altered without problems (Domiant7/-5 or Dominant7/+5), but the seventh not (an alteration of the minor seventh invariably results in a loss of the dominant function). In a better Approach to alterations, one should not develop harmony on the given form, but on a "tangent form F[E]" (corresponding to the tangent module M[c] in the representable case). Presently, no further theory is known in this direction.
Degree Theory
25.2
Summary. We give three short accounts of computer programs for degree theory. -~-
25.2.1
Chains of Thirds
Summary. Software for analysis by chains of thirds: Mazzola's presto® chord analyzer. -~-
The presto® software l analyzes zero-addressed chords in PiM od l2 . The chords in presto® are abstracted from any local compositions in the four-dimensional space 0 P LDz = Onset EB Pitch EB Loudness EB Durationlz via projection to PiM od l2 . The program also retains the local composition's lowest pitch, see figure 25.1. The user may then select a reference pitch class; by default the class of the retained lowest pitch is chosen. This means that in fact, the chord is not taken as a local composition, but as a global one with two charts: the given chord, and an additional chart, the singleton of the reference pitch, the latter being a representative of the so-called "fundamental" pitch in classical harmony. This is an interpretative activity and cannot be justified without a deeper analysis of the context of the given chord. So a priori, the reference pitch is a variable which has to be integrated within a context-sensitive calculus. The projected local composition S is then confronted with all chains of thirds (see section 13.4.2) which start from the reference pitch and have a minimal cardinality such that they contain S. For example, the chord {a, 1, 2} has 23 minimal third chains when calculated from the reference 1. The list of all possible chains of thirds is shown in appendix L.2. Among these third chains 1 See
chapter 49 for details.
25.2. DEGREE THEORY
533
Chord analysis 1-----11
1---""
J
Chord class: 78.1*
Chord SYMbols [ D*I+ 7 +/ .0+
Variant No 3 ~
Reference
OK
Figure 25.1: The sound-event set of ascending small filled squares to the left is analyzed by the software's chordmaster routine (middle small window entitled "Chord analysis"). This one shows the chord's pitch classes in third chain sequence from below. The black notes are those of the chord, the white ones in between are the added pitches from the embedding third chain. The isomorphism class of this chain (78.1*) is indicated, as well as the standard denotation of the chain in jazz chord symbols. The variant number (No 3 here) is the number of the chain out of the total number of possible chains. The reference pitch is the lowest pitch, it can be changed by the user to any other pitch class (even outside the chord).
the chains with absolute minimal cardinality can be found, it is the set 3Chain(S) defined in 13.4.2. This set is independent of the reference assumptions. Although it does not determine the isomorphism class of S in general, it gives a lot of information for the harmonic semantics of the chord. The absolutely minimal third chains of the chord are not yet function-theoretic constructs, but they can and will be used to build function-theoretic values, see section 25.3.3. The complete list of third chains in appendix L.2 also contains the symbols from the American jazz notation. It is well known that there are many and not really optimal variants of this notation. The presto® implementation of our list has been charged with one variant of the jazz lead-sheet notation that can be justified by the total number of third chains (including 12-chains, not as with traditional (and not the new!) jazz practice, where chords only have a restricted number of notes). Basically the procedure is this. We have chosen one of the four 12-tone chains to build the reference interval step succession. It is reasonable to choose the one which is lexicographically first (most packed to the left) with respect to the "white keys", i.e., to the diatonic scale from C. This maximal chain of thirds has its initial sub chains which all
CHAPTER 25. HARMONIC SEMANTICS
534
are named with the usual numbers. So the first chain from C is C = {D, 4, 7}, the second is C7 = {D, 4, 7, 1O}, etc., until the last and total chain
C23
{D,4,7,ID,2,5,8,11,3,6,9,1} {C, E, G, B., D, F, A., B, E., G., A, D.}.
(25.3) (25.4)
All degree names will be related to this reference chain, e.g., if we have a "13" in a symbol, this relates to A. with respect to the C chain, and to the corresponding transposition in the general case. For the triadic chains, we use the traditional four types of major C, minor Cm = {D, 3, 7}, diminished CD = {D, 3, 6} and augmented C+ = {D, 4, 8} chains. The larger chains are built upon these basic cases, and the added symbols refer to the above degree codification. For example, the symbol Cm7 + /11 means that we add the major seventh 11 and go until the 11chain, including the none ("9" adding pitch class 2) and the undecime ("11" adding pitch class 5). So the deviations from the normal chain are indicated by m, D, + for the triadic initial piece, and then by the alterations x+, x- for the other deviations from the norm. In some cases, the single alterations are not sufficient, for example, the chain nr. 46: {D, 4, 8,11,3, 7} would have the notation C+7+/9+/11++. Alternatively, we can write C+7+/9+/13+/ll to indicate that 11 has been omitted and instead, 13+ was added. This notation (well known to mathematicians) can also be used to describe precisely the pitch classes of the given chord: we add a hat for every omitted ingredient, and we write this information after the standard symbol. For example, the fifth {C,G} would read as C(3 or Cm/3-. The set of third chains 3Chain(S) of a non-empty chord S evidently is a simplex in the nerve of the third chain interpretation of PiM Od 12 . We shall see in section 25.3.2 that harmonic functions starting from degree theory can be viewed as fuzzy truth functions on this nerve, i.e., functions defined on each vertex and then extended to the entire chain in the affine way to the geometric realization of the nerve. This type of functions was already encountered in the metric and melodic analysis under the label of "weight functions". So the topological flavor of the degree theory reduces to the combinatorial nerve of the standard chords (the third chains in this case). The qUE:Jstion of the function values of standard chords with respect to specific tonalities will be discussed in section 25.3.2.
25.2.2
American Jazz Theory
Summary. John Amuedo's MAX programs CHORD-CLASSIFIER, SCALE-FINDER, and SCALE-MONITOR for analysis of AST-based jazz harmonies. -~-
In his music theory thesis [15], Amuedo has programmed a bunch of LISP routines for AST and jazz-theoretic degree classification of chords. We have already mentioned the AST part of this work in section 11.5.2.3 and section 16.3. This software-part is used in the three harmonic analysis routines. The 76 standard ("fundamental" in Amuedo's wording) chords as well as the 18 standard scales of this approach are retrieved from relevant American jazz literature 2 • Like presto®'s chord analyzer, the present chord-analysis works on pitch classes (pes) in PiM Od 12 , 2 Above
all, it is [67], for further literature, see [15, Bibliography:
Ja2Z
harmony]
25.2. DEGREE THEORY
535
however, the class of the lowest pitch in the original integer pitch space is retained to specify the chord and scale names. The chord names or symbols are derived from seven groups of "foundation chords" . In the table below is their list (see the list in section 11.5.2.3 for the definition of the different "norm" symbols). This list is complemented by Amuedo's "Shortest List" of 18 "essential jazz scales" shown below. The CHORD-CLASSIFIER routine takes the input chord with integer coefficients (also, like presto®, not necessarily as a 'vertical' chord), retains the pcs and the lowest note's pc. The output is a list of seven entries. Apart the general AST-classification routine SET-SLAVE, this information is deduced from the standard chord list and from the essential scale list. The last four entries are just different labelings in terms of AST norms, such as DNR, DNF, DPF, INF, we refrain from this accounting overhead. The first entry: "RootPos." exhibits the lowest pitch class, together with a standard chord from the above list, possibly extended to higher third-chain notes. If this task fails, MAX' LISP-routine inserts the chord's DNR on this position. The second entry "Inversion:" exhibits other possible chords from the standard list which could describe the given chord by use of another fundamental note when applying inversions. The third entry "PC-Aliases:" lists all possible chord names irrespectively of the given DNF.-The SCALE-FINDER routine finds scales which are "compatible" with a given chord in the sense of pc-subset relations. This routine outputs all scale types, together with possible transpositions such that they contain the chord's pcs.-The third routine SCALEMONITOR is more pedagogical and just for exercises in selected scales. On this basis, a harmonic analysis is reduced to a set of degree symbols (lead-sheet notation) for chords (together with fundamental notes) which are grouped according to typical initial triad, as used in presto® and other degree notation. The enrichment with respect to presto®'s notation consists in a more function-theoretic flavor of the names, in the case of "dominant" chord types, for example. This information is-however-also present in presto®'s fundamental note selection option, such that the distinction between Am7 and C6 is managed by the distinction between the fundamental A and C in the two cases, respectively, and also with the consequence that the minimal third chains in these cases are not the same. In both cases, we end up with a determined list of standard chords (including standard scales in the shape of "large chords") which describe the given chord and reduce its study to the harmonic semantics of these empirically or systematically retrieved standard objects.
536
CHAPTER 25. HARMONIC SEMANTICS
I Lead-sheet symbol I DNR I ZNF I Representative I Major Type (Ma) Ma 317.0 0,4,7 C,E,G Ma6 437.3 0,3,5,8 C,E,G,A 416.1 0,1,5,8 C,E,G,B Ma7 Dominant Type (Do) 439.3 0,3,6,8 C,E,G,Bb Do Dob5 430.3 0,2,6,8 C,E,Gb,Bb 425.2 0,2,4,8 C,E,Ab,Bb Do5# Augmented Type (Au) C,E,Ab Au 318.0 0,4,8 434.2 0,3,4,8 C,E,Ab,B Au7 Minor Type (mi) 314.0 0,3,7 C,Eb,G mi mi6 428.2 0,2,5,8 C,Eb,G,A 437.2 0,3,5,8 C,Eb,G, Bb mi7 413.1 0,1,4,8 C,Eb,G, B miM7 Diminished Type (di) 313.0 0,3,6 C,Eb,Gb di 440.0 0,3,6,9 C,Eb,Gb,A di6 di7 428.1 0,2,5,8 C,Eb,Gb,Bb 412.1 0,1,4,7 C,Eb,Gb,B diM7 Major Flat Five Type (Mb5) C,E,Gb Mb5 316.0 0,4,6 Mb5M7 415.1 0,1,5,7 C,E,Gb,B Sustained Four Type (Su) 308.0 0,2,5 C,D,F Su2 310.2 0,2,7 C,F,G Su5 310.1 0,2,7 C,F,Bb Su7 Added to the common foundation chords by Amuedo Sustained Root Type (R) R25 1310.0 1 0,2,7 1 C,D,G 307.1 0,2,4 C,D,Bb R27 Stacked Perfect-Fourth/Perfect-Fifth (P4, P5) P4*3 1427.1 1 0,2,5,71 C,F,Bb,Eb P5*3 427.2 0,2,5,7 C,G,D,A
25.2. DEGREE THEORY
537
I Name AlLDominant+5 Augmented Bebop_Major Bebop_Seventh Blues_(6-note) Blues_(7-note) Diminished Harmonic_Minor Hung/Gypsy /Byz Locrian+n9 Major Major_b6 Melodic-Minor Penta_Dom Penta_Lydian Penta_Major Penta_Minor Whole_Tone
25.2.3
DNF
Representative
834 646 832 817 656 708 835 738 729 815 743 742 739 549 550 5501 555 674
C,Db,Eb,E,F#,G,Ab,Bb C,Eb,E,G,Ab,B C,D,E,F,G,G#,A,B C,D,E,F,G,A,Bb,B C,Eb,F ,Gb,G,Bb C,Eb,E,F,Gb,G,Bb C,D,Eb,F ,Gb,Ab,A,B C,D,Eb,F,G,Ab,B C,D,Eb,Gb,G,Ab,B C,Db,D,Eb,F,Gb,Ab,Bb C,D,E,F,G,A,B C,D,E,F,G,Ab,B C,D,Eb,F,G,B C,D,E,G,Bb C,E,F#,Ab,B C,D,E,G,A C,D,Eb,G,A C,D,E,Gb,Ab,Bb
Hans Straub: General Degrees in General Scales
Summary. Hans Straub has initiated a general harmony, including modulation theory, as a generalization of Mazzola's original approach [327]. Here we just give a summary of Hans Straub's approach to scales and degrees.
In the author's original approach to harmony in [327], the diatonic scale X was covered by triadic degree chords to yield the triadic interpretation X(3) discussed in section 13.4.2. The corresponding cadence and modulation theory was subsequently generalized by Daniel Muzzulini [390]' and this one still more generalized by Straub [514]. In Straub's general approach, a scale is not given as a starting structure, but as a result, in fact the union, of a set S of "degree" zero-addressed chords s c 0@PiMod 12 . No further requirement upon similarity or structure of the chosen chords is imposed. For example, the "Hora Cero" interpretation consists of the set of seven three-element chords SHara Cera
= {C, Dm, Em, F, G, Am, B D}
with a nerve that looks like a belt which has been pinched to a point (the G vertex) in a position. Another interpretation is called "Dorico Flamenco" because of its Flamenco origin,
CHAPTER 25. HARMONIC SEMANTICS
538 its seven chord atlas is
SDorico
Flamenco
= {Dm, C, BO, Am, G, F, E}
both generating scales of type 17.1 in the chord classification appendix L.1. The spirit in Straub's approach is not so much the corresponding generality of cadence and modulation theory (which will be taken up in the following chapters), but the insight that much of existing music traditions, in particular popular music or American jazz, as we have seen in section 25.2.2, is defined by a grown chord vocabulary that is not subjected to a scale, on the contrary, it generates 'scales' that issue from the chord vocabulary, however not scales that will playa melodic role, only harmonically driven background material. This approach is however a purely degree-theoretic one, even in a very radical vein: No approach to theoretical background structures to allowed chords is imminent. You are just allowed to playa number of chords per "tonality" and to use cadences and modulations according to determined rules, but no further function theory is addressed. The question of harmonic semantics is indeed not explicit here. Although the jazz idiom has a strong affinity to European function semantics, it is not true that this and other ethnological/ cultural contexts are or can be cast in this kind of abstraction. Some ideosyncratic usages of chord vocabularies happen to look like absolute pitch perception: No comparison between different pitches leads to identification, each pitch is an individual. In harmony, this would mean that some usages of chords relates to a kind of "absolute harmonic position" , every used chord is irreducible, in fact a semantic atom of harmony.
25.3
Function Theory
Summary. We discuss the topological similarity approach to harmonic functions, as it has been prepared in the previous chapter 24.
In this section, we consider two addresses, A, B, a retraction a : A ----+ B, a form F, and a B-absorbing point z : A ----+ F. In traditional harmony, for example in Schonberg'S treatise [478], the harmonic band between two chords is the set of common notes. We have seen in section 13.4.2 that the Mobius strip shaped nerve of the triadic interpretation of a diatonic scale represents the information about harmonic bands among certain sets of triadic degrees. In that naive setup, the harmonic relation was given by common notes. Following the empirical insight 11.3.6, the deeper semantics of a (zero-addressed) chord may be recognized from its symmetries [400], or, to put it in the wording of harmonic topology, from the self-addressed objective closure (24.2.3). Putting this together with our empirical insight, one may consider the harmonic bands between the objective closures of chords, instead of the original, usually zero-addressed objects. This motivates Noll's definition of allomorphic chords for a given semigroup of symmetries ([400]). Here is our rephrasing of that definition (Noll does not refer to topological structures) in the general terms of harmonic topologies.
539
25.3. FUNCTION THEORY
Definition 85 Let m be a monoid in InTop(F). Then the B-allomorphic extension of m is the open set AllExtB(m) = ExtB(n)
U
n-
of all points in ExToPB(F) which contain some non-trivial sub monoid n of m, i.e., some nonclosed monoid n which is topologically dominated by m. The elements of AllExtB(m) are called the B-allomorphs of m. Evidently, ExtB(m) C AllExtB(m). In order to make the common endomorphisms precise, we have locally closed 3 subsets
AllExtB(n, m)
=
ExtB(n) -
U
ExtB(l)
n-
for all non-trivial dominance relations n a partition
-< l -< m for monoids l strictly larger than n. This gives
AllExtB(m) =
II
AllExtB(n, m)
(25.5)
n-
of an allomorphic extension into locally closed subsets. Such a subset AllExtB(n, m) is called the strict B-extension of n in m. Except the open part AllExtB(m, m) = ExtB(m), all strict extensions have empty interiors. Observe that in general, the intersection of two allomorphic extensions is not an allomorphic extension, so these open sets do not define a basis of a topology. Example 45 We take the classical situation B = 0, A = Z12, F = @A, however with the usual identification of elements of F as multiples of fifths. Then Noll's "second dominant morpheme in C major" is D* = (0:* = e 8 .3, j3 = e 9 .8)e, a bigeneric monoid. Allomorphic extensions are a means for defining harmonic functions via extensions, this is the idea in [400, 1.3.1, Definition 4]. Given specific monoids Mm,n which "signify" certain harmonic functions (see below for details), the representation of these functions extends to the (zero-addressed) chords in AllExto(Mm,n). This approach reveals the thoroughly intuitionistic logic oftopological spaces [314,1.7/8]. In fact, by the irreducibility of ExToPB(F), extensionsand a fortiori allomorphic extensions-are never disjoint. So there is little chance to have a unique functional representation for a given chord. Even on the level of strict extensions, we may have non-empty intersections AllExtB(n,m) nAllExtB(n',m') for different pairs (n,m), (n', m'). We have this lemma on empty strict extensions: Lemma 40 With the above notation, if the defining monoid m is saturated, i. e., equal to its
saturation
m,
and its non-trivial submonoid n is not, then AllExtB(n, m)
= 0.
Proof. From the construction of the saturation sheaf it is clear that a monoid inclusion n C m implies ii c m. Therefore, if m = m and n is not saturated, x E ExtB(n) implies x E ExtB(ii) and therefore AllExtB(n, m) = 0. 3Intersections of open and closed sets, see appendix H.1.4, definition 168.
CHAPTER 25. HARMONIC SEMANTICS
540
Remark 11 In general, it is difficult to give these functional specifications a quantitative weight, in terms of percentages, say. The only evident measure of the role of a chord X in AllExtB(m) is the position of its strict extension-the unique non-trivial submonoid n C m such that X E AllExtB(n, m)-in the lattice in (the closure of m) of submonoids. If the entire descent from monoid to submonoids is finite, one may for example take the relative depth of descent from top to define a numerical weight of a submonoid.
We still do not know very much about the relations between the saturation sheaf and the topological properties of the extension topology. But here are some exercises for straightforward connections: Exercise 59 With the above notation, prove these statements: 1. If U c ExToPB(F) is open, then the basic extension of the monoid of sections Sat~(U) satisfies U C ExtB(Sat~(U)). 2. If for two monoids n --< m, then AllExtB(n) C AllExtB(m). 3. We have S at~ (AllExt B (m)) chain condition4 then
= nn-(m, saturated and non-trivial n.
If m verifies the descending
n
n.
minimal among saturated and non-trivial n-(m
25.3.1
Canonical Morphemes for European Harmony
Summary. We discuss the construction principle for bigeneric morphemes, the crucial objects for Noll's reconstruction of traditional European harmony.
-EWe have so far prepared the semantic field via extension and allomorphic extension concepts. So we are ready to look for monoids which may recover the harmonic semantics known from classical function theory. As above in example 45, we are now working in the situation B = 0, A = 1: 12 , F = @A, the case studied in [400] with the elements of F being interpreted as multiples of fifths. Following the suggestion from section 24.1.2 concerning the structure of certain common chords, in particular circle chords, the construction of special monoids (or their objective trace under an absorbing point z = I dZ12 in the present case) has been set up following a bigeneric approach. In fact, the circle chords are the zero-constant traces of self-addressed chords monexz( (I, eC .O)e) associated with a bigeneric monoid, whereas the example VJ is the zero-objective trace of monexz((e8 .3,e2 .0,e7 .0,)e) which is associated with a trigeneric monoid. The bigeneric approach then takes two generators ad = ed .3, (3a = ea .8 (d for "diminished" , a for "augmented", will be justified below) and considers the "morpheme" monoids 5
(25.6) 4Every descending chain of submonoids is stationary, for example for a finite endomorphism monoid End(F). 5In [400J, only the generated semigroups are considered, but the results are the same.
25.3. FUNCTION THEORY
541
generated by ad, (3a for any coefficients a, d E Z12. The experimental evidence of the crucial role of these monoids is given by the following proposition. It uses the affine Lie bracket [xy] = x.y - y.x which is defined on any monoid of endomorphisms X@X of a module X. Further, we need to encode the translation classes of some zero-addressed chords in Z12. We need the class numbers 2,5,6,7,10, and 14, see appendix L.1 for the class numbers. Class 2 contains two translation classes, 2.0 (=2 in the class table), and 2.1. Classes 5,6,7 are translation classes. Class 10 splits into four translation classes, two for each subclass 10, 10.1 of the table. We denote them again by 10.1, 10.2, 10.3, 10.4 according to the lexicographic order. Class 14 splits into two translation classes 14.1, 14.2.
Proposition 46 [400, III.3.3, Lemma 2] Given two coefficients a, dE Z12! the translation class of the zero-constant part IMd,a 10 of the morpheme monoid Md,a is a function of the Lie bracket [ad(3a] = e 5d+2a ! and we have these translation classes:
Translation Class of constant part of Md,a
Lie invariant 5d + 2a
2.0
0 3 or 9 4 or 8 6 1 11 5 'l 10
5 6 'l 10.1 10.2 10.3
10·4 14·1 14·2
2
The point of this proposition is that these classes are very common. In fact, the Lie invariant 5d + 2a yields these chord types for the constant parts of the above morphemes: 1. 5d + 2a = 1 gives a major chord such as {c, e, g},
2. 5d + 2a = -1 gives a minor chord such as {c, eD, g}, 3. 5d + 2a = -2 gives a (minor) seventh chord without fifth, such as {g, b, f}, 4. 5d + 2a = 2 gives a (minor) sixth chord without tonic, such as {eD, g, a}. This means that we get very classical chords for the Lie invariants ±1, ±2. These Lie invariants are taken to define the functional semantics of tonic, dominant, and subdominant values in [400]. Before we step over to the tonal function calculus in 25.3.2, we should understand the general principle which governs the construction of bigeneric morphemes. In [400], the situation
CHAPTER 25. HARMONIC SEMANTICS
542
was uniquely F = @Z12, and no systematic account was given for the "bi" in the bigeneric approach. It was also not clear which should be the relation between the two generators. This is what we want to complete now. To this end, recall from appendix C.2.3, definition 125, the idempotent component ..jX of an idempotent element x E End(F) of the endomorphism monoid of a form F. Recall also from appendix C.2.3, proposition 66, that for a representable form F = @Q, the idempotent components are all conjugate under translations to idempotent components of linear idempotents xu,v which correspond to direct decompositions Q = U EEl V. In Z12, we have the four direct decompositions Z12 Z12 Z12
= Z12 EEl (0), = (0) EEl Z12,
= 3.Z12 EEl4.Z12 .::. Z4 EEl Z3, Z12 = 4.Z12 EEl 3.Z12
which are pairwise coupled by the exchange of factors, corresponding to the exchange x f---> 1- x of idempotents. The second group of idempotents is the Sylow decomposition of Z12, the only non-trivial direct decomposition. The above selection of generators ad, f3a corresponds exactly to the Sylow decomposition; we take ad E Je x .prZ4' f3a E Je Y.prZ3. More precisely, ad = e d .7fz4 ,7fZ4 = -prZ4 = (3), and
f3a = e a.7fz3,7fZ3 = -prZ3 = (8),
twice the negative projections as linear factors, and any shift coefficients a, d, which in fact can be arbitrary because of the choice of the linear part! So the recipe is this: Take any non-trivial direct decomposition of the underlying module Q = U EEl V and the associated idempotents Pu, Pv, take two elements a E Je x ·Pu, f3 E JeY.pv in idempotent components which are conjugate to the components ..jfiU, yiPV, and build the bigeneric monoid (a, f3)e. So the bigeneric approach corresponds to the direct decomposition into two factors, the relation among these generators is the complementary role of the direct factors in the idempotent relation x f---> 1 - x. Let us apply this generalization to the just tuning space Euler Z Module = Z.q EEl Z.t. This rank-two Z-module has infinitely many direct decompositions into two rank-one factors, for example the above one, defining the projections Pl,P2 = 1 - P1 onto the factors Z.q,Z.t, respectively, or the decomposition Z.q EEl Z.K t via syntonic comma K t. Let us look at two canonical generators for the idempotents Pl,P2. Set 7f1 = -P1, 7f2 = -P2, and then
This entails and a~,e
= aa,e, f3~,b = f3.."b.
a~,e =
em .P1, f3~,b = em ·P2,
Then we have this zero-constant chord:
25.3. FUNCTION THEORY
543
e
which yields the major triad Ie = {(g), @, (~)} for 'f} = = 0, a = b = 1. So the direct decomposition approach not only yields standard chords in 12-tempered space, but also in just tuning. No further developments have been investigated to date in the just tuning space. So if ever the form is representable, F = @Q for a module Q, if Q = U EB V is a direct decomposition with corresponding projections pu,pv, and if x, y E Q, we have the generators of a bigeneric monoid o:~ = eX. - Pu, = eY . - Pv which live in the idempotent components y'ex-pu(x).pu, y'eY-pv(y).pv, respectively. If s = et.so E GL(Q) is any invertible endomorphism of Q, then the conjugate generators are
(3:: u
.s v (3
s·o:x
-1
s. y. s
s(U)
= O:so(x)'
-1 _
-
(3S(V)
sO(Y)'
and we have the conjugate monoid u (3v) s -1 Y e·
s. (O:x,
=
(s(U) (3S(V)) O:so(x)' so(y) e
whose zero-constant part is
This means: ~
Proposition 47 The transformations via s E GL(Q) of all zero-constant chords in @Q derived from bigeneric morphemes are the zero-constant chords derived from s-conjugate generators. In particular, all translates of zero-constant chords derived from bigeneric morphemes are again of this type. In particular, the full translation classes of common zero-addressed chords in @Z12 derived from bigeneric morphemes are derived in this way, this is the situation encountered in [400]. Generalizations of this approach are obvious: One may step over to direct decompositions into three, four, etc., factors and corresponding multigeneric monoids, one may also take into account more general bigeneric chords with not so common zero-constant parts, and, last, but not least, one may reconsider the whole theory for more general addresses B, not just the zero address, and for more general forms (representable or not).
Remark 12 At present, we do not know how far the canonical transitive GL(2, Z)-action on the set of direct, non-trivial decompositions Z2 = U EB V could be connected to the idea of the abelian group 8L(2, Z)j(8L(2, Z), 8L(2, Z)) being identified with the set Z12 of pitch classes (see section 24.1.1.1).
25.3.2
Riemann Matrices
Summary. The Riemann program suggests a special matrix associated with every possible chord. The coefficients of this Riemann matrix are accounts of the role of the chord within the coordinates of tonalities, modes, and function values. We give the formal definition of the Riemann matrix and its construction principles.
544
CHAPTER 25. HARMONIC SEMANTICS -I;-
We are now ready for considerations of tonal functions built upon the preceding mechanisms from degree and function theory6. The essence is this:
Principle 23 Any function theory should aim at defining tonal functions for any existing chord', i.e., relations between instances of the concept of a tonality and "semantic pointers", the harmonic function values, which provide the syntactic structure with means for understanding harmonic coherence. This statement is not meant as a cryptic sentence to cheat humanities, rather do we delimit our subject from levels of understanding which are not the scope of exact science, viz. textual predication in the sense of section 18.2. In this vein, functional values are just names that are attributed to chords by means of precise evaluation modalities. So the only intrinsic meaning we can offer is extensionality, the determination of fibers lying over specific functional value symbols. The names are not really relevant, except that they may evoke some paratextual mechanism. Our approach yields textual predicates which give every chord a truth value in some truth module, and such that this evaluation is built upon the structure theory thus far developed. The collection of tonal functions is defined by three domains: A name set TON of "tonalities", a name set VAL of "tonal function values", and a form space TRUTH(I) for of "truth denotators" in a specific truth module (or, more generally, a truth form) I and at a specific address B for a form F. Formally speaking (see also formula (18.7)), a tonal function TFf,t(f, t) E VAL x TON, is a parametrized set of objective textual predicates which are encoded by maps
on the supporting domain ExToPB(F). For such a collection of predicates, the Riemann matrix is the matrix TF = (TFj,t)U,t)EVALXTON of predicates. Given such a matrix, for any chord a E ExToPB(F), its Riemann matrix is by definition the matrix8 of all values, i.e.,
TF(a)
= (TFf,t(a))U,t)EVALXTON
E MVAL,TON('ll'f),
(25.7)
and therefore, we have the Riemann matrix map
Here, a tonality symbol tETON is codified by its value vector TFt(a) = (TFj,t(a))fEvAd on all chords a E ExToPB(F). This is a fuzzy evaluation of tonal roles of chords with respect to the function symbols in VAL. 6Recall that we are still given two addresses A, B, a form F, and an absorbing retraction a : A --+ B. 7See also Dahlhaus' paper [100] on Riemann theory. 8The set of all matrices with rows in a set m, columns in a set n, and coefficients in a space X, is denoted by Mm,n(X).
545
25.3. FUNCTION THEORY
25.3.3
Chains of Thirds
Summary. This section gives an elementary construction of a Riemann matrix, based on chains of thirds and minimal embeddings of chords in such chains.
This setup works in the zero-address and form F with Fun(F) is the set of "tonics" with just one tonic per pitch class
=
@Z12.
The tonality set
TON = {C,D"D,E"E,F,G"G,A"A,B"B}. The value set is the classical six-element set for tonic, dominant, sub dominant , each in major and minor mode:
VAL
=
{T,D,5,t,d,s},
with the minor problem of symbol confusion (D is doubly coded, but this is the tradition). The truth module is the already discussed fuzzy group I = 51 = ~/Z (see example 36), and the values will be just the denotators associated with the half open interval subsets [0, <1>[ of 51. From section 25.2.1 we learned that this approach deals with the third chain simplex 3Chain(c) of a chord c. Given a tonality symbol f in our context, and a third chain or singleton a, we shall define the function values TFj,t(a) and then extend this evaluation to any chord c by the formula
(r=
TFj,t(c)
1
-1
= at(card(3Chain(c)) L;aE3Chain(c)at (TFj,t(a)))
(25.8)
which includes the case of third chains and singletons as zero simplexes of the third chain interpretation. Here, the auxiliary function
at: [0,00[----7 [0, 1[ is supposed to be a strictly increasing bijection9 , for example at(x) = ~ arctan(x). The reader may generalize this formula by more sophisticated contributions of the simplex vertexes, we just show the principle here. The principle is that a chord's tonal function is some kind of average of the tonal function taken over all minimal third chains, i.e., the "standard" chords covering the-possibly exotic-given chord. We are left with the definition of the tonal function for singletons and non-trivial third chains. Again, this is a prototypical formalism, to be adopted by the reader for specific needs. Suppose that a third chain a is given by the sequence aI, a2, ... ak, then we suppose given the non-negative values and then set
9This normalization is used to cast the raw values in the unit interval of the circle group.
546
CHAPTER 25. HARMONIC SEMANTICS
with a non-negative function 3CH on the singleton arguments, e.g., the sum of their second powers. One may in particular implement singleton values which yield good truth values for major, minor and similar common triads. The formulas for the singletons x should be such that the values for basic tonality symbol f = C yield the values for general tonality symbols, according to the formula (25.9)
if the symbol f corresponds to the tonic t. So this method is based on an elementary truth evaluation on the singletons relative to one reference tonality and then uses different constructors via third chain evaluation and then minimal covering families of third chains. This method has been implemented in RUBATO®'s HarmoRUBETTE®, see section 41.3 for details. The above approach seems to be numerically oriented. It is however still a logical approach in the fuzzy logic module Sl. Nonetheless, such a specific truth value assignment is not mandatory, one may think of much more abstract truth modules J and then, after having defined a method that works on J, transform such abstract values to more 'practical' values via appropriate truth module morphisms J ---> I.
25.3.4
Tonal Functions from Absorbing Addresses
Summary. This section discusses the tonal functions of chords associated with absorbing addresses, generalizing the idea of self-addressed morphemes in [400].
This approach does not take values of a numerical nature, but gets off on a purely topologicallevel of truth levels. The philosophy is that the evaluation is not so much information about truth or falsity, but about the esthetic position within a 'neighborhood system of harmonic centers'. Harmonic evaluation in this spirit follows a 'logic' where the location is more relevant than the decision between dichotomic alternatives. It is in fact an open question whether Riemann's harmonic logic ever was meant to be a restatement of classical binary logic, and whether its role was ever limited to yield decisions between correct and false musical syntax. We start on the truth value form functor I = ExTop(F) for a pitch form F (e.g., F = PiM Od 12 , Euler Z Module), defined on the subcategory of addresses with retractions as morphisms. This generates the truth form T RUTH (I) and, by definition, its functor values B@TRUTH(1) = B@OExTop(F) at address B. So a subset U c ExToPB(F), i.e., an element U E B@2ExTop(F), gives rise to a truth value U~ E B@TRUTH(1), and we have integrated topologically deduced subsets of the B-addressed extension topology of form F. On this basis, the evaluation TFj,t(c) of a chord c in ExToPB(F) follows this scheme: The tonality symbol t is associated with an interpretation Sr' of an A-addressed scale St C ExToPA(F). The elements of M t are images Urn = monexz(m) of monoids 1o m E InTop(F). The monoids are deduced from the interpretation S:':'o for a basic tonality to by conjugation with the translation l l that corresponds to the tonic shift from to to t, or another transformation for more involved collections of tonal symbols. laThe "harmonic morphemes" according to the terminology of Noll [403]. the calculations of morpheme transformations for bigeneric morphemes in section 25.3.1.
11 See
25.3. FUNCTION THEORY
547
This method's function symbols rel~te to the basic interpretation 8::<0 and its charts. If m is such a chart's monoid, some specific specialization monoids, i.e., non-trivial submonoids nm,i, i = 1, ... q(m) are chosen to define the function symbols fm,i. For a general tonality symbol t, the function symbols are associated with the correspondingly transformed monoids and submonoids. This being true, the function value TFf,to(c) for a symbol f corresponding to the specialization pair nm,i -< m is the locally closed allomorphic extension set AllExtB(nm,i, m) iff c is an element of this set, and 0 else. In what follows, we want to give some examples of tonal functions from absorbing addresses. They are all focused on the classical context of pitch form F = PiM Od 12 with fifth identifier, addresses A = Z12, B = 0, and absorbing address retraction z : A -> o. Example 46 Bigeneric Major Tonality [400]. We take up the bigeneric morpheme monoids Md,a from equation (25.6) and their associated self-addressedchords MJ,a = monexzCMd,a). We are given the usual twelve tonality symbols TON
= {C,Di>,D, Ei>,F, Gi>, G, Ai>, A, Bi>, B}
and calculate the pitch class p corresponding to tonality symbol X in mUltiples of fifths, e.g., G corresponds to p = 1, we write G = e 1 C, to be short. The X = ePC-major tonality is defined by the twelve morpheme charts with the respective functional symbols as follows: We choose d = 1 and set Function Symbol Consonant Mode Dissonant Mode
The corresponding predicates are related to the extensions (the unique open allomorph)
Exto(MJ,a) and take symbols T, D, 8, t, d, s for the "consonant mode", and T*, D*, 8*, t*, d*, s* for the "dissonant mode" 12, i.e., VAL
= {T,D, 8, t, d, s, T*, D*, 8*, t*, d*,s*},
and the value
FePC,v (c) -_ {ExtO(Ma(p,V),d(P,V)) if c E Exto(Ma(p,v),d(p,v)),
oelse,
with v E VAL, and
12Terminology from [400j.
(a(p, T), d(p, T)) (a(p, D), d(p, D))
(l,p+ 4), (ll,p + 9),
(a(p, s*), d(p, s*))
(2,p + 8),
CHAPTER 25. HARMONIC SEMANTICS
548
according to the above table. Recall that the zero-constant parts of these morpheme charts are classical triads, as discussed in proposition 46 of section 25.3.1 and the remarks following that proposition. We also see that the terminology of a "consonant" or "dissonant" mode is now plausible by the triads in the dissonant mode being related to seventh and tritonus intervals. This example can be enriched by adding selected allomorphs to the above open allomorphs. A priori, we would preconize the full allomorphic extension of each of the above morphemes, and then build a harmonic function theory upon the complete 'allomorphic spectrum'. Exercise 60 Calculate and draw the nerve of the twelve-chart interpretation by the semigroup morphemes I3 exz{ (ad, f3a)) and the sub-nerve of its zero-constant trace by the bigeneric morphemes in example 46. The nerve of the semigroup submorphemes tells more about the intersection configurations because the identity of a monoid 'collapses' all information about other common points. Example 47 Third Degree Tonality. We have already defined the triadic interpretation X(3) of a diatonic scale X in section 13.4.2. This was a zero-addressed situation which we now 'blow up' in the self-addressed context l4 . Again, we have the twelve pitch class related tonalities TON
= {C,Dp,D,Bp,E,F,Gp,G,Ap,A,Bp,B}.
= C and start on seven monoid morphemes, MI = Int(Ie), MIl = Int(IIe), ... MvI = Int(VIe),MvIl = Int(VIIc). We have Int(Ie) = M I ,4, i.e., the generators a, f3 generate the entire monoid Int(Ie), which is the semigroup generated by a, f3, united with the identityl5. The analog is true for the other monoids Int(IIe), ... Int(VIe) since degrees II to VI are isomorphic to degree I. However, Int(VIIe) is more complicated. The symmetry group of V I Ie is a Klein group and the sub-semigroup of non-invertible elements
We first stick to the basic X
is not bigeneric in the sense discussed above. Here it is:
Int(VIIe) - Sym(VIIc)
{ e l lO , e20 , e 5 0 ,
e5 6,e I 16, e 8 3, e8 g,
eI2,e72,e310,e910,eI8,e78,eI08,
,
,
e 34 e6 4 e 94}
13These can also be defined as being the intersection of the monoid morphemes with the semi-group of all non-invertible elements of End(F). 14Recall that we work in pitch space PiMod12 with the fifth identifier. 15We know from appendix C.2.3 that
J I dempot(Z12@Z12) v'iu v4 u Ve34 u Ve64 u Ve94 u v'3 u Ve43 u VeB3 u Va u VeTO u Ve20 u Ve3Q ... U
Ve l1 D
is the idempotent decomposition of Z12@Z12 into invertible endomorphisms, those above linear factor 4, those above linear factor 3, and those above linear factor zero. Since these linear factors commute, the components, which are conjugate with each other above a fixed linear idempotent (appendix C.2.3, proposition 66), are all semigroups.
549
25.3. FUNCTION THEORY
We denote the monoid generated by these non-invertible elements of Int(VIIe) by Into(VIIe). The charts of our self-addressed triadic interpretation are the six charts MI = intex z (Ie) , MIl = intex z (IIe), ... MVI = intexz(V Ie), and the degree seven chart MVIl = monexzInto(VIIc). The function values are symbolized according to all possible allomorphs of the seven chart monoids, which yield the truth values as in the preceding example. We leave the details to the reader. The intersection configuration is remarkable. The nerve of the objective traces of the exz-charts from the sub-semigroups of non-invertible elements 16 in the monoids
Int(Ie) , ... Int(V Ie) , Into (V IIe) is a priori a simplicial complex containing the nerve of the zero-constant part. In this case, no new intersection configuration intervenes: As in the zero-constant case, the nerve of the selfaddressed extension of the triadic interpretation of a diatonic scale is again a Mobius strip. Figure 25.2 shows the intersection configuration in detail. 13.01.13,41. I I I , OJ , [1 1.81
10, 01 .1 0, 9 1. [3. OJ, [3. 3]
13. 41,[I I,OJ20 J
VII 1' - - - -
[2. 0]. [5.0],18, 9J.19, 41
!
[5. 01.[9. 4J
14.01
I I. OJ. [5. OJ. [9, 4J. [9. 81
11, 0 1. 11. 31; 14. 01. [4. 9J
Figure 25.2: The non-invertible elements of the self-addressed charts built upon the endomorphisms of classical triads define a nerve which is identical to the nerve from the constant parts of the self-addressed triads! The figure shows the self-addressed chart intersections for C tonality, everything calculated in the fifth-identifier form PiMod 12 .
Remark 13 The passage from zero-addressed to self-addressed degrees maintains the nerve, but if we take one more step from the self-addressed to the functorial setup, the Mobius strip is 16These are all elements except the identity.
550
CHAPTER 25. HARMONIC SEMANTICS
enriched by new simplexes. More concretely, we take the self-addressed degrees X C Z12@Z12 and consider the associated functorial local compositions X C (Z12@Z12)~and the global composition Z I defined by this covering I. In this setup, the nerve of the self-addressed configuration is of course part of the nerve of the functorial configuration Z I, but we encounter new simplexes. Specifically:
Exercise 61 Show that the functorial degrees I~ and I I~ have a non-empty intersection and thereby extend the Mobius strip of the self-addressed triadic covering. Hint: Find a morphism f : Z12 ---+ Z12 which yields an f-slice f@rn f@Ir=l= 0.
Chapter 26
Cadence Was ist C-Dur? Wahrlich, Herr Mazzola, eine gute Frage! Rudolf Wille [575] Summary. Cadences are shorthand representations of tonalities. (We do not discuss the other meanings-e.g., the solo cadence in the sense of a concert climax--of this typically homonymic term.) There is a variety of approaches to realize such a representation. We give an explicit definition of the concept of a cadence with respect to varied addresses and ambient spaces. In particular, we present the very classical cadences, those related to self-addressed function theory, and more exotic self-addressed cadences which relate to symmetries rather than to tones or sets of such objects.
The concept of a cadence as a final cadence or a cadential formula to consolidate a tonality goes back to Italian theorists of the 16th century. In this chapter, we shall discuss this meaning of the word and not that of the concert solo cadence. In this sense, a cadence is a good example for the construction of sign structures in music. The signifier expression is fixed in a cadential formula, such as I-IV-I-V-I, which is supposed to obtain its signification in a specific context, and thereby points to its signified content: the "uniquely determined tonality" or the "composition's conclusion", respectively. We shall only deal with the former meaning because this one is also the structural substrate of the latter: the conclusion is expected because a specific, uniquely determined tonality is recognized from a cadential formula. If this tonality (whatever the technical meaning of this word may be) is the significate of the given cadence formula, a mathematical function of type "cadence formula=function(tonality)" is given. The characteristic of this function is that it is injective. Two tonalities are associated with different cadence formulas: Ic-IVc-Ic-Vc-Ic differs from ID-IVD-ID-VD-ID. Implicitly, the existence of a cadence function depends upon two conditions which usually are not noticed by theorists because they seem straightforward: 551 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
552
CHAPTER 26. CADENCE
• The first deals with the very concept of a tonality. Making this concept precise is a condition for a unique determination of its instances whatsoever. The recognition of a tonality from its cadence presumes that you are able to distinguish it from musical objects that are not tonalities or are so, but share a different flavor which you would eliminate in the present context. For example, the recognition of the European C-major tonality from the classical Rameau cadence Ic-IVc-Ic-Vc-Ic presupposes that we would not search in a set of Turkish or other non-European tonalities. So we may suppose being given a set X of global compositions which define the extension of the concept of a tonality. • The second condition relates to the range of the possible cadence formulas. In fact, searching for a formula presupposes that one will find it in a delimited domain of musical cadence expressions. In other words, one should know what is a cadence formula in the present context and what is not. Call P the set of admitted cadence formulas, or "cadence parameters" as the elements of P parametrize the domain of formulas. We shall see that for all known situations, P can be chosen to be a set of global compositions.
26.1
Making the Concept Precise
Summary. This section describes the concept of a cadence. -~-
Definition 86 Given two sets X, P of tonalities and cadence parameters, respectively, as introduced above, and a map"" : X -+ P, we say that"" is cadential in a tonality x E X if the fiber ""-l,,,,(x) is the singleton {x}, i.e., "" is injective in x. The map"" is called cadential or a cadence, iff it is cadential in every tonality x EX. We should however be aware that the general nature of a cadence map cannot be restrained to mathematical or even functorial properties. A cadence is a relation which can issue from any motivation, be it historical, systematic or arbitrary. Essentially, the above definition is the one used in [327, 328, 340]. In the present theory language, we would prefer restating that definition in terms of textual predicates. Disregarding the fact that global compositions are not denotators, we would have to extend the cadence map to a 'global predicate' Cad outside X by the value ",,(y) = y/Cad = 0 for y rf- X. Inside X, we then would have the values x/Cad in Glob~, the set of A-addressed global compositions with charts living in 1f~ for a specific cadence truth form F. Whenever we use this wording for cadences, keep in mind that no extension of the category Den of denotators to global objects has been defined so far, except for powerset denotators, i.e., local compositions. Nonetheless, we shall use this sloppy language because the environments are evident. Before discussing the examples, we should emphasize the dramatic ambiguity of these concepts between truth and beauty. The truth values for a cadence are global compositions, i.e., objects of musical esthetics rather than poor truth denotators. A cadence gives you a parametrization of musical objects-tonalities in this case-in an auxiliary space of musical objects in order to obtain a shorthand representation of these objects. Rather than evaluating these objects by truth or falsity, the cadence positions them in an esthetic parameter space.
26.2. CLASSICAL CADENCES RELATING TO 12-TEMPERED INTONATION
553
Observe also that we did not give a specific delimitation of the tonality concept, it is just a word which will be filled with meaning in the following examples. Nonetheless, we should recall the discussion of function theory in chapter 25.3. There, "tonality" was introduced as a symbol in an abstract set TON. Its usage was the construction of a bunch of textual predicates T Ff,t which give each chord a tonal function (in this case: an allomorphic set in the topological extension space). These predicates may be used to define cadences, but the effort is considerable, we leave this as an exercise 1 . The link between the function predicates and the cadence predicates is that the function predicates are evaluated by use of global compositions (such as the triadic interpretation of a scale). These global compositions correspond to the tonalities in the set X for cadences. Notice that the scope of tonal functions is not the identification of tonalities, but the attribution of a tonal value for any given chord, and this is much more than mere identification.
26.2
Classical Cadences Relating to 12-tempered Intonation
Summary. This first example reviews the classical cadences of European tonal music in relation to pitch classes in PiM Od 12 . -~-
The first group of cadences relates to zero-addressed (commutative) compositions, more precisely, to compositions stemming from the pitch class ambient spaces Z12 for 12-tempered tuning and Z2 for just temperament. These cases are also prominent since we have extensive modulation theorems for them. These approaches are also prototypical in so far as the used atlases consist of quite special charts (typically triadic coverings) which could be generalized to richer and more realistic coverings (and partly have been by Straub's investigations). But it is just this simplistic perspective which shows that even under elementary conditions, interesting and experimentally significant structures may emerge. The experimental investigations will be explicated in chapter 27 dedicated to modulation phenomena in classical scores from European literature.
26.2.1
Cadences in Triadic Interpretations of Diatonic Scales
Summary. This section describes the cadences related to triadic degrees. -~-
We start with the classical example. Let X = Dia(3) be the twelve-element set of zeroaddressed triadic interpretations of all major scales in Z12. Take the cadence parameter set P = ObGlob~"2 of global, zero-addressed compositions with charts in Z12. For a given sequence J = 1 :::; j1 < j2 < .. . jk :::; 7 of indices for triadic charts, we define the map KJ(S(3») = {Jj1 , J12 , ... J jk } with image the sub-interpretation of the covering defined by the charts from the subsequence J. For example, if J = 1,4, we get the value KJ(E(3») = {IE,IVE }, a twochart-interpretation of the six-element pitch class set {e, g~, b, a, c~}. The extension of such a lSpeaking of a cadence in the context of tonal function predicates, this cadence would be formalized by the function vector TFt discussed in sec section 25.3.2.
554
CHAPTER 26. CADENCE
function to other triadic interpretations, for example of melodic minor scales, would evidently destroy cadentiality. On the other hand, ignorance ofthe precise formula (first and fourth triadic degrees) would make it impossible to retrace the argument. For example, if we had also at our hands the first and fifth triadic degrees, it could not be decided whether the interpretation is the image {IE,IVE} or {lA, VA}. Although the function K,J is a cadence for the given tonality domain Dia(3), the usual selection method for 'good' sequences asks for more. The idea is that you only know that the associated interpretation is defined by some J, but one is not sure about which one has been applied. So the requirement is that the tonality should be uniquely determined whatever J may be the defining selection parameter. This leads to the concept of a cadential set of triadic degrees.
Definition 87 Given a set X of interpretations of objective, local A-addressed compositions of form space F, a set C C I of charts of an element Sf E X is called cadential in Sf iff Sf is the only interpretation T J in X such that C C J. The cadential set C is called minimal cadential in Sf iff it has no proper subset which is also cadential in Sf. A family (Cx)xEX of (minimal) cadential sets C x for each interpretation x E X is called a (minimal) cadential family for X. The classical example X lies:
J1
= Dia(3)
yields five index sequences for minimal cadential fami-
= (II,III),J2 = (II, V),Ja = (III,IV),h = (IV, V),J5 = (VII).
This means that the family Jl, for example, defines its set {I 18 (3), II 18 (3)} for the triadic interpretation S(3). The fourth one is the reminiscence to Rameau's cadence I-IV-I-V-I, except that the first degree triad is superfluous since no tonic is stressed in this triadic interpretation. Remark 14 Observe that in the present definition of cadential sets, no syntactical position is defined, so the sequence in the Rameau cadence I-IV-I-V-I is not reflected, only the set {l, IV, I, V, I} is of interest in this context. A further specification of syntactical relations is necessary for the concrete applications, but this is another subject which we do not open up here.
In [390], Daniel Muzzulini calculated all minimal cadential sets for the following tonality sets X (by computer). We take the translation orbits X = eZ12 .S(3) of triadic interpretations of seven-element scales S C 1:12 by the seven degrees consisting of those three-element subsets such that every second pitch class in the clock order is selected. The number of cadential sets varies between 5 and 21, the first number being taken by the triadic major scale interpretations Dia(3) while the maximum of 21 cadence sets is taken by the orbit of harmonic minor scale interpretations H ar(3) = eZ12 .c~3), in fact, here every pair (X, Y) of two different degree indexes defines a minimal cadence set, and we have = 21 sets. A more modern definition of cadential sets runs as follows. We consider a set X of Aaddressed interpretations x = Sf -+ S -+ @A x F
G)
in form space F. A cadential interpretation for Sf is an interpretation CJ
-+
C
-+
@A x F
26.2. CLASSICAL CADENCES RELATING TO 12- TEMPERED INTONATION
555
which factorizes through S1, i.e., there is a morphism
such that the triangle
(26.1 ) S1 -------+"@AxF
commutes. This implies that each chart of C J is associated with a chart of S1, and that this is done in such a way that the points of the charts are mapped into each other in accordance with the underlying embeddings of the local support compositions S, C. And we ask that there is no factorization through an interpretation y =I- x in the set X. Morally (which in this book reduces to topos theory), such a cadential interpretation is a point in the full sieve @x which characterizes x within the collection X. So cadential sets relate to selecting 'separating' points ex in full sieves @x, x EX. A general theory of such constructs has not been developed to date.
26.2.2
Cadences in More General Interpretations
Summary. This section describes the cadences related to generalized degrees in general scales, as investigated by Hans Straub. -~-
Starting from the generalization of minimal cadence sets for the seven-element scales studied in [390], Hans Straub [514] has undertaken a vast generalization of cadence set types on seven-element scales. A (minimal) cadential set is the same as above, but it need not cover the scale, it is a more abstract representation not of the single scale notes, but of the degree representation of the scale. For example, one may consider as degrees all the triadic chords of minor, major, diminished, or augmented types, which are contained in a given scale. Complete computer-aided (here, computers become definitively mandatory) calculations have been carried out for minimal cadence sets consisting of a number of three-element degrees of any type, no classical triadic chords are preferred. If the degrees are all the triadic chords of minor, major, diminished, or augmented types in a scale, the nerve dimensions may vary from zero to seven, and not every scale is covered by this degree system. But this degree system has the advantage that all degree chords are classical, which is not the case in Muzzulini's generalization. However, in Straub's general concept framework, one finds degree systems consisting of seven triadic chords which have no cadence sets at all, for example for the scales of type 60 and 62 (in our classification table L.1). It turns out that 21 is the absolute maximum of possible minimal cadence sets for systems of seven three-element chords in seven-element scales, and that it is achieved by the majority of scales.
CHAPTER 26. CADENCE
556
In the very general setup, Straub recognizes that a particular type of degree systems is interesting. According to the degree-theoretic approach which includes coverings by standard structures (as sketched in section 25.1), Straub therefore considers irreducible degree systems DEC in the sense that no element of DEC can be written as a union of other elements of DEC. Evidently, systems consisting of elements of equal cardinality, such as triadic systems, are irreducible. Also, a system is always built by a unique irreducible subsystem. A classification of all irreducible degree systems has not been accomplished to date. It is however important to embed the classical European degree concepts in this variational environment to understand how special or singular the classical choice is. According to Straub's investigations, the distinguished position of the classical diatonic, harmonic, and melodic scales does not emerge on the general level of degree systems and the associated cadence systems, but only with respect to characteristic properties in modulation theory (see chapter 27 for this discussion). This latter aspect has however not been investigated in depth. Exercise 62 Show that a degree system of a finite scale is always built by a unique irreducible subsystem.
26.3
Cadences in Self-addressed Tonalities of Morphology
Summary. Relating to interpretation of self-addressed tonalities, minimal cadential sets have been calculated by Thomas Noll, we present and discuss the results. -I;-
In [403], the basic form is the classical 12-tempered pitch class space F = PiMod 12 , with fifth identifier. In the vein of our example 47 in chapter 25, major and minor tonalities are constructed as interpretations of Z12-addressed local compositions. Elements ea.b are denoted by [a, bJ. We restrict to the description of C-major and C-minor, the other 22 tonalities being defined via transposition. The basic local composition (the 'scale') has 23 elements, eight of them being constant, i.e., stemming from zero-addressed tones: C-major
={
[0,0], [1,0], [2,0], [3,0], [4,0], [5,0], [8, OJ, [11,0], [1,3], [2,3]' [3,3], [8,3], [11,3], [0,4], [3,4]' [9,4]' [4,8], [8,8], [9,8], [11,8], [0,9], [4,9]' [8, 9]},
whereas the interpretation is given by 10 charts which are related to the function symbols T, D, S, D*, s, s* in major and t, d, s, s*, D, D* in minor. Each of these main function symbols is associated with a "Gegenklang" and a "Parallelklang" symbol g,p, respectively. We have these
26.3. CADENCES IN SELF-ADDRESSED TONALITIES OF MORPHOLOGY
557
major tonality charts 2 : Tg =[1,3]' [4,9]' [1,0]' [4,0]'
Tp =[4,8]' [0,4]' [0,0], [4,0]
Sg =[3, 3]' [0,9], [0,0], [3,0],
Sp =[11,8], [3,4], [11,0], [3,0]
Dg=[11, 3], [8,9], [2,0]' [5,0]' Dg* =[8,3], [8, 9], [11,0], [5,0] Dp=[9, 8], [9,4], [1,0], [5,0] sG =[11,3], [8,9], [11,0]' [8,0], sG* =[2,3], [8,9], [2,0], [8,0]
sP=[8, 8]' [0,4], [0,0], [8,0] which are motivated by the allomorphs to the bigeneric morpheme construction from section 25.3.1 (see also [403]). Exercise 63 Draw the nerve of this interpretation, and the nerve of the constant subcharts. Compare them to the nerve (harmonic strip) of the triadic interpretation from example 47 in chapter 25. With these settings, the set X of tonalities is defined by the 24-element set of 12 major tonalities C-major, Db-major, ... B-major, and 12 minor tonalities C-minor, Db-minor, ... Bminor. The cadence maps are given by 56 minimal cadential sets. Half this number defines the minimal cadential sets and are applied to major tonalities only (!), the other half operates on minor tonalities. This means that the the 28 candidates for major tonalities characterize any major tonality within X, whereas the other half does the job for the minor tonalities. The following list from [403] shows the candidates for major tonalities. With each chart Z, the number in brackets indicates the number of zero-addressed chords with up to four tones, which is contained in the extension Exto(Z). Observe that in general, replacing the chart by its zero-addressed extension is not cadential. The point is that in a concrete score, only zero-addressed chords will appear, at least without any additional constructions. So the question of how to represent a self-addressed chord on the zero-addressed level arises. To look for zero-addressed extensions means that one switches to the topological framework and seeks zero-addressed chords in the extension of a chart Z. A refined method would consist in the consideration of allomorphs as discussed in the function-theoretic logic above. This method should however be worked out in more detail. In fact, it is possible that a zero-addressed chord is contained in several extensions, and, worse than that, in extensions stemming from different cadential sets. So the overall information is that a zero-addressed chord is contained in some extensions for some cadential sets. The problem is this: How can one decide which tonality is the best candidate for the placement of the given chord?-Such a question becomes important if one needs recognition algorithms in a systematic tonality analysis of given scores, on a software platform such as RUBATO®, for example. Still more generally set, a zero-addressed chord need not be viewed as a local object, but it may be interpreted by sub chords which live in some extension sets, much like the degree concepts in non-irreducible chords of Straub's above approach. No systematic research has been carried out to date. 2The symbol construction in [403] is not thoroughly logical.
CHAPTER 26. CADENCE
558
Minimal cadence sets in major self-addressed tonalities.
Dg* (26) Dg* (26) 8g (12) 8g (12) 8g (12) 8g (12) sG* (26) sG* (26) 8p (32) 8p (32) sP (32) sP (32) 8g (12) 8g (12) 8g (12) 8g (12) sG (12) sG (12) sG (12) sG (12) sP (32) sP (32) sP (32) sP (32) 8p (32) 8p (32) 8p (32) 8p (32)
Tg (12) Tp (32) Dg* (26) Dg (12) Dp (32) sG* (26) Tg (12) Tp (32) Dg* (26) Dp (32) Dg (12) 8g* (26) sG (12) sG (12) sP (32) sP (32) Dg (12) Dg (12) Dp (32) Dp (32) Dg (12) Dg (12) Dp (32) Dp (32) sG (12) sG (12) sP (32) sP (32)
Tg Tp Tg Tp Tg Tp Tg Tp Tg Tp Tg Tp Tg Tp Tg Tp
(12) (32) (12) (32) (12) (32) (12) (32) (12) (32) (12) (32) (12) (32) (12) (32)
The corresponding minor list is deduced from the preceding one by these exchanges: Tg tG, Tp --+ tP,8g --+ dG, 8p --+ dP, Dg +-+ sG, D*g +-+ s*G, Dp +-+ sP.
26.4
--+
Self-addressed Cadences by Symmetries and Morphisms
Summary. More exotic cadences can be introduced by use of symmetries instead of chords. This example will be applied in the analysis of L. van Beethoven's "Hammerklavier" -Sonata in
26.4. SELF-ADDRESSED CADENCES BY SYMMETRIES AND MORPHISMS
559
section 28.2. -~-
The preceding examples have dealt with different addresses, but they had in common that the addresses of the tonality sets and of the cadence parameter sets were the same in each case. One can also try to associate tonalities and cadences on different addresses. One example of such a procedure, which we shall use in the discussion of the modulation architecture in Beethoven's op. 106 (section 28.2), will be discussed hereafter. We start on the space ExTop(PiM Od 12 ) of zero-addressed local compositions on pitch classes. To each such local composition Ch, the continuous injection in formula (24.13)
intexz,B : ExTopo(PiMod 12 ) ----+ ExTop z '2 (PiMod 12 ) associates the 'self-addressed chord' intexz,B(Ch) of all endomorphisms. The idea is that one may associate to this fractal chord a self-addressed chord defined by a specific sub-monoid of endomorphisms. Since intexz,B is injective, it is a cadence map on any set of 'tonalities' X c ExTopo(PiM Od 12 ). So if X is any set ofzero-addressed local compositions of pitch classes, playing the role of scales, we have a first example of a cadence via intexz,Blx. The example we are aiming at is the selection of a specific group of automorphisms in intex z,B (C h). For example, we may consider a group G of automorphisms of PiM OdZ12 and then consider the intersection (in the naive sense, not as functors) intexz,B,c(Ch) = monexz(G)nintexz,B(Ch). For example, take the group G = TIo of inversions and transpositions on Z12 and then take the intersection intexz,B,T1o(Ch), i.e., the group of 'counterpoint pitch symmetries' on PiMod z12 which induce automorphisms of Ch. Or else take the full automorphism group Aut(PiModz12 ) ~ GL 1(Z12)' Clearly, such a map is not injective on all of ExTopo(PiMod 12 ) since the automorphisms of a zero-addressed chord are the same as those of its complement. If we do however restrict the cadence domain to the domain X 6 , those objective zero-addressed local compositions with up to six elements, then the map
intex Z" B GL 1 (Z 12 ): X6
----+
ExToPz'2 (PiMod12 )
has a nice property of cadentiality: Lemma 41 The map intex z ,B,GL , (Z12) on the domain X6 is cadential exactly zn the three diminished seventh chords (of class number 37, pitch classes in semitone steps):
ITno
{1,4,7,10},
ITnl
e1.ITno
ITn2
e 2.ITno = {O,3,6,9}.
=
{2,5,8,1l},
A proof results from the contemplation of the symmetry groups of chord classes in the class list of Appendix L.1. This means that the diminished seventh chords are the only ones that can be recovered from their symmetries! As with the self-addressed tonalities above, these maps have the drawback that their codomain values are somewhat above naive reality of score objects. But it is possible to give a more down-to-earth restatement of this map in terms of orbits!
CHAPTER 26. CADENCE
560
To this end, recall that the set Dia(3) of triadic interpretations of diatonic scales is the orbit of the counterpoint pitch group T 10, Any subgroup GeT10 generates the partition orb(G) of G-orbits on Dia(3), an element of the set Part(Dia(3)) of partitions of Dia(3). Let this set be the new parameter set P = Part(Dia(3)) of our map, and take again X = X6 as domain, i.e., ""orb: X6 ---+ Part(Dia(3)). We define ""orb(Ch)
= orb(intexz,B,Tlo(Ch))
with the obvious restatement of Z12-addressed elements as automorphisms. Then: Lemma 42 The map ""orb is cadential exactly in the three diminished seventh chords
We leave the proof as an exercise for the reader, but observe that there are exactly two orbits in the three cadential chords: We have
the others are transpositions thereof. These two orbits, the large W ("world") and the small W* ("antiworld") will play a crucial role in the analysis of the modulation architecture of Beethoven's op.106.
26.5
Cadences for J list Intonation
Summary. For use in just modulation theory as it has been worked out by Hildegard Radl [429]' we present minimal cadence sets in third-fifth and Pythagorean tuning. -~-
We recall the context from just and "justest" tuning introduced in section 13.4.2.2. We are working in the Euler Plane space spanned by fifth and third axes, as well as in the Pythagorean subspace PythagorasLine spanned by the fifth axis alone, and related to the EulerPlane by the projection modulo third comma. In all the following cases, we shall only discuss the triadic interpretations of the C scale since the other scale interpretations are deduced from this one by transpositions, and the corresponding cadential sets are the transposed sets, so the names of degrees in cadential sets just vary by the scale symbol. We shall therefore omit this specification in order to keep everything readable.
26.5.1
Tonalities in Third-Fifth Intonation
Summary. The minimal cadential sets of degrees are described for major, (natural) minor, harmonic minor, and melodic minor tonalities. -~-
26.5. CADENCES FOR JUST INTONATION The just triadic major scale interpretation
561 C(3)
has the seven degrees 3
I,II,III,IV, V, VI, VII described in figure 13.12. The minimal cadential sets are these:
J 1 = (II),J2 = (III, VI),J3 = (III,IV),J4 = (IV, V),J5 = (VII),J6 = (V, VI).
(26.2)
The just triadic natural minor tonality C(3) is the isomorphic image of C(3) under the 180 degree rotation around the middle between tones C and g, see also figure 7.5 which shows the scale and the seven degrees. The minimal cadential sets follow from the transformation of the above major cadence sets 26.2 and are these 4 :
J 1 = (VII), J 2 = (III, VI), h = (V, VI), J 4 = (IV, V), J 5 = (II), J 6 = (III, IV).
(26.3)
The just triadic harmonic minor scale interpretation c~3) works as follows: The harmonic minor scale and its degrees are shown in figure ?? The scale Ch is selected out from the chromatic scale shown in figure 7.5, the degrees are uniquely determined by the tone names given in the chromatic scale. The minimal cadential sets are these: J 1 = (III),J2 = (II),h = (VII),J4 = (I,IV), J 5 = (I, V), J 6 = (I, VI), h = (IV, V), J s = (IV, VI), J g = (V, VI).
(26.4) (26.5)
The just triadic melodic minor scale interpretation c~) works as follows: The melodic minor scale and its degrees are shown in figure ?? The scale Cm is selected out from the chromatic scale shown in figure 7.5, the degrees are uniquely determined by the tone names given in the chromatic scale. The minimal cadential sets are these:
J 1 = (I), J 2 = (II), h = (II I), J4 = (IV, V), J 5 = (V I), J 6 = (V II).
26.5.2
(26.6)
Tonalities in Pythagorean Intonation
Summary. The minimal cadential sets of degrees are described for Pythagorean major, (just) minor, harmonic minor, and melodic minor tonalities.
The Pythagorean scale interpretations are defined by the comma projection Euler Plane ----; PythagorasLine, applied to the synonymous just tonalities discussed above. Exercise 64 Draw all the above major and minor scales in the Pythagorean tuning. 3We do not consider the alternative degrees 11* and VII* from figure 13.12. 4The rotation transforms major degrees into minor degrees as follows: I f-+ I, I I V, V f-+ IV, V I f-+ I II, V II f-+ II.
f-+
V I I, I I I
f-+
V I, IV
f-+
CHAPTER 26. CADENCE
562
Here are the minimal cadential sets: Major: J1
= (VII), J 2 = (II, III), J 3 = (II, V), J4 = (III,IV), J5 = (IV, V).
Natural Minor, here, the degrees are coupled by a reflection on PythagoreanLine 5 and accordingly yield these minimal cadential sets: J1
= (II), J 2 = (VII, VI), J 3 = (VII,IV), J4 = (VI, V).
Harmonic minor: J1
= (III)and
all pairs of any two different degrees =I- III.
Melodic minor:
= (III), J 2 = (I, II), J3 = (I, V), J4 = (I, VII), = (II,IV), J6 = (II, VI), h = (IV, V),Js = (IV, VII), J g = (V, VI), J g = (VI, VII). J1
J5
5The symmetry e1.(-1) on the line transforms major degrees into minor degrees as follows: I
V II,! II
>-+
V I, IV
>-+
V, V
>-+
IV, V I
>-+
II I, V II
>-+
II.
(26.7) (26.8) (26.9)
>-+
I,ll
>-+
Chapter 27
Modulation Denn das Wesentliche an einer Modulation ist nicht das Ziel, sondern der Weg. Arnold Schonberg [478] Summary. This chapter deals with the central issue of modulation between two given tonalities. It involves explicit models of tonalities, of cadences and-even more crucial-of the transition process from one tonality to its successor. The present model involves the analogy to elementary particle physics: Modulation is viewed as a 'force interaction' between two 'tonality particles' which is mediated by a 'modulation quantum'. The model allows for a complete calculation of fundamental degrees of modulation in congruence with A. Schonbergs harmony [478]. The model is realized for diatonic tonalities in 12-tempered and just tuning. It has been extended to all 7-element scales in 12-tempered tuning and to a number of scales in just tuning. The 12-tempered extension reveals a privileged position of the diatonic scale with regard to this modulation theory. We conclude the chapter with a discussion of the basic role of modulation models and their application to optimize harmonic paths in the sense of section 27.2. -~-
In the historical development of mathematical music theory, the modulation models were the initial major topic. The first results were traced in [326, 327]. The original problem was to understand the role of the diminished seventh chord in the modulation strategy of Beethoven's "Hammerklavier" sonata op.l06. It turned out that the inner symmetries of that chord seemed to govern the possible modulatory transitions and their specific stamp. This entailed the idea that modulation could be described by use of 'modulation symmetries' between start and target tonality, and that the pivotal chords (those which enforce the switch between the involved tonalities) could be extracted from a set of 'modulatory tones' which admits such modulation symmetries as inner symmetries. The first (not so satisfactory) attempts in fact focused on the diminished seventh chords, but then, the definition of such modulatory sets turned out to work perfectly when coupled to cadential sets for the target tonalities instead of the rather artificially chosen diminished seventh chord. 563 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 27. MODULATION
564
Interestingly enough, no essential extensions of those early methods have been proposed to date. Which does not mean that the results have not been greatly extended to many general scales and to just tuning. But the general setup of the involved concepts is still the old one. This is why we shall start this subject with a general view on the modulation methodology thus far developed in the following section 27.1.3. This will also reveal a fascinating perspective towards a "grand unification" of harmony and counterpoint as it has been suggested by Noll's definition of Riemann consonances and dissonances [400]. It is important to review the crucial role of the very concept of modulation in music. The medieval concept of "modulatio", as it is encountered in Augustinus' famous early definition musica est ars bene modulandi, has the quite general meaning of rational organization of tones and rhythms or, in the later medieval tradition, and more generally, musical composition. The modern meaning of a tonal modulation is a specialization of the original concept and was only introduced in the 18th century (see article "modulatio" in [457]). The following approach essentially relates to tonal modulation, but the mathematical formalism and related canonical generalizations suggest a broader conceptualization. This means that the idea of a tonal modulation is in fact a special case of a potential theory of transitions between instances of a particular musical structure. Transition between tonalities can be paralleled by transitions between motives, rhythms, global objects, and by transitions between more sophisticated representations of tonalities (such as the self-addressed setup), motives, rhythms, and global objects. So the theory would have to describe transition mechanisms and their expressions in terms of compositional instances of practical use. In this spirit, and it is the mathematical universality which preconizes the development, modulation should be rephrased in the understanding of medieval tradition, i.e., as a rational construction of music in all its structural streams. Unfortunately, this latter perspective is by no means a contemporary vein of musicology or even computer music science. But it turns out that any future system of music composition technology should deal with the general problem of transitional structures in all musical layers, such as harmony (in its most general setup), motivic, and rhythmical layout-including their interactions on local and global levels.
27.1
Modeling Modulation by Particle Interaction
Summary. This section introduces the motivation for developing modulation models. These models are designed to convey a harmonic transformation process. We discuss the 'particle interaction' model which allows us to simulate A. Schonberg's fundamental modulation degrees wherever he described direct modulations. The model is thoroughly formalized for 12-tempered and just tuning. The corresponding modulation theorems are proven and commented.
-ETo begin with, this typical case of a model in musicology should be understood with care: It is not a dogmatic prescription of what a modulation should be, and a fortiori not a declaration of what kind of tonalities, cadences and modulation mechanisms should be englobed. Everyone can modulate as she likes. Modulation is even and-MIas quite often in popular music-processed without any reflection: In the commercial shortcuts, after a song unit (following the fourperiod blues scheme, for example) a song is suddenly transposed by a number of semitone
27.1. MODELING MODULATION BY PARTICLE INTERACTION
565
steps and restarts without any preparation or transitory passage on a new shifted tonality. In our approach, we evidently are more interested in the description of possible relations between adjacent tonalities in musical compositions, and in the question of how such relations should be expressed explicitly on the material level of harmonic syntax.
27.1.1
Models and the Anthropic Principle
Summary. We sketch the role of mathematical models as a means to investigate the anthropic principle of music history, i.e., the historical selection of distinguished solutions of theoretical and practical approaches to musical rule systems within a variety of fictitious alternatives. -~-
The following modulation model is a first, and a very simple construction which however yields very good results and has the typical property of mathematical models in music: To enable a quasi-automatic generalization to situations where the classical music theory for which the model was constructed has no answer. In the case of modulation which originally was modeled for major scales, the generalization extends to arbitrary scales. The property of extensibility of a mathematical model relocates the existing music theory (which it models) in a field of potential, fictitious theories. This puts the historically grown facticity into a relation with the potential 'worlds of music'. The purely historic justification of existing modulation rules, for example, does not give us reasons for this choice, and this makes the purely historical approach a poor knowledge basis: We know that something is the case, but not why, and why other possibilities are not. In contrast, the mathematical approach gives us a field of potential theories wherein the actual one can be asked for its possible special properties with respect to non-existing variants. This differentia specific a is a remarkable advantage of mathematical methodology against the historical approach of musicology which cannot, in its poor intellectual performance, embed the facts in a variety of fictions and thereby understand the selection of what is against what is not. This evokes Leibnitz idea that the existing world is the best of all possible worlds: Is the existing music theory the best possible choice? Or is it at least a distinguished one? In cosmology, this idea has been restated under the title of the "anthropic principle" [40]. It says that the physical laws are the best possible for the existence of humans, more precisely (and less radically), it is the theorem stating that a slight variation of the fundamental constants, such as the gravitational constant, or the electric charge of electrons and protons, would make any higher molecular complexity as is necessary for carbon-based biochemistry impossible.
27.1.2
Classical Motivation and Heuristics
Summary. We describe Schonberg's tripartite modulation scheme and the task of formalization. -~-
In his classical text on harmony [478], Schonberg has described tonal modulation from one tonality to another as a three-part process of neutralization, turning point to the new tonality, and cadential confirmation, see figure 27.1. The central statement is the list of modulation
566
CHAPTER 27. MODULATION
Modulation Degrees (lIF,IVF, VIlF)
New Tonality Cadence Degrees (lIF & VF)
Figure 27.1: The three parts of Schonbergs modulation scheme in the case of a modulation from C-major to F-major. degrees in the middle turning point part. A mathematical model must start with a precise restatement of the musicological terms and facts in a rigid mathematical terminology. So we have to answer the questions 1. What is the set of admitted tonalities? 2. What is a degree? 3. What is a cadence? 4. What is the underlying modulation mechanism? 5. How can the modulation degrees be deduced from a solution of the preceding questions? To begin with, we want to work in the context of well-tempered tuning, and we shall only look for pitch classes, i.e., for for zero-addressed points in PiM Od 12 . So our pitch class set identifies to the cyclic group Z12. We consider twelve possible diatonic scales, C = {a, 2, 4, 5, 7, 9,11} and its eleven translates F, B D, ED' ... , E, D, G, in the cycle of fourth. To define tonalities, we then consider the set D ,;a(3) fJ
=
C(3) F(3) B(3) E(3) E(3) D(3) G(3) " b ' b , •• " , ,
of triadic interpretations. The degrees in tonality T(3) are identified with the triadic charts X T , X = 1,11, ... VII, everything being visualized by means of the nerve, i.e., the harmonic strip of a triadic interpretation. This means that a seventh chord, e.g., {d, f, a, c} in C(3), is not simply lIe, but must be 'interpreted' as a union lIe U IVe of degrees in C. And the chord {d, f~, a} is not a second degree in C with "sharpened third"; degrees are strictly triadic degrees within the respective tonalities!- We should recall the Yoneda philosophy from section 9.3 in this context. Accordingly, an object X is identified by the functor of its 'perspectives', i.e., by the system of all morphisms f : Y ----+ X in the given category. In the category Glob of global compositions, where we are presently working, our perspective is the canonical morphism X(3) ----+ X onto the basic scale X. Another perspective, such as a modal interpretation X f ----+ X (see section 13.4.2), or a combined perspective of triads and modal tone X?) of the present approach.
----+
X, would yield a variant
27.1. MODELING MODULATION BY PARTICLE INTERACTION
567
Remark 15 For the traditionally educated in harmony, this self-control is hard to understand. It seems that one can no longer understand everything. But understanding everything amounts
to understanding nothing. We had already seen in section 13.4.2.1 that Riemann's attempt to define tonal functions on all chords had to fail. And the technique of alterations is questionable in harmony since it does not lead to contradictions like function theory, but produces meaningless statements. We know from Mason's theorem in section 7.5.1 that two sharps and three flats suffice to interpret any chord in a seven-note scale as an alteration of a chord in any other seven-note scale! Mason's result tells us that with a sufficiently high number of alteration signs, nothing is in fact said if one interprets a chord in this way as an alteration of a chord in a determined scale. With these definitions, and with the concept of a cadence being related to minimal sets of degrees (see definition 87 in section 26.2.1) defining uniquely the surrounding tonality, the mechanism of modulation from tonality 8(3) to tonality T(3) is defined by a symmetry which transforms 8(3) into T(3). As in particle physics, we will materialize such a transformation force by a modulation quantum M. By definition, this is a zero-addressed local composition in PiM Od 12 , together with an interpretation M(3) by triadic chords such that the modulation symmetry is an inner symmetry of M, plus two more technical properties. The point of this modulation quantum is that it is 'musical matter'~much like the physical modulation quanta are materialized forces~and therefore preconizes a set of degrees to be played in the middle part of Schonberg's process shown in figure 27.1. As to the technical properties of M(3), the first one states that the intersection M nTis rigid, i.e., the group of symmetries 8ym(M n T) is trivial. It guarantees that the modulation symmetry 8(3) ----t T(3) is uniquely determined. The second property requires that the modulation quantum be a minimal set of pitch classes which satisfies the other properties. This is an economical condition designed to exclude superfluous tones. The precise definitions will be given below in section 27.1.4. The central statement of this modulation model is that such modulation quanta exist for all pairs of tonalities, and that they yield exactly the modulation degrees described by Schonberg [478]. Figure 27.2 shows such a quantum M(3) for the modulation from C to B". We recognize the two harmonic strips (light) as well as the connecting quantum (dark) in its nerve representation. The modulation degrees are exactly those vertexes of M(3) which lie in the harmonic band (four degrees in the example of figure 27.2) of the target tonality. The statements of this model and its background mechanism, viz. the inherent symmetry transformation, will also be tested on the repertoire of classical music, such as Beethovens op. 106, for example (see section 28.2). For the latter, it is possible to obtain a deeper understanding of its well-known intricate and complex modulation plan, switching back and forth between a "world" around the main tonality BI>-major, and the "anti-world" around tonality B-minor (theses of Erwin Ratz and Jiirgen Uhde, see section 28.2.2). The above modulation model illustrates perfectly the "anthropic principle" subject which emerges from mathematical modeling. The model will be extended to any seven-tone scale and corresponding harmonic bands, see section 27.1.5, and one can thereby locate the case of the major scale in a world of potential variants. The main result in this variational context is that the major scale occupies a singular position in that its modulation arsenal turns out to be the minimal possible variety among all scales which admit modulations between any two
568
CHAPTER 27. MODULATION
Figure 27.2: Visualization of a modulation quantum M(3) for the transition from C to ED' The nerve representation of M(3) shows the connecting middle structure in dark color, in particular it contains two five-dimensional simplexes (symbolized by horizontal bars) above and below a three-dimensional simplex (the tetrahedron). tonalities. So the historical dominance of the major scale receives a supplementary justification in a perfectly systematic framework, and one which the purely historical approach can a priori not provide.
27.1.3
The General Background
Summary. The general situation is sketched in order to establish a connection to the topostheoretic perspective and grand unification options. -I;-
In the preceding classical model of modulation, the space form was PiM Od 12 , and we were concerned with a collection X = Dia(3) of triadic interpretations S(3) of scales S, i.e., zeroaddressed local compositions in PiM Od 12 . Moreover, these interpretations were all isomorphic with each other under symmetries which are given on the space PiM Od 12 ; more precisely, we considered a full orbit X = G.C(3) under the action of a group G of symmetries on the space form. Further, the cadence was chosen to be the one with minimal cadence sets (according to definition 87). Finally, the modulation quantum M(3) was a triadic interpretation of a zeroaddressed local composition in PiM Od 12 . We have not yet proven that such quanta exist, and when they do so in the specific setups. We shall discuss this topic in detail later in sections 27.1.4, 27.1.6. For the time being, we are only interested in an adequate generalization of this conceptual setup in terms of the theory of global compositions. We should again stress that this generalization has not yet been completed by corresponding theorems as they exist in some classical cases. The evident drawback of the classical model is the restriction to isomorphic interpretations. In general, transitions between instances of a type of musical structures cannot be
569
27.1. MODELING MODULATION BY PARTICLE INTERACTION
restricted to isomorphic instances, proper deformations and specializations should be possible. In particular, tonal modulation should also work between non-isomorphic tonalities, modulations between non-isomorphic motives should be the common case, and modulations between completely different rhythms are just what any interesting composition should englobe after a century of jazz. We shall come back to rhythmical modulations between isomorphic rhythms in section 28.3.2, this example uses the present theory in the onset dimension. It is remarkable that modulation between non-isomorphic tonalities has not yet been treated in a systematic way in traditional music theory. The reason may be that in traditional music cultures, the selection of tonalities is a quite stable preference which identifies the traditional music culture and therefore would contradict its own message while being allowed to change its isomorphism class. Let us now set up a more general approach to modulation. We first have to identify the basic objects. The obvious generalization for tonalities and modulation quanta is that they are interpretations of local compositions at an address A in a form space F. We first have to consider a set X of interpretations. Such an interpretation Sf maps epi onto its local support S which means that we have a composed arrow (not a mono, in general!) Sf
---+
S
>----+
@A x F
(27.1 )
of global and local, not necessarily objective, compositions. We next give a morphism between such global compositions (27.2) which should generalize the isomorphism in the classical case. But we cannot consider 'abstract' global morphisms since we also want to apply these morphisms to the modulation quantum. Therefore, we are given a morphism of the underlying local compositions f /1 : S ---+ T which is, by definition, induced from a "symmetry" h : F ---+ F of the form space. Notice that it is not necessary to suppose h being an isomorphism. So we are looking for global morphisms
, f /1
: Sf diagram
---+
T J which extend the local morphism
SI
fill
--------+
S
--------+
f /1
to the chart configuration, i.e., the
@A x F
11xh
fill
(27.3)
T J --------+ T --------+ @A x F
with the evident horizontal arrows commutes. Next, we suppose that we are given an interpreted cadence composition, also A-addressed in F, for each tonality SI: (27.4) which lives in Sf, i.e., we have a factorization of interpretations of local compositions in F:
C ffr
1/11
SI
--------+
--------+
CS I
1/11 S
--------+
--------+
@A x F
11
@A x F
(27.5)
CHAPTER 27. MODULATION
570
and this means that not only is the support CSI a subcomposition of S, but the degrees of the cadence are also associated with degrees of this tonality via the map T. In the above classical case, this means that the triadic degrees of the cadence set must be triadic degrees in the englobing tonality. Cadentiality in this situation means that whenever we have two such factorizations of through SI, T J in the tonality collection X, then we must have SI = TJ. The last of our objects is the modulation quantum. This is again an A-addressed interpretation
Cffr
(27.6) and we ask that the modulation morphism be an endomorphism of this quantum, more precisely, the diagram ML - - + M - - + @AxF
(27.7) ML - - +
M
--+
@AxF
.>..
commutes under a suitable morphism mil, a condition which reduces to the condition that we find a chart map A which is compatible with the restriction endomorphism m of the ambient 'symmetry' transformation 1 x h. So in the setup of the harmonic topologies, we just ask that ME ExtA(h), and that this local condition be compatible with the charts of M's interpretation. The classical condition that the target tonality T J as well as the modulation quantum should contain the cadence set now means that we have a commutative diagram
C':J
1
--+
ML
1
(27.8)
where the intersection! TJ n ML is the fiber product T J X@AxF ML. The rigidity condition now states that the only invertible symmetry k : F - t F which induces an endomorphism of the intersection TJ nM L is the identity. This condition may be enriched by the additional conditions, for example, that k must also induce an isomorphism which transforms interpretations within X into each other. In the classical definition of the modulation quantum M(3), there is a last requirement, i.e., that the intersection Tn M be covered by the degrees of T(3) which are contained in M. In the general setup this reads as follows: For each subset J* of the given atlas J of T J , we have a sub-composition TJ* >--+ T J and therefore a corresponding fiber product diagram
1
1
(27.9)
Take the unique maximal sub-composition TJ'mQX such that the projection TJ* n ML - t TJ* has a section a: TJ'mQX - t TJ* nML. (The empty sub-composition evidently has this property, lObserve that even when the factors are objective, their intersection will not be so in general!
27.1. MODELING MODULATION BY PARTICLE INTERACTION
571
so existence is evident.) Intuitively, this means that we take all charts in T which also live in M. Then we ask that the canonical composed map TJ",ax,::", TJ* nM L
--+
TnM
is an epimorphism. The philosophy of this approach is that a modulation looks like a perspective from the old tonality into the new one, and this is quite the same as the approach of Noll to Riemann's concept of consonance and dissonance (see section 30.2.1) which views this qualification as an expression of the morphisms between two chords, i.e., as a perspective view of relative consonance/dissonance of one chord with respect to another. The point in modulation theory is that we do not consider the perspectives (morphisms) as an ultimate structure, but want them to be an expression of a materialization (the quantum) in order to grasp the material trace of 'modulation forces'.
27.1.4
The Well-Tempered Case
Summary. The 12-tempered modulation theorem is proven for diatonic scales. We visualize the nerves of the modulation quanta and compare to A. Schonberg's lists in [478]. -~-
This technical section follows the general modulation theorem as exposed in Muzzulini's paper [390]. This modulation theorem generalizes the earlier approaches in [327, 328, 340]. Muzzulini considers a seven-element scale S in PiM Od 12 and its interpretation S(3) by seven three-element degrees Xs C S defined by taking all subsets of form Xs = {Si' Si+2, SiH} with indexes modulo 7 and referring to any indexation S = {so, Sl, ... S6} of the S-elements along the circle line. The naming of these degrees is not relevant for the interpretation, however, one usually enumerates the degrees as with ordinary diatonic scales, i.e.,
although the naming is a function of the indexation of S. By construction any such triadic interpretation has a Mobius strip as its geometric nerve. From the classification table appendix L.1, we see that there are 66 translation classes, 38 translation-inversion classes, and 25 affine classes of seven-element scales. The first two groups also conserve interpretations and therefore are also orbits of triadic interpretations. All minimal cadential sets have been calculated by computer for the 66 translation classes (this can be done on the basis of the 38 translationinversion classes). The number of these minimal cadential sets are indicated in the third column of table appendix N.l.l. We already know the five minimal cadential sets for major scales (No. 38.1 in the classification appendix L.1) from the cadence discussion in section 26.2.l. The maximal number of cadential sets appears for harmonic minor scales (and only here): Here, each pair of different triadic degrees is a minimal cadential set, so we have 21 such sets. Besides the major diatonic scale, there are two other scale orbits with this minimal cadence set number: No.52 and No. 62. The melodic minor scale has 15 minimal cadence sets. Call a scale S is rigid iff its symmetry group Sym(S) is trivial. If an inner symmetry of S induces an automorphism of the triadic interpretation S(3) , we say that it is an inner symmetry
CHAPTER 27. MODULATION
572
of 8(3). If the group of inner symmetries of 8(3) is trivial, lemma from [390] is clear:
8(3)
is also called rigid. The following
Lemma 43 No translation of a seven-element scale is an inner symmetry, and any inversion in 8ym(8) is an inner symmetry of 8(3). The only possible non-trivial inner symmetries of 8(3) are inversions. In particular, 8(3) is rigid iff 8ym(8) contains no inversions.
Given a pair 8(3), T(3) of different triadic interpretations in the same translation orbit T 12 .X(3), a modulator is a symmetry g which induces an isomorphism g : 8(3) ~ T(3). A modulator must be of type g = e t .h, where h is an inner symmetry of 8(3). By the preceding lemma, h must be an inversion or the identity. Hence a modulator must be a translation or an inversion. Definition 88 Given a pair 8(3) ,T(3) of different triadic interpretations in the same translation orbit T 12 .X(3) , a modulation from 8(3) to T(3) is a pair (g, f-l) consisting of a modulator for 8(3) , T(3), together with a minimal cadential set f-l for the target tonality T(3) .
Given a modulation (g, f-l) from quantum Q C PiM Od 12 are these:
8(3)
to T(3), the axiomatic properties of a modulation
Property 1
(1)
The modulator g is an element of 8ym(Q).
(2)1"
All triads in f-l are subsets of Q.
(3)
The intersection TI12 n 8ym(T n Q) is trivial, and Tn Q is covered by triads of T(3).
(4)
The quantum Q is a minimal set with properties (1) and (2)/""
The main problem of this construction is the existence of a modulation quantum. A modulation is said to be quantized iff it has at least one modulation quantum. The modulation theorem below will give an exhaustive answer to this problem. If a modulation (g, It) has a quantum Q, denote by (Q n T) (3) the interpretation of the trace of Q in T by the triads of T which cover Tn Q. The interpretation of Q by triads of 8(3) and T(3) is denoted by Q(3). The degrees of (Q n T)(3) are called the pivots of this quantized modulation. Observe that, by property (£1")' (3), the modulator is uniquely determined by the quantum. In the following modulation theorem [390], we distinguish the 38 T h2-orbits of sevenelement scales. Theorem 30 In the following, let 0 =f. p E 2 12 , and T(3) = eP .8(3) .
• Scales with rigid triadic interpretation: For each scale 8 of the 28 Th2-orbits with rigid interpretations, and for arbitrary p, there exists at least one quantized modulation from 8(3) to T(3). - The maximum of 226 quantized modulations occurs for the orbit of scale type No.54.1 (the harmonic minor scale), while the minimum of 53 quantized modulations occurs for the orbit of scale type No.41.1.
27.1. MODELING MODULATION BY PARTICLE INTERACTION
573
6/ 1 II' (IV') III"
III' - II
t
VI
'X"-
I
115
IV
IV' V
V III
VI I
~
II '
6/4
4/4
II I
Figure 27.3: Visualization of all nerves of interpreted modulation quanta Q(3) for modulations on Dia(3). Certain nerves are higher than three-dimensional. There, we have symbolized the higher-dimensional simplexes by bars which connect all vertexes of the related lower-dimensional simplexes. For example, the bars of the quantum for 3/2, 3/4 symbolize a five-dimensional simplex connecting the vertexes of two triangular surfaces, each .
• Scales with non-rigid triadic interpretations: For orbit of scale type No.52 and No. 55, there exist quantized modulations except for p = 1 and p = 11. For the orbit of scale type No.38 and No.62, there exist quantized modulations except for p = 5 and p = 7. The six remaining orbits have at least one quantized modulation for each p.-The maximum of 114 quantized modulations occurs at the orbit of scale type No.47.1 (melodic minor scale). Among the scales with quantized modulations for each p, the minimum of 26 occurs at the orbit of scale type No. 38.1 (diatonic major scale). Idea of proof. The proof [390] used a computer program to calculate quanta and pivots as follows: For a given pair S(3),T(3) = eP .S(3), choose a corresponding modulation (g,/1,). Then there is exactly one candidate Q for a quantum which fulfills properties (1), (2)/L' and (4) of £ w In fact, Q is the orbit of the tones of J-l under the group of symmetries generated by the modulator. This candidate is rejected if property (3) does not hold, and we may proceed with a new modulation.
CHAPTER 27. MODULATION
574
Figure 27.3 shows the nerves of all quanta for modulations in the orbit Dia(3) of diatonic major scales. For lists of modulation data, such as quanta, cadence sets, pivot sets, for different scale classes, please consult the tables N.LI, N.L2, N.L3, and N.L4 in appendix N.L
27.1.5
Reconstructing the Diatonic Scale from Modulation
Summary. We discuss and comment the modulation theorem 30 for 12-tempered diatonic 7-tone scales. The overall behavior of the classical diatonic and melodic or harmonic minor scales for the present modulation model turn out to be in a distinguished position among all 7-tone scales. They can thus be reconstructed from their role in modulation processes; this gives modulation theory a new perspective within historical development of scales for musical composition which exemplifies the anthropic principle of mathematical models in music theory. -~-
The guarantee of the existence of quantized modulations is the alias of the historically grown rule canon within the mathematical model. This theorem in fact guarantees quantized modulations for all couples in Dia(3), and the pivotal degrees coincide with the pivotal degrees in Schonberg's treatise [478] wherever he considers direct modulations. But the mathematical model also applies to the non-diatonic scales of seven tones and to any translation pair of triadic interpretations. So the modulation model immerses the classical case Dia(3) in a variety of modulation scenarios which have never been dealt with in historical contexts. This power of variation is a characteristic feature of the systematic approach in mathematical music theory. Let us briefly review the historical approach in order to understand this point. It cannot reduce to a pure tracing of what is the case in a diachronic development. This was also remarked by Dahlhaus [103, Bd.1O,pp.104-105]:
1st demnach sogar unter Historikem das A usmaft, in dem die Chronologie als "substantiell" - als "wesentlich" Jur den "Zusammenhang" von Phanonemen gelten soll (. . .), zumindest umstritten, so sollte es vollends einem Systematiker erlaubt sein, ohne ubertriebene chronologische Zaghaftigkeit Ideen, Institutionen und Praktiken auJeinander zu beziehen und zu "Strukturen" zusammenzuJassen, deren Enstehungszeiten um Jahrzehnte differieren, deren innerer Konnex aber keiner strikten aufteren Gleichzeitigkeit bedarJ, um den Status einer wissenschaftlichen, nicht einer bloft spekulativen Hypothese zu erhalten. Und so wenig der konstruktive Zug des VerJahrens zu leugnen ist, so unbestreitbar durfte es sein, daft ohne ihn weder eine Systematik noch eine Historiographie, die sich nicht in schierer Annalistik erschOpft, uberhaupt moglich waren. (. ..) Erst die systematische Konstruktion offnet den Blick daJur, welche Tatsachen einer Geschichte angehoren, die zu erzahlen lohnend erscheint. In the terminology of Saussurean semiology, Dahlhaus recognizes that the diachronic dimension of music must also be understood as an expression and extension of the system of music. And that this extension only becomes worth being considered qua instance of an overall system.
27.1. MODELING MODULATION BY PARTICLE INTERACTION
575
Clearly, the surface character of the diachronic axis with respect to the systematic background is paralleled by the surface character of the synchronic axis. In fact, this axis refers to the ethnological dimension of music, i.e., its cross-cultural variability. It would be as annoying as pure historiography to repertorize ethnological diversification if the fundamental question of systematic connections over the cultural localities could not steer ethnological musicology. In other words, the spatio-temporal extension and diversification of music could not be understood without its characteristic nature qua system. A semiotic system cannot be reduced to a bunch of records over time and space.-This is the basic axiom of Saussure's original linguistic approach-even if it turns out to be hard to recognize the system's laws. So why is the modulation theorem in its present form (theorem 30) necessary to the comprehension of the music system? Wouldn't it suffice to have the original theorem for Dia (3)? In the synchronic axis it would in fact: It gives the different transposition distances a common basis from which pivotal degrees can be deduced. The quantum model of modulation is the system, and the different transposition distances are synchronic places in a certain time slice of music history. But that does not explain why the European corpus of classical compositions could grow to such a prominent and voluminous status, a status which is strongly based upon some standard scales, the diatonic major, the melodic, and the harmonic minor scales, and an associated harmony whose top performance crystallizes in tonal modulation. We do not claim that the modulation theorem explains all reasons for the prominent status of these scales, but we claim that it gives the prominent scales a prominent position in modulation processes. From this, we deduce that the historical success of these scales could relate to their modulatory excellence. The position of the harmonic minor scale is such that its number of 226 quantized modulations is maximal among all (seven-tone) scales with rigid triadic interpretation. Among the scales with non-rigid triadic interpretation and quantized modulations for each translation, the maximal number of 114 quantized modulations occurs for the melodic minor scale. So these minor scales have a maximal freedom of modulatory actions. Among the scales with non-rigid triadic interpretation and quantized modulations for each translation, the diatonic major scale is charged with the minimum of 26 quantized modulations. The scale No. 41.1 with a minimum of 53 quantized modulations has no interpretation in the historical scale repertory, to our knowledge. To understand these extremal positions, observe that the modulation theorem does not prefer any of these scales in its concept framework or hypotheses. So it is a priori not expectable that the three scales of diatonic major, melodic, and harmonic minor could play any role in their variety of quantized modulations. The scale No. 41.1 is also in a distinguished position in this context, but we do not have any interpretation of this fact. However, the historic "choice" of scales does coincide with three of four scales that are exhibited by this generic modulation model. So the two final questions remain: • How can we recognize the modulation model in the historic traces?
• If one accepts the model's presence in the systematic background of the historic development, to what degree has the historic development been guided by system parameters which are related to the given modulation model?
CHAPTER 27. MODULATION
576
The first point will be attacked in the experimental chapter 28 on applications of the theorem to classical compositions of the European tradition. The second question is hard to tackle since the question relates to the innermost reflections or mere intuitions which may have guided composers and-to a lesser degree-theorists. Independently of its role as an explanatory mechanism for historical phenomena, the present model can very well be used to produce modulations in new contexts. For instance, one may try to write compositions in exotic scales and to use modulation rules from this model. Or one may also apply this model to other parameter spaces. In fact, the modulation model does not preconize the pitch domain in its structural premises! One may also apply the modulation rules to time, loudness or other musical parameters in order to cope with Augustinus' philosophy that music is the art of good 'modulation'. This latter approach has been tested in a composition Synthesis [339] for piano, percussion, and base. In the first movement of that composition, modulation was applied to the time domain, i.e., to rhythmical modulation instead of tonal modulation. We shall also discuss this experiment in section 28.3.2. The table in appendix N.1.5 shows examples of direct schematic and short modulations (as usual in the modulation texts) for every fourth translation p = 1,2,3,4,5,6,7,8,9,0,11, starting from C-major. The degrees, which are separated by commata, are meant to be played in time order, one after the other.
27.1.6
The Case of Just Tuning
Summary. This section is devoted to the proof of the modulation theorem for just tuning, presented in [331, 340] for major scales, and elaborated by Hildegard Radl [429] to harmonic, melodic minor, and other scales. The theorem sheds some light on the particular chromatic scale in just tuning, as considered by Martin Vogel [547].
-E27.1.6.1
Just Scales and their Triadic Interpretations
We shall first of all deal with just scales in the EulerPlane as introduced in section 7.2.1.2. The scales S which we shall encounter in the sequel will all be generating, i.e., 7/.,S = 7/.,2, the ambient module whose first axis encodes fifths while the second encodes thirds. Therefore, it will be equivalent to speak about automorphisms and about inner symmetries. For any zeroaddressed local composition C in the Euler Plane, we say that it is rigid iff Aut( C) = I d. We are going to discuss these scale types (refer to section 7.2.1.2 for just scales and section13.4.2.2 for triadic interpretations): • The set M aj of major scales, i.e., the translates X = etC of just C-major C, together with the set M aj (3) of triadic interpretations by the seven degrees I,I I, ... V I I, excluding the variants 11*, V I 1* . • The set N atMin of natural minor scales, i.e., the translates X = etCnat of natural C-minor Cnat, together with the set N atMin (3) of triadic interpretations.
27.1. MODELING MODULATION BY PARTICLE INTERACTION
.b
.a• .b.'# • f· 0 • • •• c
b,
(e)
g
a,
e,
•
d
g
•
0
•• •
0
0
•• • IV
• II
harmonic C·minur
0 0
•
0 0 0
0
e-
d
e,
C-chromatic
0
577
0
0
V
0
0 0
•
• •• 0
0
0
III
0
• • •
0
•
0
0 VI
• •• 0
VII
0 0
• 0
0
•
0
Figure 27.4: The harmonic minor scale and its triadic interpretation. • The set HarMin of harmonic minor scales, i.e., the translates X = etChar of harmonic C-minor ChaT) see figure 27.4, together with the set HarMin(3) of triadic interpretations. • The set M elMin of melodic minor scales, i.e., the translates X = et Cme ! of melodic C-minor Cme !, see figure 27.5, together with the set MelMin(3) of triadic interpretations. • The 32 translation orbits AltM aj (i) of seven-element scales C( i) which are defined as follows, see figure 27.6: Take the fifth pairing c, g. Take the a priori C-scale frame shown on top of figure 27.6. Then, choose any additional five tones which are either the major tones f, d, a, e, b or one of their alterations, whence we obtain 25 = 32 variants. Of course, these known special cases are included: C = C(l), Cnat = C(5), Cme ! = C(9), Char = C(11). And the corresponding triadic interpretation sets AltM aj (i) (3) . On the one hand, we shall discuss modulations within the orbits
MaJ' (3) , NatMin(3) , HarMin(3) , MelMin(3) , AltMaJ'(i) (3) , on the other, we shall also switch between major and natural minor scales, this latter case meaning that we take the larger orbit under translations and 180°-rotations.
27.1.6.2
Modulations and Quanta
We now have to adapt the crucial definitions from 12-tempered modulation to just modulation.
CHAPTER 27. MODULATION
578
a
•
b
c
.f
b
f· • •• • (e)
g
a.
e,
a
f
d
0
0
•• •
• •• IV
0
0
0 0
0 0
0 V
•
0 0
II
0 0
• •
•
me/odic
C-chrolllalic
0
0
0 0
0
III
• • •• • 0
0
.b
•
N
VI
• • • •
d
g
pminor
0 0
•
0 0
• • •
(e?
0
VII
• 0
0
•
0
Figure 27.5: The melodic minor scale and its triadic interpretation.
Definition 89 Fix a subgroup G c GL(71}). Given a pair S(3), T(3) of different triadic interpretations of one of the above scale types which live in the same orbit G.X(3), a modulation from S(3) to T(3) is a pair (g , 1-£) consisting of a modulator g E G for S(3) , T(3), together with a minimal cadential set 1-£ for the target tonality T(3) . Given a modulation (g,l-£) from S(3) to T(3), the axiomatic properties of a modulation quantum Q C Euler Plane are these:
Property 2
(£ 1,ust) .-
(1)
The modulator g is an element of Sym(Q).
(2)/1() 3
All triads in 1-£ are subsets of Q. The intersection T n Q is rigid, and Tn Q is covered by triads of T(3).
(4) 27.1.6.3
The quantum Q is a minimal set with properties (1) and (2)/1-"
Automorphisms of Triadic Interpretations of Seven-Element Scales
Let S(3) be the triadic interpretation of a generating seven-element scale S
c
Euler Plane,
27.1. MODELING MODULATION BY PARTICLE INTERACTION
579
C-sca/e frame
IEFE 1m HE 1m HE HE 'ttE HE 9HE lOttE !lIm i2HE 13ffil 14HE 15HE 16tm tm HE tm 20HE 3
B
5
6
17
18
19
21HE 22ffij 23BE 24=
HE HE 2'HE HE 29HE 30ffE 31Em 32EtE 25
26
28
Figure 27.6: The 32 altered scales are derived from the C-major frame on top. for example the above types in section 27.1.6.1. Let a E Sym(S) be an automorphism which induces an automorphism a E Sym( S(3)). Since this also induces an automorphism of the nerve n(S(3)) which is a Mobius strip, we have combinatorial information on the possible permutations induced by a. Clearly, a is of finite order. Since the boundary of the nerve is a circle, and since the adjacency relations of the seven vertexes on this circle must be preserved, there is only the dihedral group of the regular heptangle, i.e., the identity, the order-two reflections, and the order-seven translations on the vertexes, and hence, since the elements of the scale are recovered by the two-simplexes, the same is true for the scale. On the other hand, if we consider any finite subgroup H c GL 2 (Z), then Minkowski's theorem (see appendix C.3.5, theorem 43) tells us that the composed projection
H
>---+
GL2(Z)
---+
GL 2 (Z3)
is injective. But we have card(GL2(Zp)) = (p-l)2p(p+l) for any prime p (see appendix C.3.4)
CHAPTER 27. MODULATION
580
which is 48 for p = 3. Hence, if we apply this fact to the linear part of our automorphism, order 7 is forbidden for our automorphisms, and we are left with orders 1 and 2. This means that if a = ea.A, then we have A2 = Id, A.a + a = 0 for A E GL 2(Z), a E Z2. Suppose we are given a modulator ¢ : 8(3) --+ T(3) between two just tonalities, then it has the shape ¢ = T.a with a translation T = eb and an automorphism a = ea.R of 8(3) as above. Since the modulation quantum must be a full orbit of the cadence IL'S tones under the group (¢) generated by ¢, it is important to know the shape of an orbit (¢)(x) of a point x E Z2 of the EulerPlane space. Here is the formula ([429, p.12]): (¢)(x) = eZ.(R+Id)b x U eZ.(R+Id)b.¢(x).
(27.10)
So we have one 'branch' of this orbit which is just the translation orbit of x under translation e(R+Id)b, and the other 'branch' which is the translation orbit of ¢(x) under the same translation. If a = I d, both branches are shifted against each other by b, and they are generated by a translation orbit under e 2b •
Exercise 65 Draw an image of the orbit of C-major under b = (9,2) and a being the unique non-trivial automorphism A of C-major, as described in formula 8.6. 27.1.6.4
Finiteness of Modulation Domains
Definition 90 The modulation domain of a tonality 8(3) is the set of shift vectors b E Z2 such that there is a quantized modulation from 8(3) to T(3) = eb 8(3) . In a good number of cases, the modulation domain of a tonality is finite. We investigate this situation. In order to control the possible intersections between the target tonality and the orbit (¢)(X) of a set X of tones in the target tonality which stands for the potential modulation quantum, we should control the cases where the different transforms of a scale are disjoint from each other.
Proposition 48 With the above notation, if C is the major scale in Euler Plane, then the transforms
are mutually disjoint except for a finite number of shift vectors b. Proof. The translation vector b = (b 1 , b2) induces a translation vector (R + Id)b = b2.( -1,2). So clearly, the two branches eZ.(R+ld)bC and eZ.(R+ld)b.¢(C) are disjoint if Ib11 is sufficiently large. On the other hand, the vertical shift component by b2 of the scale C to the second branch eZ.(R+Id)b.¢(C) under ¢ cannot hit the first branch for 1 < Ib2 1 since the vertical component of b2 .( -1,2) is twice the vertical shift component b2 , and thus, the branches cannot intersect in the vertical progression. QED.
Corollary 18 For modulations among just major tonalities, the modulation domain is finite.
27.1. MODELING MODULATION BY PARTICLE INTERACTION
581
Proof. Consider the minimal cadence sets Jl, ... J 6 as defined in formula (26.2). It is easily seen that the union of every cadence set, except J 1 = {II}, J 5 = {VII}, has no rigid superset in the scale. So only J 1 = {II}, J 5 = {V II} can be candidates for cadence sets in the quantum construction (£~ust). But these two degrees are not rigid. Now, the quantum, if it exists, must be the orbit of either of these degrees under the group (¢). But this is just one of these two degrees in every transformed instance of the scale. So, if the transformed instances are all disjoint from each other, the trace of the quantum in the target scale is either I I or V I I, so it is not rigid. Now, by the above proposition, this is the case for all but a finite number of shift values b, and we are done. QED.
27.1.7
Quantized Modulations and Modulation Domains for Selected Scales
Summary. We present the quantized modulations for modulations between major scales, between natural minor scales, between major and natural minor scales, between harmonic minor scales, and between melodic minor scales. We finally discuss Radl's results concerning the 32 altered scales introduced in 27.1.6.1 and their modulation domains. -~-
In the following sections, we present the modulation domains and the pivotal degrees for a number of standard scales. The proofs are omitted since they are a tedious and lengthy, but not very illuminating work of case distinction. The details are found in Radl's work [429] which in its algorithmic part should be incorporated in a computer-aided algorithm. Theorem 31 This variety of modulation results, i.e., the existence of modulation quanta is not written in the closed form of a theorem. Nevertheless, it should be read as a highly segmented existence theorem of just modulation theory. We shall refer to this list of results when referring to the modulation theorem for just tuning.
The main ideas are these: • to use the finiteness statement of corollary 18 and to digress on the different cases in this finite set of possibilities; • to use the fact that a modulation by a shift b can be used to calculate the modulation in the negative direction -b. Exercise 66 Calculate the -b-case from the results for the b-case.
We should also mention that Radl has equally calculated some modulation results for other situations in the Euler context, such as scales in the Pythagorean tuning, and in the three-dimensional EulerSpace from third, fifth and seventh components. We withdraw from a discussion of these ramifications.
582 27.1.7.1
CHAPTER 27. MODULATION Modulation Between Major Tonalities
Suppose that we have to modulate from case:
S(3)
to
T(3)
= e b S(3). We have two modulators in this (27.11) (27.12)
The modulations are listed in detail in table appendix N.2.1. Here, we just present the modulation domain in figure 27.7. We see that this domain covers all the tones ofthe major scale (except e
b
.r g
• • (c? • • • • eo·
bJ,
r
c4
g
:l>
C-cirromalic
d
• .e. b • • (c? • • • • • b,o' a
c4* bJ,
g
f
c4
lJl,
d
e,
major modulaliun domain
Figure 27.7: The modulation domain from just C-major. the tonic itself which is excluded by definition) as well as six external tones. The modulation domain is very similar to Vogel's chromatic. From this compositional point of view, Vogel's chromatic is optimally chosen as a modulatory domain. Other chromatics, such as Michel's [374] or Roederer's [462] are less adapted to this modulation domain.
27.1. 7.2
Modulation Between Natural Minor Tonalities
We have the two modulators
The modulations are listed in detail in table appendix N.2.2. We just present the modulation domain in figure 27.8. As may be expected from the symmetry relation between major and natural minor type, this domain is the same as that of major tonalities, however only absolutely. Relative to the scale, the domain is a rotated one.
27.1. 7.3
Modulation From Natural Minor to Major Tonalities
We have the two modulators
27.1. MODELING MODULATION BY PARTICLE INTERACTION
•••• • • 0-. • ••
d.*
a
e
b
b,
f
(c)
g
dj,
a,
e,
583
bI,*
nalural minor
modulalion domain
Figure 27.8: The natural minor modulation domain from natural C-minor. The modulations are listed in detail in table appendix N.2 .3. We just present the modulation domain in figure 27.9.
•
b,
f
(c)
dj,
a,
•
g
d
naillral minor 10 major modulalion domain
Figure 27.9: The modulation domain from natural C-minor to major tonalities. This is a remarkable result since in the well-tempered situation, there is no distinction of the scales for major and natural minor, it is just the tonic which differs. So here, we have a modulation between different translation classes. 27.1. 7.4
Modulation Steps From Major to Natural Minor Scales
We have the two modulators (same as above).
e b .A,
eb.B , B=
(1o 1). -1
The modulations are listed in detail in table appendix N.2.4. We just present the modulation domain in figure 27.10. This is the symmetric result to the above one.
CHAPTER 27. MODULATION
584
. . a .e .b
d,*
• • -0
\Y,
f
••
g
(el
d
majur fU natllra/minor modulo/ion domaill
Figure 27.10: The modulation domain from C-major to natural minor tonalities.
27.1. 7.5
Modulation Steps Between Harmonic Minor Scales
We have the unique translation modulator
The modulations are listed in detail in table appendix N.2.5. We just present the modulation domain in figure 27.11.
•
g,
d,•
• "• • • r,• • • r• • • .'• • • .. • • • <1;'
b,
jI,
d.
b,
.--
c
b
(0)
g
0:.
d
b
r~
harmOl1ic.' minor mot/ulo/jolt domain
Figure 27.11: The modulation domain from C harmonic minor tonalities. This modulation domain is extremely large compared to the others. Perhaps this is a reason why harmonic minor is so prominent in classical European harmony, and also, to some degree, in composition.
27.1.7.6
Modulation Steps Between Melodic Minor Scales
We have the two modulators
The modulations are listed in detail in table appendix N.2.6. We just present the modulation domain in figure 27.12.
27.1. MODELING MODULATION BY PARTICLE INTERACTION
•
b~
585
• •
g
e,
melodic minor modulo/ioll domain
Figure 27.12: The modulation domain from C melodic minor tonalities.
27.1.7.7
General Modulation Behavior for 32 Altered Scales
This investigation in Radl's work was undertaken to get a global view of possible scale behavior in modulation when deriving scales from the major scale by determined alterations as described above. From the table N.2.7, we can distinguish these cases.
• has no modulations if its modulation domain is empty (always excluding the start tonality!), • has infinite modulations if its modulation domain is infinite, • has modulations if its modulation domain is not empty (always excluding the start tonality!), has limited modulations if the transitive closure (all tonics which can be reached by successive modulations from relative modulation domains) of its modulation domain is not total space. The cases (15, 19) with no modulation are pathological. The cases with infinite modulations (21, 22, 24, 29, 30, 32) are too indeterminate since we can modulate in a set of tonalities which cannot be distinguished from their perceivable pitch structure, see also the discussion of valence theory in appendix B.2. The cases (8, 25) oflimited modulations means that modulations cannot be concatenated to yield any target tonality. The remaining cases are the reasonable ones. There, we have reduction to modulations with just four target tonalities of the cases 2, 7, 13, 14, 23, same for the cases 10, 16, 17, 18, 20, 26, 27, 31. So these cases have very small domains against the remaining cases which all reduce to one of the special tonality types of major, natural minor, harmonic minor or melodic minor which have relatively large modulation domains (eight and more targets against four in the previous cases 2,7, ... ). The only case which remains is number 6, and this is a variant of the melodic scale. So we may summarize that the known scales have a much larger number of target domains than the other scales in the frame of the reasonable scales. This is also a confirmation of Muzzulini 's results on the distinguished position of major, melodic minor, and harmonic minor in the well-tempered case.
CHAPTER 27. MODULATION
586
27.2
Harmonic Tension
Summary. Harmonic tension regards the valuation of syntagmatic chord complexes. This concept adds to the isolated semantics of chords a semantic charge of complex syntagmatic arrangements. We discuss a distance-based concept of harmonic tension and related mathematical structures, such as the Riemann tensor algebra. The chapter concludes with a technical discussion of optimal harmonic paths, including algorithms for optimal path search.
-EThe harmonic evaluation of isolated chords does not give the full information about a preferable choice of harmonic evaluation in the syntagmatic context. Classical harmony has not dealt with this problem except in the rudimentary concept of cadences. We shall here set up a formalism to grasp harmonic context structures from given Riemann matrix data. This is an important subject since it is very difficult to define a global syntagmatic equilibrium on the harmonic level. The Riemann tensor algebra will give a first approach to the formal treatment of the problem of local and global harmonic tension. This approach has been used in the HarmoRUBETTE®and will be discussed on a more practical level in section 41.3.
27.2.1
The Riemann Algebra
Summary. We introduce the Riemann algebra as a quiver algebra that is built on the Riemann matrix.
-EWe take up the formalism of Riemann matrices introduced in section 25.3.2, including the basic data of that context, i.e., a name set TON of "tonalities", a name set VAL of "tonal function values", and a form space TRUTH(I) for of "truth denotators" in a specific truth module (or, more generally, a truth form) I and at a specific address B for a form F. To make the ideas concrete, and also related to the HarmoRUBETTE®module of RUBATO®(see section 41.3 for this tool) we shall work in the truth module I = R The form F can be any form which englobes the parameters for harmony (this is however not a formal condition in the following discussion). The address B will be any real vector space. We first need a mathematical construction: the Riemann algebra.
Definition 91 Given value and tonality sets VAL, TON, a space form F with address B (a real vector space), the Riemann algebra (over the real numbers) is the quiver algebra2 Rie(VAL, TON, F, B)) over the reals for the complete quiver" (the Riemann quiver) over the set VAL x TON x ExToPB(F)). The definition is motivated by the usual situation in harmony: When one analyzes a piece of music, there is a sequence of chords which are given certain function values with respect to certain tonalities. Such sequences are formally restated as being paths in the Riemann quiver. The Riemann algebra is needed in order to make calculations on different harmonic paths in 2See appendix D.1.1.1 for this concept. 3See appendix C.2.2, definition 65.
27.2. HARMONIC TENSION
587
the Riemann quiver, and in order to evaluate the best ones among a given selection of such paths. This is needed in order to decide which path should be taken when we have a certain way of giving weights to special sub-paths according to specific approaches to harmony. The basic data in such a calculation are real-valued weights which we assign to certain paths. For example, one may exhibit a classical cadence path of shape Ix ---t IVx ---t Vx ---t Ix and give it a high positive weight in order to express that such sequences are good paths. Or one may like to express that a passage Vx ---t Ix is a good relaxation movement, where as the other direction Ix ---t Vx is not and receives negative values, say. So these weight assignments regard more or less long paths in the Riemann quiver and should help in determining the quality, i.e., weight of a given path through a piece of music. Before delving into details, observe that this approach is still a relative one in the following sense. Suppose we are given a sequence of chords ao, al, a2, ... an, and suppose that we have two harmonic interpretations of them in terms of two Riemann paths
(vo,fo,ao) (v~,f;,ao)
---t ---t
(vl,il,ad
---t •••
(v~,f~,ad
(vn,fn,an),
---t ••.
(v~,f~,an)
which we want to compare: Which one is harmonically preferable? One may therefore calculate "weights" for these paths and decide upon these weights. This is what we shall achieve. But there is also a question concerning absolute weights: What is the harmonic weight of the last chord an, compared to the first one ao? Is it legitimate to say that this is just the weight of a selected Riemann path? It is certainly not because the local values of the weights could add up to some path weight which always gives the last chord a higher weight than the first one. This question is in fact the question about the global development of harmonic tension. Up to date, no musicological theory has ever dealt with this problem: Is it possible to calculate global harmonic tension from local knowledge? It is certainly not true that every piece of music should have a global slope of zero, i.e., the beginning tension being equal to the ending tension. For example, in a Scriabin sonata, such as op. 72 Vers la jlamme, the final tension is expected to be much higher than the beginning one. So how should one calculate global 'absolute' tension? We shall come back to this issue in the discussion of the HarmoRUBETTE® software module in section 41.3.
27.2.2
Weights on the Riemann Algebra
Summary. Weights on the Riemann algebra are predicates which measure the quality of a harmonic development path.
So let us now work on the weight functions on harmonic paths. Suppose that we are given a harmonic, real-valued default weight function w on all paths (vo,fo,ao) ---t (vl,il,al) ---t ... (Vi, fi' ai) of lengths i = i l , i 2 , ... ik. Then we may choose • a set of coefficients J.Li, i
= i l , i 2 , ... i k ,
and,
• for each such i, a system .Ai of coefficients in the sub-path operators C{,
588
CHAPTER 27. MODULATION
and define this linear form on the Riemann algebra:
O(p)
=
(27.13)
where the linear form w extends the original w on the spaces spanned by the paths in the images The idea of this definition is that we suppose that certain paths under the endomorphisms are already weighted (in fact by w). We then extend the weights to all paths by use of the sub-path operators. These operators take the given weights into account, each with its weight coefficient from the systems These systems give a sub-path its specific weight according to its position within the argument p. This is reasonable since we will in general give a subpath a different harmonic meaning in a path according to whether it stays in the beginning, the middle, or the end. After this, the different contributions of sub-paths of specified lengths i = h, i 2 , .•• ik are weighted against each other according to the role of the sub-paths of these lengths in the present harmonic analysis. For example, if the harmonic concept gives more importance to short sub-paths of length 2, against lengths above 5, say, then the coefficient J.t2 should be significantly larger than the coefficients above J.t5. It may even happen that certain lengths playa completely different role than others, as we shall see in a moment. So let us make a concrete example concerning the default weight function. The address is B = 0, and we are working in the function-theoretic context discussed in section 25.3.3. This means
c;; .
cf.
Fun(F)
@/Z12,
TON
{C,lJp,D,Ep,F,Cip,Ci,Ap,A,Bp,B},
VAL
{T, D, 8, t,d, 8}
and the circle group truth module I = 8 1 for the function theory, assigning a half open interval = ¢(a)~= [0, ¢(a)[ of 8 1 to each chord a E ExTopo(F), function value f E VAL, and tonality tETON. Based on this information, we want to define the linear form w on all paths of the Riemann quiver. By definition, the present form will vanish on all paths of length larger than 1. So this tension theory is very local, just limited to contributions from isolated chords and from transitions between two successive chords. Let us first deal with the vertexes, i.e., the isolated chords. We have to define the values w(f, t, a) for function values f E VAL, tonalities tETON, and chords a E ExToPo(F). Suppose that TFJ,t(a) = ¢(a)~with ¢(a) -=I- 0. Then we set w(f,t,a) = In(¢(a)). If ¢(a) = 0, the corresponding predicate is ¢(a)~= 0. This means that in a given harmonic path, we encounter a chord which has "False" as its harmonic truth value. Such a path should fail to be a competitor in the selection of a best harmonic path. More generally, we would like to eliminate paths containing chords with a value ¢(a) S; ¢min, S; ¢min. So we replace w(f,t,a) = In(¢(a)) by w(f, t, a) = -00 if ¢(a) S; ¢min. Of course, this infinity value is incompatible with the definition of a common linear form. But if we read the linear form codomain ~ as the set of intervals ] - 00, r[, r E ~, together with the transported structure of ~-module, we can extend the set by the empty interval 0 and add the rules 8.0 = 0,8 E ~ and 0+] - 00, r[=] - 00, r[+0 = 0. This structure is still a semigroup under addition and has a bilinear scalar multiplication. The empty interval is however an absorbing element: Every linear expression involving the empty
TFj,t(a)
°
589
27.2. HARMONIC TENSION
set is absorbed by this one and reduces to the empty set. So the values of our "linear form" live in the truth space 1l'~. We have in fact defined the first piece of a predicate on tuples of chords, given by the predicates T PI,t and the limiting value ¢min' The second piece of our predicate uses the linear form on paths of length 1. This contribution deals with the tension between two positions in the Riemann matrix. We are given a path (h, t l , al) ---+ (12, t 2, a2). We would like to measure the harmonic tension of this transition as such, i.e., independently of the involved chords. The value W((h,tl,ad ---+ (12,t 2,a2» will therefore be just a function of the arrow (h, tl) ---+ (12, t2) in the complete quiver over the Riemann matrix index set VAL x TON, call this quiver the Riemann index quiver. Two points of this quiver share two components: the function values and the tonality symbols. In our case, the function values are also split into a set VAL type = {T, D, S} of function value types and a set VALmode = {min, maj} of major or minor mode. So we shall restate the index quiver as the complete quiver over the triple product VAL type x VALmode x TON. Given a path (jtype t ) ---+ (jtype jmode t), we have to take into account the 1 'jmode 1,1 2'2 ,2 component steps in each coordinate. For each of these three coordinates, we set up a real-valued matrix to express the respective tension component. For the function type, the 3 x 3-matrix is
T VAL,ype = (ttype) XY X,YEVAL'ype' This means that in a I-path where the type changes from X to Y enforces a tension of quantity tt;/f/. The corresponding meaning for mode changes is quantized by the 2 x 2-matrix
T VALrnode -- (t mode mn )m,nEVAL mode • Finally, the change of tonality is codified by the 12 x 12-matrix
TToN
=
(tYt°N)s,tETON'
Observe that neither of these matrices will be symmetric or even distance like, i.e., having zero diagonal etc. For instance, the change from major to minor is supposed to be lower valued than the converse change in classical European music. With these component contributions, we may finally define the transition default values as follows:
W((typel, model, tonalitYl) ---+ (type2' mode2, tonalitY2)) -_ (ttype type, type2 )2 + (tmode mode, mode2 )2 + (tTON tonalitYl tonalitY2 )2 . Of course this Euclidean approach can be replaced by any equivalent distance definition, this one is only one typical solution. So we never have minus infinity here, the only "False" contributions stem from the values on isolated chords as explained above. To put everything together, the zero-length and one-length components are combined to yield the formula (27.13), i.e., (27.14) where the JL-coefficients give a relative weight to the path-length contributions. We shall see in section 41.3 that the HarmoRUBETTE® implements exactly this type of formula with the
CHAPTER 27. MODULATION
590
special values >.? = >.~. = 1, and zero else, except that the nominal values in that implementation are the exponential values eV of our values v. This construction yields an objective tension predicate
Tension(>'·, /k., w,
27.2.3
Harmonic Tensions from Classical Harmony?
Summary. We discuss a particular "harmonic perspective" of harmonic tension proposed by Anja Fleischer in [154]. -~-
The above construction of a tension predicate has the advantage that it works for any chord as proposed by Hugo Riemann. But it is not the usual approach in traditional harmony. In this latter context, it happens that most chords are not given any specific weight. We present one such approach as it was explicated in a diploma thesis by Anja Fleischer in [154]. In that approach, she took the classical binary logic and gave selected paths values of either "True" or "False" in the circle group truth module. Moreover, the value domains are chosen more in the flavor of Riemannian Harmony as it is taught in music schools. The function list is this: VAL = {T, t, Tv, tv, D, d, Dv, dv, S, s, Sv, sv} which are major tonical, dominant, and subdominant values, together with their minor and substitute (German: "Vertreter") variants. The tonality list is this:
TON {DurC, DurCis, DurD, DurDis, DurE, DurF, DurG, DurGis, DurA, DurB, DurH, Molle, Molleis, Molld, Molldis, Molle, M ollf, M ollg, M ollgis, M olla, M ollb, M ollh}. The 24 symbols relate to German major (=Dur), minor (=Moll), and the twelve tonality names. Here is the list of core chords plus values in the sense that these values are those where a chord takes the "True" value in the harmony predicate. The chord data in the first coordinate relate
591
27.2. HARMONIC TENSION
to the same form F = PitchM Od 12 as before. True~
{(0,2,3,tv), (0,2,4,t),(0,2,5,s),(0,2,6,sv),(0,2, 7,t),(0,2, 7, tv) (0,2,9, s), (0, 2,9, sv), (0,3,7, Tv), (0,4,5, s), (0,4,7, T), (0,4,9, t), (0,4,9,d),(0,4,10,t),(0,4,11,t), (0,5,7,s),(0,5,7,sv),(0,5,8,Sv) (0,5,9, S), (0,7,9, t), (0, 7,10, tv), (0, 7, 11, t), (0, 7, 11, tv), (0,9,11, t), (0,9,11, d), (1,4,9, tv), (1,4,9, sv), (1, 7, 9, tv), (1,9,11, tv), (1,9,11, dv), (2,4,5,s),(2,4,6,sv), (2,4, 7,t),(2,4, 7,d), (2,4,8,tv),(2,4,8,dv), (2,4,9, s), (2,4,9, sv), (2,4,11, t), (2,4,11, tv), (2,4,11, d), (2,4,11, dv), (2,5,7, d), (2, 5, 7, dv), (2, 5, 9, s), (2, 5, 11, D), (4,5,7, t), (4,5,7, d), (4,5,8,tv), (4,5,8,dv), (4,5,8,sv),(4,5,9,s),(4,5,11,t),(4,5,11,tv), (4,5,11, d), (4,5,11, dv), (4, 7, 9, t), (4, 7, 9, tv), (4, 7,9, d), (4, 7,9, dv), (4,9, 11,t),(4,9,11,tv),(4,9, 11,d), (4,9, 11,dv), (7,9, 10,dv), (7,9, 11,d)}. The other chords of same interval structure are given the corresponding values relative to the related translation operations. This is a kind of historical "kernel" of harmonic knowledge. The question is whether one should just give "False" values to all chords that are not covered by this list. I would not, but it is difficult to establish a transparent rule set to evaluate the other chords (and these are most of the total chord system). The choice of falsity is the worst case scenario, but the present list does not give a good alternative, one just did not find general rules to be applied in the negative cases. We have included this example to make evident the dramatic gap between what is known and what is fuzzy and problematic in historical contexts.
27.2.4
Optimizing Harmonic Paths
Summary. This section deduces algorithms for valuations of entire paths under modulation constraints. The latter subject reveals the exorbitant complexity of harmonic analysis and throws new light on the question of complexity and depth in the humanities. -I;-
In the preceding optimization procedure, changes of tonalities (for example in the distance matrix TToN) could be accounted for by numerical weights such that a tonality change is much more expensive than a conservation of the current tonality. Nonetheless, there is no finer tuning of the neighborhood of a harmonic development when a tonality change is recognized. The following algorithm is adequate for such a purpose. Let us reconsider the above case described in formula (27.14). Each chord of our chord sequence (ao, al, ... an) has its Riemann matrix evaluation, and the replacement by the "False" value for values below ¢min. Suppose that every chord of the sequence has a Riemann matrix with at least one non-false coefficient. Call the sequence coherent iff there is at least one tonality symbol such that each Riemann matrix in the sequence has at least one non-false coefficient at this tonality. A coherent sequence allows a harmonic path without "false" chord values living in one and the same tonality. Suppose further that one is interested in getting a minimum of tonality changes. This requirement is
592
CHAPTER 27. MODULATION
met as follows. Start with the maximal coherent subsequence (ao, aI, ... amI) starting from the first chord (no gaps allowed). Then consider all maximal coherent subsequences (no gaps allowed) starting from aI, ... amI' amI +1 (if possible). From these sequences, take the sequence with maximal last index: ai2'" . amI' amI +1, ... a m2 , then repeat this procedure starting from index i2 and so on. After a finite iteration, we end up with the covering of the whole sequence by coherent subsequences (ao, aI, ... amJ, (ai 2 ,'" a m2 ), ... (aik"" an). In each sequence, we may select a best path in a common tonality. This results in a spline-like configuration of paths PI, P2, ... Pk with possibly overlapping parts. These parts can be taken as modulatory regions, more precisely: regions of neutralization which are followed by regions of pivotal chords and cadences. A deeper analysis of such splining regions with regard to the modulation theorems (subsections 27.1.4 if. and 27.1.6 if.) is pending.
Chapter 28
Applications A us der Tatsache, daft sich die von den Theoretikern geschaffenen Strukturen und Formen meist als kiinstlich und zuweilen als abwegig herausgestellt haben, folgt nicht, daft es iiberhaupt keine allgemeine Struktur gibt, die nicht eines Tages von einer besseren Analyse der Musik, die aile ihre Erscheinungsformen in Zeit und Raum beriicksichtigt, freigelegt werden konnte. Claude Levi-Strauss [294] Summary. This chapter deals with illustrations of the modulation model in chapter 27. It treats short and longer examples from Bach to modern jazz. This exercise should make clear the methodological background of these musicological experiments and illustrate the theory exposed in chapter 4. -1;-
Before working out the concrete examples, we should emphasize that this approach is not meant to be a confirmation of any classical music theory. It is rather a discussion of a selection of empirical material in the light of the previous modulation theorems. The delicate point here is that the modulation theorems do not preconize any historical specification, they couldn't care less. These theorems give a list of pivotal degrees and not the answer to the question of historical adequacy. For such questions, one would have to look for other criteria. A principal branch of such criteria comes from the communicative dimension of musical topography. When a work is analyzed, it is first taken in its neutral setup as a determined denotator of a more or less complex structure. As such, it has several pointers to poietic determinants, for example by the composer's name or/and the year of publication. When considering the poietic coordinate, one could ask for the adequacy of the modulation model in question with respect to the historical position of the composer and the composer's knowledge base. Questions as this one arise: "Did the composer (Beethoven, for instance) use the type of reflections as
593 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 28. APPLICATIONS
594
described in the modulation model to construe the modulations in the given score (op.106. for example)?" Up to unknown and highly unlikely secret historical legacies, the answer is clearly "No!". But this is definitely not the discourse of our enterprise. We are not discussion conscious compositional strategies. Rather we are confronting the result of compositional efforts: the score (denotator) with the present modulation models. We are, so to speak, in the physicist's position to investigate nature's laws without ever knowing if and what and how a divine being could have built nature. The default answer of a musicologist would be that we are not physicists in so far as we know quite a bit about Beethoven's approach to modulation or at least, we know that he was not in a state of considering those mathematical tools used in the mathematical model of modulation. And therefore-this the argumentation of the musicologist-any such modeling remains disconnected from the historical reality. But it is rightly this latter concept of historical reality which we have to question in view of Dahlhaus' request for a systematic (re)construction of the dia- and synchronic system of music cited in section 27.1.5. Reality is not just a set of phenomena, but the totality of agents which shape the historical dynamics of the system. It is precisely the invisible and counter-intuitive strata of a system which usually are responsible for its unfolding. The absence of conscious traces of Beethoven's modulatory constructs does not explain these modulations and it does a fortiori not explain the global architecture of these modulations in the development of a large piece, such as Beethoven's op. 106. Moreover, the incredible organization and process of such masterworks cannot be explained by biographic factors, these things are of a much finer nature than the gross facts of a composer's life. And even if the composer had made statements about his poietic activity, this would just be one perspective upon the work's neutral (objective) data. There is no guarantee that this perspective must be the best possible. It is very likely that the intuitive and ingenious forces cover a strong majority of the work's characteristic and outstanding instances. So what can we retain from an application of a mathematical model of modulation if not the historical adequacy? We can retain the explanatory power, i.e., the adequacy of model and instance, theory and experiment. How much of the given example can the model explain, and in which detail? This is what we shall explain in the following examples, and not, whether or not the composer would have thought in the same lines. A last and personal remark about composers: From all composers that I have known, the great majority-also those who work with explicit constructive methods-was never in a state of explaining their own works to a satisfactory degree. Good works have always been far from algorithmic runs--even with composers of a knowledgeable mathematical flair or training, such as Tom Johnson or Jan Beran or myself. This is not a dogmatic statement, of course. It is the insight that good music is an enormously complex affair for which the present knowledge is not yet in a state of giving exhaustive construction algorithms.
28.1
First Examples
Summary. This section discusses short example from Johann Sebastian Bach, Wolfgang Amadeus Mozart, and Claude Debussy.
-E-
28.1. FIRST EXAMPLES
595
In all subsequent examples, we shall refer to the modulation theorem in the well-tempered case. This seems legitimate in the historical context, starting from Bach. But it is not a priori clear which theorem should be applied in general.
28.1.1
Johann Sebastian Bach: Choral from "Himmelfahrtsoratorium"
Summary. The example exposes the first eight bars of the composition and shows a remarkable congruence of the model with a piece of "nature" which is far from being poietically conceived according to the model or its expression in music theories. -I;-
6. Choral") Fl. tran. ~ Ii ail' st'a Soprano Ob,l,ll Violillo I
: r~~"~'~ ~~~~~~~~~~~~~~~"'~iE~~~~~~~~~$~~~~~§1I I~ f)
,::'::'
--;----
Nun lie-get lJie .En-gel
at - lesun mus - sen fur
:ii: Nun
-
Alto
Violino 11
Tenore Fiola
.Basso
..
~ lie . g'et at lJie .Ell - g,tl IIlUS
.........
ILl! '-'
~
Nun lie-get ])ie .E7tiJel :
~ les un - tel' sen for und
at mils
les sen
~
Ie, sen
.1 miis 5
6
?
~
'"
aus
=11 - war
-
ge - n011/ ten "alii -
dir,dich fUr dir
s~nur f-zu
auf
sdbst_ auf 6
9 6
aus zu - war
nUf
8
.
-
war
~
~
7
a1J.S
§'C
meni - metl.
:1,f ,':11;,; "Olll
'"
'"
un - tel' fiir und
nUl'
Idit; ;,:'" ~I:t :~~·a:s -;;- ~~ 'fiir dir mil -.:u . - u'ar-fen _
un . ~,:"- Idit; die" fur und 'filr dir
-"'"
Nun lie-get Die En-gel
ter d,jr,dic" selbst lind 'fur dir a~f -
kOlll
ge - nom ten kOlll
7 4 435.
~
men,
-
men; men.
-
-llom
ter<
-
menj men. 3 2 3
C01J.ti'llU0
",.
Figure 28.1: Johann Sebastian Bach: Choral from the Himmelfahrtsoratorium [33, p.41] (with kind permission of the Biirenreiter Publishers). In this example, we consider bars 1 through 8 (see figure 28.1). For our analysis we strictly take the perspective of triadic degree interpretations, i.e., the global objects contained in Dia(3). Chords are always zero-addressed and are deduced from onsets in the sense that whenever there is a new onset, one considers the chord consisting of all pitch classes of events which start here or did not end at this onset or before. According to our triadic degree perspective, such a chord will not remain uninterpreted, it will be covered by degrees of a triadic degree interpretation in the sense already explicated in section 27.1.2. For chords which are not triadic degrees, we choose an interpretation by a minimal number of triadic degrees. For example, the interval chord { d, a} could be embedded in the major triad {d, f", a} or in the minor triad {d, f, a}. Since this embedding is not unique, we have listed our choices in figure 28.2. We shall view this process as a sequence of two modulations, the first from D(3) to A (3), the second back from A (3) to D(3). The scope is to compare the harmonic
596
CHAPTER 28. APPLICATIONS
~~ J,, I
T..1
I
!
J0
I" I
........
.,. VII,
Y
I 1/10 I
v"
VI!"
Vo
""'-v-' 1/1.
6
No permitlnl
V II"
"A I IVA
I
; ;A A
I
I/IA
I VA
V.
Jt
'-v-'
IV.
A
"-
I~A I
T
A
7
--cadence D
,."
V lIelllral: A(J)n D(J)
cadence
VII,
lH!rmilU!11
I
;
VilA
cll(IJrge
lIel/lral: D (J)nA(J)
"0
VIA
'\J.I
~
T.5
I
II)
If)
4
3
Ii
--
Figure 28.2: Two successive modulations with Bach from D-major to A-major, and back. In both cases, the pivotal degrees correspond to those predicted by the (well-tempered) modulation theorem. Moreover, within the symmetric degree neighborhood of the turning point of the first modulation, the modulator, i.e., the inversion Ug/g~, becomes evident. development with instances of the well-tempered 1 modulation theorem. The modulator is the inversion Ug/g~ (recall that we denote Ug/g~ for the inversion U3 , a reflection at the point between g and gij). The pivot degrees for the modulation A(3) 'V'7 D(3) are IlIA, VA, VIlA whereas the modulation D(3) 'V'7 A(3) admits the pivotal degrees IlD,IVD, VIID (see appendix N.l). The modulator symmetry Ug/g~ gives rise to the following correspondences:
The first bar and the first two onsets of bar 2 are neutral since all appearing neutral degrees I,I II, V, V I of D(3) are also degrees of A (3). The third onset in bar 2 presents the pivots VA, VIlA according to table appendix N.1.2 and may therefore be viewed as the turning point of this modulation to A (3). The following onsets until the fermata at the beginning of bar 4 are a cadence of the target tonality A(3) with minimal cadence set {VIlA}, or, distributed among several onsets, with {IlA' VA}. Beyond this it is interesting to observe the neighborhood of the turning point: it makes the modulator evident. The last four onsets before the turning point present IUIIlUV, V,I, and (less weighted as onset of the fifth to the delayed VA) VID. 1 We often use the sloppy wording "well-tempered" instead of "12-tempered", but are conscious that there are subtle differences in the terms.
597
28.1. FIRST EXAMPLES
After the turning point and before the cadence towards the end of bar 3, appear two onsets of VI,JV U II of A(3). These three degrees of A(3) correspond (under the modulator) to the degrees I,JII, V of D(3) before the turning point. The modulation in the reversed direction brings the neutral degrees in bar 4, then in the first two onsets of bar 5 the turning point with the admitted pivots IID and V IID, and finally, in bar 6, follows the cadence according to the same pattern as observed in the first modulation.
N918. Chor der Priester
Figure 28.3: Wolfgang Amadeus Mozart: Zauberfiote, Pianoexcerpt (Soldan), second Scene No. 18, Chor der Priester [383, p.1l8] (with kind permission of the C.F. Peters Publishers, Frankfurt/Main).
598
CHAPTER 28. APPLICATIONS
28.1.2
Wolfgang Amadeus Mozart: "Zauberflote", Choir of Priests
Summary. This second example yields-apart from a provocative congruence--an interesting text-based semantic interpretation of the modulatory structure, an interpretation which seemingly did not appear from more traditional harmonic analyses.
We are considering bars 1-18 of scene No. 18: "Chor der Priester" (see figure 28.3). The interpretation via chord onsets relating to triadic interpretations in Dia(3) is shown in figure 28.4. We however have to comment on this figure since three onsets in bar 8 are not contained in any triadic structure from Dia(3): the diminished seventh chord VJ1 2 = {f~,a,c,eb}. It is interpretable via degrees as a union of VIle, VIlE" VIID " and VIlE.
8
7
ID
9
M2
M2 VIlE VIlE M2 VIIG VIIG VIIG
1.
2.
)11. - 8.
3.
12
11
4.
6.
5.
7.
13
ID
1. - 8.
1. - 4.
5. - 8.
15
IIA
ID
VD
VA
VA
/.
VIlA
IVA
2. - 4
1. - 4.
1.
2.
3.
4.
17
VIIE~
)11.-3.
lIE 4. - 8.
VE VIlE 1. - 8.
18
VA
IVE IVA
IA
IVA
IA
VIlA
IA
l
3.
4.
1. - 2.
3. - 4.
1.
2.
VIIE~
VD VIID
14
VD
16
8.
10
Figure 28.4: The configuration of triadic degrees as it is shown here from the perspective of modulation theory. In bar 8, the diminished seventh chord plays a special role since it is not interpretable within Dia(3). To begin with, the score text poses a problem in bar 8 since we now leave D-major, where the degrees were positioned until bar 7. But we also leave Dia(3). In fact, bars 8 and 9 can be interpreted as these degrees in g~3) (see also 26.2.1): five times II U VII, then VII, and eight
28.1. FIRST EXAMPLES
599
times I. After that we return to D-major, and we have no more such conflicts until degree VIIEb at the beginning of bar 14, which can also be viewed as VIIEb or lIe in harmonic minor tonalities. If we interpret the strict neighborhood of the diminished triad as lying in A-major, it appears as being isolated and cannot be understood easily. It is interesting that these two places are in a special position when related to the text semantics. The diminished seventh chord, when exiting the orbit Dia(3), is the moment of "diist'rer Nacht", the diminished triad V I I Eb corresponds to the text which tells about the dark night, death, the polar concept against "life" .
a)
b)
Figure 28.5: Part a) of this figure shows the modulation paths within the circle of fourths. Here, E, is the pole to A, and to its neighboring tonalities D, E which are symmetrically positioned with respect to the polar axis A, E,. The two semantic poles "night" (first E,) and "life" (second E,) are placed on distinguished tonality positions in front of the contextual tonalities D, A, E. Part b). The five modulators of the present modulations are partitioned into three groups: Two pairs of orthogonal axes, Ue/ J and Ug/g~, as well as Ub / c and Ug~/a, and the axis Ug~ which lies symmetrically with respect to these two pairs.
Let us try to set up a connection between the text-semantic extra position of 001 2 and of V IIEb within the modulatory configuration. To this end, we write II IEb in bar 9, and in bar 14, we write lIE after VIIEb . Bar 8, i.e., 001 2 , is not interpreted by degrees, but as a representation
CHAPTER 28. APPLICATIONS
600
of modulators. Here is our modulation scheme (the pivots are bold-faced): Modulator: Tonality:
Ue/ J D."...
Ue/ J
Eo"""
Circle of fourths:
5.
7.
Pivots:
II
II
Ug/ 9u D."... II.
V
VII
VII
IV V VII Appearance:
b.9
V VII b.lO
6.
II III (IV) V
III
III
U9u A."...
b.13
b.14
Ub/ c
Eo""" 5. II
Ugu / a E."...
A
I.
II
III II V VII b.14
IV
VII b.16
In figure 28.5, part a), the modulation paths are drawn on the circle of fourths. One recognizes the two segments in Eo as a pole to A an its symmetrically positioned neighbors D, E. The two semantic poles "night" (first Eo) and "life" (second Eo) are placed on distinguished tonality positions in front of the contextual tonalities D, A, E. Also, the modulators appear in a neat symmetric way: Part b) of figure 28.5 shows the axes of five different inversions and the diminished seventh chord 9Jt2 . These five modulators are divided into three groups: The pair of orthogonal axes (minor thirds!) associated with Ue / J and Ug / 9u is contained in the symmetry group of 9Jt2 . It controls the first three modulations, i.e., the 'edges' D - Eo and D - A in figure 28.5, part a). Hence, 9Jt2 appears as a 'support' of two modulators. These 9Jt2 -symmetries are in fact visible in bar 8. On the change a ~ f~ = Ug / 9u (a), from the first to the second eighth onset, on the change C ~ a = Ue/J(c) from the second to the third, on the change
{a,el>} ~ {f~,c} = Ug/gu({a,el>}) = UeIJ({a,el>}) = e3 ({a,el>}) from the fifth to the sixth, and on the change c ~ a = Ue/J(c) from the sixth to the seventh eighth onset. Symmetrically to this, the two 'edges' E - Eo and E - A in part b) of figure 28.5 are mediated by the equally orthogonal axes associated with Ub / c and Ugu / a . And the middle modulation A ."... Eo can be viewed as 'symmetry edge' on part b) of figure 28.5; its modulator U9u is equally inserted in a symmetric way between the double pair of orthogonal axes. This beautiful symmetry, which is semantically accompanied by the linguistic text, has the advantage that we can recognize an inner and semantic connectivity among the four tonalities D, A, E, Eo instead of five tonalities D, A, E, Eo, and gh, without any semantical connectivity.
28.1.3
Claude Debussy: "Preludes", Livre 1, No.4
Summary. This analysis again produces a possibly unexpected congruence with the model and succeeds in giving a very explicit interpretation to the title's "parfum". -I;-
It is particularly important to recall from the introduction to this chapter that we do not make any statements about the poietic rationales of these compositions in our examples.
601
28.1. FIRST EXAMPLES
And that our analysis is nonetheless a scientifically legitimate and valid approach, much as human physical models of nature. We would strongly suggest to set up comparative discourses between traditional music analysis, jazz analysis, and the mathematical approach. For obscure reasons, this type of discourse has been applied very rarely2 . We would however prefer to lead such a discourse on the level of neutral analysis, and not under the petitio principii perspective that knowing the composer's innermost agitations (who will ever know that?!) preconizes a distinguished analysis. We would rather like to see the objective advantages in the understanding of what is written down in the score.
• IJasse
la
UII
pes 1J'PfJ1I.1ee
et solllenue
~
m.tt. Figure 28.6: Claude Debussy: Les sons et les parfums ... "Preludes", Livre 1, Nr. IV, [113] (with kind permission of the Henle Publishers). We want to study a piece in Debussy'S "Preludes", Livre 1. It is No. 4 "Les sons et les parfums tournent dans l'air du soir", a title which is taken from Baudelaire's "Harmonie du soir", third verse. We want to deal with the modulation from Ab-major to A-major (this is what the signature suggests), bars 33-37, see figure 28.6. The analysis starts on the last fourth (indication: serrez ... ) of bar 33 and terminates on the arpeggio at the beginning of bar 37. The degree onsets are shown in figure 28.7. In contrast to the preceding example, there is no problem here to immerse chords within triadic interpretations. Rather do we encounter two problems: the mixed variety of tonalities and the ambiguity of most onsets with respect to tonal specification. It is particularly remarkable that a cadence of A-major does not occur. Moreover, the rhythmically simplistic sequence of mixed chords, which alters between Ie, Ve, and VA, IA (bars 35 to 2/3 of bar 36, indication: la basse un peu appuyee) and are close to shocking in their strangeness to the context. If we suppose that there is a modulation from Ab to A, what could be the role of C? One can testify the presence of C by the cadence IV, V, in fact the mentioned Ie-Ve sequence, by 2 An
exceptional case is found in Musiktheorie, Vol. 4/2000.
602
CHAPTER 28. APPLICATIONS
the arpeggio of the chord Ie U V Ie of the left hand, and by the melody line of the right hand. The role of C cannot be that of an intermediate tonality since C incessantly interplays with the target tonality. Also it is impossible that modulation targets to the intersection of A and C since the end clearly points away from C. The only possibility which remains is that C does not play the role of a modulatory station, but of a modulator, if we stick to the present model.
33
5
6
34 L
2
~
~
5
6
35
1.
2
~
~
5
6
36
1.
2
~
~
I
5
6
37
I I
III
VI
I or IV or VI
c
V
I
V
E
Vll
Y
f
I
V
f
f
I
T
f
T
A
Figure 28.7: In the modulation from Ab to E (and A), the modulator, i.e., the inner symmetry of C and G b is evidenced by chordical and motivic representation of this double scale material. The triadic interpretation C(3) has exactly one non-trivial inner symmetry, namely the inversion Ud. There is a hint for this symmetry in the line of the left-hand voice in bars 35-36: Here, we have the mentioned arpeggio of Ie U VIc, a chord which is symmetrical with respect to Ud , a fact which is felt as a to-and-fro movement within the chordically completed arpeggio (indication to play the bass!). Moreover, we observe three tones db, eb, f at the second onset of bar 34, which lie in IV U V of G~3), and the last onset of the same bar can also be specified as
I I I of G~3). However, relating to symmetries, G b and C are equivalent. Also, regarding inner symmetries, Gb and C are equivalent; i.e., the inversion Ud is 'represented' by both, C and G b. But if Ud is the modulator, then it operates via A~3) 'V'7 E(3) = Ud(A~3)), and not to target A(3). This should happen with the pivots II,III, V, VII or with II,IV, V, VII (second cadence option). This is exactly what happens in the beginning of bar 34: lIE U VIlE appear as unambiguous pivots towards E(3). The other degrees which do not pertain to C or to G b
28.2. MODULATION IN BEETHOVEN'S SONATA OP.106, 1sT MOVEMENT
603
are drawn in figure 28.7. They are all in E as well as in A, except the last onset lAb in bar 36, which-together with the onsets in Gb-concludes the tonality Ab in an atmospheric flavor the tonality Ab which is in symmetrical position to E. This not only helps to understand the modulatory function of the tritonally related tonalities C and Gb, it also makes evident the ambiguity of the modulation-and thereby the charm of this passage: The piece modulated to E, but the cadence of E did not take place, one remains within the intersection E n A, and the first degree of A in bar 37 is a fact that can only be interpreted in an ambiguous way; it could also be lYE. So tonality E has been introduced in a defined way, but without cadencing, rather by neutralizing it with respect to A. In this view, the title receives a structural reading. The tones are really turned within the modulation (via Ud), the "parfume" of Ab remains mixed with the "parfumes" of A and E until the end, it keeps returning like a memory.
28.2
Modulation in Beethoven's Sonata op.l06, pt Movement
Summary. This longer example illustrates complex connections of modulatory processes within the Allegro movement of the "Hammerklavier" Sonata. We give an interpretation and precision of the famous theses by Erwin Ratz [434] and Jiirgen Uhde [534] with respect to the overall architecture of the modulation processes. -~-
28.2.1
Introduction
Summary. This introduction sketches the key position of the "Hammerklavier" Sonata in the history of European piano music. -~-
In contrast to the preceding examples, this one will not deal with short modulatory passages but with the overall organization of tonalities and architecture of modulatory passages. The methods will be the same as in the preceding examples, but we shall only sketch the detailsexcept for some punctual zooms. A detailed discussion, including motivic aspects ("motivischthematische Arbeit") and their connection to harmony can be found in [328]; some analysis of the motivic aspects have also been discussed in example 24 of section 14.2. We want to investigate the Allegro movement in Beethoven's "Hammerklavier" sonata op.106 [46]' a very famous and equally difficult late work which deaf Beethoven had composed 1817-1819. In his description of op.106 [257], Joachim Kaiser starts with the lapidary phrase: "Grosste Sonate der Musikgeschichte." The sonata's public premiere was executed in 1836 in Paris by Franz Liszt. But the "Hammerklavier" sonata is not only a technical and mnemotechnical challenge for the interpreter, it is also a mysterium for the sonata theory-developed by Adolf Bernhard Marx [320] in 1850s-which established the sonata form as an accepted norm. This sonata form ("Sonatenhauptsatzform") is an architectural and processual scheme of a large musical form which is framed by a syntactic subdivision into exposition, development, recapitulation, and coda (see figure 28.8).
604
CHAPTER 28. APPLICATIONS
The comparison with the classical sonata form will only be a starting point of our analysis of the present individual composition. It is well known that the sonata scheme is not adequate for more than a statistical estimation of really existing sonata movements. However as a starting point, we can use the scheme in order to evaluate knowledge and limits of traditional musicology with the complexity of tonal and modulatory structures of the present sonata. Scheme orlhe sonata (onn (or lhe Al legro movement in Ludwig dcvitllion from lo,mlity
eITeC:live
Vall
Beelhoven"s op.I06
aniculalion according 10 general scheme
tonalilY circumslunc:e...
( 1· 37) Bn.wmajor
(38·44) Bn,, ·> G (44-46)'1
.A----------------j
(47- 123)
G-mnjor
'"m..: '"5
ils repetition. pl"OCtSS of liquidation. here a kind of eanOn
."
;::
'":z-l (2 1 3~ 226) resting on dominant ( F) and preparation of recapitulation
repetition of exposition,
~~ii~it(u~~J.:J
( (227-268) p:lft with tin;;1 subject
bm all in
(269·278) transit ion
(279·361) part w;,h (277-36 1)
(362-405)
second subject
(362-405) c:ulooOff. first subjecl and imensification
Btlat-major
Figure 28.8: The sonata scheme of the first movement of the "Hammerklavier" sonata op.106, compared to the normed sonata scheme. Our analysis is not thought of as a contradiction but as a confirmation and elaboration of existing analyses. In particular, we take over the fundamental world-antiworld thesis of Erwin Ratz [434] and the "catastrophe theory" of Jiirgen Uhde [534] which relates to the specific modulatory role of the diminished seventh chords in the sonata. With regard to the formal ex-
28.2. MODULATION IN BEETHOVEN'S SONATA OP.106, 1sT MOVEMENT
605
plicitness of our model, the take-over will however include precision, completion, and unification of aforesaid world-antiworld thesis and catastrophe theory.
28.2.2
The Fundamental Theses of Erwin Ratz and Jrgen Uhde
Summary. This section presents the thesis of Ratz and its weltanschauung, as well as the thesis of Uhde. A restatement of these theses in group-theoretic terms is given. As its consequence, a particular modulation architecture is predicted. This will be verified in section 28.2.3. -~-
In [434]' Ratz expressed the idea that the "empire of tonalities" in op.l06 is polarized into a "world" around the pole BD-major, the sonata's main tonality, and an "antiworld" around the counterpole b minor. In [534], Uhde supplemented this thesis in so far as two so-called "catastrophes" occur in the Allegro movement when the "world" is left in order to enter the "antiworld". Catastrophes are dramatic modulatory processes which differ significantly from normal modulations (see figure 28.9). We discuss the following thesis:
Thesis 4 Let 9J10 = {c~, e, g, bD} c PiM Od 12 the local composition of the diminished seventh chord. Then the modulation structure of op.l06 is determined by the modulators in the sense of theorem 30 which are contained in Aut(9J1o). What is the relation of this thesis with Ratz' world/antiworld? To begin with, the group Aut(9J1o) solves the following purely group-theoretic problem: We look for a maximal subgroup M of the group of inversions and translations T h2 on PiM Od 12 which acts as modulator group on Dia(3), but under the restriction that no group element transforms B;3) to D(3). Here, the interpretation D(3) stands for the minor tonality b of ionian mode. Under the action of such a subgroup, the set Dia(3) is divided in at least two orbits (one for B D, and one for b). A priori, it is not clear whether there are several such subgroups, and how many orbits such maximal subgroups will produce. But it turns out that the subgroup is uniquely determined, viz., M = Sym(9J1 o) nT112 , these are the inversions and translations which leave 9J1 0 invariant. We have the surjection Sym(9J1 o) --+ Aut(9J1o), and it is easily seen that its restriction
M
--+
Aut(9J1o )
is an isomorphism. So we may identify these groups. Moreover, there are exactly two orbits under this automorphism group, namely the world
and the antiworld
W*
=
Aut(9J1o).D
=
{D, F, AD, B}.
We are now in state to make more precise Ratz' hypothesis: The tonalities of Ware those of Ratz' "world", whereas the complementary set W* defines Ratz' "antiworld". This casts our possibilities in two ways, we are given a precise dichotomy W /W* and the admitted modulators, i.e., the elements of the automorphism group Aut(9J1o).
CHAPTER 28. APPLICATIONS
606
A
B
Figure 28.9: Ludwig van Beethoven: Two modulations in the first movement of op.l06 [46] (with kind permission of the C.F. Peters Publishers, Frankfurt, Frankfurt/Main). The first (A), from G to E b, is a common one, whereas the second (B), from Eb to b-minor, is a catastrophe modulation in the sense of Uhde. The modulatory situation here is that of a restriction of modulators to the group Aut(VJto). What is the meaning of this condition for the modulation model? As long as the modulator which is described by the modulation theorem is contained in Aut(VJto), there is no problem to apply the theorem and to look for pivots. But a modulation to the third circle of fourths, like C ...... Eb or A ...... C only admits translations in our restricted framework. In this case we must refrain from rigidity condition (3) (the triviality of the intersection T h2 n Sym(T n Q)) in property 1 of section 27.1.4. In fact, this condition which guaranteed the uniqueness of the modulator is superfluous now: there is only this translation symmetry, no other candidate! But then, there is a corresponding theorem where the modulators are the translations e±3. Therefore, one should modulate in the sense of restricted modulators and pivots within Wand within W*, whereas modulation between two tonalities living in different worlds should yield a catastrophe in the sense of Uhde, whaich means in particular that in such a situation,
28.2. MODULATION IN BEETHOVEN'S SONATA OP.106, 1sT MOVEMENT
607
diminished seventh chords-which are the 'creators' of the catastrophe-----should become visible at the surface of the score.
28.2.3
Overview of the Modulation Structure
Summary. We present the modulatory architecture and the modulations which split into "ordinary" cases and "catastrophes" in the sense of Uhde. -~-
We now have to test these postulates in the total plan of modulations of the Allegro movement. The modulation plan looks like this: W
Bb
W "-"'+ e- 3
G
W
"-"'+
ug
Eb
W'
"-"'+
!
D/b
W' "-"'+
Ud/d~
B
W
"-"'+
!
Bb
W
"-"'+
ub ,
Gb
W "-"'+ ab / a
u
G
W
"-"'+
e3
B b•
In order to view these modulations on the circle of fourths and relative to the world/antiworld dichotomy, see figure 28.10. For each modulation, the modulators from Aut(m1o) are indicated. Here are the single modulations in detail:
Figure 28.10: The graph shows the modulation plan in the Allegro movement of Beethoven's op.106 in the tonality system arranged on the circle of fourths. The start switches from Bb to G. The inverse modulation occurs at the end, and both follow the same procedure, see sections 28.2.4 and 28.2.11. Except these initial and terminal movements, the modulation plan is perfectly symmetric around the symmetry axis between B;3) / E;3) and A (3) / E(3) .
608
28.2.4
CHAPTER 28. APPLICATIONS
Modulation B'p 'V't G via e- 3 in W
The first modulation Bp -v-+ G in the transition (bars 39-46) to the second subject could in principle be performed by use of a "pedal modulation" [478]. We do however not encounter this modulation, but 'merely' a sequence of V I I G-degrees whose top notes are shifted by minor thirds from each other, i.e., exactly the situation of the pivot VII and the third translation, as predicted by the modulation under restricted modulators.
28.2.5
Modulation G'V't E'p via Ug in W
This modulation is bipartite (first part: bars 124-127, second part: bars 128-129). Before we encounter the pivots V II - V - V II of Ep according to modulation table appendix N.1 in part two, we hear tone 9 as an octave interval: pedal and stationary voice. Here the inversion at 9 is made evident (figure 28.11). On the one hand, this section is a cadence of Crnel whose inner symmetry is Ug • One recognizes a bipartite contrapuntal motive structure of which the second part (bars 126-127) is the inversion of the first part (bars 124-125) in pitch c~. But modulo octave (in the pitch class space), this inversion is the same as Ug • Since c~ is not contained in the present scales, it is delegated to the octave in g. This means that the modulator Ug is motivically evidenced in this first modulatory section preceding the pivots and the cadence.
•• • •• •• ••••• • • •• • • • • • • • • • • • • • • • • •• • • • • • •• •• • • •• • •• •• ••• • • • • • •••••••••••••••
•• ••••••• ••••• • •• •• • •• •• •• •••• •• •• •• • •• • •• •• •• •••••••••••••••
Figure 28.11: If we omit tone f in bars 124-127 (it serves for the identification of Crnel) and transpose all pitches into one octave between the two gs (to the right), then we recognize the motivic inversion symmetry between bars 124-125 and bars 126-127.
28.2.6
Modulation E'p 'V't D /b from W to W*
This modulation is a catastrophe in the sense of Uhde since it leads to the antiworld W*. As we may recognize already from the score (B) in figure 28.9, bars 189-197 are of a dramatic shape. Any elaborate motivic, rhythmic or harmonic effort is postponed in favor of a pertinent presentation of diminished seventh chords. An approach to modulation V IE" ID (bars 189192) fails, the resolution of all alteration signs indicates the exit from tonal space. We hear the "generator" of the catastrophe, the diminished seventh chord as such.
28.2. MODULATION IN BEETHOVEN'S SONATA OP.106, 1ST MOVEMENT
28.2.7
Modulation D /b
-v-t
B via Ud/d~
=
609
within W*
Ug~/a
The process is resolved chordically here and impregnated by two simultaneous local meters (figure 28.12): in the left hand a 6/4 meter via triplets, in the right hand 8/8 meter. In both expressivo indications (bars 205, 209) appears (bar 207, and repeated in bar 211) in the triplets in the left hand the motif h - b - fu and b - fu - b = Ud/ du (fu - b - fu). And the right hand plays d and dU = Ud/du(d) in bars 205-209 as well as e and cU = Ud/du(e). In bar 209 appears at the beginning degree IB and after that in the right hand a chromatic sequence which comprises the seven pitches B* = {cU,d,dU,e,f,h,g}. Now, B = Ud/du(D) = e5 .11(D). But by a concatenation with the fifth circle symmetry we produce e5 .5 = Ud/ du .eo.7 out of Ud/ du ' This result lies in Sym(ooto) (!), and we exactly get
B* = e5 .5(B), so that we really are situated in B-up to an inner symmetry of ooto. 5/8
1/8
9/8
13/8
f-I--'---'---'--tl---'--'-'--t-I--'---'---'"-tl---'--'-'---ll 1/2
212
1 3
~~
2
.3
Y = 2 , ZI = 8
4/2 11
3 1 _L
3
.---
Ie
e.'!pressi'Vo ~~
t
312
I
4 3
t.
......JL...-
1/6
4/6
~ 7/6
....J10/6
f-I--''"-----'--tl--'----'--t-I--''"-----'--tl--'----'---II 1/2
2/2
3/2
Y = 2, Z2 = 6
4/2
Figure 28.12: Example of two simultaneous local meters, corresponding to the left and right hand in bars 209-210 of the first movement of Beethoven's op.106.
28.2.8
Modulation B
-v-t
B'p from W* to W
The next modulation leads us back from the antiworld W* to the world W (bars 214-226). Corresponding to the short stay in W* the return is an easy business. There follows a sequence of arpeggiated intervals which is strongly based upon ooto and ends on IB,: twice six times four times four times four times
e, Cu g,e
gu,e bp,e bp, f
within ooto within ooto within ooto I B,
610
28.2.9
CHAPTER 28. APPLICATIONS
Modulation Bp
-v-t
Gp via Ub, within W
For this modulation within the world W, we can apply the normal modulator. The modulation is a fast one (bars 238-239), although the involved tonalities B, and G, are separated by four fourths. At the end of bar 238, the neutral degree I B , is followed by its inversion VIc, = Ub, (IB,), so the modulator is again put into evidence. Immediately after that, at the beginning of bar 239, follows {b" j, a,} which lies in III U V of G, and corresponds to the pivots.
28.2.10
Modulation G p -v-t G via Ua,/a within W
This second Uhde catastrophe (bars 249-262) is highly dramatic. At the beginning (bars 249250), we remain within the large orbit W: I of G,. As with the first catastrophe, we then encounter a pronounced series of diminished seventh chords. In bars 259-262, it terminates in the intersection BnG,: we do not know whether the change or orbits has been successful or not. The decision is only taken in bar 263 with pitch j which is not in B, but in G,. So we actually did not really leave the world, at least not unambiguously. This 'delusion' is particularly refined since we hear the "fanfare" with I I Ic which could also be viewed as I of b-minor. But this third degree corresponds to a pivot of our model, the inversion of the bass in the moment of the forte onset at the end of bar 266: b = Ua,/a(f~) shows the same symmetry as for the above modulation D /b ~ B within the antiworld after the first catastrophe. So we do not move from B to D as it is suggested by the change of signature in bar 267 and from the pitch material before the appearance of j in bar 264. This superficial impression is an "allusion" to the anti world situation, unveiled however as being an illusion and is resolved in a consistent way. We have moved from G flat to G.
28.2.11
Modulation G
-v-t
Bp via e3 within W
This last modulation proceeds completely regularly according to the scheme that we already know from the inverse modulation in the transition to the second subject as discussed above.
28.3
Rhythmical Modulation in "Synthesis"
Summary. The modulation model does not restrict to time from its logical structure. This fact is exploited in a composition of rhythmical modulation for percussion ensemble and piano. -~-
The harmonic modulation model is based upon the pitch class construction modulo octave, i.e., the space PiM Od 12 based on the module 212 derived from the semi-tone pitch space based on the integers, modulo the submodule generated by the octave quantity 12. We have viewed the tonalities as being scales, together with their triadic interpretations by the seven wellknown degrees. But there is nothing substantial to the choice of these spaces. The mathematical framework is completely insensitive to the forms and denotators which implement the underlying parameter and class spaces.
28.3. RHYTHMICAL MODULATION IN "SYNTHESIS"
611
So why not apply the modulation theory to a concept framework which is mathematically the same as for harmony, but which is semantically different, more precisely: the new approach presents a space and its denotators in the context of onset time and its rhythms. We shall develop this switch from pitch to time in the next section and apply this theory to a rhythmical modulation in movement No.1 of Guerino Mazzola's jazz concert "Synthesis" [339] for piano, large percussion ensemble, and e-bass. Before delving into technicalities, we should however observe a fundamental difference between two time qualities involved in musical composition. We have known time as a form related to onset and duration. These dimensions are part of what defines events in scores. They are comparable to other event parameters such as pitch, loudness etc. A local or global composition is a configuration of events which represent the composition's substance. This material data is without any logical or strategic specification. It does not include the composer's poietic construction plans or the analyst's structural evaluation. But modulation is a structure that is not only defined on the level of the effective neutral, pivotal, and cadence degrees, it is rather built on the tripartite strategy Neutralization-Turning Point-Cadence as defined by Schonberg. This is a logical construction on the syntactical level: a sequence of three functional units in the syntagmatic string of musical development. Like abstract logical schemes, such as modus barbara ("a implies b" and "b implies c" implies "a imples c"), this is not a priori a syntagmatic string in the material musical time. It regards logical time rather than material time. In harmonic modulation, we have three logical stages in the pitch domain, and their unfolding on the material time line is only a representation of an abstract process, not the process as such. Substantially, in harmonic modulation we have an excellent example for Augustinus' definition of music as an art of instantiation of good rational strategies. As already mentioned in the previous discussion of Augustinus' definition, modulation could also regard rhythmical strategies, for example. It is very important to distinguish logical time from material time in this case since here, the syntagmatic string which embodies the logical time is superposed to the material time of rhythmical structures.
28.3.1
Rhythmic Modes
Summary. This section describes the transfer of harmonic modality to rhythmic categories, and from this derives the modulations in the rhythm domain. -1":The transfer of harmonic dimensions to rhythmic ones must deal with the semantic specifications of these dimensions. In fact, pitch is a space category which carries a strong connotation of sound quality, i.e., of instrumental realization. No abstract pitch quality has ever been used in harmony as soon as realistic music parameters are to be described. We could of course say that the abstraction from instrumental realities is only a question of habituation, and that abstract time could also be thought of in an abstract comprehension which already worked for harmony. However, it is easy to imagine a more or less good orientation in the pitch space whereas it is difficult to imagine an orientation in an abstract time space, since the distinction between later and earlier time onsets is only relative and risks failure if no supplementary time-dependent
612
CHAPTER 28. APPLICATIONS
structures can be found. Moreover, the superposition of logical and material time layers asks for a clear marking of material time events. This is why we would like to add supplementary, logically redundant markers of time such that one can use such redundancies to shape time in a rhythmical modulation theory. The idea is as follows: Suppose the onset time is parametrized by integers, Onsetz -----+ Simple(Z). Id
Suppose further (in analogy with the octave period) that we have the translation e 12 in time on Onsetz in the sense of a time period. This could be realized by a 12/8 meter, for example, where the eighth duration stands for the unit time in Onsetz. As a marker parameter, we select the form Percussion -----+ Simple(Z) whose zero-addressed denotators are percussion instruments Id
as represented in a list with integer indexes. This could for example be a list of percussive sounds in an MIDI-environment where the numbers stand for the program changes (see below for more concrete setups). Our events are situated in the combined space
PerOns
-----+
Id
Simple(Z E9 Z) :::., PO
-----+
Id
Limit( Onsetz, Percussion)
and will be called percussion events in this context. But we want more, i.e., macro percussion events, in order to construct profiled markers. So we consider these macro percussion event spaces as they were introduced in section 13.4.3.1:
Knot PerOns
-----+
Id
Limit (PerOns, M akro PerOns)
with M akro PerOns and F
-----+ !:F:::'2FK >->!lFK
Power ( Knot PerOns)
= Fun(MakroPerons) , FK =
Fun(KnotPerOns).
A macro rhythmic germ R* is a finite local composition in the ambient space KnotPerOns' And an infinite macro local Para-rhythm is paraphrased by the 12-periodic local composition e[-oo,oo]12 R*. In the sequel, we shall only consider germs R* which are in bijection to their time projections R Ons ' and such that ROns are contained in the period interval [0,12[. This means that we may equivalently consider the rhythm classes modulo the time period 12. These objects are the residual classes of the macro rhythmic germs R* which we now identify with the germs because of our additional assumptions. So we are working in the macro class space:
KnotperOns12
-----+
Id
Limit(PerOns12, MakroPerOns12)
with M akro PerOnS 12 and F
-----+ !:F:::'2FK >->!lFK
Power ( Knot PerOns12)
= Fun(MakroPerOns12),FK = Fun(KnotperOns12)'
deduced from
PerOns12
-----+
Id
Simple(Z12 E9 Z)
in an analogue construction to scales in pitch class spaces. Intuitively, these macro rhythmic germs in M akro PerOnS12 are just time scales whose points are loaded with a rhythmic satellite object each. As to modulation theory with such objects, we can apply the modulation model for 12tempered tuning, but we have to take care of the satellites! There may be many rhythmic scales
28.3. RHYTHMICAL MODULATION IN "SYNTHESIS"
613
with the same time projection! This means that the translation and time-inversion symmetries will have to carryover the (unaltered) satellites. This is as if the harmonic modulation would be carried out with "colored" pitch classes, whereas the inversion of pitch classes would preserve colors.
28.3.2
Composition for Percussion Ensemble
Summary. This final section discusses the concrete realization of the rhythmical modulation model, essentially by use of a large ensemble of percussive instruments. -~-
The composition in question here is a rhythmical modulation in movement No. 1 of Mazzola's jazz concert "Synthesis" [339]. The modulation takes place after the exposition in the sonata scheme of this movement (3:18-5:48 on piece # 1 of [339]). This developmental start is written in 12/8 measure, in fact the measure which we need for the rhythmical modulation in a 12-periodic scheme as exposed above. The modulation is built upon the rhythmic scale (the time projection of the macro germ) corresponding to the complement 62 of No. 62 the chord classification list in appendix L.1. The macro germ G* above 62 has the following shape. For each onset x E 62, we have a corresponding denotator ((x,Px), Sat x ) of form KnotperOns12" The px-coordinate is an integer value for a percussion sound, whereas the satellite Sat x is a zero-addressed denotator of form M akro PerOnS12. We choose for each satellite a zero-addressed three-element motif of form PerOns12,
with first element the origin (0,0) E Z12 EB Z (see also figure 28.13). The choice of these threeelement satellites is due to the general construction principle of the "Synthesis" concert, i.e., the use of the 26 isomorphism classes of three-element motives for all melodic and rhythmic structures. This principle was already encountered in section 11.6.3 where we discussed the third movement of the concert. The modulation goes from the rhythmic macro germ G* to its symmetric image H* = R(G*) under the retrograde motion R which fixes the seventh tone (first tone of the seventh degree). The lower half of figure 28.13 shows the retrograde germ with the transported satellites. This is the germ for the "rhythmic target scale". According to the modulation theory (welltempered case), there is a determined set of pivots related to a selected cadence in the target scale, and we can get off with the explicit construction. In the "Synthesis" concert, we first set up six bars in order to define the start rhythm scale. In each bar 1-5, a new tone of the start rhythm scale is added, and two tones are added in bar 6, so we have the complete scale. Observe that this addition of tones means that each added tone can be repeated in the successive bars and thereby enriches the previous rhythms. From bar 7 to bar 12, the tones of G* are successively removed so that we have a neutralization process here. The modulator is made evident by a rhythmic motif which is built around the above retrograde symmetry R, including its repetition during the whole second modulation segment presenting the pivots, from bar 13 to bar 23.
614
CHAPTER 28. APPLICATIONS
Nr.62
I
.-. /
.'.'.
Krebs yon Nr.62
r-· I
i
i
"
.-.,
i
\.-.
Krebs der Einsatze
\.-.
~
.,.
.-.
i!I
r-· I
,
/
( I
l
.'.
I
Figure 28.13: Above: The rhythmic macro germ G* of the rhythmical modulation in the first movement of the "Synthesis" concert [339]. Below: The retrograde of G* with the satellites being carried over to the transformed onset-percussion events. Already in bar 21, the new tonic of H*, which is (by definition) the image of the old tonic-with its characteristic rhythmic motif satellite-, is played and remains alive until the end of the modulatory process. This means that the new first onset is no more the old one, but occurs on onset 9/8 (modulo 12/8) of the old bar onset. This change is realized by a new bar onset at onset 9/8 of bar 24. From this point on, we have 5 bars to confirm the target rhythm and to terminate the modulation_ Both cadences, that of the start and that of the target rhythm, are accompanied by a kind of regular falling drop sequence of two eights duration tones in order to stress the beginning of the respective bar units. Recall from section 13.4.3.1 that the macro rhythms can be flattened down to the sets in the effective form PerOns12 of rhythmic events. But the construction of the effective material had to be performed on a higher conceptual level of macro events. Probably this kind of conceptual upgrading of modulatory processes could also be used to understand existing and construct new harmonic modulations, or even melodic modulations if a modulation model of motivic structures (in OnPiM odn,m, say) is available.
Part VII
Counterpoint
615
Chapter 29
Melodic Variation by Arrows punctus est cuius pars nulla est Euclid [215, book I] Summary. The ideas of tangent objects described in section 7.5 are applied to define contrapuntal intervals as tangential "arrows" from the cantus firmus to the discantus tone. This formalism fits with the idea that the discantus is a kind of melodic variation around the cantus firmus line. The ring structure of the set of such arrows is discussed and motivated from the musical perspective. -I;-
The mathematical theory of counterpoint is an excellent subject to illustrate the idea of mathematical conceptualization of musical and musicological objects. It shows that mathematical subtleties are no formal overhead and can help to grasp music(ologic)al differences at the highest level. After the introduction of arrows as a formal restatement of the contrapuntal interval concept, we shall see in section 30.2.1 that this setup provides us with an astonishing relation between harmonic and contrapuntal objects, a relation which was never observed by musicological tradition. This insight could have deep consequences for the understanding of the hitherto unresolved transition process from polyphony to homophony, from counterpoint to harmony as a basis of musical composition.
29.1
Arrows and Alterations
Summary. Arrows are a conceptual refinement of ordered intervals in the pitch domain. We compare these and related concepts and work out their specific differences. -I;-
Most of the common modules of simple space forms describe aspects of elementary music objects which can be related to an abstraction from physical events. However, musical reflection
617 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
618
CHAPTER 29. MELODIC VARIATION BY ARROWS
is a mental and symbolic approach, and this is not only the case on the higher level of compound object interaction as it was developed in the theory of global compositions. We already observe such a symbolism on the most elementary understanding of what we mean when talking about a tone. We are referring to the fact that in systematic musicology, some German specialists (see e.g., [205]) distinguish between "Ton" (tone) and "Tonort" (a tone's place), when discussion alterations. For example, in the 12-tempered system, the Tonort of j~ is the same as the Tonort of g.." whereas these two tones are thought to be different since their origin, one j, one g, expresses a musical thinking that cannot be traced anymore on the level of the common Tonort. This understanding is based on the concept of musical objects as being grown, dynamic entities and not merely static states. In this approach, a tone is not a point but a variation of a point, something that has a direction from past to present, in short: an 'arrow'. This unveils a fundamental stream of musical thinking: that of varied structures. In this part, we want to discuss a central instance of this type of thinking, namely counterpoint. The objective is to interpret contrapuntal intervals as "arrows" and to deduce the basic rules of counterpoint. We cannot present a general theory of varied structures yet. Instead, we shall give a prototypical sketch of how such a theory could look. In the following, we suppose that we are working in a form space Sq of simple type, module 1 M and identifier automorphism q : M -.'::+ M (giving rise to the functorial identifier automorphism @q). For example: SId = EulerModule with M = Q3, or Sl = PiMod 12 with M = Z12, or S(7) = FiPiM Od 12 = PiM Od 12 ,(7), the fifth identifier pitch class space, etc. In contrapuntal reasoning, one is not primarily interested in single pitches, but in "arrows" starting from a basic pitch x: the cantus firmus and ending in a target pitch y, the discantus. In order to catch this information, one could come back to the arrow approach exposed in section 7.2.3, i.e., to the Z-addressed denotators D : Z-Sq(ON = x, OFF = y). For the following reason we refrain from this point of view 2 : The Z-addressed denotators have too little canonical algebraic structure for our purposes. We shall see this in a moment. The formalization we choose for counterpoint is the dual numbers approach described in section 7.5. The module M is enriched to the dual numbers module M[c], and we also admit the presence of the identifier module automorphism q : M -.'::+ M which induces the canonical automorphism q[c] of M[c]; call the corresponding space Sq[c]. For example we would have EulerModuleq[c] -----+ Simple(Q3[c]), more generally, with the notations of appendix G.5.3: @q
Sq[c]
-----+
I d(Sq)[c]=@(q[c])
Simple(coord(Sq)[c])
for an automorphism q of the coordinator module coord(Sq). In this setting, ifT is an address, a T-addressed contrapuntal interval of space form Sq is aT-addressed denotator D : T-Sq[c](x+ c.i), x + c.i E T@M = T@coord(Sq). If q = Id M , we also omit the q-index, and without any further address specification, we tacitly suppose the zero address. There are four canonical surjections 3 1 Always over a commutative ring R in this context. 2This point of view is however the adequate for harmonic considerations as we have already learned in the self-addressed theory. 3Pay attention to the fact that these surjections are operating on the form functors and not on the frames.
29.2. THE CONTRAPUNTAL INTERVAL CONCEPT
619
associated with the first (cantus firmus) and second (interval) projections M[c] --+ M and with the synonymous alterators of sweeping and hanging orientation a+, a_ introduced and motivated musicologically in section 7.5. In section 29.5 we shall also consider more general addresses (in particular self-addressed contrapuntal intervals), but for the time being, we stick to the zero-addressed contrapuntal intervals. The enrichment of the dual number modules M[c] des not lie in the R-linear structure, it is based on the R-algebra structure of R[c] with 102 = O. In other words, the well-known possibility of address killing X@B@Sq = X ~ B@Sq described in section 11.2 could lead to an identification of Z-addressed arrows in Sq with zeroaddressed arrows in Z@Sq, but we need an additional algebraic structure on the latter form space.
29.2
The Contrapuntal Interval Concept
Summary. This section presents a formal definition of contrapuntal intervals or arrows, together with the two possible orientations of sweeping and hanging counterpoint. -~-
The point of departure of this theory is the fact that the known physical and mathematical concepts of consonance and dissonance are not adequate to the musical paradigm of an ant agony of interval categories since they only conceptualize gradual changes in sonance. This unsatisfactory situation was also recognized by Hugo Riemann [456] in his critique of Euler and Helmholtz. Our problem is to turn this antagonistic paradigm into a mathematical concept framework in order to deduce the announced counterpoint model. Our requirements would not be satisfied if we thought of an interval as being a set of two pitch events. This would in particular not do justice to the concept of voices. The cant us firmus could not be distinguished from the discantus, and the crossing of voice could not be conceived. It is adequate to contrapuntal reasoning to distinguish a basic cantus firmus tone from the dependent discantus tone, i.e., to consider arrows as defined above, and to indicate the orientation. In the sense of this musical requirement, we shall now describe an oriented contrapuntal interval formally as a pair (a±, D : O-Sq[c] (x + c.i)) of a sweeping or hanging orientation plus a contrapuntal interval D as defined above. When the rest is clear, we shall also reduce the description to the simple pair (a±, x + c.i) or even x + c.i, if the orientation is clear. Whereas here, x is the cant us firmus of the contrapuntal interval, the quantity x ± i is the discantus of the interval. Within a given orientation, the two-part style "note against note" is now interpreted as a sequence Xs + c.i s , s = 1,2,3, ... of arrows. If a change of orientation happens, the indexes have to be split into regions of constant orientation. The historical development of the counterpoint of many parts is now reflected in
CHAPTER 29. MELODIC VARIATION BY ARROWS
620
the historically variable characteristic types of geometric variations of the Gregorian chant of the cantus firmus' melodic line by arrows. Other tone attributes, such as duration or loudness, are neglected in this elementary motivic exposition. Except some minor remarks concerning just pitch spaces, we shall focus our discussion to the 12-tempered pitch class space PiM Od 12 ,q and its contrapuntal space PiMod 12 ,q[c] of pitch class arrows, q being an automorphism, e.g., one of the four circle automorphisms q = (1), (5), (7), (11), of Z12. This is quite congruent with the contrapuntal reasoning. For example, the octave extension of a perfect consonance is again perfect, idem for imperfect consonances and for dissonances [468]. So after this reduction we are left with 144 contrapuntal intervals for each orientation, the arrows in PiM Od 12 ,q [c].
29.3
The Algebra of Intervals
Summary. The set of arrows is canonically provided with the structure of an algebra. The formal definition and first mathematical properties are exposed. -I;-
The module Z12[c] of PiMod 12 ,q[c] is not only a Z12-module but also a Z12-algebra, i.e., a module with a bilinear, associative and unitary multiplication defined by the nilpotent element c with c 2 = 0 (see also example 76 in appendix D.1.1). The product of contrapuntal intervals a + c.b, c + c.d is defined by
(a
+ c.b).(c + c.d) = ac + c.bc + c.ad + c 2 .bd = ac + c.(bc + ad)
which yields a new contrapuntal interval. An invertible element of Z12[c] is an element a + c.b with a = 1,5,7,11, and the inverse element is (a + c.b)-l = a - c.b. Corresponding to the symmetries in Z12, the symmetries in Z12[c] are the affine endomorphisms (over the ring Z12[c])
ea+c.b(u + c.v) which is invertible iff u
29.3.1
= 1,5,7,11.
The Third Torus
Summary. The third torus is the geometric representation of Z12 associated with its Sylow decomposition4 . We discuss the mathematical structure and its metrical properties. -I;-
The module isomorphism T : Z12 ~ Z3 EEl Z4 : Z f--+ (z(mod3), -z(mod4)) with its inverse T- 1(Z3, Z4) = 4.Z3 + 3.Z4 identify the cyclic module Z12 with a discrete torus. Under this isomorphism, we have an isomorphism of forms T : PiM Od 12 ~ PiThirds 3 ,4 with the third class form PiThirds 34 ------. Simple(Z3 EEl Z4). Hereby, the first component of aT-transformed ,
Id
pitch class is the number of its major thirds while the second component is the number of minor thirds that add to the given pitch when counted from zero. For example, T(7) = (1,1), i.e., "fifth = major third plus minor third", see figure 29.1. 4See theorem
41
in appendix C.3.
29.3. THE ALGEBRA OF INTERVALS
621
8
Figure 29.1: On the torus of thirds, the third relations among pitch classes and intervals are represented in a geometric way. The sequence of semi-tone steps appears as an entwined closed spiral. To describe 'pure intervals' in pitch classes, starting from the form
IntMod 12
---+ Id
Syn(PiMod 12 )
for interval quantities in pitch classes, we also use the third torus structure as a synonym for interval quantities in terms of thirds, i.e.,
IntThirds 34 ,
---+ Id
Syn(PiThirds 3 '4).
As in differential geometry, one has the space of contrapuntal intervals IntMod 12 ,q[E] or the corresponding third tangent torus I ntThirds 3,4,q [E] in terms of a cantus firmus torus where twelve "tangent" tori are attached: The contrapuntal intervals are viewed as tangents to their cantus firmus points: A tangent torus Ix = x + E.Z12 is attached at each of its points x, see figure 29.2.
CHAPTER 29. MELODIC VARIATION BY ARROWS
622
Figure 29.2: The contrapuntal intervals viewed as tangents to their cantus firmus points. ----+
Let us review the affine automorphism group G L(Z12) in terms of the Sylow representation of Z12 as a discrete torus. The group is evidently generated by these four symmetries: C1
= eO.ll
multiplication by - 1; inversion
C2
=
e O.5
C3
=
e3 .1 addition of constant 3; minor third chain
C4
= e4 .1
multiplication by 5; make fourth circle addition of constant 4; major third chain ----+
This system is not only musically meaningful and therefore turns the group GL(Z12) into a musically meaningful group by the concatenation principle 2, it also shows that all automorphisms preserve the third distance in the following sense: Let x, y E Z3 EB Z4. Define d(x, y) as the minimal number of ascending or descending minor or major third steps on the third torus to move from x to y; for example: d(O,9) = 1 since 0 - 3 = 9, d(O, 1) = 2 since 0+4 - 3 = 1, etc. The distance is just the minimal number of edges on the discrete third torus that connects the two points x and y. This distance, in fact a metric, is left invariant under each of the four generators C1 to C4, see figure 29.3 for the evidence.-Summarizing: ----+
Proposition 49 The group GL(Z12) leaves invariant the third distance, i.e., it is a group of isometries of the interval torus IntThirds 3 ,4'
29.4
Musical Interpretation of the Interval Ring
Summary. This section deals with musical interpretations of the operations in the interval ring. We take this occasion to deepen the topic of mathematical "overhead" structures and their role in building musical concepts. -I;-
29.4. MUSICAL INTERPRETATION OF THE INTERVAL RING
a)
623
b)
4e / ,e 7 e3
e7
e
0
e ll / . 0
e3 es
~2
Ie
e ll ';0 e 2
•
e6
4e e 0
• " e5
Ie
ge
4e
e7 e3
-
';0 e 2 e6
c)
e
0
e ll
~
e
5
Ie g
e
d)
Figure 29.3: The elementary symmetries on the third torus. The generator Cl is a 180°-rotation around the axis through 0 and 6; C2 is a reflection at the torus' equatorial plane; C3 is a 90°rotation around the polar axis; C4 is a tilting movement in by 120°. Each of these generators preserves the third distance on the torus.
In this section we only deal with invertible symmetries. We want to understand what could be the musical meaning and interpretation of multiplication in PiMod 12 [E], or, mathematically equivalent, in IntMod 12 ,q[Ej. This is indeed a crucial situation. A priori, music theory never considered such a multiplication of intervals, and a conservative attitude could very well prohibit such an extension as an inadmissible mathematical overhead. But this is a classical situation with mathematical objects in the sciences, be it in physics, chemistry or musicology: Mathematical structures have more properties than their application needs. So there is always a degree of mathematical overhead. The question is rather whether ingredients of that overhead can be enriched by meaning within the applying science. More precisely, we are not preconizing the power of a conceptual oracle with mathematical properties. We simply associate mathematical properties with musical properties (in our case) and try to profit from the semantical added value in favor of a richer mathematization of musical phenomena.
CHAPTER 29. MELODIC VARIATION BY ARROWS
624
As already exposed in section 8.3, we shall again use the concatenation principle 2 from -----> section 8.3 here. This means that understanding a symmetry in et.g E GL(Z12[c]) means understanding its factors in a determined factorization. As in section 8.3, we shall present a system of 'elementary', musically understandable symmetries which generates all symmetries. To begin with, we shall treat the translations et = e t .1 and the multiplications 9 separately. For a translation quantity t = a+c.b, we have et = e a+E • b = ea.e E . b . Furthermore, 9 = U+C.v = u.(l + c.uv) = u.(a + c.1)uv. This yields the following system:
3.
S3 =
4.
S4
=
u, U = 1,5,7,11, 1 + c.l.
Here are the musical interpretations of the system's symmetries: Symmetry (a+x+)+c.y.
1.
Sl
2.
S2 =
= ea.
Sl
causes a transposition if the interval by a since we have
Sl
(x + c.y) =
e E • b • Symmetry S2 causes a transposition of the discant voice by b while the cantus firmus remains unaltered, i.e., S2(X + c.y) = a + c.(y + b). this constant enlargement of the distance between cantus firmus and discantus corresponds to a traditional technique of double counterpoint.
An example of an interesting symmetry, a concatenation of types 1 and 2, is given for = a ± c.l. For t = a + c.1, minor intervals are transformed into corresponding major intervals under the transposition by a. Conversely, t = a + c.1 transforms major intervals into corresponding minor ones under the transposition by a. This means that also not 'strict' parallels of thirds, such as
t
2 + c.3, 7 + c.4 = e5+ E .1(2
+ c.3), 11 + c.3 =
e4-
E•
1(7 + c.4)
are viewed as translation symmetries. 3.
u, U = 1,5,7,11. Symmetries of this type multiply the cant us firmus, and the distance between the voices, and hence also the discantus, by u. Since we cannot reduce this discussion to the involved pair of tones, and hence not to the basic pitch domain Z12, we must have a closer look at this situation.
S3 =
3.1 u
= 11. The cantus firmus is reflected at the origin c = 0 whereas the voice distance ll.b means the octave complement to b. For example, the sweeping minor third d, f( = 2 + c.3) is transformed into the sweeping major sixth bD, g( = 10 + c.9). Observe that, by definition, symmetries do not alter the underlying orientation.
3.2 u = 5. Musically, the distance b of the interval tones can be of equal interest as a multiple of minor seconds (= 1) or of fourths (= 5). The symmetry u = 5 connects intervals of equal values in these two perspectives. In a multiplication by 5, the minor second distance b of an interval a + c.b is transformed into the fourth distance 5.b.
29.5. SELF-ADDRESSED ARROWS
625
3.3 u = 7. For this value, we have an analogous argument as for the preceding case, with fifths instead of fourths. One could also invoke the concatenation principle and use the factorization 7 = 5.11 to reduce this case to the cases 3.1 and 3.2. 4
84
= 1 + c.1.
With this symmetry, the cant us firmus remains fixed whereas the voice distance increases by the distance of the cantus firmus from the origin c = o. Thereby, the cantus firmus acquires a new function, i.e., to appear itself as a 'discantus' with respect to the origin. Thus its distance to the origin is added to the given discantus as a fixed reference quantity. If we repeat the application of this symmetry to the resulting interval, the same reference quantity is again added to the discantus, etc. Repeated application of 84 therefore generates a circle on the discantus of the original interval, a circle whose step width is defined by the cantus firmus' distance to the origin. Musically speaking, the origin could be imagined as being a tonic. The appearance of the origin as a reference pitch is only justified by our choice of the generator 84 in the given system. If we took e-c.b. 84 instead, then b would play the role of the reference pitch. Example 48 3 + c.4 f-+ 3 + c.7 = (1
+ c.1).(3 + c.4) f-+ + c.1).(3 + c.7) f-+ 3 + c.1 =(1 + c.1).(3 + c.1O) f-+ 3 + c.4 = (1 + c.1).(3 + c.1)
3 + c.10 = (1
Example 49 6 + c.3 f-+6
+ c.g =(1 + c.1).(6 + c.3) f-+ 6 + c.3 =(1 + c.1).(6 + c.g)
By the way, this symmetry type is the only one within our generator system which connects the cantus firmus and the discantus components in an irreducible manner. This symmetry type crystallizes an essential difference to the common reasoning in the pitch class space PichMod 12 .
29.5
Self-addressed Arrows
Summary. Since arrows formally behave much like tones, self-addressed arrows can be introduced as a natural generalization of self-addressed tones and (ordinary) arrows via address change according to 8.3.4. This extension is described-together with a canonical projection which will playa major role in the theory of consonances and dissonances in section 30.2.1. -~-
626
CHAPTER 29. MELODIC VARIATION BY ARROWS
In this section, we want to concentrate on the algebraic relations between self-addressed tones and contrapuntal arrows. Fix an arbitrary commutative ring R and consider the canonical R-linear injection i : R - t R[e]. We also have the functorial R-linear injection [e] : R@RR - t R[e]@R[e]R[e] : f I---t J[e] defined in section 7.5. Furthermore, we have two R-linear address change injections c : R ~ OR@RR - t R@RR,c[e] : R[e] ~ OR@RR[e] - t R[e]@R[e]R[e]. This entails the following commutative diagram R
i
R[e]
-----t
tel
-----t
1e[e]
(29.1)
R[e]@R[e]R[e]
of R-linear injections. Whereas the left lower corner parametrizes the self-addressed tones in a simple form of module R, and the right upper corner parametrizes the contrapuntal intervals in the module R, the right lower corner parametrizes the self-addressed contrapuntal intervals, as opposed to the zero-addressed tones in the module R, or the prime counterpoint intervals, respectively. The four-dimensional R-module R[e]@R[e]R[e] can also be viewed as a left R[e]-module by the ordinary composition a.x = eO .a.x with linear endomorphisms eO .a, a E R[e]. Then we have a direct decomposition in one-dimensional R[e]-modules:
(29.2) i.e., two two-dimensional R-modules. Moreover, this decomposition is also a kernel-image decomposition with respect to the idempotent 5 right multiplication endomorphism ?eeO, of R[e]modules, i.e., Ker(?eeO) = R[eJ.e-e1, Im(?eeo) = R[eJ.e10. Now, the image is exactly the image of R[e] under C[e], whereas ?eeO maps the image of R@RR under [e] isomorphically (as R-module) onto the image Im(?eeo). in other words: Theorem 32 With respect to the embeddings of diagram (29.1), the projection ?eeO associates bijectively the self-addressed tones with the contrapuntal intervals and leaves the original tones in R invariant. This means that self-addressed tones and contrapuntal intervals are put under a canonical algebraic correspondence in the large space of self-addressed arrows
29.6
Change of Orientation
Summary. Orientation within a contrapuntal sequence may change, and enforce a special treatment of contrapuntal steps from sweeping orientation to hanging orientation or vice versa. We show how such a change can be reduced to an orientation preserving situation by means of the regular embedding of the algebra of dual numbers in the linear endomorphism ring.
-E5See also appendix C.2.3, example 68.
29.6. CHANGE OF ORIENTATION
627
The left-regular embedding6
identifies the two-dimensional dual number algebra with a subspace of the ring of two-by-two matrices over R. In this embedding, the dual number algebra is generated by the dual number multiplication
(c) = with the relation (c)2
(~ ~)
= O. The sweeping and hanging orientations are interpreted as a projection
_(1o 1) _(1 -1)
O!+ -
0
,O!_-
0
0
which are related by the multiplicative relation (29.3) The matrix algebra M 2,2(R) is generated by two indeterminates (c), O!+ with relations (c)2 = 0, O!~ = O!+, and O!+.(c)+(c).O!+ = (1 +c), and it is spanned by the linear basis 1, (c), O!+, O!+.(c). Relation (29.3) can be used to reinterpret hanging counterpoint in terms of sweeping counterpoint as follows: Suppose that we have a sequence Xl + c.i l , X2 + c.i 2 with the first interval in sweeping, but the second one in hanging orientation. This means that the evaluation via orientation projections produces the two discantus instances O!+(Xl +c.id, 0!-(X2 +c.i 2). In order to change the interpretation of orientations, use (29.3) and rephrase the second discantus as so that we are dealing with sweeping orientation related to the new interval
This technique will be used to deduce contrapuntal steps while orientation changes. Observe that the mediator factor (1 + c) in formula (29.3) enhances the musical meaning of the fourth generator symmetry 84 in section 29.4: The multiplication by generator 84 helps to reinterpret hanging orientation in terms of sweeping orientation.
6See appendix D.l.
Chapter 30
Interval Dichotomies as an Expression of Contrast enim veTO sicut vitium mala virtus a nullo umquam morali philosopho dictum fuit, ita nec musicus umquam litteratus discordiantiam malam concordantiam nuncupavit. Johannes Tinctoris [527, p.90] Summary. For contrapuntal composition and theory, consonant and dissonant intervals are a dichotomic concept. We present the mathematical restatement and fundamental properties of the basic concept of an interval dichotomy. For the classification of dichotomies, a strong condition on unique symmetries of polarity between the two halves of dichotomies is added. It reveals a distinguished role of the consonance/dissonance dichotomy of classical counterpoint and of the major dichotomy (associated with the major scale). We discuss evidence of the consonance/dissonance dichotomy from theoretical and empirical points of view and open the discourse to an intercultural perspective guided by the classification of dichotomies, as investigated by Jens Richert [223]. -~-
Remark 16 In the following counterpoint chapters, we shall tacitly work in the fifth pitch and interval spaces, i.e., we take the automorphism q = (7) in the identifiers. We therefore shall-for example-speak of the fifth when addressing the unit 1. Whenever we deviate from this convention, the reader should be warned. We shall also stick to the forms built upon 1.12 and tacitly carryover the distance structures of the isomorphic Sylow torus representation 1.3 EB 1. 4 in order to keep notation simpler.
629 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
630
30.1
CHAPTER 30. INTERVAL DICHOTOMIES AS A CONTRAST
Dichotomies and Polarity
Summary. The technical definition of interval dichotomies in I ntMOd 12 ,q and in the counterpoint arrow form I ntMOd 12 ,q [E] is given 1. We discuss canonical polarities, in fact automorphisms of such dichotomies, as well as their topological behavior which is measured by diameter and span of a dichotomy. This is used to classify the system of dichotomies. Among the 26 classes, six classes admit unique polarities. These classes of strong dichotomies show a topologically distinguished position of the consonance/dissonance and the major dichotomy, the latter being defined by the six tonic-rooted (proper) intervals of major tonality. Among the strong dichotomies, the consonance-dissonance dichotomy itself (not only its class) is distinguished by a number of characteristics. -I;-
Let A be an addressed Z-module. Then an objective, A-addressed local composition X in ambient space I ntM Od 12 ,q is called a (A-addressed) marked interval dichotomy iff it is equipollent 2 to its complement C(X) in the total local composition A@IntMod 12 ,q. Observe that the latter is not necessarily finite. Hence we have the complement action i f-+ Ci(X) of Z2 on the set DiM(A, IntM Od 12 ,q) of A-addressed marked dichotomies. An interval dichotomy is a Z2-orbit of marked interval dichotomies, the orbit set is Di(A,IntMod12 ,q). --+ A second left action is defined by the automorphism group GL(Z12) of the ambient space --+ IntMod 12 ,q. If 9 E GL(Z12),X E DiM(A,IntMod12 ,q), then we set g.X = {g.xlx E X}, and the set CiM (A, I ntMOd 12 ,q) denotes the orbit space of this action, its elements are called the (Aaddressed) marked dichotomy classes. Clearly the two actions commute, i.e., we have a left action --+ --+ of Z2 x GL(Z12) on DiM (A, IntM Od 12 ,q), an induced action of GL(Z12) on Di(A, IntM Od 12 ,q), and one of Z2 on the space of marked dichotomy classes CiM (A, I ntMOd 12 ,q). The total orbit space --+ Ci(A,IntMod 12 ,q) = Z2 x GL(Z12)\DiM(A,IntMod 12 ,q) is the space of (A-addressed) dichotomy classes. The same constructions (mutatis mutandis) can be made on the space of counterpoint arrows IntMod 12 ,q[E]. Let A be an addressed Z-module. Then an objective, A-addressed local composition X in ambient space IntMod 12 ,q[E] is called a (A-addressed) marked counterpoint dichotomy iff it is equipollent to its complement C(X) in the total local composition A@IntMod12 ,q[E]. On the set DcM(A,IntMod12 ,q[E]) of A-addressed marked counterpoint dichotomies, we have the complement action i f-+ Ci(X) of Z2. A counterpoint dichotomy is a Z2-orbit of marked counterpoint dichotomies, the orbit set is Dc(A, IntM Od 12 ,dE]). --+ A second left action is defined by the automorphism group GL(Z12[E]) of the ambient space --+ IntMod 12 ,q[E]. If 9 E GL(Z12 [E]), X E DcM(A, IntMod 12 ,q [E]), then we set g.X = {g.xlx EX}, and the CcM(A,IntMod 12 ,q[E]) denotes the orbit space of this action, its elements are called the (A-addressed) marked counterpoint dichotomy classes. Clearly the two actions commute, --+ i.e., we have a left action of Z2 x GL(Z12[E]) on DcM(A,IntMod12 ,q[E]), an induced action IThis means the interval form with the identifier defined by the automorphism q of Z12, i.e., IntMod12 q ,
Syn(PiMod12,q), etc. 2 ••• has same cardinality as ...
----> Id
30.1. DICHOTOMIES AND POLARITY
631
---+
of GL(Z12[EJ) on Dc(A,IntMod 12 ,q[EJ), and one of Z2 on the space of marked counterpoint dichotomy classes CcM(A,IntMod 12 ,q[tJ). The total orbit space
is the space of (A-addressed) counterpoint dichotomy classes.
Definition 92 A marked interval dichotomy X is called auto complementary if it is isomorphic to its complement C(X), i.e., iff its dichotomy class coincides with its marked dichotomy class. The marked dichotomy X is called rigid if its symmetry group is trivial. It is called strong if it is autocomplementary and rigid. If a marked dichotomy X is auto complementary, so is its complement. If a marked dichotomy is rigid, so is its complement. Hence, if X is strong, so is its complement. So autocomplementarity, rigidity, and strength are invariants of the dichotomy classes. From the classification of zero-addressed objective local compositions in PiM Od 12 in appendix L.1, we know that there are 34 classes of zero-addressed marked interval dichotomies (26 classes numbers 63 to 88, complements not counted twice, count twice the 8 numbers without *). There are 26 interval classes, 8 autocomplementary classes, and 6 strong classes. We often denote a marked dichotomy by (X/C(X)) and its class by [C/C(X)], whereas a dichotomy is denoted by (XIC(X)) and its class by [qC(X)]. Since the group of symmetries acts on the set of dichotomies, one can say that a symmetry stems not only from an isomorphism of the underlying marked dichotomies but is also associated with an isomorphism of the associated dichotomy. More precisely, if we are given a symmetry f : (X/Y) ----+ (U IV) between two marked dichotomies, we get an isomorphism of the interpretations that are associated with the partitions XuY and UuV. Conversely, an isomorphism between such two interpretations is induced by two isomorphisms on two pairs of charts. But each such isomorphism gives automatically rise to an isomorphism of the other chart pair-the only thing which we lose is the order of charts, i.e., we are left with a transformation among the (non-marked) dichotomies. In this sense the unique non-trivial inner symmetry p of a strong dichotomy (XIY), i.e., p(X/Y) = (p(X)/p(Y)) = (Y/ X) is also called its polarity.
Example 50 Here are the six strong (marked and unmarked) dichotomies, the numbers referring to the classification table of local compositions in appendix L.1. The polarity of dichotomy number n and identifier number q is denoted by Pq,n, or Pn for q = 7, if the context is clear. If an index is omitted, we tacitly suppose the fifth representation with q = 7. 1. The dichotomy Nr. 64 ~7,64 =
with polarity
P7,64
(h/h)
=
({1, 2, 3,4, 5, 11}I{O, 6, 7, 8, 9, 1O})
= 7.Pl,64.7 = e l l .11 corresponding to the dichotomy ~1,64
= (II! JI) =
({2, 4, 5, 7, 9,11 }I{O, 1,3,6,8, 10}),
in the semitone representation (q = 1), with polarity Pl,82 = e 5 .11. The dichotomy ~7,64 arises when considering all proper (non-vanishing) intervals in a major scale when counted from the tonic.
CHAPTER 30. INTERVAL DICHOTOMIES AS A CONTRAST
632
2. The dichotomy Nr. 68 ~7,68
with polarity P7,68
=
=
7.Pl,68.7
({O, 2, 7, 8, 9, 11 }I{I, 3, 4, 5, 6, 1O}),
= e 6 .5 corresponding to the dichotomy
~1,68 = ({O, 1,2,3,5, 8}1{ 4,6,7,9,1O,11})
in the semitone representation with q = 1, with polarity Pl,68
= e6 .5.
3. The dichotomy Nr. 71 ~7,71
with polarity P7,71
=
({O, 2, 7, 8, 9,11 }1{1, 3, 4, 5, 6, 1O}),
= 7.Pl,71.7 = e5 .11 corresponding to the dichotomy ~1,71
= ({O, 1, 2, 3, 6, 7}1{ 4,5,8,9,1O,11})
in the semitone representation with q
= 1, with polarity Pl,71 = e l l .I1.
4. The dichotomy Nr. 75 ~7,75
with polarity P7,75
=
({O, 2, 4,7,8,11 }1{1, 3, 5, 6, 9, 1O}),
= 7.Pl,75.7 = e 5 .11 ~1,75
=
corresponding to the dichotomy
({0,I,2,4,5,8}1{3,6, 7,9,1O,11})
in the semitone representation with q
= 1, with polarity Pl,75 = el l .I1.
5. The dichotomy Nr. 78 ~7,78 = ({O, 2, 4, 6, 7, lO}I{I, 3, 5, 8, 9,11}),
with polarity P7,78 = 7.Pl,78.7 = e3 .11 corresponding to the dichotomy ~1,78 =
({O, 1, 2, 4, 6, 1O}1{3, 5, 7, 8, 9,11})
in the semitone representation with q
=
1, with polarity Pl,78
= eg .I1.
6. This is the classical dichotomy Nr. 82 ~82
= ~7,82 = (Kd D 7 ) =
({O, 1,3,4,8, 9}1{2, 5, 6, 7,1O,11}).
of contrapuntal consonances (left) and dissonances (right) in the fifth system. Its polarity is 'the' autocomplementarity function P7,82 = 7.e 2 .5.7 = e2 .5 deduced from the known3 autocomplementarity function Pl,82 = e2 .5 of the consonance-dissonance dichotomy ~1,82
= (Kd D 1) =
({O, 3, 4,7,8, 9}I{I, 2, 5, 6,IO,11})
in the semitone representation, as discussed in section 24.1.1. 3See [336], for example.
30.1. DICHOTOMIES AND POLARITY
633
These strong dichotomies can also be represented as partitions of the interval torus in the Sylow representation IntThirds 3 ,4,q Syn(IntThirds 3 ,4) with the q-identifier (usually the
W
fifth). Recall that we have the third distance d(x, y) on this torus, and that it is the same as the distance with identifier q = 1.
Definition 93 Let (XIY) be a strong marked dichotomy in IntThirds 3 ,4,q. Then its diameter is defined by 8(XIY)
L
1
= 2"
d(u, v).
u,vEX
By use of the polarity of a strong dichotomy one sees that 8(XIY) = 8(YI X) so that we may define this number as the diameter 8(YIX) of the strong dichotomy (XIY). Since evidently all dichotomies of a dichotomy class have the same diameters (recall that symmetries of the torus are isometries for the distance), we may define the diameter 8[XlYl of a dichotomy class [XlYl by the diameter of any of its representatives. The diameter measures the average distance between points of one half of the dichotomy.
Definition 94 Let (XIY) be a strong marked dichotomy in IntThirds 3 ,4,q with polarity p. Then its span is defined by O"(XIY) = d(u,p(u)).
L
uEX
Since any polarity is an involution in our context, one sees that O"(XIY) = dY I X) so that we may define this number as the span dXIY) of the strong dichotomy (XIY). Since evidently all dichotomies of a dichotomy class have the same span (recall that symmetries of the torus are isometries for the distance), we may define the span O"[XIYl of a dichotomy class [XlYl by the span of any of its representatives. Diameter and span of our six strong classes are visualized in figure 30.1. Intuitively, the minimality of 8 (KID) means that the subsets K and D are separated in an optimal way on the torus (figure 30.1). The maximality of O"(IIJ) means that I and J are optimally mixed on the torus. If we stay on a point on I, and we want to go to another point of I on a shortest path, then we often have to traverse a point of J, a phenomenon which never happens for (KID): Between any two consonant intervals there is always a shortest path which does not leave the consonant half. By the given polarities, all these statements about K and I are also valid for D and J. The possible connections within K and within I (and the complements, respectively) are shown in figure 30.2. In contrast to the graph of I, the graph of K has no inner symmetry, i.e., every consonant interval is uniquely determined by its position on the graph. This means that the consonancedissonance dichotomy has a privileged position among all strong dichotomies. But this does not yet exhibit the precise representative (KID) within the class [KIDl. The selection ofthe marked dichotomy (KID) is in fact realized by an algebraic condition: to require that the first half X of an element (XIY) of the class [KID] be a multiplicative monoid. This condition was discovered by Noll [400]. This in fact exhibits the marked half of consonances against dissonance, these two parts are not equivalent from this point of view. Until now it is however not clear what are the structural consequences of the predicates which uniquely exhibit the consonances.
CHAPTER 30. INTERVAL DICHOTOMIES AS A CONTRAST
634
a(X)
16
ri
82
I
71,78 _ _ 64
75
10
24
a)
I
I
I
25
28
29
~4-
.8 •
.3 ·11 < 7
b)
'~ 0
·10 .6
0
.J 5
2
6(X)
,. 9 ~·
Figure 30.1: (a) Span and diameter of the six strong dichotomies. The polar position of the consonance-dissonance dichotomy (Nr. 82) against the major dichotomy (Nr. 64) is visible. (b) The geometric meaning of the minimal diameter and the maximal span in (K / D) is evidenced by an optimal separation of the two halves of the dichotomy.
30.2
The Consonance and Dissonance Dichotomy
Summary. The consonance-dissonance dichotomy is in a canonical bijection with the Riemann consonances. We also give empirical evidence of this dichotomy from brain research. The section concludes with a cognitive interpretation of the specific role of music for the individual psyche. -I; -
Recall from section 24.1.1 that the consonance-dissonance dichotomy also appears in the context of just intonation and relates to the 12-tempered case by means of the enharmonic projection. We now want to investigate further remarkable properties of the (K/ D) dichotomy. To begin with, we define a (marked) dichotomy (X[c]/Y[c]) of contrapuntal intervals for every (marked) dichotomy (X/Y) on the space form S by the rule (X[c]/Y[c]) = (coord(S) + c.X/coord(S) + c.Y, where coord(S) is the coordinator module of S.
30.2. THE CONSONANCE AND DISSONANCE DICHOTOMY 4
11
2
9
• • • • • • 7
5
class 64
0
• 1
7
635
1
8
2
class 68
2
4
1
0
]
2 6
3
0
class 82
~
8
9
0
Figure 30.2: The graphs of all strong dichotomies, regarding the third connections. The graphs among the I (major, class 64) and the K (consonance, class 82) intervals show characteristic differences. While in I, all intervals are positioned on a line, the consonance graph shows two minimal paths (0 ....... 3 ....... 7; 0 ....... 4 ....... 7) between 0,7, for example. Observe that when starting from the prime, all imperfect consonances (thirds, sixths) can be reached directly, whereas the perfect fifth is the only non-neighbor of the prime.
30.2.1
Fux and Riemann Consonances Are Isomorphic
Summary. We present the Riemann dichotomy and its 1-1 correspondence with the Fux dichotomy based on the diagram discussed in 29.5. -~-
Recall the commutative diagram (29.1) of Z12-linear injections for the special ring R Z12
i
Zdc:]
------->
1e[e]
el Z12@Z12 Z12
[e]
------->
= Z12.
(30.1)
ZdC:]@Z12[e]Zdc:]
We may vew this configuration as a realization of determined forms as follows:
0@PitchMod 12 ,q
i
------->
el
Z12@PitchMod 12 ,q
[e]
------->
0@IntMod12 ,q[E]
1e[e]
Z12[c:]@IntMod 12 ,q[E]
(30.2)
636
CHAPTER 30. INTERVAL DICHOTOMIES AS A CONTRAST
Observe that the space in the right bottom corresponds to the larger space Zl2[c]@Zl2[c]. Also, the lower horizontal arrow is not a natural one in terms of ambient morphisms f / a for local compositions. To begin with, let us look at the Riemann dichotomy which was first defined in [400]. For a justification of this naming that relates to Riemann's concept of relative consonances and dissonances, see [400]. The Riemann dichotomy is the monoid Trans(D, T) generated by the "transporter" set Tr(D, T) consisting by definition of all not necessarily invertible symmetries f : D = {1, 2, 5} ---- T = {O, 1, 4} from the dominant triad D to the tonic triad T (in the fifth system). The monoid Trans(D, T) has 72 elements. So it naturally gives rise to a dichotomy on Zl2@PitchMod l2 ,q and also defines a point in the intension topology InTop(PitchM Od l2 ,q).
Exercise 67 Show that card(Trans(D, T)) = 72. On the other hand, consider the consonant contrapuntal intervals K[c] = Zl2 + c.K in 0@IntMod l2 ,q[E]. The intension Int(K[c]) is canonically identified to a local composition in Zl2[c]@IntMod l2 ,q[E] since the address Zl2[c] is faithful (take the identity). The intersection W of Int(K[c]) with the subspace Zl2@PitchMod l2 ,q consists of those endomorphisms eXy of PitchMod l2 ,q that induce endomorphisms of K[c]. Using the isomorphism
from section 29.5, W identifies to the monoid R E InTop(PitchM Od l2 ,q) of left stabilizers of v-l(K[c]). In [400] it is shown that Trans(D, T) = W. More precisely, with the above identification, we have this theorem:
Proposition 50 Let eX.y be an endomorphism of PitchM Od l2 ,q' Then it is in the left stabilizerA oJv-l(K[c]) iff it is a product eX.y = 11 eX' 'Yi oj endomorphisms5 which transport the dominant triad D = {1, 2, 5} into the tonic triad T = {O, 1, 4}. In particular, every element oj v-l(K[c]) can be written as such a product oj transporter endomorphisms. The last statement follows from the fact that the consonance interval numbers K are a multiplicative monoid, and therefore, each element eX .k, k E K is a stabilizer of v-l (K[c]). This implies the following:
Corollary 19 Let f = 0 + c.1 the fifth interval at the C tonic O. For any consonant interval c = x+c.k, there is a sequence ti = eX' 'Yi, i = 1, ... m of transporter endomorphisms ti : D ---- T such that c = tm·tm-l .... tl(f). Evidently, there is a deep relation between Riemann theory (as it appears in Noll's perspective) and the Fux dichotomy of consonance-dissonance. Presently, we do not know more about the harmonic/contrapuntal implications ofthe above results. The straightforward hope is that the transition from polyphonic counterpoint to homophonic harmonic relates to the above mathematical facts. But neither the systematic nor the historical consequences of these facts 4I.e., for all endomorphisms et.k with consonant component k, eX .y.et.k = e S .k' has also a consonant component k'. 5Including the "empty" product, i.e., the identity.
30.2. THE CONSONANCE AND DISSONANCE DICHOTOMY
637
are at reach. It is however true that musicology has never understood the theoretical relations between counterpoint and harmony. There must be a fundamental relation because the development of homophony out of the polyphonic tradition cannot be a rupture without any inner coherence. Even if such a rupture were a historic fact, it would be a primordial question of systematic musicology to explain the structural, system-immanent rationales for such a rupture. The present results give first hints for the explanation of this lacuna.
30.2.2
Induced Polarities
Summary. We describe the auto complementary functions induced on the contrapuntal dichotomies (X[c:]jY[c:]) which are deduced from strong dichotomies (XjY). -I;-
Suppose that we are given a strong dichotomy (XjY) which bears the polarity eU.v. Then the contrapuntal dichotomy is no longer rigid, but still auto complementary. The precise situation is described as follows. Proposition 51 Let ~ = (XjY) be a strong dichotomy with polarity p~ = eU.v. Choose a cantus firmus point x. Then there is exactly one symmetry p~ on the counterpoint interval space IntMod 12 ,q[C:] which is a polarity of (X[c:]jY[c:]) and fixes the "tangent space" Ix = x + c.il 12 at x. Call this the polarity at x. We have
and under this polarity, a tangent space Iy is mapped onto the tangent space Ix+v(y-x). Exercise 68 Give a proof of proposition 51.
30.2.3
Empirical Evidence for the Polarity Function
Summary. A review of neurophysiological verifications of the presence of the Fux polarity in human depth EEG is exposed. -I;-
Although this book is not a report on physiological or psychological correlates to music structures, it is important to give an overview of a pronounced evidence of electrophysiological correlates of the consonance-dissonance dichotomy. This is by no means a justification of even a proof of the adequacy of the mathematical investigations, but it must be considered as a fundamental relativization of traditional consonance-dissonance theories. These ideas never produce dichotomies but yield degrees of consonance or dissonance, a quality which is completely irrelevant to the musical counterpoint dichotomy. The aim of a project at the Neurology Department of the Zurich University Hospital which the epileptologist Heinz-Gregor Wieser, the author, and their collaborators conducted during the years 1984-1988 was to test mathematical principles of classical counterpoint by means of depth EEG responses to musical stimuli. In particular, it was planned to test the
CHAPTER 30. INTERVAL DICHOTOMIES AS A CONTRAST
638
relative results of the EEG to different musical inputs (consonances vs. dissonances) and not the relation of responses to musical stimuli versus non-musical stimuli. The latter problem has been investigated with much success by Hellmuth Petsche and his collaborators [415]. It is important to stress that from our results, we do not draw any kind of conclusions concerning a possible genetic nature of musical understanding or a possible universal validity of classical European interval categories. Our investigations show that in some defined regions of the brain of some European humans, certain significant reactions take place-nothing more and nothing less. There is no reason to generalize whatsoever, but there are enough reasons to try to repeat these investigations in other research sites with a comparable infrastructure. This is all the more desirable since the qualitative results of the investigations by Wieser and the author (namely the prominent role of limbic structures for the judgment of musical pleasantness) have been confirmed by others; see [58], for example. For a more complete report of our results, we refer to [336, 337, 353, 570, 571, 572]. In this short review, we shall restrict ourselves to the two subtests concerning (a) isolated successive intervals and (b) the polarity between simultaneous consonances and dissonances. Our results confirm our hypothesis on (1) a significant differentiation of EEG responses to consonant vs. dissonant intervals in limbic and auditory brain areas and (2) a pronounced sensitivity of these areas in EEG responses to the fundamental polarity between consonances and dissonances. In particular, the quantitative measurement of these responses by use of the "spectral participation vector" has confirmed our belief that this vector may carry some of the semantic charge of EEG signals.
30.2.3.1
The EEG Test
The test concerned different contexts of consonances and dissonances as well as the test of the polarity e 2 .5. We used EEG from the scalp (Hess system), stereotactic depth EEG following [569], and multipolar foramen ovale recordings [573]. The tests were applied to the rare cases of patients suffering from medically intractable complex partial epilepsy seizure of suspected mediobasal temporal lobe origin and underwent presurgical evaluation with a view towards surgical epilepsy therapy. None of the total 13 patients considered the voluntary 30 minute music test through monophonic earphones as being disagreeable. There are several reasons why, despite the particular state of epileptics, the tests remain comparable to tests with normal humans which cannot be conducted for evident reasons. First, our tests were performed during interictal periods. Second, localization of the focus gives a good estimation of its possible influence. Third, epileptiform potentials are easily distinguished from others by the expert. For each patient, we recorded 700 time windows for fast Fourier transform (FFT) spectral analysis with 256 samples per second, each window for different EEG channels, different power windows (6 = 0 - 4Hz,O = 4 - 8Hz,o: = 8 -14Hz,{3 = 14 - 40Hz) and repetitions, totally 11'000 raw spectral data per patient. Unfortunately, the project could not be completed for extrascientific reasons and hence, only two patients have been thoroughly evaluated. These patients were C. J.-L., a 35-year-old academic, and V.S., a 31-year-old artisan. Both are male Europeans ad prefer standard classical, light, and folklore music. Figure 30.3 shows the positions of bipolar depth EEG recordings which we are going to discuss. Notice that C. j.-L.'s recordings RCA, RH, LCA are homologous to V.S.'s recordings 4, 10, 14. Recording RCA lies within the
30.2. THE CONSONANCE AND DISSONANCE DICHOTOMY
639
right hippocampus, recording RH is positioned within Heschl's gyrus (auditory cortex), and recording LCA lies within the left hippocampus.
Figure 30.3: Implantation scheme for the two patients C. J.-L. and V.S. following X-rays showing the topographic position of depth electrodes. The three homologous regions are indicated by dashed ovals. For C. J.-L., RCA is electrode 2/1-3, RH is 6/5-6, and LCA is 8/1-3. For V.s., 4 is electrode 2/1-2, 10 is 4/5-6, and 14 is 6/1-2. (Numbers after the slashes indicate precise positions on the electrodes, where 1 = deepest position and 10 = position near surface. Nevertheless, we have been able to observe a great deal of visual evidence for EEG response to music stimuli in neo- and archicortical regions of all patients; for details, see [336, 570]. 30.2.3.2
Analysis by Spectral Participation Vectors
We used four sounds for these tests: piano, sine wave, cello (without vibrato), and "test", a clear, organ-like sound, all synthesized from a Yamaha TX7 synthesizer and CX5M voicing program in order to avoid possible emotional artifacts associated with natural sounds from sociocultural premises. The music program was written on a precursor of the commercial composition software presto@ [338]. The spectral analysis was executed on a CDC Cyber computer. We made use of the spectral participation vector
S(E)
=
(P(E), P(E)/ Pii(E), P(E)/ Pe(E), P(E)/ P",(E), P(E)/ Pf3(E))
of an event E and its associated participation value
v(E)
=
P(E)/Pe(E)
+ P(E)/P",(E) + P(E)/Pf3(E)
CHAPTER 30. INTERVAL DICHOTOMIES AS A CONTRAST
640
which is a length measure. Here, peE) is the total spectral power [423], PaCE) is the <5-power of event E, Po(E) is the O-power, Po.(E) is the a-power,and P{3(E) is the j3-power, see also [336]. From the results obtained thus far we conclude that this representation is well suited to give an adequate picture of possible semantic charge of EEG signals. It is a measure motivated by,among others, the well-known vigilance-related a-participation So. = P(E)/ Po. (E) observed by Berger. We also use the delta participation Sa = P(E)/ PaCE), the O-participation So = P(E)/ Po(E), and the j3-participation S{3 = P(E)/ P{3(E), the latter has been recognized as being strongly related to higher cognitive brain activity by Giannitrapani (see [182]) and Petsche et al. (who also focus on the '}'-band 30 - 50Hz) [416]. 30.2.3.3
Isolated Successive Intervals
We first focus our attention on a subtest concerning musically isolated successive intervals, i.e., the two tones of an interval are played one after the other without interruption. All the intervals were played in three orders: (1) all consonances, ordered according to their size; all dissonances, also ordered according to their size. (2) All consonances, ordered according to complementarity size (if possible); then all dissonances, ordered according to complementarity (if possible). (3) A mixed succession of all intervals according to a particular dodecaphonic all-interval series. Having fixed a frequency band, 0, say, and an interval with first tone event El and second event E 2 , we consider the quotient
of the theta participations of the first and second tones. If Qo(El, E 2 ) > lor Qo(E1 , E 2 ) < 1, respectively, then theta participation lowers or increases, respectively, from the first to the second tone. In order to compare these ratios for consonances and dissonances, we take the quotient
Qo(K/D)
=
So(K)/So(D)
of the mean value SoCK) of all values Qo(El, E 2 ) for consonances (El, E 2 ) and the analogous mean value So(D) for dissonances. This construction is repeated for all recording positions and all frequency bands, including 0, a, j3; band <5 is omitted since it may be affected by noise. This test was performed four times with patient C. J.-L. and six times with V.S. A one-sided Wilcoxon test shows significantly higher quotients for consonances compared to dissonances, i.e., Q? (K / D) is significantly larger than 1 for many recording positions and frequency bands, see figure 30.4, and observe the similarity of distribution of these quotients for our patients with respect to homologous recordings and frequency bands. This means that for consonances, participation lowers more when the second tone appears than for dissonances. 30.2.3.4
Polarity
To test the response to the (K/ D)-polarity, we confronted each consonant interval X with all the dissonant intervals Y, and we looked for particular responses in cases where Y was the interval which should correspond to X according to the polarity formula Y = 2X + 5. Here, we looked at simultaneous intervals. For each given consonance X, we played a sequence of six confrontations, i.e., immediate successions (X, Y1 ), (X, Y2 ), ••. (X, Y6 ) of X with each of the six dissonances Y1 , Y2 , ••• Y6 , see also figure 30.5. The duration of each interval was 0.68 seconds.
30.2. THE CONSONANCE AND DISSONANCE DICHOTOMY
/
(l
641
4
f).05
a 0.05
Figure 30.4: Graphical representation of the quotients Qe(K/D), Qa(K/D), and Q(3(K/D) for all locations and patients C. J.-L. and V.S. These numbers show that for consonances, participation lowers more when the second tone appears, compared to dissonances. The I-level is indicated in the graphics, observe the places where this level is exceeded. We looked for the least participation values among the dissonant intervals Y1 , Y 2 , ... Y6 , when confronted with a fixed consonance X. We then compared the effective hits to the a priori chance to hit the correct dissonance. This method was applied to every recording position and to the three above frequency bands. Figure 30.6 shows the numbers compared to 100%, the measure for a priori chance. Due to the small number of samples for this subtest, we did not apply any statistical test here. However, as figure 30.6 shows, the results are remarkable and similar for both patients, and we conclude that this pilot investigation strongly supports the presence of the (K/ D)-polarity as a foundation of contrapuntal processes.
30.2.4
Music and the Hippocampal Gate Function
Summary. The neurophysiological results, in particular their localizations in the emotional brain and the auditory cortex, are interpreted from the cognitive perspective. The gate function of the hippocampal formation suggests a key function of music in opening subconscious~ preferredly emotional~memory contents.
CHAPTER 30. INTERVAL DICHOTOMIES AS A CONTRAST
642
0 -> 1
0 3->1
®
4
6
2
3
11
3
4
0
5
0
\1
®
3
4
2
0
0
10
3
10
6
0
3
3
4
4
11
6
1
7 ->1
7
CD
10
a
9
5
2
a
@
9
5
Figure 30.5: The score of the confrontation test of each consonance with all dissonances; duration of a note in the test is 0.68 seconds. Music and emotions are intimately related, this is common knowledge. The above results suggest a mechanism that could explain this relation on the neurophysiological and cognitive level. We have seen that the emotional brain in its hippocampal structures has a pronounced response to elementary structures of harmony and counterpoints: the intervals in their consonance-dissonance dichotomy, and this is so independently of any sound color physics. Now, the classical thesis of Papez and MacLean [315] states that the limbic system, a prominent part of the archicortex, is responsible for emotional human behavior, this is why it is also called the emotional brain. So the hippocampal sensitivity to consonances vs. dissonances could relate to the emotional function of music, i.e., of musical intervals in our case. The question is, how musical signs which are by no means emotions by themselves (although Schonberg and other prominent music experts constantly evoke the notes' emotional and erotic life) can evoke and signify emotions in humans, and why this is done in such a way that the same music may evoke a great variety of such reactions and significations. Evidently these outputs are the result of a determined sample of music plus an individual human ingredient. The point is that the hippocampal formation has been recognized as a key structure for memory [501]. The neuroscientist Jonathan Winson has proposed a more specific theory of the hippocampal memory function [579], in that he argues that the hippocampus performs a gate
30.2. THE CONSONANCE AND DISSONANCE DICHOTOMY
643
V,s.
Ler c.J-J~
Figure 30.6: Graphical representation of hitting frequencies for the polarity subtest for patients C. J.-L. and V.S., three homologous locations as well as 0,0:, f3 frequency bands. The frequent and pronounced values above 100% show that for both patients there was a strongly affirmative EEG response to the test of the correct values for the polarity. In addition, the topographic/spectral distribution of values for 0: and f3 bands is comparable for these patients. function to the subconscious (he even evokes Freud's "Unbewusstes"), i.e., to memory contents of emotional character. This means that the hippocampus is a structure that plays the role of a gateway to hidden memory contents. It is well known that humans do not have a free or controlled access to their memory contents, in particular not on the level of long-term and emotional memory, concerning early childhood, for example. This suggests that special mechanisms must be activated in order to open the hippocampal gate to unveil locked memory contents. It is straightforward from our neurophysiological findings and the gate function of the hippocampus that its musical stimulation could yield such a "key" to open the gate to hidden memory contents. If this were the case, two specifica of the relation of music and emotion would be explained at once: (1) The emotional contents are not generated by music, they are merely retrieved and evoked from a memory database, whence the individual emotional response to one and the same music would receive a logical explanation. (2) The musical stimulation of the hippocampus is very probably not independent of the human individual who undergoes this process, in other words: If the music is a key, each individual is likely to have his/her individual key to the "subconscious". This would explain why there are so many different musical tastes-beyond musical education and culture. This would also
644
CHAPTER 30. INTERVAL DICHOTOMIES AS A CONTRAST
explain why it is often a specific tune or musical mood that is the personal preference: If this tune played a role in the encoding of a specific emotional memory content, the same key-tune could playa role in the decoding process. Summarizing, we have this thesis:
Thesis 5 Consonant and dissonant intervals and associated harmonic or contrapuntal structures evoke a hippocampus state/process which activates a gateway to mainly subconscious memory contents. In other words, Winson's gate hypothesis of the hippocampal formation must also be stated in the sense of the existence of a musicogenic key to the gate. This thesis does not mean that music produces emotions, it only retrieves and reactivates them from a memory database. So it acts on the brain like a drug and produces psychic effects. In this metaphor, the 'chemical formula' of the music drug corresponds to the involved musical structure.
Chapter 31
Modeling Counterpoint by Local Symmetries Der Rangunterschied zwischen den perJekten und den imperJekten Konsonanzen ermoglichte die Formulierung genereller KonsonanzJolgeregeln Jur eine indeterminata positio. Klaus-Jiirgen Sachs [468, p. 114] Summary. This chapter presents the counterpoint model in form of a counterpoint theorem which guarantees the existence and exhibits an arsenal of admitted contrapuntal steps that come in extremely close to the rules of classical counterpoint. The theorem is based on the concept of a contrapuntal symmetry and follows the paradigm of local symmetries as a rationale for forces in physics. Because of its generic concept framework, the theorem, which in this general form 1 was proved by Jens Richert [223], is also valid for non-European scales. We discuss these extensions.
31.1
Deformations of the Strong Dichotomies by Contrapuntal Symmetries on IntMod 12 ,q[c]
Summary. In the core theory of counterpoint [468], the concept of contrapuntal "tension" between successive, perfect and imperfect consonant intervals plays a crucial role. This idea is made precise in the framework of contrapuntal symmetries which deform the strong dichotomy. The separation property of contrapuntal symmetries is proven. -~-
IThe original theorem is presented in [336, 340J and deals with the classical consonances and dissonances, when applied to European diatonic scales.
645 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
646
CHAPTER 31. MODELING COUNTERPOINT BY LOCAL SYMMETRIES
The following counterpoint theorem (section 31.3) is concerned with the elementary and core situation of classical counterpoint: "note-against-note". It is a theory which describes a system of rules for "allowed" sequences (x s + c.i s , as)s of oriented intervals, a system which essentially boils down to a set of rules for allowed successor pairings (xs + c.i s , as), (XsH + c.isH, asH) within such contrapuntal sequences. We shall also restrict our investigation to sequences of constant sweeping orientation as = a+ = const., changes in the orientation are handled as described above in 29.6. The first elementary rule of counterpoint "note-against-note" says that we are not allowed to take other intervals than the consonances ~ E K[c]. This seems evident, but it imposes a strong obstruction against another, more hidden directive: the idea of creating a tension between each interval and its successor. More precisely, the meaning of "contra" is not only that of a vertical opposition between cantus firmus and discantus. As Sachs has remarked in [468], the preposition "contra" equally means a horizontal opposition between successive intervals in the given sequence. This requirement is not very explicit, but it is reflected in the distinction between perfect consonances (prime, fifth, octave) and the others, the imperfect sixths and thirds, and the idea of changing between perfect and imperfect consonances in order to create tension. This conceptual distinction seems to evoke a dissonant ingredient in the consonant character, although it does not really abolish the consonance, it is a kind of coloring effect. So, the idea of contrapuntal tension is in some sense a contradictory requirement against the primordial rule of forbidden dissonances: We should like to behave as if there were dissonances within consonances and to create a tensed movement from consonances to dissonances and vice versa. In order to solve this requirement in our mathematical remake of the contrapuntal rules, we introduce this technique: Given a symmetry 9 of IntMod 12 ,q[C]' we may apply 9 to the consonance-dissonance dichotomy (K[c]/ D[c]) and "deform" it to the dichotomy g(K[c]/ D[c]). In general, the deformed dichotomy will have its parts in such a position that some real consonances ~ are also g-deformed consonances, i.e., ~ E g(K[c]), and some are g-deformed dissonances, i.e., ~ E g(D[c]). This implies that we may restate the directive of creating contrapuntal tension in the sense that for a given pair of successive consonances ~,'T/, the first one is a g-deformed dissonance while the other is a g-deformed consonance (or vice versa) for a determined symmetry g, in which case we say that the (unordered) pair ~,'T/ is g-polarized. We shall see below in section 31.2 that the symmetries which we shall exhibit for this role are indeed local symmetries which in physics are responsible for creation of forces, i.e., deformational tension. Of course it is not evident that there is always a symmetry 9 that polarizes two consonances ~,'T/ in the above sense. Let us first discuss this topic. It shows that the strong dichotomies are exactly what is needed to guarantee this polarization property.
Proposition 52 Let (X[c]/Y[c]) be a strong dichotomy and let ~,'T/ be two different intervals. Then there is a symmetry 9 such that the pair~, 'T/ is g-polarized. If ~,'T/ lie in different halves of (X[c]/Y[c]), then 9 = Id does the job. So we may suppose that both, ~ and 'T/ lie one half, say in X, the other case is settled by a transformation of the pair of intervals to a pair within X via the polarity of (X[c]/Y[c]). Let ~ = i + c.j, 'T/ = k + c.t with j, t E X. The symmetry is 9 = el+E.m(n + c.o), n 2 = 1. Then we have these applications: g.~
= l + ni + c(m + jn + oi),g.'T/ = l + nk + c(m + tn + ok).
31.2. CONTRAPUNTAL SYMMETRIES ARE LOCAL
647
We want the coefficients of E to stay in X and Y, respectively. If we try with 0 = 0, this means that em.n(j) and em.n(t) stay in different halves. Suppose that both are always in the same half. Try further n = 1. Then we have j + m, t + mE X or j + m, t + mE Y for all m E Z12. Take any a E X and set m = a- j. Thenj+m = a, and t+m = (t- j)+j+m = (t- j)+a. So adding the difference t - j to any a E X is again an element of X. But then the symmetry e t - j is a non-trivial automorphism of X, contradicting the rigidity of X, and we are done. QED. We shall give another proof of this fact below. Let us now formulate the properties of symmetries which we want to use for the deformations for a counterpoint rule set:
Definition 95 Let ~[E] = (X[E]/Y[E]) be a strong dichotomy. Let ~ symmetry 9 is contrapuntal for ~ iff
(i)
~
=
x
+ E.i
E X[E]. A
tf- g(X[E]),
(ii) pX is a polarity of g.~[E], (iii) The cardinality of g(X[E]) nX[E]) is maximal among those 9 which have properties (i) and
(ii) . The reason for these requirements is this. We have seen that every pair of intervals can be polarized by a specific symmetry. But we are interested in a rule set which guarantees more than the mere possibility of separation. We want to have those symmetries which admit a maximal number of polarized couples starting from a fixed interval. The second condition is introduced in order to relate the polarizing symmetry 9 to the given polarity pX of the dichotomy at the cantus firmus x. It can be shown [333] that for x = 0 this property is also equivalent to the commutativity condition 9 .p~ = p~.g, so it is a generalized commutativity condition.
Definition 96 If a strong dichotomy ~[E] = (X[E]jY[E]) and an interval ~ E X[E] are given, an interval 'I] is called an admitted successor of ~ if it is contained in an intersection g(X[E]) nX[E] for a contrapuntal symmetry 9 for
31.2
~.
Contrapuntal Symmetries Are Local
Summary. A closer look at contrapuntal symmetries shows their local character. In complete analogy with modern physics, local symmetries produce the looked-for forces of contrapuntal tension. Thus, the melodic variation of the cantus firmus in counterpoint is perfectly interpreted as a deformation caused by the "forces" from local symmetries. -~-
Before we deal with the counterpoint theorem, we should explain the local character of contrapuntal symmetries. We shall consider the example of classical consonances and dissonances, i.e., the dichotomy ~[E] = (K[E]I D[E]). We compare two interpretations of the local composition of all zero-addressed intervals I = 0@IntMod 12 ,q[E]. The first leI is induced by ~ an has the atlas Ix,Kx = x + E.K,D x = x + E.D,x E Z12.
CHAPTER 31. MODELING COUNTERPOINT BY LOCAL SYMMETRIES
648
The second leg is the analogous construction built on the deformed dichotomy g.~[c], i.e., its atlas is Ix, gKx = Ix n g.K[c], gDx = Ix n g.D[c], x E Z12. Although the autocomplementary function P1. for the cantus firmus x is also a polarity of the g-deformed dichotomy g.~[c] according to definition 95, its action on the two interpretations is qualitatively different (see also figure 31.1). Z,2[E]
/
(O:l
gD
9~-+----+:--+-r--+---+-+-+--+--
t
t
"'gK
Figure 31.1: The autocomplementary symmetry on the deformed consonance-dissonance dichotomy is a local symmetry, whereas it is a global one on the original dichotomy. This fact resembles physical forces being induced by local symmetries.
x =
Consider the example 9 = eC •S(5 + 10.4) for the consonance ~ = 10.9, i.e., at cantus firmus We have g.K[c] = (1 - c.4)Z12 + c.es .5K.
o.
For a cantus-firmus point w, this means gKw =w+c.(5K+8-4w).
But under P~, w is transported to 5w, and we have gK5w
= 5w + c.(5K + 8 + 4w).
So we recognize the following: On the first interpretation, P~ acts on Kw via translation on the cantus firmus: K5w = e4W (Kw ), followed by the autocomplementarity symmetry on hw. But on the second interpretation, P~ does not operate on gKw by a translation plus autocomplementarity symmetry on 15w , in fact, gK5w is different from e4W (gKw)! This can be understood in the sense that P~ acts on the second interpretation via g-deformation in the spirit of physics, i.e., as a local symmetry, instead of a global symmetry as it is the case for the first interpretation. The latter can be viewed as a 'spiral turn': 'rotation' (=autocomplementarity symmetry on
649
31.3. THE COUNTERPOINT THEOREM
each chart Iw) plus translation (K5w = e4W (Kw)) 'along the rotation axis'. The former however does not shift the part gKw of Iw to e4W(gKw), but deforms it to
by the factor c:.8w. This phenomenon is analogous to physics in the sense that forces in physics are induced by local symmetries [162]. In this understanding, local symmetries on the interval space seem to be responsible for the tension which controls the progression from interval to interval in our model.
31.3
The Counterpoint Theorem
Summary. This section presents the counterpoint theorem and its corollaries. -~-
In this section, we prove the general counterpoint theorem. "General" means that we deal with all strong dichotomies and construct the lists of admitted interval successors for these cases, including the classical case as a special item. The general theorem was proved in Jens Hichert's thesis [223] and sheds a new light on the general problem of what counterpoint is about. We shall discuss the special case of the consonance-dissonance dichotomy in section 31.4.1.
Remark 17 In order to cope with Hichert's calculations and tables we shall use the interval space relative to the identity identifier, and not to the fifth identifier. In particular, the consonance quantities are {O, 3, 4, 7, 8, 9} in this section.
31.3.1
Some Preliminary Calculations
Summary. We prove technical lemmata for the exhibition of Hichert's algorithm to be introduced in section 31.3.3. -~-
We fix a marked dichotomy 6. = (XjY). Let us first come back to the rigidity property of X[c:]: Lemma 44 The symmetry group of X[c:] is Sym(X[c:])
= eZ,2 .
Proof. Clearly eZ12 C Sym(X[c:]). Therefore, ez+c.t(u+c.v X[c:] =
X[c:] iff
ec.t(u+c.v) X[c:] =
X[c:].
This implies t + vz + uk E X for all z E Z12, k E X. Whence t = 0, U = 1 since X is rigid. Therefore, for z = 1, we imply v + k E X, all k E X, i.e., v = 0. QED.
650
CHAPTER 31. MODELING COUNTERPOINT BY LOCAL SYMMETRIES
Proposition 53 Let H
= e Z12 .GL(Z12[c]). Then X[c] is H-rigid, i.e., the orbit application C.
H
-+
ObLoc~12[cJ : g f-+ g.X[c]
is injective, i.e., a cadence of the group H. Exercise 69 Show that each consonant part Xx
= x + c.X of the tangent space Ix is rigid.
Lemma 45 With the notation of corollary 53, if g = et(u
g(z)
= g.e c,vz Then we have
+ c.v)
E H, and if z E Z12, we set
(i) (gZl)Y2 = gZl +Z2,
Proof. The first formula is clear. The second follows from eZg c.v).ez(u-o,v) = g(z) .e zu . QED.
= ez+c.t(u + c.v) = ec.t(u +
~
Corollary 20 For g E GL(Zdc]), there is a symmetry h E H such that g.X[E] In fact, there is a u E H such that g the solution h = u(z).
Lemma 46 Let~
= eZ.u, so by lemma 45, g.X[E] = u(z) .X[c], and we have
~
~
= x+c.k,g E GL(ZldE]), andz
E Z12. Then, if~ rf- g.x[E] andp~: g.X[E]-+ eZ.g.X[c], p1.+ x : ez.g.X[c] ~ eZ.g.Y[E], and eZ.g.X[c] n X[E]
g.Y[E], we also have eZ~ rfeZ.(g.X[c] n X [E]) , in particular card(g.X[E] n X[E]) Proof. It is clear that
= h.X[E].
eZ~
=
card(ez.g.X[E] n X[E]).
rf- ez.g.X[E], while by lemma 51,
Z+X .e Z .g. X[] x -z .e Z.g. X[] x X[] Z Y[] P6. 10 -- eZ 'P6.·e E -- eZ 'P6..g. 10 -- e.g. 10, and finally, eZ.g.X[c] n X[c]
= eZ.g.X[E] n ez.X[E] = eZ.(g.X[E] n X [10]). QED.
Proposition 54 The contrapuntal symmetries can be calculated if one knows the contrapuntal symmetries g E H at cantus firmus x = O. More precisely, if ~ = x + E.k E X[c] and if g is any symmetry, such that properties (i) through (iii) of g in definition 95 are true, then they are also
true with unchanged set g.X[E] = h.X[E] for a symmetry h E H. Furthermore, to check this property for h, we may verify the properties (i) through (iii) for the interval E.k, the symmetry kC- x ) E H, and the polarity p~). Finally, the intersection h.X[E] n X[c] coincides with the translate eX(h(-x).X[E] n X [EJ) , which means that we may just look for relative cantus firmus steps when building the rules of admitted steps.
Proof. The replacement of g by h follows from corollary 20. By lemma 45 we have e-x.h.X[E] = he-x) .X[c], and by lemma 46, with z = -x, we can verify the contrapuntality of h on e-x~ = c.k, and on pt x +x ) = p~). The last statement follows from the translation formula for intersections in lemma 45. QED.
651
31.3. THE COUNTERPOINT THEOREM
31.3.2
Two Lemmata on Cardinalities of Intersections
Lemma 47 Let (XjY) be a strong dichotomy in O@IntMod 12 ,1, and 0 :::; i :::; 6 and integer. -----+ Setting G i = {g E GL(Z12)1 card(g.X n X) = i}, we have card(G i ) = card(G6 _ i ). Proof. Let p be the polarity of (XjY), the right multiplication with p induces a permutation -----+ ~ -----+ of order 2 ?p : GL(Z12) ----; GL(Z12)' For g E G i , consider the dichotomy g(XjY) and its intersection with X. This gives X = g.X n Xu g.p.X n X, whence card(g.p.X n X) = 6 card(g.X n X). Therefore, g.p E G 6 - i , and by the order two permutation ?p and the finiteness of all the involved sets, we have p.G i = G 6 - i . QED. The next lemma basically guarantees the existence of admitted contrapuntal successor intervals, as we shall see in the next section. Lemma 48 Let K be a zero-addressed objective local composition in a finite cyclic group M -=::. Zn and let U E GL(M). Then
L
card(em.U(K) n K)
=
k 2.
mEM
Proof. Let U(K) = {U1,'" ud the image set with its k generator of M. Then
L
= card(K) elements. Let r be a
card(em.U(K) n K)
mEM
n-1 L card(etT.U(K) n K) t=O
n-1 k L L card( {e tT us} n K) t=O s=l
n-1 k L LXK(etTu s ) t=O s=l k n-1 L ( L XK(etTu s ))
8=1 t=O
with the characteristic function XK for an element being in K or not. Since r is a generator of M, the expression L;~:Ol XK(e tT u 8 ) adds up to k, and we have the result. QED.
31.3.3
An Algorithm for Exhibiting the Contrapuntal Symmetries
Summary. This section discusses Hichert's algorithm for the calculation of all contrapuntal symmetries and admitted contrapuntal steps by use of a specific software. -~-
652
CHAPTER 31. MODELING COUNTERPOINT BY LOCAL SYMMETRIES
This section restates the three conditions (i) through (iii) of the contrapuntal symmetries in order to provide an algorithm for the calculation of all admitted contrapuntal interval successors. Given the strong dichotomy Ll = (X/Y), and according to proposition 54, we may start from an interval ~ = c.k, k E X and restrict to symmetries 9 = ec.t(u + c.uv) E H (uv instead of v without restriction). Let us first reformulate conditions (i) and (ii). We have
g.X[c]
U g.(x + c.X)
=
U ux + c.(uvx + t) + c.uX U y + c.(vy + t) + c.uX U y + c.(evy+tu.X).
yEZ 12
yE Z 12
Setting h(y)
= evy+tu,
we have
U y + c.h(y).X.
g.X[c] =
yE Z 12
Therefore, we have
U y + c.(h(y).X n X).
g.X[c] n X[c] =
yE Z 12
This means that condition (i) is equivalent to k tj. h(O).X which means k is equivalent to 3s E X such that k = h(O).p(s) = u.p(s) + t.
E
h(O).p.X, and this
This is equivalent to the statement that there is an sEX such that
U y + c.(eVy+k-u.p(s)u.X)
g.x[c] =
(31.1)
yE Z 12
so that we have
card(g.X[c] n X[c]) =
L
card(evy+k-u.p(s)u.X n X).
(31.2)
yEZ 12
Further, condition (ii) means p~.g p = er.w, this means that
=
g.p~, see the remarks after definition 95. If we have
wt+r = ur +t.
(31.3)
Therefore, conditions (i) and (ii) are equivalent to equations 31.1 and 31.3, whereas the maximal number is calculated upon formula 31.2. In order to calculate the number 31.2, we have to distinguish three cases concerning the values of v:
31.3. THE COUNTERPOINT THEOREM
653
1. v is invertible. Then we have card(g.X[E] n X[E]) =
L
card(eY.u.X n X).
(31.4)
yE Z ,2
2. v = O. This gives
card(g.xlE] n X[E]) = 12card(e k 3. v =
±p, P E
u . p (s)
.u.X n X).
(31.5)
{2, 3, 4, 6}. This gives card(g.X[E] n X[E])
=v
12/v
L
card(e(j-1)v+k- u .p (s) .u.X n X).
(31.6)
j=l
If we recall lemma 15, the first case with formula 31.4 implies:
Fact 15 There are always at least 36 successors for a fixed given interval
~ E
X[E].
We are now ready to state the algorithm which was implemented on Turbo-Pascal by Richert [223]. The algorithm starts from the fixed value k E X and first calculates all the possible coefficients of g. This means that we have to go through the loop which transgresses all 9 = ee.t( u + E.UV) via u E {I, 5, 7, 11}, SEX, v E 1:: 12 , t = k - up(s), and then for each such value set calculate card(g.X[E] n X[E]) according to the three cases 31.4, 31.5, 31.6. For each case, we update the set C k of intermediate candidates for contrapuntal symmetries, i.e., we add a new 9 to the existing set if its intersection number is maximal among the already given candidates in C k , and we remove all previous 9 where intersection cardinalities are smaller than the actual maximum. Exercise 70 Write a C or Java program which implements the above algorithm. If you do not speak C or Java, write a Mathematica or Maple program. If you do not speak these languages either, interrupt reading this book, learn one of these languages, and proceed. The complete lists of all contrapuntal symmetries, together with their intersection configurations, can be found in appendix 0.1. The admitted successors will be listed below. This information yields the following Theorem 33 Let II = (X/Y) be a strong dichotomy, and let ~ E X[E]. The number of admitted successors of ~ is always at least equal to 36, in particular, there exists always a contrapuntal symmetry 9 for ~. An admitted successor of ~ also always exists if one prescribes the cantus firmus of the successor interval. -For each of the six strong dichotomy classes, a list of forbidden successor intervals is exposed below, after this theorem.
We shall say that an interval ~ E X is a "cul-de-sac" under determined conditions if there is no admitted successor under these conditions. The following table is meant as follows: Each table section is related to a fixed representative (X/Y) of the indicated class. The first column indicates the interval quantity k of
CHAPTER 31. MODELING COUNTERPOINT BY LOCAL SYMMETRIES
654
e
the "consonant" start interval = x + c.k E X[c]. The obstructions to successor interval = y + c.l E X[c] are visible in columns 2-13 where the difference d = y - x = 0,1,2, ... 11 leads the column in the first row. For each couple k, d, we see the forbidden interval quantities l of the target interval. For example, in class 64, the steps x + c.4 t--+ x + 6 + c.l is forbidden for l = 2,4,7,9 and admitted for all other l.
'TJ
k
I
0
I
1
I
2
I
3
I
4
I
5
I
6
I
7
I
8
I
9
I
10
11
Forbidden Successors for Dichotomy Class Nr. 64
2
2
4
2,4,7,9
5
5,11
7
2,4,7,9
9
9
11
11
2,4,7,9 5,11
5,11
2,4,7,9 5,11
2,4,7,9
5,11
2,4,7,9
2,4,7,9 5,11
2,4,7,9
5,11
5,11
2,4,7,9
5,11
2,4,7,9 5,11
2,4,7,9 9
11
11
11
11
11
11
11
11
5,11
5,11
2,4,7,9 9 11
11
0
0
11
Forbidden Successors for Dichotomy Class Nr. 68
0
0
0
2,8
2,8
1,3,5
0
0
2,8
2,8
0 2,8
0 2,8
1,3,5
1,3,5
2,8
2,8
2,8
1,3,5
5
2,8
1,3,5
5
8
8
8
8
2,8 5
8
8
8
Forbidden Successors for Dichotomy Class Nr. 71
0,3
0,3
0,3
1,2,7
1,2,7
1,2,7
1,2,7
1,2,7
1,2,7
2,3
2,3
2,3
2,3
2,3
2,3
2,3
2,3
2,3
2,3
2,3
3,6
3,6
3,6
1,2,7
1,2,7
3,6
3
3
1,2,7
1,2,7
3
0,3
3
0,3
3
3,6
3
0,3
2,3 3
1,2,7
3,6 1,2,7
Forbidden Successors for Dichotomy Class Nr. 75
0
0 0,1,4,5
0,1,4,5
2,5 4
0,1,4,5
2,5 0,1,4,5
0,1,4,5 2,5
8
0,1,4,5
2,8
0,1,4,5
2,8
0,1,4,5
2,5 0,1,4,5
2,5
0,1,4,5
2,5
2,8
2
0,1,4,5
2,5
0,1,4,5
2,5
2
2,8
Forbidden Successors for Dichotomy Class Nr. 78
0
4
0,2,6
0,2,6
0,2,6
0,2,6
0,2,6
0,2,6
1,2
1,2
1,2
1,2
1,2
1,2
1,2
1,2
1,2
1,2
1,2
1,4
1,4
0,2,6 10
0,2,6
1,10
1,4 0,2,6
1,2 1,4
0,2,6
1,4
0,2,6
0,2,6
1,10
1,4
Forbidden Successors for Dichotomy Class Nr. 82 0
0
3,9 4
3,9
0,4,8
7 8
8
9
3,9
0,4,8 7
7
3,9
7
7
3,9
0,4,8
0,4,8 7
0,4,8 7
0,4,8 7
8 3,9
3,9
2,8
1,3,5
3,9
7
8
31.4. THE CLASSICAL CASE: CONSONANCES AND DISSONANCES
31.3.4
655
Transfer of the Counterpoint Rules to General Representatives of Strong Dichotomies
Summary. The above counterpoint theorem was made explicit for one selected representative of a strong dichotomy. Following [223], we give rules to transfer these results and tables to arbitrary representatives of strong dichotomies. -~-
Let ~ = (X/Y) be a strong dichotomy with polarity Pt::... Take an interval ~ = c:.k E X[c:], and let 9\ c H be the set of contrapuntal symmetries of ~ and ~. Take any symmetry g = etu E GL(Z12), set go = eo.tu, and consider the transformed dichotomy (L/M) = g.~. We have go'X[c:] = L[c:],go.Y[C:] = M[c:]. ~
Proposition 55 [223, Satz 3.2] With the above notation, the conjugate set 9\g = go.9\.g;l is exactly the set of contrapuntal symmetries of g.~ and ~g = gc~' The number of admitted successors of ~ and of ~g coincide.
Attention: In general, the fact that 'rf is a successor of ~ does not imply that go'rf is a successor of go~! This result can be used to transform the tables of forbidden successors as given in the table after the counterpoint theorem 33 for other representatives of the dichotomy classes. To this end, suppose that in such a table (for dichotomy representative J = (X/Y)), the couple ~ = c:.k, 'rf = b + c:.j is forbidden. This means that in our table, on the location of row with interval quantity k and column with interval quantity b, the coefficient j appears as a forbidden quantity. According to proposition 55, in the transformed table, the row g.k and the column g.b must show a forbidden quantity g.j. So we have a recipe for transforming a table for ~ under a symmetry g = etu: 1. Permute the 12 columns of the given table by a multiplication of the column head numbers 0,1,2,3, ... 11 by u (mod 12) and rearrange the new numbers by increasing values.
2. Replace the leading column interval numbers k by g.k and rearrange the corresponding 6 rows by increasing values of the leading numbers. 3. Replace each forbidden item j by the item g.j.
31.4
The Classical Case: Consonances and Dissonances
Summary. This section deals with the classical case of the counterpoint theorem for the consonance-dissonance dichotomy of Palestrina-Fux Theory. -~-
We give a specialized counterpoint theorem for the consonance-dissonance dichotomy, the sweeping orientation, and relating to ecclesiastical modes as defined in section 13.4.2.
CHAPTER 31. MODELING COUNTERPOINT BY LOCAL SYMMETRIES
656
Theorem 34 Let b. = (KjD) be the consonance-dissonance dichotomy, and let ~ E K[s]. The number of admitted successors of ~ is always at least equal to 36, in particular, there exists always a contrapuntal symmetry g for~. An admitted successor of ~ also always exists if one prescribes the cantus firmus of the successor interval. -If one restricts the admitted pitch classes to an ecclesiastical mode (see section 13.4.2), then an admitted successor of~ always exists, even if the cantus firmus is prescribed. So, under these conditions, there is no cul-de-sac. Parallels of fifth (x + s.7 1--+ y + s.7) are generally forbidden. For all other parallels, no general obstruction exists. The admitted relative progressions are listed in the table after theorem 33, Class Nr. 82, whereas the progressions for the C-major scale are listed in appendix 0.2.
We shall discuss below the relation between strong dichotomies and scales, in particular the appearance of culs-de-sac.
31.4.1
Discussion of the Counterpoint Theorem in the Light of Reduced Strict Style
Summary. The concluding section gives an overview of the (strong) commonalities with and (weak) diversities from the classical rules of the Palestrina-Fux system and its reduction to pitch classes. -~-
•• ••• ••• •
••••
I
t
•
-
287 cases
l
1';\1
I
54 inadmissable. 21 o/them(arbidden (b lack)
.
(:
I
I
I~ ).
I
I
I I J
\
alltage/her 37 forbidden
Figure 31.2: Out of the 287 possible progressions with a mode (modulo translations of the cantus firmus), the reduced strict style exhibits 54 inadmissible cases. According to the mathematical model of counterpoint, 37 progressions are not admitted. Out of these, 21 cases are inadmissible in Fux' sense. intuitively, the commonalities of the two approaches can be described by use of a probabilistic argument: If somebody tries to hit at least 21 of the 54 inadmissible cases of the reduced strict style without knowing anything about counterpoint by 37 trials, the chance is less than 2.10- 8 . In this discussion, we refer to the codification of the strict style by Fux [174] (see also [528]) since its explicit and rigorous rule system is particularly useful for a qualitative and quantitative comparison of the mathematical model with the classical counterpoint rules. In order to establish a basis of comparison for the mathematical model, we first have to transfer
31.4. THE CLASSICAL CASE: CONSONANCES AND DISSONANCES
657
the F'ux rules to the interval space IntMod 12 ,1. This yields a model of Fux' rules 'modulo octave', a rule system which we call the reduced strict style. The upshot of a detailed investigation [342]2 states this: Fact 16 In the reduced strict style, only the rule of forbidden fifth parallels and the tritone rules have an unrestricted validity. Within an ecclesiastical mode, there are 287 a priori possible progressions [342]. According to the consonance-dissonance counterpoint theorem 34, 37 of them are forbidden. Among them, 21 coincide with the 54 Fux-inadmissible progressions. The remaining 16 forbidden progressions of the mathematical model deal with progressions which are bad or allowed by Fux. Out of the ten allowed ones, four concern tritone movements of the cantus firmus. Three of the remaining obstructions (in the mathematical model) concern progressions from the major third into the prime, from which one may lead to an "ottava battuta". The remaining three obstructions deal with progressions which leave unaltered the pitch material, see also figure 31.2. We should stress that the mathematical model does not formalize a switch between perfect and imperfect consonances. The concrete shape of a polarization which induces a determined progression is redefined for each individual progression. Further, the model can be applied to any scale. This unveils an interesting fact concerning the dominant role of the major scale: In the analysis for the three seven-element scale classes which consist exclusively of minor and major second steps (classes 38.i: major, 47.1: melodic minor, and 62: whole-tone scale, extended by one pitch class) the major scale is by far optimal for the degree of freedom in the choice of admitted successors. There is no cul-de-sac. Only for two progressions with prescribed start interval and cantus firmus progression is the successor uniquely determined. The melodic minor scale has less successor freedom, but there are no culs-de-sac. In 16 cases there is only one possible solution. In the extended whole-tone scale, there are 18 culs-de-sac. The freedom of choice is minimal. We shall discuss this item in more generality in section 31.4.2.
31.4.2
The Major Dichotomy-A Cultural Antipode?
Summary. Among the six strong dichotomy classes, we look for others than the F'ux dichotomy and discuss their relative positions. Among the possible alternatives we especially focus on the major dichotomy, a topological antipode to the Fux dichotomy. The possibility to associate the major dichotomy with classical Indian scales is discussed. -~-
In his thesis [223], Hichert has observed a number of interesting topological properties of the six strong dichotomies. Here is a representative record of these observations: The major and consonance-dissonance dichotomies are not only polar with respect to diameter (definition 93 in section 30.1) and span (definition 94 in section 30.1), see figure 30.1. They are also polar with regard to their number of contrapuntal symmetries and interdictions, see figure 31.3. In other words: 2The paper [342] was accepted for publication in the journal Musiktheorie, but never published for marketing reasons (!), an anecdote about German musicology which shares a particular flavor.
CHAPTER 31. MODELING COUNTERPOINT BY LOCAL SYMMETRIES
658
contrapuntal symmetries
11
-
I
82
I 78 -
9
6
68
4
I 50
78
1 I
I I
75 - - 71 -
II 64-
I 80
82
96
interdictions
Figure 31.3: From the proof of the counterpoint theorem, one deduces the numbers of contrapuntal symmetries and interdictions. These again position the major and consonance-dissonance dichotomies in a polar relation. Fact 17 The consonance-dissonance dichotomy has a maximum of contrapuntal symmetries and a minimum of interdictions, as opposed to the polar major dichotomy. We have already observed in section 31.4.1 that, among the 'diatonic' scales (only minor and major second steps), the major scale has a maximum of freedom of choice for the consonance-dissonance dichotomy. Conversely, the major scale has only culs-de-sac for the major dichotomy. So we are also interested in the scales with seven tones with respect to the major dichotomy! It is a further fact that among these scales with no cul-de-sac for the major dichotomy no European scale appears. There is one such scale, namely K* = {O, 3, 4, 7, 8, 9, 11} of class 60, which is very interesting. To begin with, it is the consonance set K plus an added 'leading' note 11. So if the major scale is good for consonances and dissonances, the major dichotomy is good for a scale which is intimately related to consonances! This scale really shows a character which relates to Indian raga music. The basic melakarta framework for ragas is known [103, Bd.8, p.265ff] to be built by the 72 melas. The small number of seven of these mela scales have been used until the present days. One of them, "mayamalavagaula", Nr.15 = {c, dl>, e, f, g, aI>, h}, which is class 6i in our chord classification, can be represented by {O, 3, 4, 7, 8, 9,1}. But this is very similar to the above scale K*, we only have to switch the 11 to the 1, as opposed to 11 in 0. So we are led to the question of how far there is a polarity not only in mathematical relations between consonances-dissonance and the major scale as opposed to the major dichotomy and a 'consonance-dissonance' scale K* which is akin to mela Nr.15 in raga music, but also a global polarity in musical cultures between European and Indian tradition. See also figure 31.4 for this polarization. This concluding discussion is an excellent example of the anthropic principle on the level of scale and interval interplay. Seemingly, the historical selection tends to optimize certain abstract
31.4. THE CLASSICAL CASE: CONSONANCES AND DISSONANCES
659
strong dichotomie
KID
• • • •
• I •
I
-~ ~'-i,co,-"-",,
111 L-_ _ _ _ _ __
K * .. (7 items)
XOTIC
I
I
I
______________
whole-tone mel. minor +1 tone
7-scales
major
DIATO IC
Figure 31.4: Polarity b etween seven-tone scales and strong dichotomies shows a polarization between European and exotic (in particular: akin to Indian) scales. The size of darkened disks shows the number of progressions while the stars show culs-de-sac.
a priori properties of topological and transformational character. The possibility to compare such different musical cultures as European and Indian traditions opens a wide field of comparative musicology (relative ethnology) which is based upon systematic results instead of historiographic and ethnographic contingencies.
Part VIII
Structure Theory of Performance
661
Chapter 32
Local and Global Performance Transformations C'est l'execution du poeme qui est le poeme. Paul Valery Summary. Performance includes a non-trivial transformation from the mental reality of a score to the physical reality of acoustic and, in the limit, gestural realization. We discuss a model of local and global transformation structures and present a preliminary discourse on the need to "shape mental reality" in performance. Local performance transformation structures, together with their syntactic combination to global structures, are formally developed. They involve extrapolation from discrete to continuous or differentiable data; the latter are induced by use of spline techniques. We give a justification of such a procedure from the musical and mathematical points of view, in particular with stress on expressive coherence. -L;-
This part, structure theory of performance, is a turning point in the entire theory of the topos of music. In fact, the preceding parts dealt with general structure theory and then, on a more musicological and music-theoretic focus, with mental perspectives of rhythm, motives, harmony, and counterpoint. In contrast, performance is concerned with the transformation of mental structures into physical ones. This is what traditionally happens in a concert where human artists are performing on physical instruments, including all the richness of human expression on the gestural, emotional, or structural level of physiological, social, and physical parameters. Evidently, a comprehensive theory of performance is out of reach as long as major constituents have not even been attacked on a scientific level. For example, there is no deeper understanding of the emotional function of music, one knows some extremely elementary facts, for example those concerning the emotional impact of contrapuntal intervals on the emotional brain, as described in chapters 29, 30, and 31. Also on the level of instrumental factors in the quality of a performance, very little is known, for example, concerning the role of instrumental parameters in the communication of musical contents. Worse than that: There is not even a
663
G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
664
CHAPTER 32. LOCAL AND GLOBAL PERFORMANCE TRANSFORMATIONS
commonly accepted structure theory of performance, i.e., a theory which deals with the precise and general description of what is a performance in its most elementary shape. For example, the discussion of tempo has not yet been carried to a point of general acceptance of what tempo can be, including a fundamental disagreement on its hierarchical ramifications. One of these discussions could germinate around the distinction between mental (symbolic, logical, call it as you wish) time and physical time. Naively, mental time, as it is encountered on the score notation, looks like something discrete, to be encoded on integers, or some isomorphic sub module O.Z of the real numbers, whereas physical time is parametrized by the full line of real numbers. This is also the point of view of Desain and Honing in [125]: They reduce mental time to
discrete time intervals of a metrical structure, leaving the smooth part to the
continuous time scales of tempo changes and expressive timing. This procedure is mathematically incorrect because • the metrical time is infinitely divisible in itself: No positive lower limit for mental durations has ever been envisaged, metrical time is a topologically dense, not a discrete set in the field of real numbers. Hence, any reasonable (more precisely: uniformly continuous) time function from mental time E to physical time e can uniquely be extended to a time function on the reals (see [261]). There is no conceptual reason to restrict metrical time to a discrete sub domain of the reals . • Tempo does not deal with something more continuous than metrical time. It is another conceptI: the inverse differential quotient of a function E f-+ e(E) between two copies of the real number axis with irreducibly different ontological specifications, namely the musical mental status of the score and the physical status of performed music. Other misunderstandings floating around in musicological environments [189] maintain that tempo is a locally constant function, i.e., a step function, much like the medieval theories on velocity in the spirit of Oresme (see also our discussion in chapter 4). He would decompose accelerated movements into a succession of uniform movements [485, contribution of Isabelle Stengers on Galileo Galilei]. In what follows, we have not tried to downsize the complexity of performance, it is indeed the most complex subject in musicology. It involves all kinds of considerations concerning the three basic realities of physics, mentality, and psychology. But beyond this ontological diversity, it is of a sophisticated structural nature, involving differential geometry, ODEs, and PDEs, and evidently all the music theory which is presupposed in any reasonable performance theory (though: not in everybody's performance, but blurred gesticulations of musicians "playing their ass off" 2 are not the subject of any performance theory). We shall however not discuss proper psychological aspects, but restrict ourselves to the mental and physical perspectives. For psychological concerns, refer to [272]. It may appear that we thereby omit an essential and basic point of view, and that such a restriction would 1 We
shall introduce and discuss such concepts in section 33.1.1. of free jazz saxophonist Werner Liidi to Cecil Taylor's question: "What's your concept?"
2 Answer
32.1. PERFORMANCE AS A REALITY SWITCH
665
hamper the entire discourse. This is true insofar as we omit an essential perspective. But it is false that this hampers the discourse. The performance discourse has to deal with structural descriptions of performance: What happens if a score is played in acoustical and gestural reality? This is beyond psychology. Also do the rationales of a performance not uniquely rely on the psychological reality (of emotions as stressed in the naive romantic approach), but, in the spirit of Theodor Wiesengrund Adorno or Walter Benjamin, on analytical facts or, in another approach, on gestural paradigms, as investigated by Johan Sundberg, for example. Performance research is only in its initial phase, in particular with respect to the high level of performing artists. But this is no reason to abbreviate the scientific path with the risk of misunderstanding the beauty and complexity of performance, and to fall into some crevasse of oversimplification. Hopefully, this should help the reader to follow chapters on performance theory with due patience.
32.1
Performance as a Reality Switch
Summary. There is a sharp dichotomy of realities, communication, and semantical levels between a score and its performances. We expose these facts and their consequences for a theory of performance versus mental music theory. -I;-
Performance is more than simply playing an instrument "all improviso"; even in free jazz (if it merits that name), musicians always refer to an inner score. The concept of a score is used here in its generic meaning, i.e., a generic score is any written, imagined or conceived scheme for the execution of a musical composition (see [361] for details). In European or Japanese classical music for example, a score is realized as a denotator structure of more or less complex form (see [378], for an explicit score form englobing classical Viennese composition). In improvised music, the scheme is less of a structural nature than of a processual one. The jazz musician, for example, follows action and reaction patterns and rules according to blues schemes and individual dictionaries of motivic, rhythmic, and harmonic elements. We propose this informal but reliable definition:
Definition 97 Performance is defined as the physical realization of an interpretation of a generic score. We leave the meaning of "interpretation" open in its common understanding, but we do include, and this is a core issue of the subsequent discourse, interpretation in the technical sense of interpretations of local compositions. We also stress the generic attribute "physical" which does include the acoustic realization, but does not exclude further performance parameters, such as the gestural dynamics of a performer. But we do, as already mentioned, exclude psychic parameters and therefore mean strictly physical performance. Notice that this subsumes intermediate technological strata as a special subsystem of physical realization.
666
32.2
CHAPTER 32. LOCAL AND GLOBAL PERFORMANCE TRANSFORMATIONS
Why Do We Need Infinite Performance of the Same Piece?
Summary. This section deals with the a priori necessity of infinite performance. The argumentation relates to infinite analysis-due to the Yoneda philosophy-and to its communication on the rhetoric level of expressive performance. -~-
The basic problem here is a very common one: Why do we need infinite performance? We are talking about real performances in real concerts. Do we, and why do we need new performances, again and again? Couldn't it arrive that all possible or-at least-all relevant performances are definitively played at a given moment, and that all successive performances are doomed to the existence of superfluous variants of the arsenal of core performances. Seemingly, a musical composition, as it is fixed in a score and an associated denotator, is a finite object. So it would be a logical consequence of this finiteness that infinity is not inscripted in a musical composition and its performance. We therefore have two questions: Is there a substantial infinity in the interpretative variety of a given score? And, if this is the case, is there an associated infinity of performances, and why should such varieties have a sense for the listener or for the artist? The first question is easily answered: Yes, there is an infinity of interpretations of a given finite score denotator. This was already shown in the discussion of iterated interpretations in section 13.4.2. This is an affirmative answer on the mental level, and one may easily add an infinity of evaluations of any such finite or infinite interpretation, for example on the level of rhythmic, motivic, or harmonic analyses as described previously in the respective chapters. One may also enrich the possibilities of viewing a given interpretation by a variation of the address, a technique which evokes Yoneda's lemma, of course, but which is also very concretely developed in the context of harmonic topologies. We shall see later in chapter 44.7 on performance operators that the analytical propaedeutics to performance also includes an important class of analyses: the analytical weights, i.e., numerical functions associated with more abstract analyses (such as harmonic topologies). This type of numerical evaluation which was already introduced in the power series Awo of section 13.4.2 is not just a "boiled-down" version of 'serious analysis', it is just another way of looking at complex configurations. The point is that the categories of 'understanding' a (finite) musical composition are of an infinite character in many respects, and that there is no reason to claim complete understanding by any finitistic argument. So the expression of mental content relies upon an infinite arsenal: Fact 18 Performance conveys an infinite message.
But infinity in performance also comes in from a second point of view: Performance is inevitably an experiment in understanding or advancing comprehension. The realization of a piece of music in physical spaces creates a view, a flight through a virtual landscape which may reveal new insights to the involved actors, insights which are not anticipated but emerge from a particular visual angle of a determined 'performance flight'. Such an experience is a mental
32.3. LOCAL STRUCTURE
667
experiment as it has been described in chapter 4, and as such it is a creative investigation, not only a reproductive activity for the sake of social coherence and psycho-hygiene. In this sense, performance need not be agreeable or pleasant-recognizing new aspects within an infinite repertory of analytical structures can be painful, but healthy. Glenn Gould's work is a brilliant piece of history in this investigative field of musical performance. Even his most awkward performances of Beethoven's piano music (such as the strange, cranky performance of the "Hammerklavier" sonata op. 106 or the funny and blasphemic review of the "Appassionata" sonata op. 57) are masterpieces in revealing new aspects of compositions, be it solely to learn how not to play them ...
32.3
Local Structure
Summary. Just as with the analytical work, the performance task is also composed of local subtasks which constitute the morphemes of performance transformations. This section describes the emergence of such units and their structure. -~-
In a first approximation, performance p could be seen as a set map which associates with every element X of a local composition (in a score-related form) a physical event, encoded by an element x = p(X) in a second local composition whose parameters pertain to a form of physical signification. The coherence of such a performance map is however very rarely a global one. For example, the tempi may change instantaneously, by the indication istesso tempo, forcing the artist to restart from the initial tempo after a deformation via a sequence of agogical indications. Or else, left and right hands may follow their separate tempi-except for some meeting points where the onsets should coincide (in Chopin rubato, for example). In orchestral music, the tuning is a function of the instrument, although everything is written in the same orchestra score. Or it may happen that the local performance of an ornament (a complicated trill, say) is an autonomous shaping map, operating independently of the map on the events which neighbor the ornament. We therefore observe a global structure, a patchwork of local chart maps, much like morphisms on global compositions.
32.3.1
The Coherence of Local Performance Transformations
Summary. Arguments for building local units-based on local compositions-of performance transformations are given. The idea of making performance on local units coherent is explicated: The mapping on the unit's domain is defined by a coherent rule, typically described by continuously differentiable transformations on connected neighborhoods of the local unit. -~-
In the theory of local and global this: You are given a set map between this map is not just any set-theoretic the extensibility of such a set map to
compositions, the definition of morphisms was roughly two sets of music objects and you want to express that map, but shares a type of coherence. This is done by the ambient space by an affine space map. This is one
668
CHAPTER 32. LOCAL AND GLOBAL PERFORMANCE TRANSFORMATIONS
way of expressing coherence between the images of different points of the local composition domain: they are mapped under a formula of affine type. This kind of coherence is also required in classical algebraic geometry when defining morphisms between algebraic sets. For performance maps, the narrow affine formalism and also the formalism of polynomial maps are both too algebraic in nature. Let us give some arguments to make this clear. If we want to model a fermata tempo shape, we can evidently not restrain to linear expressions since the fermata tempo must decrease in the beginning and increase towards the end of the fermata duration. Moreover, the cognitive rules of logarithmic perception of differences 3 would enforce changes in tempo in the sense that the change of physical onset as a function of mental onset, i.e., de/dE at onset E proportional to the actual value of physical onset e, i.e., de/dE", e which means that dE/de", l/e and therefore E '" In(e), i.e., e '" exp(E) so we cannot use polynomials for such a shaping of physical onset. The general shape of physical parameters against mental ones is also strongly documented in glissando and crescendo effects, i.e., changes of pitch h or l as functions of mental time E. These two effects are also a strong argument for continuous mental time: Glissando and crescendo must be defined at each moment of a continuous time parameter, otherwise they cannot be realized, not more than the continuous (even differentiable) movement of the conductor's baton. Of course, such general function requirements could be worked around by gluing of polynomial functions to polynomial spline functions. But this is exactly what we do not want here: We want a unique class of functional expressions for the local coherence. It is this principle: What we can achieve by gluing together local pieces should be left to the global theory instead of hiding the gluing technique on the level of function classes!
32.3.2
Differential Morphisms of Local Compositions
Summary. This section describes the categories of local compositions with differential morphisms on real vector spaces. This formalism captures unreflected prerequisites of musicological approaches on performance instances, such as tempo or tuning.
-EBefore constructing the categories of differential morphisms, we should specify their objects. As the existing performance theory is only developed for the zero address and for objective compositions, we shall only look at such objects. In general, score denotators live in complex form spaces, including components which are far from being performable, such as bar-lines or pauses. We will not consider these full-fledged structures, but only local compositions which consist of objects that are candidates for events to be performed in physical spaces. Such local compositions live in space forms which are limits (products are sufficient here), whose factor spaces are submodules of real vector spaces. For example, pitch can be encoded by integers (such as is the case for MIDI key numbers), or by tuples of rationals (as is the case with the Euler spaces built from (just) octaves, fifths, thirds, sevenths etc.), whereas onset is encode by a module Z.x or Z[1/2] or Q. Dynamics is usually encoded by integers (such as suggested by the MIDI key velocity numbers 0,1,2, ... 127. With these usual values, one may consider local compositions (K, M) with M a submodule of~n. The same observations are valid 3See also appendix A.2.2.
669
32.3. LOCAL STRUCTURE
when representing local compositions which are related to physical parameter space forms. We shall therefore, without loss of generality, assume that we are given local compositions (K, S) in simple space forms S ---> Simple(JR.n) of modules 4 JR. n , n E N. Later, we shall specify the Id
form names when more concrete parameters will be discussed. Let us first explain the general construction of tangent compositions and their morphisms:
Definition 98 Given an objective, zero-addressed commutative local composition (K, S) in a simple ambient space S ---> Simple(M) with R-module M, then the tangent composition TK Id
of K is the composition (K x R.K, S2), where st
---> Id
Limit(S, ... S) is the t-fold product.
The local composition K is called the basis of TK. For a natural number t, the t-fold tangent composition Tt K is the tangent composition T(T t - 1 K) = K x R.K x (R.K x R.K)t-l) C S2t, with special value TO K = K. The tangent space TkK of K at point k E K is the subset {k} x R.K, identified with R.K for its module structure. Given two tangent compositions T K of K and T L of L (both compositions over the same ring R). A tangent morphism Tf : TK -7 TL is a set map Tf : TK -7 TL between local compositions T K, T L which factors through the canonical projections p K : T K -7 K, p L : T L -7 L and a (necessarily uniquely determined) map f : K -7 L, and such that all the fiber maps T fk : TkK -7 Tf(k)L are linear. For 1 < t, a t-fold tangent morphism Tt f : Tt K -7 TtL is a tangent morphism T(Tt - 1 K) -7 T(T t - 1 L) whose basis map is a (t -I)-fold tangent morphism. The obvious category of t-fold tangent compositions and t-fold tangent morphisms is denoted by Tank. Lemma 49 Let ComLoc~ be the category of commutative objective local compositions over the commutative ring R. Let f : (K, S) -7 (L, U) be a morphism in 1 ComLoc~. Then the map f
t--+
Tf
=
f x R.f : TK
-7
TL
defines an injective natural transformation ComLoc~ -7 Tank onto the subcategory of those tangent morphisms T f : T K -7 T L between tangent compositions such that the fiber maps Tik are all the same linear map T, and we have f(kI) - f(k 2) = T(k 1 - k 2) for all couples kl' k2 E K, i.e., the map f is defined via T and the value on one single point. The proof of this lemma is left to the reader. For the real number field R = JR., we are not only interested in the tangent categories, but in those morphisms which extend to differentiable maps on the underlying vector space, or, more generally in maps which extend to any maps of a specific category Cat:
Definition 99 A t-fold tangent morphism Tt f : Tt K -7 TtL of t-fold tangent compositions with positive t in Tan~ is said to be t-fold differentiable iff there is a t-fold differentiable map F between the underlying vector spaces such that Tt FITt K = TT f. For 0 ~ t, Such a map is said to be C t iff it may be extended to a C t map F between the underlying vector spaces. 4The case n = 00, the countable direct sum of copies of IR for possible Fourier coefficients or similar parameters in sound color spaces will not be considered except for some explicitly described special considerations
670
CHAPTER 32. LOCAL AND GLOBAL PERFORMANCE TRANSFORMATIONS
Clearly, for positive t, the t-fold differentiable and the C t morphisms, respectively, define subcategories LocDiff and LocCt of Tan~, respectively. More generally, if Cat is a category of morphisms on real vector spaces, we denote by Tanbat the category oft-fold tangent morphisms which extend to morphisms from Cat. In practice, this mathematical catechism is not very practical, we shall rather use the wording of "an extension of a given t-fold tangent morphism to a morphism of the respective category Cat". Example 51 In traditional performance research, tempo is an important feature. It is usually described via Malzel's metronomic tempo indication (M.M.) of the type "x quarters per minute". This means that we compare mental time E (quarters) to physical time e (minutes) via the quotient tlE / tle as a function of E. Forgetting about the other parameters for the sake of simplicity, we have a (not necessarily finite) local composition of (real-valued) mental onsets MyOnsets : 0"",0 S ({El' ... E i , ... }) with form OS ---> Power( Onset) (see list of mental forms in formula Id
6.69 in section 6.6). Suppose that we have a performance set map p associating each Ei with a physical onset P(Ei) = ei of a denotator MyPhysOnsets : O"",PhysOS( {el, ... ei, ... }) in the form PhysOS ---> Power(PhysOnset) Id
over the physical onset form PhysOnset
---> Id
Simple(JR) (see 6.6). Suppose that the tempo
indications are given on MyOnsets by the values Ti at the onsets E i . This data defines a tangent morphism Tp : MyOnsets -+ MyPhysOnsets whose linear fibers over Ei are the linear maps E f-+ Ti.E For the other onsets, tempo is not declared. This is the usual situation in music scores. However, it is not clear what should be the tempo in between the indicated onsets. The point is that one would like to turn the tangent morphism into a differential morphism, at least into a piecewise differential morphism, ie., one that extends to a piecewise differentiable function having the derivatives Ti as required at the arguments E i . Some musicologists even do not agree with this extensibility requirement, but maintain that tempo is not defined except at the given arguments. We have already countered that position, however, if tempo has to be extended in the sense that we ask for a differentiable morphism giving rise to the present tangent morphism, there are many ways to do so. For example, tempo could be set to a piecewise constant function, giving rise to a step tempo curve which integrates to a piecewise linear map P that extends p. As to the mental contents of a score, this question could be viewed as secondary, but if we agree with Paul Valery in saying that c'est l'execution du poeme qui est le poeme (see the catchword of this chapter), then the total content of a composition must include the extension of p to P. It should also be stressed that the European understanding of musical time is a type of negative account, time is only interesting if it is over, i.e., the time between two successive notes is non-existent, or, at least, of no existential relevance. Tempo does not exist between two notes since there is no time feeling, performed music has no time except when a note onset intervenes. This is a severe lack of understanding of what musically happens. Such a music understanding is poor and proves a negative, immature relation to time. Positively stated, European time is more a kind of trigger, a Turing machine unit slot transporting the logical processes (typically in harmony).
671
32.3. LOCAL STRUCTURE
It is the merit of performance research, in particular the Swedish school of Johan Sundberg and his collaborators, to have pointed at this delicate time question. We shall come back to the extension type problem for tangent morphisms in chapter 36 on expressive performance.
Exercise 71 Let T f : T K ----> T L be a tangent morphism of Tan~ over a basis K = {x, y} of cardinality 2 in R Then Tf is extensible by exactly one I-fold differential morphism whose coordinate functions Pi(t) are polynomials in t of degree three. Moreover, these polynomials are rational functions in x, y, fi(X), fi(y), t which are polynomials for fixed x, y. If Tfx = Tfy = 0, we have min(fi(x), fi(Y)) ::::; Pi(t) ::::; max(fi(x), fi(Y)) for x::::; t ::::; y. Exercise 72 Let p : K ----> L be a set map of local compositions with card(K) = m + 1, then there is a polynomial morphism extension P of p whose coordinate functions have degree::::; m. (Hint: Find a straight line through the origin and not parallel to any of the finitely many lines connecting pairs of points of K. Take the projection q of K into the orthogonal space to this line. This is a bijection onto the image q(K). Within this orthogonal space, repeat the same procedure until you have projected K into a one-dimensional subspace, call K* the image of K in this subspace. Then, looking at the map p* : K* ----> L induced by p on this one-dimensional projection of K, the claim is a classical result of polynomial interpolation.) Exercise 73 If the points of K in exercise 72 are in general position, then p is a morphism of local compositions and therefore automatically t-fold differentiable for every t.
32.3.2.1
A Recursive Interpolation Algorithm
The next result is related to what we shall call analytical weights, a key structure in the theory of performance operators. But it pertains to the theory of differential morphisms, so we present it right here. Since the essential statement is a recursive algorithm which is of essential use in programming performance theory, we do not present the result as a lemma but as a construction. The situation is this: We are given a finite local composition in a simple ambient space S ----. Simple(JRn), consisting of points K = {Xl, ... x s }, and contained in the open n-cube Id
cn =jul,Vl[X .. .jun,v n [ of JR n . For m = 1, .. . n, we denote by C m =jUl,Vl[X .. .jum,v m [ the projection onto the first m coordinates, and by C m =jum , v m [ the projection onto the m-th coordinate. On K, we are given a tangent morphism Tf : TK ----> TL, where the codomain tangent composition T L sits over a local composition L in an ambient space W ----. Simple(JR) of Id
"weights", and where the linear components T!k vanish for every k E K. We want to find a C l extension of T f which adds no extra extrema to the extrema required per definition on the K points, and which evaluates to the constant value 1 on the complement of the frame cube cn. The meaning of this requirement is that we want to extend the "discrete weight" f to a function F on the entire space S such that the extension is C l , normalizes to the constant value 1 when the arguments tend to infinity, and has values very close to the given values in a small neighborhood of K without producing new extremal values not defined on K. For the following construction, we concentrate on the module JRn and forget about the underlying space forms. The construction is a recursive one, and we start with n = 1. In this case, we suppose that the s points of K are ordered by size, i.e., we have these n + 2 points
672
CHAPTER 32. LOCAL AND GLOBAL PERFORMANCE TRANSFORMATIONS
on the real axis: Ul < Xl < ... Xs < VI. Extending f to the frame arguments Ul, VI, these arguments are mapped to the values 1 = f(Ul), f(xI), ... f(x s ), 1 = f(vI). From exercise 71 we have a (unique) cubic polynomial extension of the tangent map on every pair of successive points {Ul,Xl},{Xl,X2}, ... {Xs-l,X s },{X s ,Vl} with zero fibers Tful = Tfxl = ... Tfx s = Tfvl = o. We now consider the spline function defined by gluing all the polynomial extensions at the common points of the pairs, i.e., at the elements of K. Outside the frame, we extend this spline function by the constant function of value 1. This function is C l and adds no extra extremal values to the values on K inside the frame. Suppose now that we have succeeded by induction to construct the following extension: We are given the decomposition Xi = (Zi, Wi), Zi E JRn- \ Wi E IR. Let Pn (K) = {Wil < Wi2 < ... Wir} be the n-th projection of K with the different values in increasing order. For every hyperplane Hx = p;:;-l(x),x E {Un,WillWi2, ... Wir,Vn}, we are given either the recursively defined Cl-functions Px : Hx --+ JR, or the constant function 1 for the frame points X = Un,V n . For any point V = (z,w) E JRn-l x JR, the value F(v) is defined as follows. If W f/. [un' vn [, we set F(v) = 1. Else, there is exactly one interval [a, b] from the successor pairs (un, WiJ, (Will Wi2)' ... (Wir_ll WiJ, (Wir' vn ) such that a ~ W < b. We then evaluate to the value F(v) = Pa,b,Pa(Z),Pb(Z)(w) of the rational function P defined in exercise 71. Since a, b are fixed here, the function is a polynomial in the arguments Pa(z), Pb(Z), and w. So the function is a Cl-function in V = (z, w). On each hyperplane H x, the derivatives of neighboring functions coincide, and the entire function is C l . Finally, since the values are constant 1 outside the frames on the hyperplanes, the constant 1 value is guaranteed outside the n-cube cn and we are done. This construction has the disadvantage of depending on the order of the coordinates used in the recursion. For every permutation 7r of the n coordinates, we have such a function, call it F",. Each such function extends one and the same original tangent function and is 1 outside the cube cn. So we can symmetrize the construction by the weighted sum Fsym = ~ L", F",. However, for programming tasks, such a symmetrization is very time-consuming.
32.4
Global Structure
Summary. The global structure of a performance transformation is a patchwork of local performance transformations. The combination and gluing data of local units expresses the syntax of performance. In turn, this performance syntax defines an interpretation of what has been recognized on the analytical level. This "interpretation of the interpretation" obeys its own rules and constitutes a relatively autonomous rhetorical shaping of the given "text" and its analytical comprehension. -~-
Evidently, the local performance structures described above are far from sufficient to grasp realistic performance situations. Basically, there are four reasons for this insufficiency:
673
32.4. GLOBAL STRUCTURE • Instrumental variety
The events of a score are not always in one and the same parameter space, especially if we deal with compositions for different instruments. We typically work in a colimit space
ScoreOrchestra ----- Colimit(ScoreInstrl, ... ScoreInstrk) Id
of instrumental spaces
ScoreInstri _____ Simple(JRni) Id
with individual coordinator modules. For example, the piano notes have a four-dimensional space associated with pitch, onset, loudness, and duration, whereas the violin space adds crescendo and glissando to the piano space. Ditto for the physical codomain spaces
PhysInstri _____ Simple(JRmi) Id
where the mental instrumental events are mapped. But here, the codomains may be very different in dimension according to the real physical instrument which is addressed on the mental score. The typical colimit space over the physical instruments is
PhysOrchestra ----- Coli mit (PhysInstrl , ... PhysInstrk). Id
This means that performance starts from a local composition K in the ambient space ScoreOrchestra and maps into the local composition of physical events L in the ambient space PhysOrchestra. The local compositions K, L are disjoint unions of the subcompositions K i , L i , i = 1, ... k corresponding to the instrumental cofactors in both, K, L, and the performance splits into a coproduct of k individual local instrumental performances Pi : Ki - t L i . Even if the mental score level composition K is not a disjoint union of instrumental subcomposition, i.e., a proper colimit, it can be lifted to the coproduct of its cofactors to obtain a disjoint family of performances, so our hypothesis is not restrictive. On the other side, the proper colimits on the physical level are superfluous since it is always (technically) possible to realize the disjoint union of instrumental voices if necessary. It follows that the above splitting is necessary and sufficient for the orchestral globalization of local performances.
• Gluing of local extension strategies The local approach is insufficient when sudden, non-continuous changes of the performance map happen. For example, if starting with tempo M.M. quarter = 120 per minute, and then performing a chain of accelerandi and rallentandi, it may be asked by the composer that we reset the tempo after this tempo variations, by the command "istesso tempo", meaning to return to the original M.M. quarter = 120 per minute. It would be artificial to construct a continuous tempo curve of transition to the reset value. In this case, it is natural to split the performance map into two contiguous domains with their own tempi. Or else, it may happen that the extension category Cat is rather strict, allowing only polynomial maps, say. Such a restriction may intervene for reasons of cognitive nature, or because there are hypotheses about the dynamics of performance, such as mechanical
674
CHAPTER 32. LOCAL AND GLOBAL PERFORMANCE TRANSFORMATIONS
models (see the work of McAgnus Todd [532], for example). Then it is better to glue the performance map from parts which are conformal with Cat and coincide on the intersection of charts of the atlas patchwork. This latter strategy is nothing else than the well-known spline approach to the extension of discrete map data, see [499,500] for a typical reference.
• Special roles of selected parameters of a given local performance map (hierarchy) This type of globalization effects is perhaps the most interesting. We stated that in performance theory, we are only interested in the performance map on the mental event which will effectively be played, and not in abstract objects such as bar-lines or pauses. But abstract objects may emerge from effectively played event in the following sense: Suppose that a local composition K in some space S has to be performed, for example a space parametrizing onset E, duration D, and pitch H and suppose that a space S' of selected parameters of S, for example onset and duration. We then have the projection 7fs, : S ---; S' and the associated projection K ---; 7fs' (K). In a great number of performances, it happens that the performance of K factorizes through 7fs" which means this: We look at the performance map p : K ---; L, and if L lives in the corresponding physical space PhS with the corresponding projection 7fPhS' : PhS ---; PhS' and the induced projection L ---; 7fPhS' (L). Factorization means that we have a performance map PS' : 7fs, (K) ---; 7fPhS' (L) which commutes with the projections, i.e., 7fPhS' 0 P = PS' 0 7fs'. Here, the time events in S' are not really played, but their performance determines the time performance of the really played events. Hence, it may be reasonable to add the projected mental event set 7fS' (K) to the real set K (plus some projection and commutativity conditions as shown above) in order to describe the overall situation. This idea is the basis of the so-called performance hierarchies which control the special roles of parameters.
• Stemmatic deployment of performance Performance is never realized on the spot or by means of a unit process which grasps the analytical data and presents their "rhetorical" shaping at once. Humans have to rehearse again and again, continuously refining their results until they reach the final (or provisionally final) performance. This stemmatic deployment process from the sight reading (primavista) performance to the artistically well-devised presentation could be just a human learning process which a machine can achieve at once if it is able to learn the performative substance in abstracto. But this is erroneous, since the logic of performance, the anatomy of the shaping process is a stemmatic one, a multilayered unfolding of deformations of the mental score symbols. Rather than a learning process rehearsal is a meditation on the refinement of understanding. So performance is also global in the sense of a multilayered time-dynamic process. Each stemmatic layer is a logical step in the understanding of the rhetorical expressivity.
32.4.1
Modeling Performance Syntax
Summary. We review the syntactical mechanism of global performance transformation: orchestration, contiguity, hierarchy, and stemmatic layers. -~-
32.4. GLOBAL STRUCTURE
675
As was shown above, performance syntax is a bundle of four streams of completely different nature. Contiguity is perhaps the most obvious stream. It reminds us of the syntactic juxtaposition of units within the stream of ordinary language. The orchestration stream is much more difficult to understand since it conveys a layering of the performative mapping which creates the interplay of voices. The structure of this interplay must be viewed in a multidimensional space of geometric parameters, such as onset, duration, pitch, loudness, of sound color parameters for envelopes, Fourier, FM, or wavelet coefficients and the like, and of gestural coefficients for curves of body movements. The hierarchical stream is of high cognitive relevance since it describes the leading and slave parameters of performance. Such a hierarchy is likely to produce orientation in the perception of a performance. But it is also of a purely structural relevance since hierarchies are the turning point and key to the stemmatic layering logic. In fact, the successive refinement process of performance is often managed via deformations of hierarchies, i.e., adjustments of relations and functional dependencies among the members within a given hierarchy. Whereas the three preceding streams are "in praesentia" as semioticians would say, the fourth: stemmatic layering, is of completely different nature. It is not, and this is decisive, of paradigmatic nature, since it describes a development of logical enchainment, i.e., of a juxtaposition of logical stages. But this type of syntagm is not unfolded in time, it is a kind of encapsulated history whose presence cannot be unveiled without a huge amount of ambiguity and uncertainty (see chapter 47 on inverse performance theory), its presence is virtual. The complexity of global performance is not only due to the four-fold stream of global structures, it is enforced by the completely divergent extension morphisms underlying any two different local charts within a particular stream, and even more dramatically when distributed over different streams.
32.4.2
The Formal Setup
Summary. This section introduces the formal performance transformation setup via the category G1Tank of global tangent compositions with t-fold tangent morphisms and, for R = JR, with the associated subcategories GlDiff and GlC t of G1Tan~.
From the four types of globalization phenomena, instrumental variety, gluing of local parts, hierarchy of parameters, and stemmatic deployment, the first three will be covered by the following categorical description, while the last one-being of more processual nature-will be treated in chapter 38. Here is the definition which is coined on the definition 36 of a global objective composition.
Definition 100 For a positive integer t, a global t-fold tangent composition over a commutative ring R is defined by the following data:
(i) A set G and a finite, non-empty covering I of G, (ii) a family (Tt K u , SU)UEU of local t-fold tangent compositions Tt Ku over R, with bases K u , (iii) a surjection
h : U ---t I : u f---t lu,
676
CHAPTER 32. LOCAL AND GLOBAL PERFORMANCE TRANSFORMATIONS
(iv) a bijection
E
E
U,
U, the induced subcomposition
equals the tangent subcomposition of Tt Ku induced on the basis Ku,v =PTtKu(TtKu,v),
(vi) for each couple u, v
E
U, the induced bijection
is an isomorphism of local tangent compositions.
The data (ii) - (vi) are called an atlas for the covering I of G. Two atlases , \If for the covering I of G are called equivalent iff their disjoint union is also an atlas for the covering I of G. A global tangent composition over R is a covering I of G, together with an equivalence class of atlases, a fact which we abbreviate by the symbol G l or even by G if the atlases or the covering, respectively, are clear from the context. If two global tangent compositions G l , H J over R are given, a t- fold tangent morphism from G l to H J is a couple (f, [) where 1. f: G 2. [: I
----7
----7
H is a set map,
J is a set map such that f (i)
C
[(i) for all covering sets i E I,
3. for any atlases (Tt K u , Su)u for G and (Lv, Mv)v for H, if we take the chart isomorphisms
Example 52 Let G l be zero-addressed objective global composition which is interpretable by a simple ambient space S of module M over the commutative ring R. WLOG we have GeM, and the atlas is I = {G 1 , ... G n }. The transition morphisms
32.4. GLOBAL STRUCTURE
677
theorem can be applied. Call TG 1 the tangent interpretation associated with G 1 . This is an interpretation with ambient space S x S, and the projection p : TG 1 -+ G 1 is a morphism of global compositions. Since the transition morphisms are the identities here, one may also refine the tangent construction to one of differentiable nature. 3
3
3
Figure 32.1: The seven-element global composition of which the three charts are shown here is a Mobius bottle. It may not be extended to a global tangent composition by the obvious colimit of its tangent charts and their intersections since, for example, a tangent vector t is equivalent to its negative -t under the colimit construction. Exercise 74 For a non-interpretable composition the colimit does not yield a reasonable global tangent composition. The following example illustrates this fact, see also figure 32.1. We consider a global composition G1 whose support
G = {gO,gl,g2,g3,g4 ,g5, g6} consists of seven different points, and whose covering consists of three 5-element charts G i having the common point go:
G1 = {gO,gl,g2,g5,g6},G2 = {gO,gl,g2,g3,g4},G 3 = {gO,g3,g4,g5,g6}. They are in bijection with these local compositions Ki in JR3:
¢l : G 1 -+ Kl : go 1--4 (O,O,O),gl ¢2 :
go
1--4
¢3 :
go
G2
-+
1--4
(0, I,0) ,g2
1--4
(1 , I,0) , g5
1--4
(0,0, I),g6
1--4
(0, I,0),g2
1--4
(1, I,0),g3
1--4
(0,0, -I),g4
1--4
(1,0, -I),g4
1--4
(1,0, 1)
K2 :
(O,O,O),gl
1--4
(1,0, -1)
G3
1--4
-+ K3 : (0,0,0),g3
1--4
(0,0, -I),g5
1--4
(0, -I,0),g6
1--4
(1, -1,0)
678
CHAPTER 32. LOCAL AND GLOBAL PERFORMANCE TRANSFORMATIONS
The transition isomorphisms are these: >12
= Id
>23 :
go
f-+
go, (0,0, -1)
f-+
(1,0, -1), (1,0, -1)
f-+
(0,0, -1)
>13 :
go
f-+
go, (0,0, 1)
f-+
(0, -1,0), (1,0, 1)
f-+
(1, -1,0)
In the colimit of TKi,TKij , the tangent t = (gO, (1,0,0)) E TK1 identifies to T>12(t) = t, and then to T>23(t) = -t E TK32 , whereas T>13(t) = t E TK32 , which means t rv -t, i.e., the canonical maps TKi ---+ colim(TKi' TKij) are not injective-gluing tangent spaces is not feasible. Clearly, this phenomenon is due to the non-interpretability of G I , in fact, this is a kind of Mobius bottle on which any global affine function must identify the values of the pairs (gl,g2), (g3,g4), (g5,g6). Exercise 75 However, if any global composition is such that its nerve is one-dimensional, the colimit construction yields a global tangent composition with the expected chart injections. Give a proof of this fact. Let us discuss the reason why the three globalization perspectives are included in the above definitions. Clearly, the disjoint union of the interpretation morphisms on different instrumental parts is represented by a disjoint union of morphisms on local (or global, if they exist) tangent compositions on each of the instrumental parameter spaces. Let us give an example of a global gluing operation between local tangent compositions. To keep the discussion simple, let us consider the performance map on the onset axis. Suppose that G = {Eo, E 1, E 2} is a sequence of three increasing onsets Eo < E1 < E2 which are mapped onto the set of the three physical onsets 9 = {eo, el, e2}, ei = P( E i ), under the performance map P : G ---+ g. Suppose that the musical conditions are such that one starts with an initial velocity de/dElED = Vo, de/dEI E2 = V2, whereas the velocity is de/dEIEl = V1 when we arrive at onset E 1. But the successive path should be started with velocity de/dEIE 1 = Vt, and end on velocity de/dEI E2 = V2. This apparent incompatibility at onset E1 can be resolved by the coverings I = {G 1 = {Eo, E 1}, G 2 = {El, E 2}}, i = {gl = {eo, e1}, g2 = {e1' e2}} of G, g, respectively. Consider the tangent compositions TGI, Tgi of the two interpretable compositions G I , gi, respectively, see figure 32.2. Consider an extension of the global morphism TP : TG I ---+ Tgi associated with the map P and the above velocity conditions as linear maps on the four tangent spaces
TG 1,Eo, TG1,El , TG2,El , TG2,E2 to a given category of maps on the real axis, 0 1 , say. Then we have a "gluing" of two velocities, one from below, one from above, at onset E 1 •
32.4. GLOBAL STRUCTURE
679
TG , EO
Figure 32.2: The global tangent composition built from two charts TG 1 , TG 2 of successive tempo regions.
32.4.3
Performance qua Interpretation of Interpretation
Summary. We finally compare the analytical interpretation in the framework of categories Glob of global compositions with associated global performance transformations. -~-
Recalling the analytical interpretation of a given score denotator, this one a priori regards a variety of involved object-types, such as bar-lines, notes, macros, rests- whatever is needed. Such an interpretation is guided by the analytical approach, such as the topological interpretation using local meters, or motives, or chords, as exposed in the respective chapters. In contrast, the performance-guided interpretation of the given score is firstly restricted to special event types. Usually, for example, bar-lines are not subjected to performance transformations. Secondly, the interpretation within a performable event-type is not primarily guided by analytical considerations, rather it is related to considerations of coherence in the performative process. For example, the agogical variations in a tempo curve, say, can be shaped by complex weight functions as defined via the maximal meter topology or the nerve topology on the composition's nerve. Such a tempo curve will not refer to the complex covering by maximal meters, but define a unique tangent morphism on a single tangent chart for performance. So the interpretations of analytical and performative nature can and usually will be very different. Nonetheless, the latter interpretations are related in a complex way to the analytical background interpretations. We shall devote the following chapters to the explication of these relations which belong, to our belief, to the most fascinating challenge in mathematical music theory.
Chapter 33
Performance Fields La musique mathematiquement discontinue peut donner les sensations les plus continues. Paul Valery [538, I] Summary. Performance fields are the core of an in-depth theory of performance structure. They are a distinguished type of vector fields which give an infinitely precise, i.e., infinitesimal, account of the 'shaping forces' of a given local performance transformation. Although performance fields are not recognized as such in musicology and traditional performance research, they arise in a completely natural way in the traditional context of tempo, intonation, and dynamics. We give a careful account of this basic fact. A closer look at articulation and further sound parameters (apart from onset, pitch and loudness which are used for tempo, intonation, and dynamics) reveals that performance fields should be viewed within a fairly general approach. We define the formal setup. In order to provide a deeper understanding of the semiotic signification process of performance fields, we review the performance philosophies of Theodor W. Adorno, Benjamin, and Diana Raffman. -2:;-
33.1
Classics: Tempo, Intonation, and Dynamics
Summary. This section recapitulates and analyzes the concepts of tempo, intonation, and dynamics. We take the opportunity to make the point of blurred concepts in musicology, in this case in the sense of a fascinating quest for expressive precision, paired with denial of formal explicitness. -2:;-
33.1.1
Tempo
Summary. Tempo is one of the best 'known' features of performance. However, its concept is blurred and far from standardized among musicians and performance scientists. We analyze the
681 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
682
CHAPTER 33. PERFORMANCE FIELDS
state of the art and its deficiencies. It is shown that tempo is a local concept which is charged with a large amount of semantics. We proceed in filtering out the semantics from the structural data. After this, a precise definition of tempo as a one-dimensional performance field in the onset axis is given. -~-
Tempo is the Italian word for "velocity". In musical notation and performance, tempo refers to the pace at which the mental events, typically written on a score or imagined by the performer, are projected into physical reality. This is done either to describe the relative velocity or the velocity change with respect to a given absolute tempo situation; typically indicated by notation such as verbal annotation "accelerando", "rallentando", or by corresponding pictorial signs. Or else, the tempo is described with an absolute meaning. The prototype of this second situation is Malzel's Metronome (M.M.), quantified by a quotient such as "quarter = 120", i.e., a quarter note is given the physical duration of 0.5 second, resulting in 120 quarters per minute. In contrast to this precise indication, a relative tempo sign within a Malzel context does not define precisely how much the tempo should change, and in which way this should happen., e.g., by a linear curve or in a quadratic way, etc. The relative tempo signs are massively ambiguous. We shall have to spend quite some time in order to set up a reliable handling of such blurred signs in algorithmic contexts. Absolute tempo is also indicated by more poetic indications, such as "andantino", "prest issimo", "maestoso". Clearly, such wordings are loaded with a good portion of connotation which exceeds the mere tempo and targets emotional refinement, expressed via other parameters, such as articulation (legati, staccati), dynamics (loudness variations) or intonational deformations (on violins or for the human voice). Whatever the specific expression, tempo relates to the transformation of mental onsets to physical onsets. Note that this does not imply that such a transformation is independent of the other event parameters. We shall learn that the performance of onsets may involve all other parameters. A simple example of such a relation is given by the so-called "Chopin rubato" which lets the left hand perform a "mother" tempo while the right hand performs local deviations of the left-hand tempo in order to generate the effect of temporal tension ("daughter" tempi). In this case, the common onset of left and right hand notes may lead to different images in physical time, as a function ofthe interpretation of the total score by the left-hand and right-hand charts. Formally, local tempo is related to a tangent morphism T f : T K -+ Tk of local tangent compositions TK, Tk derived from compositions K C Onset, k c PhysOnset in onset spaces. Tempo is then defined iff the linear fiber maps T fx, X E K are invertible and there, we define Tempox = (T fx )-1, the inverse slope at X. Moreover, tempo is always supposed to be positive, while negative tempi are without evident musical meaning. In such a situation, tangent morphisms are extended to differentiable morphisms with positive, continuous derivatives at every point. The tempo curves associated with such differentiable morphisms are the continuous, positive functions whose values are the inverse derivatives at all points in an open interval containing the points of K. Usually, the tangent morphisms are not of immediate interest in performance theory, this means that one is given differentiable morphisms, ie., the morphisms F inducing tempo curves as their inverse derivatives Tempox = (TF(X)-l). However, it may then also be required that these functions be differentiable extensions of tangent morphisms, but this is not the mandatory situation.
33.1. CLASSICS: TEMPO, INTONATION, AND DYNAMICS
683
Nonetheless, the music(ologic)al point of view which negates differentiable extensions is not correct: tempo is also present in between the points of K, and its structure is an essential information of the performance maps. Formally speaking, we are dealing with a local composition K c Onset, a local composition k C PhysOnset, two closed intervals OFrame = [A,B] c Onset,POjrame = [a,b] C PhysOnset such that K C OFrame,k C POjrame, and a C l diffeomorphism F : OFrame -.::::. POjrame with positive derivative TF(X) at each point X E OFrame, and such that F(K) = k (and possibly F extending a given tangent morphism Tj : TK -.::::. Tk if that is required). The tempo field oj F on OFrame is the continuous field Tempo with values Tempox = (TF(X)-l),X E OFrame. By construction, if the performance map Xo = F(Xo) is defined on any point Xo E OFrame, the performance F(X) on X E K is defined by the integral l
F(X)
=
Xo
+ {x T lxo
1
empo
(33.1 )
of the inverse tempo function. Musically, the "initial value" Xo = F(Xo) means that the "conductor" defines a starting time Xo of performance from which the remaining performance onsets can be deduced by means of the given tempo curve. Exercise 76 With the above notation, calculate the onset function F for these tempo types: (1) Tempox = qo + qlX, (2) Tempox = qo + qlX + q2X2, (3) Tempox = eX, (4) Tempox = 2 + sin(X). Discuss the possibilities of coping with these tempo types with tangent conditions. In general, the tempo must also cope with more than one given performance value, Xo =
F(Xo), Xl = F(Xd, ... Xt = F(Xt ), and then, we have to ask for the conditions Xj _ Xi
= {Xj
1
lX i Tempo
for i, j
= 0,1, ... t
33.1.2
(33.2)
to guarantee compatibility of the tempo integral with the given values.
Intonation
Summary. Intonation deals with pitch just as tempo deals with onset time. The definition of intonation as a one-dimensional performance field in the pitch axis is given. -~-
Intonation, including the specialization of tuning, deals with the relation between mental pitch and physical pitch. It is analogous to tempo insofar as it suggests a map from the pitch space Pitch to the pitch space PhysPitch. But the situation is not so easy, if we observe the common pitch spaces in music theory. Recall that symbolic pitch is usually represented in the Euler module space Euler Module, and that we have a space morphism E2M : Euler Module ---> MathPitch defined in equation (6.28). This is an injective Q-linear map on the supporting 1 We follow the Douady notation and do not write the old-fashioned infinitesimal "dY" in the integrand if this one is clear.
684
CHAPTER 33. PERFORMANCE FIELDS
modules Q3, and lR, respectively. If we want to apply calculus to this situation, we have to extend the scalars to real numbers and then consider the induced map E2M Q9lR : EulerlRM odule -> Pitch with underlying vector spaces lR3 , and lR, respectively. This latter morphism is no longer injective (in fact: surjective with two-dimensional kernel pv~, see formula (6.27)). If we consider the performance map composed from the given performance of pitch, p : Pitch -> PhysPitch, and the above E2M Q9 lR, we are confronted with a performance map p 0 E2M Q9lR from Euler space to physical pitch which is far from being a diffeomorphism. This cannot be repaired since there is no physical space corresponding to the Euler construction, at least not in the simple physical pitch dimension. The solution could consist in a construction of a direct performance map from EulerlRM odule to a physical "pitch" space of higher dimension. This one would evidently not restrict to mere physical pitch (logarithm of frequency, see appendix A.2.3), but include other parameters, such as sound color or the like. This is however an open problem in performance research: How should we perform Euler space pitch? We will stick to the state of the art and build pitch performance upon the maps of type F : Pitch -> PhysPitch. Such maps should be the Cl-extensions of tangent maps Tf : TS-> Ts over bijective basis maps f : S -> s. Here, S C Pitch, s c PhysPitch are a number of pitches and corresponding physical pitches. This situation is formally equivalent to the tempo situation. We may indeed take over mutatis mutandis the statements and formulas established for tempo as follows. We are given a mental pitch frame P Frame = [U, V], a physical pitch frame P P frame = [u, v], inclusions S C P Frame, s C P P frame, and a Cl-diffeomorphism G : P Frame ~ P P frame, the intonation curve, with positive derivative TG(X) for every X E PFrame (and possibly extending a given tangent morphism Tg : TS ~ Ts if that is required). The intonation field ofG on PFrame is the continuous field Intonation with values Intonationx = TG(X)-I,X E PFrame. By construction, if the performance map Xo = G(Xo) is defined on any point Xo E PFrame, the performance G(X) on XES is defined by the integral
G(X)
=
Xo + (x 1 lxo Intonation
(33.3)
of the inverse intonation function. Musically, the "initial value" Xo = G(Xo) means that the piece is played from a starting pitch, i.e., the "chamber pitch" Xo of performance from which the remaining performance pitches can be deduced by means of the given intonation curve. In general, the intonation must also cope with more than one given performance value, Xo = G(Xo), Xl = G(XI), . .. Xt = G(Xt}, and then, we have to ask for the conditions Xj _ Xi
=
{x; 1 lXi Intonation
(33.4)
for i,j = 0,1, ... t to guarantee compatibility of the intonation integral with the given values. This is, in particular, the case when we are given a specific intonation between semitones in a fixed tuning mode, such as well-tempered or just tempered tunings. Again, musicological approaches only deal with intonation values on the given tangent composition T S, while the intermediate values of the intonation curve are neglected (or even negated-in the worst case). However, if glissando effects are present, discrete intonation fails to give information about the intermediate values, much like the intermediate tempo is necessary to perform glissandi in their development along the time axis!
33.1. CLASSICS: TEMPO, INTONATION, AND DYNAMICS
33.1.3
685
Dynamics
Summary. Dynamics deals with loudness just as tempo deals with onset time, and intonation deals with pitch. The definition of dynamics as a one-dimensional performance field in the loudness axis is given. -~-
Dynamics is the physical shaping of loudness symbols such as jJ, m1, ppp, mp, sf and words such as sforzato, meno forte, diminuendo, crescendo. In this classical setup of score notation, the scope is much less quantified than with tempo or intonation. There is no such norm as Miilzel's metronome or the chamber pitch in dynamics. Also, the shaping of dynamics is dramatically different, i.e., refined, with respect to the written prescriptions. It may happen that a section is written in mezzo forte, but within this section, the performance of mezzo forte is quite variable, within a certain tolerance bandwidth of what mezzo forte can be felt. However, just for these reasons, it is much more accepted by musicologists that dynamics is a continuous phenomenon when compared to tempo and intonation. Sticking again to the state of the art, we build loudness performance upon the maps of type F : Loudness ----> PhysLoudness. Such maps should be the Cl-extensions of tangent maps Tf : TL ----> Tl over bijective basis maps f : L ----> l. Here, L c Loudness,l C PhysLoudness are a number of loudness values and corresponding physical loudness (logarithms of pressure amplitude, see appendix A.2.2). This situation is also formally equivalent to the tempo situation. We may indeed take over mutatis mutandis the statements and formulas established for tempo as follows. We are given a mental loudness frame LFrame = [U, V], a physical loudness frame PLframe = [u, v], inclusions L C LFrame, l C PLframe, and a Cl-diffeomorphism H : LFrame ~ PLframe, the dynamics curve, with positive derivative TH(X) for every X E LFrame (and possibly extending to a given tangent morphism Th : TL ~ Tl if that is required). The dynamics field of H on LFrame is the continuous field Dynamics with values Dynamicsx = TH(X)-l,X E LFrame. By construction, if the performance map Xo = H(Xo) is defined on any point Xo E LFrame, the performance H(X) on X E L is defined by the integral
H(X)
=
Xo + (x
JXo
1. Dynamzcs
(33.5)
of the inverse intonation function. Musically, the "initial value" Xo = H(Xo) means that the piece is played from a starting dynamics, i.e., the "mezzo forte" Xo of performance from which the remaining performance dynamics can be deduced by means of the given dynamics curve. In general, the dynamics must also cope with more than one given performance value, Xo = H(Xo), Xl = H(X I ), .. . Xt = H(Xd, and then, we have to ask for the conditions Xj _ Xi
= {Xj
JXi
1 Dynamics
(33.6)
for i,j = 0,1, ... t to guarantee compatibility of the dynamics integral with the given values. This is, in particular, the case when a specific dynamics range between successive symbols, such as ppp, mpp, pp, mp, p, m1, 1, mjJ, ff1, is required for acoustical reasons. In the precise technical
CHAPTER 33. PERFORMANCE FIELDS
686
sense this is a rare situation which occurs more in studio situations than in live performances. But also in common (what is that in the age of complex man-machine interfaces?) human performance, it may occur that one musician or instrument dictates the dynamical ranges by individual approaches, and that other musicians in the orchestra have to cope with these constraints. For example, the enhanced dynamics of a strong beating drummer can trigger dramatically the dynamical range of a whole jazz band.
33.2
Genesis of the General Formalism
Summary. Tempo, intonation and dynamic fields are one-dimensional special cases of performance fields. We discuss the a priori two-dimensional performance field of tempo and articulation. From this special case, a general type of performance fields is deduced and motivated by a set of representative examples. We conclude the section with a rigorous definition of the concept of a performance field. -~-
If we consider a local composition K c OnsetffiPitch (see section 6.4.1 for this space), the performance of such a composition is a bijection h : L -=::-. l with codomain k c PhysOnset ffi PhysPitch. Suppose that this bijection is induced by bijections on the onset and pitch axes as described above. With the above notation, we consider the projections POnset : OnsetffiPitch ----; Onset,ppitch : Onset ffi Pitch ----; Pitch. Consider the two projections L ----; K = POnset(L), and L ----; S = PPitch (L). Suppose that these projections are performed conforming to the above rules, i.e., there are frames 0 Frame, P Frame with K c 0 Frame, and S c P Frame. Suppose also that we have diffeomorphisms F, G as above. Let our performance h be defined by the projection morphisms, i.e., for (X, Y) E L, h(X, Y) = (F(X), G(Y)). This means that we are given a product C 1-diffeomorphism H = F x G : OFrame x PFrame -=::-. POframe x PPframe which restricts to h on L. Evidently, the two one-dimensional vector fields Tempo, Intonation induce a two-dimensional vector field Tempo x Intonation on the product frame OFrame x PFrame, see also figure 33.1. Whereas the factor fields are derived from the tangent morphisms T F, TG, the product vector field is evidently not derived from the tangent morphism TH. In fact, the latter is a two-dimensional linear transformation
THx,y
= TFx x TG y
and not a vector. We recognize however that the product field verifies this equation:
Tempox x Intonationy
= (THx,y)-l.6.
(33.7)
where 6. = (1,1) is the diagonal unit vector in the tangent space at the H-image (x, y) of (X, Y). This means that the tempo-intonation field is derived from the diffeomorphism H which extends the map h on the underlying composition L via the inverse image of the diagonal field 6. on the image product frame PO frame x P P frame.
33.2. GENESIS OF THE GENERAL FORMALISM
687
PPilrl.
Pilch
OFrame x PFrame
•
•
•
•
Pilch e Onsel Tempo
K
Onset OFrame
Figure 33.1: The two one-dimensional vector fields Tempo, Intonation induce a two-dimensional vector field Tempo x Intonation on the product frame OFrame x PFrame.
33.2.1
The Question of Articulation
Summary. Special attention is given to the delicate relation between tempo and articulation. It reveals that one-dimensional performance fields are a priori insufficient to describe local performance transformations. Articulation requires a priori two-dimensional fields. -~ -
It may appear as if the two-dimensional tempo-intonation field were an artificial generalization of an essentially one-dimensional situation. This is however not the case as we now show for the two-dimensional performance on the plane of onset and duration. This situation is as follows. Suppose that we have a local composition K c Onset EB Duration which should be performed with respect to a tempo performance of the onset projection. More precisely, call such a performance F(E, D), (E, D) E K, and recall the sweeping alterator already used in counterpoint (and introduced in section 7.5): 0:+ : Onset EB Duration
~
Onset: (E, D)
1-4
E
+D
688
CHAPTER 33. PERFORMANCE FIELDS
with the omission of the dual number formalism here because we do not need this algebraic enrichment of structure in performance theory. Suppose that the onset projection Ko = POnset(K) of K is performed according a performance map 1 : Ko -+ ko onto an image set ko. Where should we map the duration components? The canonical recipe is to consider the "offsets" of our events, i.e., for (E, D) E K, we take its alteration 011 = Q+(E, D) = E + D and look for its image 1(011). We may then calculate the duration d in the image (e,d) = F(E,D) by the formula d = 1(011) - I(E). This is however not well defined if we do not know what is the image of onsets outside Ko! This is another strong argument for the existence of tempo outside a composition's onsets. So we have to embed the onset performance 1 in a C1-differentiable extension, to be defined on an onset frame Frame as above. Call this extension again I. Observe that Frame must now also contain the local composition of alterates ll+(K), i.e.,
Ko U ll+(K) C Frame.
(33.8)
With these conventions, we may calculate the tangent morphism T F associated with the map
~
... "-
... ~
I
I
!
/'
-"
'" --....
.... /
///1 ~
-.. ~
/
I I
~
/
-..
/'
-""- " ~
--- "" " -- ......
I ./ I I /
,
/
./
--
///
! I
1
.,
... '"
I
I
~
~
tt t f t1 ~
~
~
t t
....
4
t
~
4
~
~
~
!
t 1t
I
• • ~
., ., , ... ...
~
~
/'
--
" " ....
...
/// ./
---
//
./
'""
" "----
./
-..
I
;I
--
-.... " -tI "
t
I
! /
!
I I
~
;I
/
~
~
/
""-
---
./
//
"-....
-;I
Figure 33.2: This parallel articulation field aTempo is derived from the tempo curve TempoE = 1 + O.4sin(E). The horizontal axis is onset, the vertical axis is duration.
F(E, D) = (e(E, D), d(E, D)) described by the Jacobian
J F(E,D)
=
aEe ( aEd
= (J(E), I(E + D)
- I(E)). The tangent map at (E, D) is
aDe) ( IjTempoE aDd IjTempoE+D - IjTempoE
689
33.2. GENESIS OF THE GENERAL FORMALISM which yields the inverse value
)-1 -_ ( D (J E(E,D)
TempOE TempOE+D - TempOE
whence the inverse image vector field of the constant diagonal field A ( JF(E,D) ) -1 .U
=
(
TempoE ) 2TempoE+D - TempoE
~
on the physical plane:
= 8TempoE,D
(33.9)
which we call the parallel field 8Tempo of articulation since the duration component is calculated in parallel to the onset by use of offset values. Figure 33.2 shows such a parallel field. We see that the direction of such an articulation field is from -7r / 4 to 7r /2. From this example, we learn that the performance field of the parallel articulation map on the onset-duration plane is not a product of one-dimensional fields, even under the completely innocuous assumption of durations being induced by offset data.
Exercise 77 Calculate the performance field of articulation if the performance map has a dilatation by 0 < A in the duration component, i.e., F(E, D) = (f(E), A(f(E + D) - f(E))). The example of a parallel field in articulation can be taken over to the pitch domain: Instead of duration we have to think of glissando. This is the proportion of pitch shift with respect to the pitch coordinate if the pitch at the end of an event must have a different pitch with respect to the onset pitch. This generates the parallel glissando performance field 8Intonation with exactly the same formalism as for articulation. The same method can be applied to generate a parallel crescendo field 8Dynamics if Dynamics is the field associated with loudness performance. The details are left to the reader. Whenever we deal with basis and pianola spaces, we use this notation: If B = B1 EB ... Bk is a product space of basis parameters (such as onset, pitch, loudness), and if P = P 1 EB ... Pk is the product space of corresponding pianola parameter spaces, we denote by a+ the alteration map B EB P ---> B defined by (b,p) f-> b + p on each basis-pianola component, whereas PB is the first projection. So parallel performance maps and the corresponding parallel performance fields are defined by use of the alteration a+. If the basis field at the basis point X is Z x, the parallel field at the basis-pianola point Q is (33.10) a linear operator in Z. We shall use this operator not only for basis-pianola couples, but in general situations of a direct product of isomorphic simple spaces B, P with the associated alteration map.
33.2.2
The Formalism of Performance Fields
Summary. This section puts the previous considerations and special cases into a generic formalism, the concept of a performance field in general musical parameter spaces. -1:;-
690
CHAPTER 33. PERFORMANCE FIELDS
The general situation is this. We suppose that a local composition K c S is contained in a closed rectangle R = [aI, bIl x ... [an, bnl of the underlying real n-space. We also suppose that a bijective performance map SJ : K --+ SJ(K) onto a local composition SJ(K) c PS in a simple physical space P S of same dimension n is given, and that this map can be extended to a CI-diffeomorphism SJ : R --+ SJ(R), i.e., a CI-diffeomorphism which is defined on a neighborhood of R. The performance field of this diffeomorphism is a continuous vector field 2 Th on R which is defined by the inverse Jacobian (33.11) applied to the diagonal unit vector ~ = (1, ... 1), i.e., ThP = TSJ-I.~.SJ. If Xo is a point in PS, the integral curve 3 ~ of ~ through Xo evaluates to ~(t) = Xo + t.~. For the existence and uniqueness of maximal integral curves of ThP, we suppose that the performance field ThP is locally Lipschitz4. Then, evidently, integral curves of ThP are transformed into integral curves of the diagonal field under SJ. Therefore, if X E R, if Xo = ThP(t), and if the image Xo = SJ(Xo) is known, then we have
Ixo
Ixo
Ix
SJ(X)
=
Xo -
(33.12)
t.~.
Ix
This means that the performance map SJ(X) can be calculated via the integral curve ThP of the performance field if there is at least one point on such a curve for which the performance map Xo = SJ(Xo) is known. This generalizes what we have already learned for tempo curves and initial performances on selected onsets. Therefore, performance can be calculated from a Lipschitz-continuous performance field on all points X whose integral curves hit a set Initial C R of points whose performance is known in advance. We may now forget about the performance map SJ and start from the performance field and a 'good' initial set, defining the performance map via the integral curves and equation (33.12). This performance map will usually only be calculated on the local composition K C R, or at least for a selected set of points of that frame, and not for any point in R, but this is exactly what we want, see figure 33.3.
33.3
What Performance Fields Signify
Summary. The very complexity of performance fields parallels a strong impact on semantic layers of performance. We want to lay bare this crucial relation. In a first approach, we discuss the contributions of Adorno, Benjamin, and Raffman to the very ineffability of performance nuances. We then deduce the adequacy of performance fields to deal with this "ineffability" and to control it on the level of the sophisticated language of calculus. Finally, we deal with the tension between structural and performative parameters in music. We expose and discuss Helga de la Motte's thesis that, historically, there is an increasing number of performance parameters being transformed into structural score parameters. -~-
218 stands for German "Tempo-Stimmung" and is symbolized by the Hebrew letter "tsadeh". 3For integral curves of vector fields, see appendix 1.2.3. 4See appendix 1.2.1.
691
33.3. WHAT PERFORMANCE FIELDS SIGNIFY
I / / / / -' I I / /./ -'
1// / / / / / /./ / 1 / / '/ / / / /
•
/////////;.'
-' /
///~////;.'/
-' ./
/
./
/
-'
• • -'
/
-'
./
/
-e
-' /
/
-'
-' /
/
-'
-' /
/
-'
-' /
)SO
/////~~7// x / / / /~ / / /
//
/ / / / / ::/e/ / . / /
////7///// ;.'/~////
/
//////
//////
;.'/////////
Figure 33.3: Performance x = p(X) can be defined upon the performance field 'IS and on an initial set (left polygon) where the performance is known in advance.
33.3.1
Th.W. Adorno, W. Benjamin, and D. Raffman
Summary. Adorno and Benjamin [110] have associated performative adequacy with an activity of "infinitesimal precision". We make plausible that their language suggests the language of vector fields- though not explicitly stated by these authors. Diana Raffman's argument for ineffability of musical nuances in performance [432] is discussed. We relate this admittance of ineffability to the search for a powerful language as an extension of the powerless common language. -I; -
The previous discussion of musicological approaches to performance might have given the impression that in general, musicologists share the tendency to oversimplify the complexity of performance, be it in discrete tempo concepts, be it in correspondingly simplistic understanding of intonation. In fact, the usual understanding of articulation is not better than that. It is defective to the point of not realizing that performance of duration is related to the onset performance plus some deformation of the duration according to articulation rules. We have not encountered any such structural description to date--worse: discussions about these phenomena were dominated by a complete ignorance of this kind of effects. However, on a non-quantitative level, very intelligent observations have been advanced by the most sophisticated theorists of performance: Theodor W. Adorno, and Walter Benjamin in [7]. Here is their basic text which introduces the "micrological procedure":
(Dieses mikrologische Verfahren) darf nicht als ein dem kiinstlerisch produktiven Entgegengesetztes verstanden werden. Walter Benjamin hat 'das Vermogen der Phantasie' als 'die Gabe, im unendlich Kleinen zu interpolieren' definiert. Das beleuchtet blitzhajt die wahre Interpretation. Der Forderung, Phantasie, als Medium des Lebens der Werke, und Genauigkeit als das ihrer Dauer, zu vereinen,
692
CHAPTER 33. PERFORMANCE FIELDS
der Grundfrage, welcher der verantwortliche Interpret sich gegenuber sieht, wird genugt nur durch den gebannten und bannenden Blick auf den Notentext der Werke. In seinem dicht gewobenen Zusammenhang sind die minimalen Hohlraume zu entdecken, in den en sinnverleihende Interpretation ihre Zufiucht findet. (. . .) Das Medium kunstlerischer Phantasie ist nicht ein Weniger an Genauigkeit sondern das noch Genauere. As the wording is chosen, micrologic is a logic in the smallest dimensions of a composition and its performance. The text suggests that this procedure could be misunderstood as opposed to artistic fantasy. Adorno evokes Benjamin's observation that fantasy is involved in the infinitely small ("im unendlich Kleinen"), more precisely in the interpolation towards the infinitely small. For Adorno, this is a revealing insight into the ultimate, true performance. Adorno asks for the discovery and inspection of the innermost interspaces ("sind die minimalen Hohlraume zu entdecken"). Infinite interpolation is the tool to do so. And this is not contrary to artistic fantasy, it is, so to speak, the strongest microscopic instrument we have, and should use. Artistic fantasy is not the pseudo-romantic blurredness, but a maximum of precision, of intensity and interplay of minimal movements and forces. This absolutely central insight of Adorno and Benjamin is not only astonishing in the musicological environment (though not as a category of Adorno's and Benjamin's discourse), it is also a very problematic approach insofar as the humanities-where their text belongs-do not have any means of making such allusions precise. The text is a kind of schizophrenic claim of non-mathematical experts in the words of mathematical concept frameworks: Interpolation, infinitely small, etc. To the mathematically trained, the allusion to calculus is straightforward. No doubt, the language of the infinitely small is calculus. Is it this kind of language which Adorno and Benjamin were aiming at? What is intriguing is that they are talking of infinite interpolation. Between what? The score is a radically discrete symbolism. The infinite interpolation is not a priori inscripted into the score structure. And, what are those cavities ("Hohlraume") in terms of music parameters and processes? In between the discrete score events, there must be some infinitely divisible space which encompasses the cavities, Adorno and Benjamin are zooming in and penetrating. A solution of this conceptual approach could in fact be the continuous and differentiable interpolation suggested in the previous considerations of extensions of local compositions and maps. We claim that our theory is the mathematically adequate concept framework to the Adorno-Benjamin approach. More precisely, performance fields which include infinitesimal information about the performance process "between the score units" seem to conceptualize this world of infinitely refined reading of what is happening in the cavities of time, pitch, loudness, articulation, glissandi, and crescendi. Without anticipating the expressive power of performance fields, it appears that performance fields in their very rich structure could englobe a deep semantic richness towards human expression of all the intentions which human performance commits. We have to imagine that given a score with its discrete event set, the layer of a performance field which is superposed to that score adds an infinitely fine interpolation in the sense of Adorno-Benjamin. It is like an optical lens system which deforms the "mechanical, rigid" score data into a rhetorical expression of the interpreter's understanding.
33.3. WHAT PERFORMANCE FIELDS SIGNIFY
693
Performance fields implement a powerful language of performative rhetorics which transcend the discrete vocabulary of common scores. This opens a substantial discussion about ineffability in music. Recall that musical performance is still a strong argument for the ineffability of musical reality: In [432], Diana Raffman has argued that ineffability is a characteristic feature of musical expression, and that this is related to the quale objects as defined by Clarence Irving Lewis in [302]. Quales are those qualities of immediate human experience which cannot be conceptualized and are of private, individual, irreproducible and antilexical nature, such as colors, sounds, hunger, anger, sadness, or happiness. Raffman argues that musical experience is strongly related to quales and therefore shares strongly ineffable characteristics. However, ineffability means escaping the power of language. And this is the critical point: which language is escaped, transcended, what is the boundary to effability? Clearly, the infinitely small, the infinite interpolation are such ineffabilities to the common language. But they are not to the mathematical language of calculus. Ineffability is a relative concept and not a static verdict. This means that ineffability is a challenge for language extension: Are we able to find a richer language which captures phenomena which were----hitherto-ineffable? Principle 24 We argue that performance fields are precisely such an extension of the music description language which turns ineffable instances of musical expressivity into regions which may be controlled by such a language enrichment. After all, vector fields are a very romantic subject: The experience of wind and weather, of stormy rains, of water streams and lava breakouts is a valid metaphor of the forces and processes of our souls. It is not miraculous that performance fields are the exact counterpart of musical expressivity in its most refined appearance as preconized by Adorno and Benjamin in their visionary text.
33.3.2
Towards Composition of Performance
Summary. Helga de la Motte has hypothesized [122] that, historically, there is an increasing number of performance parameters being transformed into structural score parameters. We discuss this argument and make a picture of its consequences for the composition of performance fields; a language reflecting such a refined performance data must be radically different from the known "digital" language as is common in western score notation. -I;-
It is undeniable that from the early days of neumes to the present, or at least, to the classical European notation, an increasing number of music parameters can be observed in the score notation. For example, the bar-lines were only introduced around 1420, whereas the dynamic signs or the instrumental specification were not present in Bach's Art of Fugue, and the metronomic indications became standard only after Miilzel's invention (though sometimes in a problematic way, such as in Beethoven's "Hammerklavier" sonata). This fact must be seen in the context of the very concept of a score. The concept of a score is that of a mediator between musical ideas and their physical execution. As such, the score of a music piece cannot be narrowed to the level of a notated sign system. The score begins in the mental layers of the composer and musician and is only supported by, not identified with, the
694
CHAPTER 33. PERFORMANCE FIELDS
schematic and digitized material form. The distribution between exterior and interior5 score depends on the specific music culture. In jazz, the major part of the score is present on its interior level, the rudimentary lead-sheet notation sketches only the most elementary ingredients such as harmonic progression and melodic core structures. This has deep implications on the concept of a composition. If one agrees that a composition is all that is fixed on the "neutral level" of the score, the stratification into inner and exterior score makes the composition concept a bit difficult: Where does the composition begin, where does it end? In fact, the interior part of the score may include mental sketches as well as schemes of performance or construction, such as realized in jazz improvisation or processual schemes in the sense of Cage. If we agree that the composition is everything that can be traced either on the exterior or the interior score, then composition may very well include performative instances if the interior score shapes the mapping from mental to physical reality by a well-defined concept. In this sense, Helga de la Motte's observation of an increasing shift of performative instances to the score level can be made more precise: It is a shift from inner score instances to those of the exterior score. Now, such a shift is a function of two factors: First, as in jazz, and even more in free jazz, it may not be the objective of a musical approach to aggregate exterior score signs, and then, no such shift is needed. Nonetheless, transcriptions of jazz improvisations may be desired (such as the famous transcriptions of John Coltrane soli by Andrew White), and also to a very refined degree. Second, the composer may want to give a precise description of instances of hitherto interior scores, such as happened with the performance signs of accelerando, crescendo, etc. In both cases, the shift can only be achieved by a refined language since the interior score may comprise ineffable spots, regions that cannot be conceptualized on the verbal level: Non-verbal concepts are a widespread phenomenon among musicians! Performance fields may be an extension of the exterior score language which helps shifting non-verbal concepts to the verbal level. But we do not insist on the verbal character of a vector field, at least not in the common sense of verbalization. Mathematical concepts are beyond common language. And they are also beyond quales, they are an effective extension of language which can help thinking things which in the past were completely out of reach to the human intellectual power. Evidently, the score concept which includes such sophisticated objects as vector fields will look completely different from the traditional discrete system. Perhaps it will also per default and as a mandatory condition be related to computer-aided representation and editing. In this sense, the medium computer may help to profile a message which musicians of all cultures have always dealt with and tried to communicate so desperately.
5To my knowledge, the term "interior score" was introduced by the Jazz theorist and musician Jacques Siron in his remarkable book "La partition interieure" [487] on jazz theory.
Chapter 34
Initial Sets and Initial Performances Jeder An/ang ist ein Ende. Hermann Hesse (1877-1962)
Summary. Performance has to start somewhere. The theory of this initialization deals with initial sets and initial performance. Naively speaking, initial sets are the first notes of a performance. Semiotically, initial theory describes a turning point from lexicality to reality in music which is supported by shifters; we explain this rationale. Since music deals with many parameters, initialization has to be specified in all dimensions. We first comment on the classical initial sets in onset, pitch, and duration. On an initial set, the performance cannot be calculated from previous performance, an initial performance has to be defined. We discuss ways to do so. On a more technical level, we introduce the hit point theory, a mathematical account for the control of performance field flows as their curves approach initial sets. Strategies of guessing best approaches to initial sets are presented.
We now situate our investigations in the framework developed in section 33.2.2. This means that we are given 1. a frame R
=
[al, bll x ... [an, bnl in the n-space IRn,
2. a local composition l K
c R,
3. a performance field 'IS, i.e., a locally Lipschitz vector field defined 2 on R, 1 We are sloppy about the underlying forms here and just consider the coordinates and modules since the forms are not of primary interest here. It is however subtended that everything happens within well-defined forms that may be evoked if necessary, e.g., if the mathematical methods need a justification via parameter forms beyond numerical coordinates. 2By definition, this means that the field is defined in a neighborhood of R. But the portions of integral curves of this field contained within the frame R are only a function of the field values within the frame.
695 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 34. INITIAL SETS AND INITIAL PERFORMANCES
696 4. and an initial set I
c R.
According to the previous theory, we shall suppose that the points Xo of the initial set I are given an initial performance PI(XO), and that we may define the performance p(X) for X E K by means of the integral curve Ix'lS if this one hits a point Xo E I via formula (33.12). In this chapter we shall first discuss the meaning of the initial set approach and then techniques to find initial points for polyhedral initial sets.
34.1
Taking off with a Shifter
Summary. When music performance takes off, a magic moment takes place. This well-known effect relates to shifter signs in the semiotic system of music. We make a point of this magic, in fact a moment with deep consequences for performance as a whole which is fully appreciated by the auditory on an emotional level. This "juncture of fiction and reality" is singular but does in fact happen in several places of a composition's performance. The magic is distributed among the entire performance, we introduce this subject as an interface between semiotics and psychology. -~-
Even before discussing the different parameter-specific initial sets, it is important to understand the deep meaning of the fact of initial performance. We have known performance as a transitional process (formally described by the performance map p) from mental to physical reality. On the level of documentation this is a transition from the score to the acoustical realization, to be archived on sound media such as a CD. Following the valid doctrine-as preconized by Valery3 and Adorn04 , for example-the performance is an integral part of the work of art, and this means that, in the sense of communication theory of art as described by Jean Molino (see our introduction in section 2.2), performance is part of the semiosis ofthe work, its meaning is not complete except when it is performed. Put it the other way round: The mental score (interior as well as exterior) conveys a part but not the whole significate, and only via performance can we complete the work's semiosis. Performance englobes a kind of usage of the mental score sign by a performer, much like the usage of a sign in the pragmatic dimension is part of its semiosis. More specifically, those signs whose significate are not only instantiated but substantially depend on the user, are the well-known shifters. Performance of a mental score is such a shift from lexicality to full-fledged meaning, since the pure score is essentially less than the work of art. In other words, performance is a shifter characteristic of the score semantics. Production of full-fledged meaning is only possible by means of performance, and this adds a semantic aspect to the sign which is a non-trivial function of the performer(s). So the anchorage of a score in the physical performance is not only a transformation but also a completion of the score's meaning. It is a completion of shifter type, i.e., adding a new, user-dependent value of the score sign, a value which turns the abstract score structure into a concrete, existential entity. Within this dramatic transformation process which the conductor Sergiu Selibidache has so violently defended against the musical reproduction industry [476], there is a particularly 3 4
"C'est I'execution du poeme qui est Ie poeme." [538] "Die Idee der Interpretation gehort zur Musik seIber und ist ihr nicht akzidentiell." [6]
34.2. ANCHORING ONSET
697
dramatic moment of initiation of the existential kernel. This is what happens in the beginning of a performance. This beginning is when the conductor (or the soloist in the case of a solo performance) appears on stage, steps to the conductor's desk, takes the baton and freezes every movement in order to get off with the first onset gesture. The moment, when the conductor lowers the baton and unfreezes the time process, is the real magic of performance: Some moments ago, the work was still in its lexical potential state, everything could have happened. But now, we are getting off into reality. The first note puts an end to the potentiality, the shifter level has come into life. We do not know whether it is this switch of existentialities which so incredibly fascinates the auditory (and the orchestra), but it is an objectively dramatic event. It might be compared to the reading of the article on the first person singular pronoun "I" in an encyclopedia, and pronouncing the word "I" as a living person with all the shifted meaning pointed at when you say and mean "I". The question of what really initiates when the performance gets off is not simple. It was possibly suggested in the previous discussion that it is only a time initialization, but this is as wrong as it would be wrong to claim that music reduces to onset time. In the initial moment of a musical performance, many different settings are instantiated, in fact as many as we have parameters to describe the sound events (and the gestural parameters in an extended performance theory) in our piece of music. In what follows, we want to investigate these initializations in more detail.
34.2
Anchoring Onset
Summary. The most elementary and important initialization is that of onset. We give the overview of its structure and function, including the shifter nature of onset initialization. The problem of multiple onset initialization is discussed. -~-
This section uniquely deals with the Onset space form and performance fields on this space, i.e., tempo curves. Musically speaking, this approach is a naive one, in fact more involved performance of time does not happen independently of other parameters and therefore cannot be described by tempo curves. For example, if we are involved in a performance where onset is a function of pitch, a situation which may happen in an arpeggio. This is viewed as a temporary onset distortion as a function of pitch. A typical such performance map SJ is as follows:
SJ(E, H)
=
(E -
4e-(H-5)2,
H).
The corresponding performance field 'IS P is shown in figure 34.1. Observe that onset components of the field may also be negative, according to the retard of onset in the middle arpeggio region of pitch around H = 5 in this generic example. The data for our pure onset performance is as follows: We are given a frame interval R = [a, b] c Onset, a tempo "field", i.e., a continuous positive tempo curve 'IS(E) = T(E) defined on R, and a finite initial set I = {Eo < E1 < ... E j} c R within the frame. On this set, we are given performance data SJ(Ei ) = ei, i = 0, ... f with eo < e1 < ... ej. The meaning of this data is that the composition K's onset set KE = PE(K) is also a subset of
698
CHAPTER 34. INITIAL SETS AND INITIAL PERFORMANCES
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
,,,,,,, "- "- "- "- "- "- "- "- "- ", , ,,,,,,, , ~
~
~
f
~
+
+
+
\
\
./
./
/'
/'
/'
/'
./
./
+
+
\
./
./
~~~~~~~~~~
~~~~~~~~~~
Figure 34.1: The performance field on the plane of onset (horizontal axis) and pitch (vertical axis), corresponding to a prototypical arpeggio. This field cannot be built upon a tempo curve. the frame, KE C R, and that its performance is calculated by the performance formula (33.1). More precisely, in the common situation, the initial set is the singleton I = {Eo} consisting of the composition's first referential onset. This could be the first element in KE or else the first bar-line's onset. In this case, no further discussion is required, the integral exists, and we can calculate all required onset performances. However, realistic situations are more involved. It may happen that one is given several initial points, for example if the left hand plays in constant tempo, whereas the right hand is allowed to vary locally against the left hand in a "Chopin rubato", i.e., in such a way that the onset performances coincide on each bar-line, but it may differ locally. In this case, the initial points for the left hand would have a fixed performance whereas the right hand tempo curve would have to fit the left hand onset values. This will in fact happen in our discussion of tempo hierarchies, see section 38.2. This means that we have additional conditions, the integrals of inverse tempo must coincide with the given initial onset performances, i.e., for 0 ~ i < j ~ /, we must have ej -
ei
=
rEj ~.
lEi
T
(34.1)
699
34.3. THE CONCERT PITCH
Under these conditions, the calculation of any onset performance peE) is clearly independent of the reference initial point Ei E I. In this case we say that the tempo curve is adapted to the initial performance PIon the initial set I. Proposition 56 With the above notation and hypotheses, there is always a unique continuous tempo curve T(E) which is linear on each interval [Ei' E i+1], i = 0, ... f - 1, constant outside [Eo,EjJ, and prescribes an arbitrary positive tempo T(Ei) for one index ~ i ~ f.
°
Proof. We prove the case of a fixed start tempo, the general case works in complete analogy. By induction on f, we may restrict the proof to f = 1. It is sufficient to show that we may find a final positive tempo value x = T(E I ) such that the linear tempo Tx(E) = (x - T(Eo))(E1 Eo) + T(Eo) fulfills d = el - eo = q(x) = for any positive value d. But q(x) = (EI -
J:a A
Eo)log(xl=~fKiEo)), which is a continuous function of x, converging to ~(E~)o as x and ranging from 00 to d = q(XI), and T(E I ) =
°
Xl
for positive x. Therefore there is exactly one positive solves our problem, QED.
Xl
T(Eo), such that
----t
Corollary 21 Suppose that for the increasing sequence Eo < EI < ... Ej of symbolic onsets, we are given positive tempi Ti = T(Ei), i = 0, ... f, and that the tempo curve T(E) on the ~, and select a positive real interval [Eo,EjJ is the polygon through these values. Let ~ = scalar a. Then there is a positive scalar T such that the polygonal tempo curve Tr for the vertex = a.~. values Tr(Eo) = To, Tr(Ej) = Tj,Tr(Ei ) = TTi , < i < f, has
°
Proof. The integral J~f-1
A = ~ J::-1 ~
J:: J:: A
assumes any positive value if
varies. Further,
A, J::- A T
1 proposition 56 guarantees that the initial and terminal integrals J~o can also be adapted to any positive value with varying T, so we are done. This means that given a tangent morphism Tg on the initial values Eo, Ej, and a polygonal tempo curve T which extends Tg, we can "deform this curve" to a new polygonal curve Tr without altering the tangent data Tg Eo , TgEf' but such that the total duration is stretched by any positive value a. In practical situations-such as the composition software presto®'s Agologic module (see also chapter 49)-this has the following application: We are given a ~. By graphically interactive polygonal tempo curve as in the corollary, with duration ~ = editing, the curve may be altered in that the vertical position (the tempo coordinate) of one inner vertex of the polygon is augmented or diminished. This changes the duration from ~ to p.~, but we do want to conserve duration. This can be achieved by a deformation of the graphically altered polygon as in the lemma, setting a = 1/ p, and we recover the original duration. So the new shape of the polygon can be conserved as far as possible.
J::
34.3
The Concert Pitch
Summary. In the pitch dimension, initialization deals with concert pitch, i.e., initial intonation or tuning. We compare this initialization with the onset's take-off and discuss the specific difference in deicticity: Concert pitch is dominated by a lexical dimension.
CHAPTER 34. INITIAL SETS AND INITIAL PERFORMANCES
700
In this first approach, we shall view pitch performance as a parallel situation to onset performance as discussed in the previous section. This means that pitch performance is viewed independently of other parameters, in particular, independent of time. As with tempo curves, this is a musically naive, but basic approach. In fact, pitch performance may be a function of onset, according to the tonal system of the given composition, for example in modulatory parts. So we are given a frame R = [a, b] C Pitch, an intonation field, i.e., a continuous intonation curve 5 'Th(H) = I(H) on R, and a finite initial set 1= {Ho < HI < ... H f } cR. On this set, we are given performance data P(Hi) = hi, i = 0, ... f with ho < hI < ... hf. In the standard case, this initial set is not as variable as for tempo. In fact, there are several constraints of tuning which are above all given by the orchestral instrumentation. For the piano, to begin with a common reference for orchestral tuning, we are given all semitone pitches I = {H0 < HI < ... H f} of the chromatic 88 keys6, together with their rigid performance pitches which are P(Hi) = hi = ho + i.log(2)jI2 in the common 12-tempered tuning. The underlying intonation curve must cope with all 88 values. We have the following result which replicates proposition 56 for intonation: Proposition 57 With the above notation and hypotheses, there is always a unique continuous intonation curve I(H) which is linear on each interval [Hi, Hi+I], i = 0, ... f - 1, constant outside [H0, H f], and prescribes an arbitrary positive intonation slope I (Hi) for one index 0 :::; i :::;
f.
The less rigid case is the tuning of the (continental European) chamber pitch a' together with the octave periodicity condition on the integral, i.e.,
r
H +1 2
JH
~
440 Hz,
1
1= log(2)
for all pitches H and the semitone encoding of pitch. Such a tuning is independent of local variations of intonation due to intonation specificities for just fifths and the like which violinists and singers may prefer. It is also open to glissandi which include all pitch values in a determined real interval. There is, however, a qualitative difference between intonation initialization and onset initialization. The latter is a "magic shifter" which by its very construction instantiates the fictitious music score time in real time. The former correspondingly instantiates fictitious score pitch in physical pitch, but its value is not a question of individual construction, it is lexicalized on the standards of music culture and tradition. It is also lexicalized by the absolute pitch perception of a number of musicians, for example Herbert von Karajan, which makes it virtually impossible to alter this conventional initialization on a shifter basis of individual, spontaneous usage. The usage of intonation and its initialization is therefore much more restricted than the usage of onset initialization. One may distinguish the local initialization within a fixed tonal context, between two successive modulations in just tuning of octaves and selected fifths (tonic, dominant for example), say, and the more individual shaping of the individual intonation of less prominent intervals within one such tonal context. 5We are sorry for the homonymous symbol I for initial sets and intonation fields, a confusion is however very unlikely. 6For the added keys on Bosendorfer's Imperial model, extend the key numbers by -1, -2, ....
701
34.4. DYNAMICAL ANCHORS
It is clear that such a one-dimensional intonation field component is as artificial as the one-dimensional tempo field. It is not artificial in the sense of a useless academic exercise, but in the sense of first approximation to artistic performance-something like a zero-state of performance-which needs refinement. This is an important observation insofar as it suggests an investigation of the problem of refining performance, of seeking for paradigms of unfolding performance shaping. The theory of performance stemmata in chapter 38 will deal with this approach.
34.4
Dynamical Anchors
Summary. In the loudness dimension, initialization deals with reference dynamics. This is a dramatic and completely shifting phenomenon which stays in contrast to the onset's take-off and the lexical pitch initialization.
-EInstead of repeating the propositions presented in the previous sections, which are, of course still valid mutatis mutandis, we should rather focus on the specific character of dynamical initialization. Above all, the performance of loudness in its physical expression is a complete shifter: There is no lexical normalization since every concert determines its dynamical initialization as a function of the concert hall, the orchestra, its disposition, and the public. Whereas the intonation curve may remain more or less the same over different performances of a determined orchestra, the initial anchoring will vary considerably, though not as fundamentally as with onset, because onset will be an existentially different one in each performance. There is another shifting character of dynamics initialization: the reproduction of a performance or the live broadcast from an electronic media, such as a radio, TV or internet concert broadcast or the simple playing of a CD on the private sound equipment. In these cases, the individual user's preferences may initialize a very specific dynamic anchorage. And, more dramatically: During the ongoing performance, the initialization may be reset according to hearing dispositions and temporary irritations from disturbing environmental noise, including the abrupt lowering of dynamics while a phone-call or a verbal intervention of another person happens. Similar to onset initialization, dynamics initialization is quite strong and shifting, in contrast to pitch initialization. But it is also more existential, together with onset initialization: It reflects the human condition of when and how and why music is enjoyed. In this shifter process, evidently, the lexical musical content seems not to suffer, it is more the anchorage in human life which is profiled. Whereas absolute pitch seems to playa certain role for the understanding of the musical message-even to the vast majority of non-absolute pitch listeners-absolute onset and dynamics play the contrary role: They give the listener his/her coordinates of existence where they want to meet this particular music.
34.5
Initializing Articulation
Summary. The initial theory becomes less trivial when applied to articulation. Initial articulation reveals the complex recursive structure within initial data, i.e., sets and performance.
702
CHAPTER 34. INITIAL SETS AND INITIAL PERFORMANCES
We explicate different approaches to initial articulation, as based upon initial onsets. -I;-
Whereas initialization on one-dimensional performance fields is a question of selecting finite point sets, initialization on higher-dimensional frames is dramatically more complex. Let us have a look at the most elementary two-dimensional situation: the articulation field, see figure 34.2.
;JFrameR
Duration
(Eb·Dbl ~
... ....
...
~
;'
./
-
/~
/
/
I I !
/
I
-.
/
"- -...
"-
I
/
'-....
-- '" .......
'-....
'-....
~
'-.... ' ....
I
~
4 ~
.,.
~
~
...
~
~
,
./
/
...
-~
/ ' /~
I t 1 ""'• t .... I I I ! 1 t I ! t I ! I t t ! I ... .... ... • f t ~
,,-
.
"-
""""-
--
/' Xv
/~
(Ea·D"l
/
/~
--/
~
,~
• ~
4 ~
J ~
I
/
I
,/
I
--.
/~
'"
......
'" " '-....
-.. '-....
~
.......
/~
-....
/
/
-. ......
/~
,/
/
/
/
/
Onset
Figure 34.2: The integral curves of an articulation field may typically hit boundary points of the given frame on different positions: the horizontal lower boundary line or the vertical left one. This entails dramatically more complex initialization data than for the one-dimensional case of tempo and intonation. In this example, we are given a frame R defined by its low left vertex (Ea, Da) and the high right vertex (Eb' Db). (In this figure, we even suppose that Da = 0, but this is not the general case.) Let us first look at the right integral curve (terminating at the point on its arrow-head to be performed) which hits the frame boundary on X h = (Eh' Da). Initialization can be decomposed in two partial problems: Initialization of onset can be reduced to the onedimensional case since we have a tempo curve here. This means that we may consider the projection of the articulation field onto the tempo curve and accordingly the projection of
34.6. HIT POINT THEORY
703
the integral curve onto its onset component. Suppose further for simplicity that the onset initialization is defined on the frame boundary value Ea. Then we are done with the onset performance, and we may proceed with duration. Let us look at initial duration in X h . Here, we have the onset performance and should define performance of duration Da. If this value vanishes, it seems natural to perform it to physical duration zero, too. But if the lower bound Da is positive, it is not clear what should be done. The canonical idea would be to identify Da as a difference Da = (Eh + Da) - Eh = (t+(X h ) - PE(Xh). We could then apply the well-known construction of duration performance as a difference of onset performances, i.e., (34.2) which is built upon the onset performance PE via the tempo curve and the initial performance on Ea. This is not mandatory though: Initial duration performance need not be without articulation, be it legato, be it staccato. Whatever: We have to be sure that not only the onset component of X h is on the frame, the alteration component (t+(Xh) must also be within the frame's onset bounds! This implies that score points to be performed on this basis must stay to the left of the descending diagonal through the lower right corner of the frame. Other points must be given a different initial performance data. If, on the other hand, the frame is hit in a point Xv = (Ea, Dv) on the left boundary line, its (initial) performance can be settled by the same formula as above (34.3) but now, the initial duration can be any long duration, not just the lowest admitted duration of our frame. In this setup, the initial performance of the left boundary line is a function of the onset performance of the whole onset interval [Ea, Ea + (Db - Da)], and not only of the initial onset performance at Ea. This is quite dramatic as a contrast to the formula (34.2) which is local on the onset of Xh and the slightly shifted onset (t+(Xh). Here, we have to know a lot about the future onset performance for the initial values at the beginning of the frame. An even more dramatic effect happens if the frame has an upper boundary line which enforces integral curves to hit the upper boundary when reaching the rectangle'S boundary BR. In our example, this could happen if the frame ended at the half height (with the same field). There, the initial performance of duration can fail to be controllable by onset performance alone, and we have to design new initial performance strategies. Difficult initial performance problems can also happen if the field doesn't have a positive onset component, as in figure 34.1. Here, initial performance must be defined on the right boundary hyperplane of the frame, i.e., for the last, not for the first events. And if the performance configuration is such that initial values happen to be positioned anywhere within the frame, initial performance has to be defined by use of general strategies which work for any initial set configuration. We shall review this topic in chapter 38 after the discussion of the basic problem: How can we effectively know where an integral curve hits the initial set?
34.6
Hit Point Theory
Summary. Hit point theory deals with the control of access to initial sets along the integral curves of performance fields. These curves describe the flow associated with performance fields.
CHAPTER 34. INITIAL SETS AND INITIAL PERFORMANCES
704
In general, they can be quite "wild" so that a generic strategy of seeking hit points of such curves with initial sets is required. -~-
The general setup for initial sets is a (locally Lipschitz) performance field 'IS defined on (a neighborhood of) a (closed) frame R in a parameter space which-up to isomorphism of functors-is defined by ]Kn. In many situations, the initial set for the performance 8J is not any set, but will be a polyhedron defined as a union I = Ui Si of a finite family (Si)i of simplexes Si cR. Each simplex Si is given by a sequence si. = (sb,sL ... S~(i») of (pairwise different) points in R. The points need not be in general position, i.e., dim(Rs i .) < n(i) is admitted 7 . If we view this data as being the vertexes of the simplex Si = {~jAj8jIO ::; Aj ::; 1, ~jAj = I}, this means that we admit degenerate simplexes as constituents of initial sets. Also is it not required that the simplexes build a simplicial complex, their intersections can be arbitrary. We also do not require that these simplexes have n + 1 vertexes: it is allowed to have a number of isolated points and straight lines in three-space, for example. Clearly, this type of initial sets allows virtually every shape-up to approximations by triangulations. The previous examples are: a series of zero simplexes in ]K for tempo and intonation curves, or the sequence (oER, oDR) of left and lower sides of R in onset-duration space. More generally, given a coordinate Y (or its index j if Y is indexed with respect to ]Kn) we shall denote by 8y R (or ojR) the simplex defined by the vertexes of R which have the lowest Y-coordinate (or j-th coordinate). Correspondingly, highest Y-coordinates define the simplexes denoted by oy R (or oj R). Observe that these simplexes are degenerate for n > 2. The hit point problem is this: Problem 1 Given an integral curve, we have to decide whether and for which curve parameter value the curve will hit the initial set I. More precisely, since an integral curve Ix 'IS is defined starting from an event X E R, we ask for the smallest parameter It I such that Ix'IS(t) E I, i.e., Ix 'IS(t) E Si, one of the initial set's simplexes.
34.6.1
Distances
Summary. This section is dedicated to the calculus of distances between points on integral curves and polyhedral initial sets. -~-
Given an polyhedral initial set I which is defined by a family (Si)i of simplexes, and any point X E R, the distance d(X, I) is the minimum of the distances d(X, Si), therefore distances of points X and simplexes S must be calculated. Suppose that S is given by the sequence (so, Sl, ... sm) of points in ]Kn. If S is degenerate, it is the union of its non-degenerate sub-simplexes. This follows from the fact that it is a projection of a non-degenerate simplex, and that any point of this projection stems from a point of a side of the non-degenerate pre-image. So we are left with the calculation of the distance d(X, S) for a non-degenerate S. If dim(S) = n, clearly, either X is in S or the 7The vector space lR.si. is the module associated with the local composition defined by the vertexes si. of Si.
34.6. HIT POINT THEORY
705
distance is achieved on a point of 8B. So in the latter case, we are in a recursive situation (the zero-dimensional simplex being trivial). To decide upon the position of X relative to B, we consider the unique representation m
X - So = (X - so)~
+ LAj.(Sj -
so)
(34.4)
j=l
of the difference X - So as a linear combination of the basis vectors Sj - So and the vector (X - so)~ of the (Euclidean) orthogonal space lR.s. ~. Considering the non-singular symmetric quadratic form of scalar products
Q
= ((Sj - so) . (Si - SO))i,j=l, ... m
and the vectors U = ((X - so)· (Si - sO))[=l, ... m,A = (Aj)j=l, ... m, we have
A
Q-1.u.
=
With this, the barycentric coordinates are defined by adding the coefficient AO = 1- ~i=l, ... mAi; we have m
i=O
and the component y = 2::0 Ai.Si in the affine space spanned by B is in B iff 0 ::::; Ai for all i = 0, ... m. We now have d(X, B) = II(X - so)~11 + d(y, B). If the second distance is not zero (y tf. B), we can proceed with the recursive calculation of d(y, 8B), and we are done. Observe
that this algorithm also gives us the coordinates of a point of B which has minimal distance to X. In computer programming practice, however, it is not reasonable to check for vanishing of distances d(X, B) because of rounding and number representation errors. We therefore should prefer to calculate whether d(X, B) < E for a selected positive neighborhood variable E. The corresponding routines are obvious. Denote the resulting point in B, which is found by this algorithm, and which has the shortest distance of points within B to X by Min(X, B), whereas its barycentric coordinates are denoted by Min(X, S)i, i = 0,1, ... m, i.e., m
Min(X, B)
=
L Min(X, Bksi.
(34.5)
i=O
As a consequence, one can use this algorithm to calculate the distance of a straight line L c to a simplex B as follows: Take the projection p(B) of B on an affine hyperplane HL orthogonal to L. Then p(B) is defined by the projected points p(Si), and our algorithm applies to the singleton Y of the projection p(L) = {Y} and to p(B), giving d(L, B) = d(Min(Y,p(B)),p(B)). Moreover the coordinates M in(Y, p( B))i give us a point ~n
m
Min(L, B)
=
L Min(Y,p(B)ksi,
(34.6)
i=O
which evidently lives in B and has minimal distance to L. In our applications for integral curves, the line L is parametrized by a point X and a directional vector D, i.e., L(t) = X + t.D. Then the parameter A on L such that we have d(L(T), Min(L, B)) = d(L, B) is denoted by T(X, D, B).
706
34.6.2
CHAPTER 34. INITIAL SETS AND INITIAL PERFORMANCES
Flow Interpolation
Summary. This section deals with selection algorithms for searching points on an integral curve of the performance field which are successively approximated to initial sets. -~-
After splitting the approximation to an initial set I to the approximation of one of its simplexes, the problem is this: We are given a positive f, a point X, and a simplex S in~n, and we have to decide whether, and in which parameter value, the integral curve Ix 'IS hits the fneighborhood8 U,S of S. Theoretically, one could just calculate the integral curve and search for a parameter that does the job--if there is any. But in computer programs, such a procedure is illusory. Numerical integration of ordinary differential equations is a time-consuming, expensive task. One cannot afford to calculate all curve points for any score event of a normal composition, which usually contains 104 - 105 events. We shall see further in section 39.4.4 that fields of any complexity may occur by chains of successive reshapings of given fields via arbitrary performance operators-same for the complexity of initial sets. Also the performance fields and the initial sets are of a completely arbitrary relative position. So the situation is this: We are given a point X and a simplex S. We know in what direction-namely 'ISx-the integral curve starts (with positive or negative curve parameter values). Nothing more. So we have to guess where the curve could approach S as well as possible. We have to evaluate the guess, and then start with another guess, etc. Eventually, we find a curve point in U,S and we are happy, or else we will have to give up the search and decide that the curve did not hit U,S. This could be a wrong decision, but (calculation) time runs out and we have to resign. Intuitively, the search is best described by the following scenario: We have a paper dragon being suspended at a fixed position in the air. We also have a nervous fly, flying around in its random zig-zag (however differentiable) manner where you never know which will be the next turn. This scenario is traced on a video recorder, and the video really shows whether and when the fly hits the dragon. But we are not in the state of viewing the entire film. Rather are we given a determined sequence VF(O) = (VideoFrame(tio), VideoFrame(tio+l)) of two successive frames. This gives us the fly's position and its velocity vector at time tio (supposing that the video is binocular ... ). We now have to guess, which video frame sequence could be the next best that shows the fly as near as possible to the dragon. We then forward the video to this next frame sequence VF(l) and judge the new situation, and so on, until we find a hit point time or else we run out of time. The first action is to guess a good curve time from the starting point X and the field vector 'ISx. Denote by x(t) the integral curve Ix 'IS(t). As nothing is known about the curve's future directions, we draw the straight line Lx(t) = X + t.'IS x and look for the parameter value tl = T(X, 'ISx, S) defined above, which gives the nearest point to S on Lx, see figure 34.3. We now have to compare d(Xb S), and d(X, S). If the former is smaller than the latter, we may proceed, if not, we are in a bad position. Of course, this linear first approximation is not necessarily the best one to get off, since the field may be rather circular than linear in this region. We could therefore try other first approximations, such as a circle in the plane9 spanned by the barycenter of S, X, and 'ISx, for example the circle through the barycenter of S and X, 8This is U,S = {xl d(x, S) < €}. 9Since the linear approximation failed, this must be a plane!
34.6. HIT POINT THEORY
707
Figure 34.3: The first approximation to a hit point on a simplex S when starting from the point X of the given composition is found by a linear approximation Lx. and tangent to 'Th x . We could follow this alternative, but we refrain from this because it does not demonstrate qualitatively new problems. For the following interpolations, recall exercise 71 in section 32.3.2 on cubic splines. This tells us that, given any two points x(s), x(t) on our integral curve, there is exactly one cubic interpolation function Ps,t : [s, tJ ----+ jRn with P(s) = x(s), P(t) = x(t), and tangents TPs = 'Thx(s) , T Pt = 'Thx(t). We shall use these approximation curves to guess optimal points since we cannot calculate all curve points for an interval [s, tJ. We now proceed as follows. We are given the curve parameter tl whose point Xl = x(td is nearer to S than Xo. We now repeat our linear approximation procedure with the line LXI and get a new curve parameter t*. We also calculate the cubic interpolation point (this is a cheap calculation effort) for Ptoh at parameter (to + td/2. If x(t*) is nearer to S than Xl and Pto,h ((to + tl)/2), we set t2 = t*, else, if Ptoh ((to + t l )/2) is nearer than Xl, we calculate x((to + t l )/2), and, if this latter is still nearer than Xl, we set t2 = (to + td/2. Else, we are stuck and have to quit. The general situation is this: We have calculated a sequence X = xo, Xl, ... Xk of successively nearer points with curve parameters ti, i.e., x(t i ) = Xi. Essentially we now have to check what happens to the left and to the rightlO of the best point Xk. Suppose that tTl ts are the right and left neighbor times to tk' We now calculate the interpolation points Ptk,t r ((tk +tr )/2) and Pt1,tk ((tl + tk)/2). If one of them, with parameter t*, is better than Xk, we calculate nearer one's curve point x(t*). If this is still nearer than Xk, we have found tk+l = t*. If not, we are stuck and give up. This procedure will be repeated until a maximal admitted number a of steps, and as long as the points Xk, k :::; a are not in U,S. The procedure stops if we run out of steps or if we eventually hit the given neighborhood of the simplex. This algorithm has been implemented in I°It could happen that Xk is the right or left extremal one. Then, we have to make linear interpolation on that side, but this situation has been explained for the construction of the third point X2 above.
CHAPTER 34. INITIAL SETS AND INITIAL PERFORMANCES
708
the PerformanceRUBETTE® of the RUBATO® software, see section 41.4. It seems that the general situation of performing points X meets serious problems for points whose integral curve does not hit any initial point (or, at least, a point in a small neighborhood of the initial set 1). This is however not so tragic since in reality, performance is not a one-step process: Stemma theory (chapter 38) will show us that any performance is unfolded from a previous, less artistic performance, starting on the quasi-mechanical reproduction of the score. Therefore, if at a given stage, a point X cannot be performed, its performance which is already defined on a less artistic level can always be called in order to save the performability of the given score. A final technical remark about the above algorithm: We implicitly supposed that the space where point X and simplex 8 live is the space of a performance field 'IS. However, we used only an integral curve through X, and not the performance field itself. This means that we only needed a curve x(t) through a point X, and no information about the curve's genealogy. 80 the algorithm is also valid for any such "isolated" curve. This will be used later in our discussion of hierarchies of local performance scores in chapter 35. In the next chapter, we shall use this notation: If X is in the given frame R, we denote by I;; 'IS U; 'IS) the restriction of I x 'IS to the maximal interval contained in the interval ] - 00,0] ([0, +ooD such that all values of the curve on this interval are contained in R. Given a performance field 'IS, E E {+, -}, and an initial set I, we set: (34.7) Here is a sorite concerning this symbol:
Sorite 11 In the following statements, direct products of performance fields 'IS 1, 'IS2, initial sets h,I2, and frames Rl, R2 refer to the product (limit) space 88 built from the given factors 881 ,882 .
(i) The operator I (ii) Setting h
r
t---t
12
I
=h
r 'IS conserves inclusions, commutes with unions and is idempotent. X (12
(h
r
'IS2) U (h
J' J' 12 )
r'IS
1)
x h we have
'IS1 x 'IS2 = h
J'
'IS1 X 12
J'
'IS2.
Proof. Statement (i) is straightforward by the uniqueness of integral curves. We show statement (ii) for E = -, the other case is analogous. For the inclusion "e", suppose WLOG that we have (X, Y) E (h x (h 'IS2)) 'IS1 X'IS 2· This means that we can reach a point of h x (h 'IS 2) from (X, Y) on the integral curve I(~,y) 'IS 1 x 'IS2. Since integral curves of a product of vector
I-
I-
I-
I-
fields project to integral curves of their factors, and by the idempotency of 'IS 2, this implies (X, Y) E h I- 'IS 1 X 12 I- 'IS 2. As to the inclusion "::>", suppose that (X, Y) is such that 'IS1(s) E h, V = 'IS1(t) we conclude 2 (t) E hand WLOG s :S t :S O. Setting W = (W, V) E h 'IS 1 X h By the following lemma, we can reach (X, Y) from (W, V) on an integral curve of 'IS 1 x'IS 2 from t to 0, and we conclude (X, Y) E (h 12 ) 'IS 1 x 'IS 2, QED.
Ix
I-
Iy 'IS
Ix
I-
I-
34.6. HIT POINT THEORY
709
Lemma 50 If we are given two locally Lipschitz vector fields 'lSI, 'lS2 on respective domains D I , D2 and two (maximal) integral curves Xl : J I (0) ---> D I , X2 : J 2(0) ---> D 2, the diagonal curve x: J I (0)nJ2(0) ---> DI x D 2 : t f---t (XI(t),X2(t)) is the (maximal) integral curve of 'lSI X'lS2 on DI x D2 through the couple x(O) = XI(0),X2(0)) of initial points.
Proof. Clearly, the diagonal is an integral curve since differentiation goes factorwise. On the other hand, if we had a proper extension of X in the product space, its projections PI 0 X, P2 0 X would yield two integral curves, one of which would have an extended domain, which contradicts the choice of Xl, X2, QED.
Part IX
Expressive Semantics
731
Chapter 35
Hierarchies and Performance Scores On trouve toujours l 'homogene Ii un certain degre de division. Paul Valery [538, I, p.209]
Summary. As a synthesis of the structural parts described in chapter 32 through chapter 34 we establish the overall structure of performance. The objects of performance structure are understood as being an additional score type, called performance score, layered over the given "symbolic" score like a system of optical lenses which 'deform' the rigid configuration of note symbols. The performance score is a global object built from an atlas of local performance scores. A local performance score is built from a hierarchy of performance cells, the very core of performance structures. We first describe the category of performance cells. Local performance scores are defined by a hierarchical construction principle: They are particular diagrams in the category of performance cells. The conceptual and musical background of such local hierarchies is evidenced through a series of examples, including the piano and violin hierarchies. We end up with the definition and exemplification of the concept of a (global) performance score. -~-
35.1
Performance Cells
Summary. Performance cells are the very local data of performance. They comprise the cell's frame (a domain of definition), the symbolic kernel (a set of prima vista objects), the performance field, the initial set, and the initial performance. -~-
The innermost local structure of performance is the performance cell. We have known all of its ingredients and shall now set up the formal definition of such a cell.
711 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
712
CHAPTER 35. HIERARCHIES AND PERFORMANCE SCORES
Definition 101 Given
E
E
{+, -}, a performance cell is a five-tuple
consisting of the following objects: 1. a local composition K c S S whose space is simple of underlying module ]Rn, K is called the symbolic kernel of the cell; 2. a closed frame R = [aI, bl ] x ... [an, bn ] contained in]Rn and containing K; 3. a locally Lipschitz-continuous performance field 'IS which is defined l on R;
4·
an initial set I such that K c I
r 'IS (see equation (34.7));
5. an initial performance 8JI : 1-+ PS, with codomain a physical space PS with module ]Rn, and such that for any point X E KnI J 'IS and any two points a = J; 'IS(a), b = J; 'IS(,B) in I, we have 8JI(b) - 8JI(a) = (a - ,B).b., where b. = (1, ... 1) is the diagonal vector in
PS's
]Rn.
Often, T is given as a union of a finite family (Si)i of possibly degenerate simplexes Si, but this is not the general case. A performance cell is visualized by a tetrahedron as shown in figure 35.1. We therefore also call (R, 'IS, I, 8JI) a performance body for K.
R
I
Ts Figure 35.1: Visualization of a performance cell by a tetrahedron, the inner ball symbolizes the cell's symbolic kernel, whereas the 4-tuple of the tetrahedron's vertexes is called the performance body for the symbolic kernel. Given a performance cell C, we automatically have a well-defined performance map 8Jc : K -+ P S which we shall always refer to when talking about the performance 8J associated with C. Without stressing the contrary, all our cells will be --cells, the theory for E = + is the same. We shall only occasionally consider mixed signatures. R.
1 Recall
that this means that 'Th is defined on an open neighborhood of R, but we only identify the field on
35.2. THE CATEGORY OF PERFORMANCE CELLS
35.2
713
The Category of Performance Cells
Summary. Performance cells constitute the objects of the category PerCell of performance cells, the morphisms being the technical expression of relations between performance data on different parameter spaces.
In nuce, we have already seen phenomena of related performance cells in the discussion of parallel fields as they arise in the Onset EEl Duration space. There, we had the projection POnset : Onset EEl Duration ---+ Onset which was compatible with the parallel field 8Tempo and the tempo curve Tempo. We shall now set up the formal statement behind those incipits. What we want is a concept of a morphism C1 ---+ C2 which defines the category PerCell of performance cells such that the associated performances are compatible with this morphism. Definition 102 (Recall that we have fixed the signature E = - in the following discussion!) Let C1 = (K1' R 1, 'IS 1 , h, PI,) and C2 = (K2' R 2, 'IS 2, 12, PI2 ) be two performance cells living in spaces 551 and 552. Suppose further that we have a projection p : S51 ---+ 552 of the parameter space 551 onto the parameter space 552 such that the underlying projection p : ]Rnl ---+ ]Rn2 is the projection onto a subset of coordinates. Then p is a morphism of performance cells p : C 1 ---+ C2 iff the following conditions are verified: 1. p(Kd C K2 (in other words: p: K1
3. Tp
0
'IS 1
5. po PI, p: P5l
= 'IS 2 0 p,
= ---+
---+
K2 is a morphism of local compositions);
i.e., a morphism of vector fields p : 'IS 1
---+
'IS 2 ;
0 plh (here, p denotes the corresponding projection on the physical spaces P52 );
P2
Lemma 51 With the above notation, if p : C 1
---+
C2 is a morphism, then we have
po PI = P2
0
(35.1 )
p.
with the homonymous p symbol for the mental and physical projections.
Ix
Ix
Proof. Let X E K l , and suppose that 'IS 1 hits h at the point Y = 'IS 1 (t). (By axiom 5 of the definition 101 of a performance cell, it does not matter, which hit point we are selecting.) Then, P1(X) = PIl(Y) - t.~l' and by linearity of p, po P1(X) = po PI,(Y) - t.~2. But po PI l (Y) = P2(p(Y)) = PI2 (Z) - S.~2' if Z is the hit point for p(Y) in h s is the curve parameter for this hit point. Hence, po PI (X) = PI2 (Z) - (s + t).~2. But integral curves of 'IS 1 are projected into integral curves of 'IS 2 by axiom 3 of definition 102 and by the uniqueness of integral curves (see the fundamental theorem of ODE 78, appendix 1.2.2). Therefore, the curve parameter s + t is also the parameter of the integral curve Ip(X) 'IS 2 where it hits Z, and this means that PI2 (Z) - (s + t).~2 = P2(p(X)), QED.
CHAPTER 35. HIERARCHIES AND PERFORMANCE SCORES
714
This means that performances of performance cells behave in the sense of natural transformations with respect to the category PerCell. Exercise 78 Work out the example of a parallel articulation field projection and its associated performance body data, as discussed in section 34.5, to obtain a morphism of performance cells. The next proposition gives us means to construct product performance bodies and product cells from two given cells. Proposition 58 Suppose that we are given two performance cells
living in spaces SSl and 882 . Let 88 be the product of 881 and 882 , with projections Pi: 88---7 88i , i = 1, 2. Then the following data defines a performance cell C on 88: 1. the frame is R
2. the field is 'lS
= R1
x R 2;
= 'lSI x
'lS2;
3. the symbolic kernel is any sub-composition K C K1 x K 2;
4·
the initial set is I
= It r I2;
5. on I, the initial performance PI is defined as follows: if (x, y) E It X 12 r'lS2, then we set PI(X, y) = (Ph (x), P2(y)), and symmetrically for the other case (x, y) E It r'lSl x I 2· With his data,we have two morphisms Pi : C ---7 Ci, i
=
1,2 of performance cells.
r
Proof. As to the construction of C, we only have to check thE:! relation K c I 'lSI x 'lS2. From sorite 11, we know that I r 'lSI x 'lS2 = It r 'lSI x I2 r 'lS2. But the latter evidently contains K1 x K 2 , and a fortiori K, whence the claim. The morphisms Pi are now straightforward by construction, QED. The performance cell C, together with the two morphisms PI, P2 will be called the product of the cells C1, C2 and denoted by C 1 x C2IK, or simply C 1 x C2 if K = K1 X K 2. Although the initial set It r'lSl x I2 r 'lS2 of the product performance cell is not a union of simplexes in general, the product formula of sorite 11 guarantees that the calculation method for hit points on simplexes discussed in section 34.6.2 may be applied to each component of an event X = (Xl, X 2 ) if the factor cells have initial sets which are built from simplexes.
35.3
Hierarchies
Summary. Hierarchies are space diagrams arising in local performances. They trace the functional organization among the arguments of performance fields.
-E-
35.3. HIERARCHIES
715
In order to obtain more concrete results, we shall first consider a particular system of symbolic and physical spaces. We suppose that we are given a series B i , i = 1,2, ... of pairwise different simple spaces, called basis spaces, as well as an equipollent series Pi, i = 1,3, ... of spaces, called pianola spaces, with Pi ----+ Syn(Bi ), while Bi ----+ Simple(]Rni). Often, we shall Id
Id
consider finite products Bi, x Bi2 X ••• Bik X P jl X ••• B jt of such spaces, and always ordered with basis spaces first, and pianola spaces second, and each space type ordered by increasing index (no repetitions!). This means that we in fact parametrize such products with finite subsets of the name set BP = BU P,B = {Bili = 1,2, ... }, P = {Pili = 1,2, ... }. We shall then identify these product spaces with the simple space Bi, EB Bi2 EB ... Bik EB P jl EB ... B jt ----+ Simple(]RN) associated with the direct sum ]RN of all involved copies of real Id vector spaces. If the defining sequence B i" . .. Bik , P jl , ... Pjt is denoted by U, we shall also rename the space Bi, EB Bi2 EB ... Bik EB P jl EB ... B jt by EBU. By definition, the space associated with the empty sequence 0 is the simple space EB0 ----+ Simple(]RO) of the zero module. Id
This generalizes the nomenclature introduced in the description of standard spaces (see equations (6.69) ff.), such as the piano space OnsetEB PitchEB Loudness EB Duration ~ Onset x Pitch x Loudness x Duration referring to (symbolic) onset, pitch, loudness, and duration, and represented by ]R4. For any subsequence of symbols V = B U" B U2 ' ... BUr' PV1 , ... BV a of U = B i1 , B i2 , ... B ik , P jl , ... B jt , we have a canonical projection PU,v : EBU -+ EBV
of such standard spaces, also denoted by Pv or p, if no ambiguities are possible. For any two such sequences V, W, we denote by V U W (V n W) the sequence defined by the union (intersection) of the basis and pianola symbols in BP. Similarly for other set-theoretic operations, such as complement in BP, differences V - W, etc. This means that we consider the Boolean algebra Sub(BP). When considering the spaces EBU associated with such sequences U, we often speak loosely about the sequences and mean the spaces, for example, we speak of the "union of spaces" EBU, EBV and mean the space EB(UUV) = (EBU)EB(EBV), etc., but no confusion should occur ...
Definition 103 Given a finite space collection BP, a space hierarchy in BP is a non-empty sublattice H C Sub( BP) (closed under finite unions and finite intersections), with maximal element (the top space) Top(H); H is viewed as a category with U -+ V iff V C U. The minimal nonempty elements of H are called fundamental spaces, their set is denoted by Fund(H). For any non-empty space U E Sub(BP) which is contained in Top(H) , we denote by ClH(U) the unique smallest space in H which contains U, and call it the hierarchy closure of U. A hiemrchy space U E H is called indecomposable if it is not the union of two disjoint non-empty subspaces of H. The motivation for the hierarchy concept is that we want to consider systems of performance cells which are related by morphisms, such as the parallel field morphisms or the product cells and their projections. Moreover, we have to group performance cells which are dominated by a root cell since all score events will be performed in one big space whose projections are however compatible with the root space.
716
CHAPTER 35. HIERARCHIES AND PERFORMANCE SCORES
Lemma 52 Given a space hierarchy H, let Fund( H) spaces of H. Then
= {Fl' ... Fk} be the set of fundamental
(i) Each Fi is contained in a unique maximal indecomposable space T(Fi) of H. (ii) Any two different maximal indecomposable spaces in H are disjoint.
(iii) The top space Top(H) is the disjoint union of its maximal indecomposable subspaces. Proof. The union of two indecomposable superspaces of each Fi in H is indecomposable, therefore there is a unique maximal indecomposable superspace T(Fi) for each Fi . Evidently, the maximal indecomposable spaces are mutually disjoint, i.e., for T(Fi) i= T(Fj ), T(Fi) nT(Fj) = EEl0. The existence of a decomposition of Top(H) as a product of maximal indecomposable subspaces follows by induction on the cardinality of Top(H) , QED. The set of fundamental spaces contained in one maximal indecomposable space of H are called the blocks of the fundament. Hence the blocks are in bijection with the maximal indecomposable subspaces. Definition 104 If an indecomposable space contains a unique (proper) maximal subspaceincluding the zero space-it is called irreducible.
Hence, every indecomposable space is either irreducible or it is the union of its maximal subspaces. Fundamental spaces are irreducible, the zero space is not. With this we may now proceed to define the performance hierarchies. To this end, we consider the zero performance cell Cf/! which is defined by a frame over the zero space EEl0 and has everything trivial: unique zero field, trivial frame, one-point initial set, zero initial performance. Here is the relation of space hierarchies to performance theory (see also figure 35.2):
Definition 105 Given a space hierarchy H, a cellular hierarchy is a diagram (in fact, a functor on the category H) f) : H ---+ PerCell with values in the category of cells PerCell such that for each U E H, f) (U) is a cell in the space EElU, with morphisms being denoted by f) (pu,v ), Pu, v, Pv, or P if no ambiguities are likely. The domain H of f) is called the type of the cellular hierarchy.
The meaning of a cellular hierarchy is that we are given a performance on the kernel of the top cell Top(f)) = f)(Top(H)), which is compatible with the performances on all other cells f)(U) of f). So the parameters of top kernel events which are also grouped on lower hierarchy spaces can be performed independently of the other parameters. Correspondingly, the performance field components grouped on a lower hierarchy space are independent of the other space parameters, a situation already encountered for the articulation hierarchy Onset EEl Duration ---+ Duration, for example. Moreover, if we have two cells f)(U), f)(V), their performance fields 'Thu, 'Thv evidently define performance fields 'Thuuv, 'Th unv on the arguments of the union and intersection of their spaces, and this is met by the lattice structure of H. The signification of a space being irreducible now also becomes more evident: If a hierarchy space is the union of two proper subspaces, any coordinate of the performance of an event may be calculated via projection of the event into an appropriate subspace. However, for an irreducible
35.3. HIERARCHIES
717 o
•
II
• E
EHLD
I.
/
I I I •
,
•
• f
!
- ......
~
I
I I
I
I ~ I / I
/
I
_
E
Figure 35.2: A cellular hierarchy is shown, together with the initial sets (blue simplexes) , the performance fields, the projections, and the symbolic kernels (red points). space, we have to make an extra effort to know how its kernel events are performed since no subspace will give us full information. The parallel field construction from section 33.2.1 is a standard example for this: The duration component of the articulation field is not reducible to the cell of a subspace, whereas the onset component is. So the hierarchy does not tell us how to compute the duration component. The parallel construction is one possibility among an infinity of others to deduce the duration component from the fundamental tempo field by means of a special "formula" which in more generality has been explicated in formula (33.10). If we are given a space hierarchy H with Top(H) = ffiBP = ffiB ffi P, we shall henceforth assume that for each pianola component Pi of P, the corresponding basis component Bi is contained in the basis B, whereas a basis component Bi may happen to live alone without its pianola counterpart in P. Such hierarchies are called standard, if we have to distinguish them from other, non-standard hierarchies. For a standard space hierarchy, we therefore always have
718
CHAPTER 35. HIERARCHIES AND PERFORMANCE SCORES
the alteration projections Q± : ffiBP --+ ffiB. It is the usual alteration on the couples ffiBi ffi Pi, and the identity on the unparalleled basis components.
35.3.1
Operations on Hierarchies
Summary. Hierarchies can be altered and recombined according to a set of standard operations. Such operations intervene in practical calculations of performance transformations in software algorithms. -~-
Given a cellular hierarchy f), if U is any subspace of the underlying space hierarchy H, we may restrict f) to the sublattice HIUof H whose top space is U. We may then restrict the cellular hierarchy f) to this sublattice. The restriction is denoted by f)IU. If we are given two cellular hierarchies f), e with space hierarchies H, K, and whose spaces pertain to the given space collection BP, and such that their top spaces are disjoint, then we have the product cellular hierarchy f) x e whose domain is the space lattice with spaces U U V, U E H, V E K. Clearly, in this product space hierarchy, a pair U U V extends U' U V' iff each component does so in the respective space hierarchy H, K, respectively. So for each such relation and the associated morphism p : U U V --+ U' U V', we have the corresponding morphism of performance cells p: f)(U) x e(V)
--+
f)(U') x e(V').
The only non-trivial statement within this fact concerns property 4 of definition 102 and follows from the idempotency of the operator? 'Th (sorite 11, (ii)). Given a space U in the name space B of basis spaces, we call a parallel space to U and denote by au the space U U PIU consisting of U and the space PIU of all the corresponding pianola space components. If a cellular hierarchy f) is such that for each projection au --+ U of spaces in its space hierarchy H, the performance field over au is the parallel field a'Th to the field 'Th of U (i.e., of the cell f)(U)) in the sense defined in formula (33.10), then we say that f) is a parallel hierarchy. Parallel hierarchies are the default hierarchies where performance is initiated.
r
35.3.2
Classification Issues
Summary. For small sets of parameters, frame structures of cellular hierarchies are completely classified. We describe the classification for hierarchies involving tempo, intonation, and dynamics. -~-
We do not claim classification of the full-fledged cellular hierarchies, this is a much too difficult task, and it is not of primary interest; it is easier and perhaps more relevant for practical reasons to classify "frame structures" for concrete cellular hierarchies. Such a frame structure is the hierarchical organization of the hierarchy's performance fields. We shall present a complete classification of hierarchies sitting over the set B = {E, H, L} of the three usual basis spaces E =
35.3. HIERARCHIES
719
Onset,H = Pitch,L = Loudness. For each hierarchy space U = E,H,L,EH,EL,HL,EHL, we denote the corresponding field as built from the symbols T = Tempo,! = Intonation, D = Dynamics. Hence, T is the field over E, lover H, Dover L, whereas TD denotes the field over EL, etc., and TID the field over EHL. We write TI x D for a product field corresponding to the space EH L which is decomposable into the subspaces EH and L in the given hierarchy ~. Figure 35.3 shows the classification Hasse diagram, with a straight line from every hierarchy to its next specializations. Here, specialization means that field components become independent from certain parameters with increasing split space hierarchies. TID TID
TID
TID
TID
TID
J
J
J
J
J
I
T
TI
D
TID
TID
TID
TID
TID
J
J
J
J
J
TI
TI
J
J
T
TID
TID
J
J
IxD
Txl
T
/\
I
TxID
I
/\
D
TDxl
I
ID
J
I
ID
J
D
ID
Txl
TD
T
I
T
I
TD
J
TD
J
J
T
D
TID
TID
/\ \/
TI
TlxD
TDxl
TlxD
IxD TI
IxD TD
J
TxD
TI
/\
T
I
ID
I
D
/\ /\ /\ ~J IX IX
Txl
J
ID
TID
TD
TID
TID
/\
N
I
D
D
/\ \/
TD
T
/\ IX
Txl TI T
I
TID
/\ \/
TD
ID
D
TxID
/\ N
Txl ID T
D
TxlxD
~ TxD Txl IxD
~ T I D
Figure 35.3: The complete classification Hasse diagram of basis hierarchy frame structures for onset (E), pitch (H), and loudness (L) in terms of corresponding fields, including specialization (straight lines) to more split fields according to functional independency of parameters. For a performance field 'IS = 'ISu, U E H of a cellular hierarchy ~ with space hierarchy H, we may ask for its functional dependence within ~. Let 'IS = 'ISI[ be the field of the maximal subspace U of U in H. This is called the territory of 'IS, and describes the portion
720
CHAPTER 35. HIERARCHIES AND PERFORMANCE SCORES
of 'Th which is completely determined by lower hierarchy data. By the extraterritorial part of'Th, denoted by 'Th ex , we mean the projection of 'Th to the complementary space U - U, so that we have 'Th = ('Th, 'Th ex ). In general, the extraterritorial part of 'Th is a function of its territory and of a system Para of parameters which are external to the hierarchy; we write 'Th ex = 'Thex('Th, Para) to indicate this dependency. So the performance fields of the fundamental spaces, the fundamental fields, play a primordial role in the construction of the hierarchical architecture, which is enriched by external parameters in the case of irreducible spaces (for others, the parameters are non-existent). Example 53 Suppose we are given a parallel hierarchy with just one parallel projection 8'Th ---'Th over the space projection 8U ---- U, according to the formula 8'Th = ('Th0pu, 2. 'ThOQ+ - 'Th0pu ) (33.10). Here, we have the empty parameter set Para = 0, and 8'Th = 'Th, 8'Th ex = 2.'Th o Q+-'Th. If we generalize this configuration by the extraterritorial part, then
l+A
8'Th.x,ex = -A-.'Th 0 Q+ - 'Th.
The parameter set is Para = {A}, and we recover the old situation by a specialization A ---- 1. We shall see in the discussion of performance operators that this parametrization is a natural one in the context of operators for articulation (legato, staccato). These concepts apply to the standard situations while performing special musical effects which we list here: 1. Pianola deformations 2 of parallel pianola fields of tempo: 'Th
=
(T, Zex(T, Para))
with specialization to 8T. This is used for articulation as discussed above, for local performance cells of ornaments and ties. 2. Deformation of dynamics over tempo: TD
=
(T, TDex(T, Para))
with specialization to T x D. This is used for ondeggiando (bow vibrato) effects and prima vista dynamics accentuation following bar-lines and time signatures. 3. Deformations of tempo over dynamics: TI
=
(I, T1ex(S, Para))
with specialization to T x I. This is used for arpeggios if we agree to quit simultaneity of arpeggio events and to change their physical onsets. 2Deformation and specialization are reciprocal processes. The first is the embedding of a particular structure in a topologically dominant set of variants (usually in some Zariski topology of algebraic geometry), the second is the restriction of an irreducible set of variants to a special, closed subset, or even a single point, usually by specialization of parameters, see also Appendix F.
721
35.3. HIERARCHIES
The last two deformation types are shown in the lower part of figure 35.3. More systematically, this classification shows these types of specialization (we refer to figure 35.3 in this list) : 1. Type: shown on the lowest line, no extraterritorial part, only the one-dimensional fundamental fields T, I, D.
2. Type: shown on the second lowest line, on top no extraterritorial part, in the middle of the hierarchy one two-dimensional field (e.g., ID = (D, IDex(D, IDex(D, Para))
in the left extreme hierarchy) with a one-dimensional territory (in the example: D), and two fundamental spaces (in the example: T, D). 3. Type: shown on line three from below. One dimension is extraterritorial and we have two fundamental one-dimensional fields (e.g., D in the left extreme hierarchy, with3 TID
= (T,I, TIDex(T x I, Para))
and the fundamental fields T, I.) 4. Type: line three from above. Only one fundamental field, above which a one-dimensional extraterritorial field component is built, and the same for the top space, it is a onedimensional extension of its two-dimensional territory (e.g., the left extreme hierarchy with TID = (TI, TIDex(TI, Para)), TI = (T, TIex(T, Para))). 5. Type: second line from top, right half. The fundament is two-dimensional and the top field has a one-dimensional extraterritorial part (e.g., the right extreme hierarchy with TID = (TD, TIDex(TD, Para)).
6. Type: second line from top, left half. The fundament is a one-dimensional field, and the top field has a two-dimensional extraterritorial part (e.g., the left extreme hierarchy with TID
= (T, TIDex((T, Para)).
7. Type: top line. Here we are left with the single total field TID without any subspaces; it is at the same time its own irreducible territory and fundamental field. Nonetheless, complete classification of cellular hierarchies should be a major issue of future research, although a difficult one. To this end, one notices that a morphism ~ ----7 £ between cellular hierarchies should be defined as a natural transformation of these functors which, by definition, starts from one and the same space hierarchy. However, the "horizontal" natural morphisms of performance cells should be generalized beyond simple projections of parameter spaces since it is natural to say that a performance 3We just write down the whole sequence of territorial fields in this formula.
CHAPTER 35. HIERARCHIES AND PERFORMANCE SCORES
722
cell is isomorphic to another cell, if the second is generated by any reasonable 4 diffeomorphism between their frames. So cellular hierarchies have only projections as morphisms, while the general concept of a morphism between performance cells is a more general one. We cannot digress on this issue since nothing non-trivial is known to the date. Let us nonetheless denote the category of cellular hierarchies by Sj although the morphisms are not made precise here, whereas the objects are.
35.3.3
Example: The Piano and Violin Hierarchies
Summary. We describe the default hierarchy associated with piano and with violin scores. -~ -
aTx I x D
Top
..------";:---@ T ~::.-_~::-~~ D IxD
~I
Fundament
Figure 35.4: The default piano cellular hierarchy. Figure 35.4 shows the default hierarchy for piano music. This is the hierarchy which one has to start with, when shaping performance. We see that pianola parameters are not given, except for duration, since on the piano, glissando or crescendo parameters for single notes are not feasible. The basis fields are also completely independent (no coupling), and we have no parameters for extraterritorial parts, since the parallel field is determined by the underlying territory T. Figure 35.5 shows the default hierarchy for violin music. It is first characterized by a double parameter extension: on one hand, we now have the crescendo extension, on the other, we have the glissando extension. Thirdly, the fundament is reduced to onset E and pitch H. This system also includes two primavista parameters A, A. The point here is that there are primavista predicates for violin, such as "ondeggiando" , an E-periodic change of loudness, which ask for refined parametrization already in primavista hierarchies. Same for the articulation field aT)... Details of these performance operators will be explained in chapter 44.7.
or
4For example an affine deformation of the frames.
723
35.4. LOCAL PERFORMANCE SCORES
1
aTD... /\ x al Root
~
EHLDGC
aTD ... /I x /
-------~
Crescelldo Extellsion
fJTD).., /I
1
EWC
EHWC Piano Root
aT)..TD/\ x al - - - -- --
TD/\ x al ENLG
aT,. x al /
- - - - - - -- . aT)..TD/\
--T---TD(\
EHLDG
--------"r---
EHDG
/
aT). x I
~L
END
\
aT,. /
ED
Tx fJl - - - - - - -- --+. T x I - - - - - - - - -_ . T EHG
EH
E
fJl - - -- - - - - - -- . I Glissando Extension
NG
H
Fundament
Figure 35.5: The default violin cellular hierarchy.
35.4
Local Performance Scores
Summary. Cellular hierarchies are the core ingredient of the local performance units, but we are still not in state of controlling all the locally relevant parameters. Moreover, we have to get prepared for the future unfolding processes (stemma theory). Local performance scores are the complete local structures for performance, we give the technicalities in the language of the space form LocPerScore of local performance scores. -I; -
So far, a cellular hierarchy lacks several specifications which are mandatory in order to be able to perform on an instrumental basis, and in order to hook a single hierarchy into a chain of unfolding performance stages. We also do not have any reference to parameters which might contribute to the specific performance hierarchy. In what follows, we shall for several reasons set up such an environment in the language of denotators. First, we ought to englobe the language of performance theory in the general denotator concept framework, homework which we have not done to this point. This is particularly important for any software developments of performance tools since the denotator language is the lingua franca of all our theoretical perspectives when they are implemented on the software level. Second, it turns out that the very definition of a local performance score is circular and therefore cannot be expressed in usual terms of mathematical theory. It can be easily expressed on the level of object-oriented programming, and this is one reason why denotators are so useful: Their formalism fits perfectly in the object-oriented paradigm.
724
CHAPTER 35. HIERARCHIES AND PERFORMANCE SCORES
The following definition of the performance score space will be given top-down in the sense that it has certain spaces in its ramification tree which are not yet made fully explicit. See figure 35.6 for its visualization by a double tetrahedron. This will be completed in subsequent chapters, but this is the right place to introduce this tree structure since the role of its components will be clearer if we have presented them in their functionality rather than in a full-size space definition.
Mother
Instrument Operator , ____-.
Daughters
Figure 35.6: The visualization of the six instances of a local performance score (LPS) on a double tetrahedron. We shall henceforth abbreviate LPS = Local Performance Score. Here is the space for LPS. We thereby still suppose that we are working in a double sequence B, P of basis and pianola spaces, as introduced in section 35.3, including this notation.
LoePer f SeoreBP
-----+
Id
Limit(IlJ)) IlJ) =Mother, Daughters, CellHierarehYBP, Instrument,
(35.2)
Operator, WeightListBp. Here is the meaning of these factors:
Mother. The mother form is the reference to another LPS, from which the given one may inherit a number of properties. It may also happen that there is no mother, i.e., our LPS is already a "primary mother", in which case a denotator of a score form SeoreForm, such as the common piano score form defined in [378], will be set instead of the referenced
35.4. LOCAL PERFORMANCE SCORES
725
mother, i.e.,
Mother
--+
Id
SeoreForm
Limit (SeoreForm, LoePerfSeore) --+ ... ( ... )
Id
(35.3)
(any adequate score form).
Daughters. The present LPS may be related to a finite set of other LPS which are derived from this LPS. The members of such a set are termed daughters of this LPS, the corresponding form is that of finite local compositions over LoePer f Score:
Daughters with F
Power(LoePerfSeore)
--+
Fin(F)>->OF
(35.4)
= Fun(LoePerfSeore).
CellHierarchYBP' This space parametrizes cellular hierarchies. A cellular hierarchy may be parametrized as a set of performance cell denotators I)(U), U E H, H the space hierarchy of I), such that the space of any such denotator also identifies the space name U. Since space names are supposed to be unique for spaces, we may just retain the space names U to nominate these spaces. In this nomenclature, we shall also denote by PU the physical space associated with U within our total space system BP. So cellular hierarchies are modeled by
CellHierarehYBP with F
--+
Fin(F)>->OF
Power(CellBP)
(35.5)
= Fun(CellBP)
and CellBP
--+
Id
Colimit(]]})
(35.6)
with]]} = (Cellu )UEBP. So we are left with the space Cellu of performance cells of space U. By definition, we have this structure:
Cellu
--+
Id
Limit (InitSetu ,Frameu, K ernelu, Field u , InitPer fu).
(35.7)
The initial set space InitSetu parametrizes either sets of simplexes or just any local composition in U, i.e.,
InitSetu Iu
--+
Id
Colimit(Iu, Simplexesu)
--+
2Fun(U) >->OFun(U)
Simplexesu
Power(U)
--+
Fin(F)>->OF
Power(Simplexu)
(35.8) (35.9) (35.10)
with F = Fun(Simplexu)
Simplexu with G
--+
Fin(G)>->OG
= Fun(U).
Power(U)
(35.11 )
726
CHAPTER 35. HIERARCHIES AND PERFORMANCE SCORES The frame is just a pair of points in U, i.e.,
Frameu
--+
Id
Limit(U, U),
designating the lower and upper extremal points
R min , Rmax
(35.12)
of the cell frame.
The field is something rather mathematical which we leave in its encapsulated form as an element of the vector space5 Der(R) of vector fields over the function on the frame R of the cell, i.e., we have this space form:
Field u
--+
Id
Colimit(lI»)
(35.13)
with II» = (Fieldu,R)rrame RcU Fieldu,R I:! Simple(Der(R)).
(35.14)
The kernel is a local composition in U, i.e., we have this space for kernels:
K ernelu
--+
Fin(Fun(U))_f!Fun(U)
Power(U).
(35.15)
The initial performance is given by its graph, i.e., as a (usually infinite) local composition in the product space of U and PU:
InitPer fu
--+
2F_f!F
Power(U EB PU)
with F = Fun(U EB PU) U EB PU --+ Limit(U, PU). Id
(35.16)
(35.17)
Instrument. The space for instruments is not specified in this general setup. However, if a concrete instrumental specification is needed, we shall make this component more precise. For instance, if the output could be intended to be piped to an MIDI device, a MusicN family member, a physical synthesis device, or a real physical instrument like a piano, a violin, etc. WeightListBP. The weight space WeightListBP has as denotators finite lists of weights on symbolic spaces U E BP. A weight is not really something new, it is only a special kind of textual predicate which we discussed in chapter 18. The relation is this: Given a predicate E and a denotator x, we associate a truth value x/E in ']['1- This truth value can be any abstract truth-oriented data, but the generality of the truth modules also includes fuzzy and similar evaluation. In particular, if we select I = ~, the truth value may be any subset of ~ (for the zero-address), and, more specifically, any open intervalj- 00, a[. If we identify the latter with the upper bound a, this just means that the predicate is a realvalued weighting of denotators. This is the interpretation of the weight concept here. But we do include predicates in their truth-oriented meaning. For example, one may define a set of notes K in the space U, i.e., a local composition which is the symbolic kernel of 5The space of derivations on the functions R is identified with the space of vector fields, see Appendix 1.2.4.
35.4. LOCAL PERFORMANCE SCORES
727
some cellular hierarchy. This can be achieved by a characteristic function XK : U --+ 'll'lR which takes the value XK(k) = 0 iff k (j. K, and XK(k) = lR. else. But this also opens the path to weights which also have relevant values between the notes where they originated. Then we have
WeightListBP
---+
WeightBP
Colimit(Weight(U), U E BP)
Id
---+
Id
List(WeightBP)
(35.18) (35.19)
with the space macro List for finite lists over a given space. The space Weight(U) is this:
Weight(U)
---+
Id
Colimit(Weightn(U) , n = 1,2,3, ... )
(35.20)
with the indexed spaces
Weightn(U)
---+
Power(WPn(U, 'll'lR)),
(35.21)
WPn(U, 'll'lR)
---+
Limit (Un' 'll'lR)
(35.22)
Limit(Un , Un), with U1 = U
(35.23)
Un+1 Un
---+
Id
---+
Id
Id
Id
Power(Un ),
(35.24)
where values of weights are given by truth denotators on 'll'lR' and the weight arguments live in the ambient space U or in one of its powers Un. So the weight predicate is evaluated on single objects in the parameter space U, or in one of its mixed powers: local compositions in U, local compositions of local compositions, mixed with points in U, etc. Usually, weights reflect structures stemming from rhythmical, melodic, harmonic and similar music analysis, see section 44.7 for details. Operator. Operators are new in this setup. The point is that the cellular hierarchy which as such completely describes the performance-together with the instrumental data. But we have not dealt with the problem of generating such cellular hierarchies from system data. The operator instance has precisely this functionality: To define the cellular hierarchy. We shall deal with this very complex component in section 44.7. Mathematically, a performance operator fl is a map
fl : fl x 2!J
--+
fl,
(35.25)
where 2!J is the space of weight lists. The first argument, a cellular hierarchy, will be taken from the mother's data and inherited to the actual LPS by the operator fl. The second argument, a selected list of weights, is usually conceived as a contribution of given musical analyses to the shaping of the present cellular hierarchy. This subject will also be dealt with in section 44.7. But it is also conceived as a source of symbolic kernels when we need to access them via their characteristic function, see section 38.3.2 for this approach.
728
35.5
CHAPTER 35. HIERARCHIES AND PERFORMANCE SCORES
Global Performance Scores
Summary. Global performance scores are atlases of local performance scores, defined for reasons of performance syntax, such as, for example, instrumentation. We describe this global approach. -2;-
We have already stressed in section 32.4 that performance is a four-fold global phenomenon: There is instrumental variety, gluing of local charts, hierarchies of parameter sets, and stemmatic inheritance. The concept of a cellular hierarchy meets the hierarchical aspect, whereas the broader LPS concept meets also the inheritance aspect by the instances of mother and daughters, and the operator-together with its weights. So we are left with the local character regarding a) instrumentation, b) local charts and strategies. With respect to these two local aspects, a global performance score should be a finite local composition of LPS, i.e., a finite set of LPS which cover different instrumental specifications as well as local charts of a global composition which is to be performed. So we may state formally the space of global performance scores: GlobPerfScoreBP
----t
Fin(F)---+flF
Power (LocPerfScoreBP)
(35.26)
with F = Fun(LocPerfScoreBP). Henceforth, we abbreviate "global performance score" by "GPS" . The moral of this construction is that it serves to perform all the kernel events within each top space in the respective cellular hierarchies of the LPS.
35.5.1
Instrumental Fibers
Summary. If we have the same instrument appearing for several LPS, limit constructions are necessary. -2;-
GPS denotators are just sets of LPS without any specific instrumental relations. If we however want to group several LPS around one and the same instrument, we have to look for limit tools. In fact, if we have k LPS with a common instrument, this is controlled by the projection PPtstrument : LoePer fScoreBP ~ Instrument onto the instrumental form. We then have to take the k-fold fiber product LPS:~t~ument = ITpkBP LocPerfScore' The Inatrument BP coproduct L PSBP,k LPsftstrument = Instrument
II
k=I,2, ...
of all these k-fold fiber products gives us the possibility to build global performance scores with instrumental grouping specifications on the space type BP. If we accumulate all space types which are of interest, in a sequence BP. = BPI, ... BPn , say, we get the more general coproduct LPsftstrument
=
II
k=I,2, ... , 1=1, ... n
LPS BPi,k Instrument
729
35.5. GLOBAL PERFORMANCE SCORES and finally the global performance score space
GlobPerfScoreBP. with F
-----+
Fin(F)-->!lF
Power(Lpsf;:trument)
= Fun(Lpsf;:trument).
(35.27)
Chapter 36
Taxonomy of Expressive Performance This last album is not titled as a memorial album or as an album in tribute because it was titled by Coltrane himself the Friday before his death on Monday, July 17, 1967. He and Bob Thiele were considering words that might apply to the sense of this album, and finally Coltrane said, "Expression. That's what it is. " Nat Hentoff [219] Summary. Performance structure describes a semiotic fact: expression of meaning by shaping of score data. These expressive semantics are classified according to the three layers of reality captured by the topographic cube: Psychic, physical, and mental (see section 2.4). The first one means that performance expresses emotions, the second one deals with expression of gestural contents, and the third one-the musicologically most interesting one-aims at giving one's understanding of the musical text a rhetoric expression. In a realistic performance, all three expressive semantics will participate, however, a theory of expressive semantics must first of all deal with the "pure" types which are, each in its own way, difficult subjects of ongoing research. We will not deal with the psychological, cognitive or neurophysiological esthesic aspect of performance since this is part of music psychology and would exceed our subject. -~-
To the common music lover, it is by no means clear whether and in which way performance should express contents, and what kind of contents could be addressed. The most widespread belief is that music expresses emotions, or even that "music is" emotions. The latter approach is sometimes contended by music psychologists (see 36.1). This is due to the common usage of music as a carrier for external contents. Music often just ornaments events, ceremonies, feasts, and as such it is not intended to ask music for whatever content. Perhaps the most interesting such phenomenon is film music. It is a common saying among film music experts [118] that the best film music is the one of which the spectator does not even take notice while watching the movie. But this is a very superficial judgment, since it is in contradiction to the fact that muting the music channel in a movie virtually destroys 733 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
734
CHAPTER 36. TAXONOMY OF EXPRESSIVE PERFORMANCE
the movie. Without music, most of its message vanishes. This is not only the case where in a criminal story of one of those cheap TV productions music has to announce that something dangerous is imminent. Omitting music evaporates the very atmosphere, the parfume of the best film! So music is quasi-absent but essential to the movie. The point of this apparent contradiction is that the common music expert saying does not really observe what we experience while watching a movie: why are we really sitting in a movie show? First of all, we replace everyday reality by an artificial, virtual reality which is projected into the dark environment of the classical camera obscura. Such a reality switch needs a strong booster to work, and this booster is music. Music is well known as the exemplary environment for a counter-world, a force that lets us forget about the common things and transports us to lost, hidden, and subconscious layers of existence, see also our comments on the Depth-EEG experiments in counterpoint theory in section 30.2.4. In reality, the core of a movie is its music, not its visual process and textual story, the latter are only the pretext of what is being communicated. Pretext in its literal sense: ante textum, not irrelevant, but not the kernel either. One could even require from good film that as a work of art it should transfigure the visual process and textual story into music. In short, this pleading for music states that music expression in performance is expression of something, not pure, self-sufficient artistry. If this is acceptable, performance as a rhetoric category should take care of how to express the contents. Therefore, on the one hand, the question must be posed about which contents can be conveyed in performance, and, on the other, about the equilibrium between contents and the medium of performance. The following discussion deals with the basic strategies of creating performance of something. It does not claim exhaustivity, but sketches prototypical approaches to contents of expression in order to position our research approach in the field of the young science of performance. For a more extended discussion of performance research history and also psychological streams, see Reinhard Kopiez' excellent survey in [272].
36.1
Feelings: Emotional Semantics
Summary. Several authors, such as Susan Langer, Manfred Clynes, Alf Gabrielsson, Jorg Langner, and Reinhard Kopiez, have also focused their research ona relation between emotions and performance. Whereas Gabrielsson [177] contends with Langer [288] that there is an isomorphism between musical structure and emotion, Langner and Kopiez [270] develop a theory of oscillating systems that is supposed to find a physiological counterpart on the neurological level.
-EIn [288, p.27], Susan Langner states that "music is a tonal analogue to emotive life", a statement which is interpreted by Alf Gabrielsson in [177, p.35] as the basic idea of "an isomorphism between the structure of music and the structure of feelings". This is also the doctrine which Gabrielsson adapts: "In summary, we may consider emotion, motion, and music as being isomorphic." This latter statement also includes motion as one of the isomorphic structures. This is related to Manfred Clynes's stress of emotion and its expression as an integrated system where motion, i.e., the gestural dimension, plays a crucial role [91]: Emotion, he calls it a "sentic
36.1. FEELINGS: EMOTIONAL SEMANTICS
735
state", may be expressed by "gestures, tone of voice, facial expression, a dance step, musical phrase, etc." While this conjecture may please psychologists, it is completely useless to scientific investigation. In fact, such an isomorphism is a piece of poetic literature as long as the components, emotions, motion (gestures), and music, are not described in a way to make this claim verifiable. Presently, there is no hope for a realistic and exhaustive description of emotions. Same for gestures (see next section 36.2), and as to music, the mathematical categories of local and global musical objects are so incredibly complicated that the claim sounds far-out, see also figure 36.1 for an attribution of emotions to articulatory ambitus. For example, the number of isomorphism classes of 72-element motives in pitch and onset (modulo octave and onset period) is ~ 2.23.10 36 as we have seen in section 11.4.1.3. How could the claimed isomorphism fit in this virtually infinite arsenal? Such a terrible simplification however does not contradict the :t:
120
Happy
100+---~~--~~----~~--------~----------~}-----
80
60 40
20
o ====;;;;;~
120 100 +---~r---~~~~~~
80 60 40
20
o
~LJ'4II~~I4«~
120 100+-~~~-----------------------------------------
80 60 40
20
o
~~~~~~~~~~~~~
120
100 80 60
40
20
o
W4~~~~~~~~~~
Figure 36.1: Attribution of emotions to the articulation ambitus [177, p.42] for the song "Oh, my darling Clementine" .
736
CHAPTER 36. TAXONOMY OF EXPRESSIVE PERFORMANCE
generic insight that expression in performance may be motivated and grounded in emotional categories. But we should be careful on this point: Our present discussion is not about emotional effects of music in the listener's psyche, we are talking about emotional rationales for expressive performance, i.e., the question of how performance could be induced by emotions. This aspect is typically addressed when the score annotations require performance actions in the mood of "amoroso", "languissant", "beklemmt". Here, the performing artist has to play in and through such a mood. To be honest, to play in such a mood is not a concrete way of telling a performer what to do. The mechanism very probably works by a feedback: The performer plays and hears his/her performance so that the output may be adapted to the received impression in the artist's own ears compared to what the artist conceives as being an impression of the given type. Evidently, such a rationale is extremely difficult to handle on a scientific level. And it is also difficult to understand a relation to the given score which transgresses the explicit annotations mentioned above. Since general feelings as a motor for performance are no good point of departure, one should (truly in the spirit of Langner, Clynes, and Gabrielsson) at least try to relate the score's contents (not the annotations, the structural facts) to emotions, which, in turn, could then be used to shape performance via the above feedback mechanism. But this is a very complex task. How should a determined emotion (and which?) be incited by a given structural fact, a cadence, a melody, a harmonic configuration, a rhythmic process? So the emotional rationale for performance splits into the • association problem between emotions and performative shaping; • and the association between non-emotive score contents and emotions. This story has not yet been written. A less totalitarian and more quantified approach than Langner, Clynes, and Gabrielsson to emotional rationales for performance has been proposed by J6rg Langner and Reinhard Kopiez [270,271,272] with what they call TOS (=Theory of Oscillating Systems). The TOS postulates a system of 120 oscillators with a determined frequency each, reaching from 8 Hz to 0.008 Hz, and distributed in logarithmic steps. In a kind of Fourier analysis (the authors have not to date published the precise formulas), the dynamical curve of a recorded piece of music is decomposed and shows the contributions of these oscillators to the given curve. The TOS is not only meant as a formal description but the authors argue that the musical progression really triggers a series of oscillators in the cognitive stratum of the human brain. It is contended that the TOS spectrum is stored in human memory and that comparison of performances is enabled by comparison of such spectral data. The authors conclude [271, p.33] that "production and perception of expressiveness in music are essentially one entity." No reference is however made to the score as such, i.e., TOS just measures the performance output and represents the spectral development in time by graphical means (called the oscillogram, see figure 36.2). So TOS does not measure the performance map as such, only the image of the map. It is evidently subtended in the TOS that the spectral decomposition bears a semantics of time processes which-independently of the hidden score-transports the neural activities of the musical brain. So expressivity is correlated to a neuronal oscillator system (expressed by firing rates of neuron populations). This is a type of rationale which refers to a score-independent instance which encodes expressivity. This is a dramatic tournament since
737
36.2. MOTION: GESTURAL SEMANTICS Oscil/ogram :
g
J d
C;-
os
g.
0.25
5
..=
0
0 008 10
12
14
16
time (s)
]~( J o
1
10
l..oudlfQ$ cu rve
12
I~
16
time (s)
Figure 36.2: Oscillogram (above) and loudness curve (below) belonging to quarter notes played in a 4/4-time signature by a drum computer with an additional accelerando at the end. This accelerando leads to a parallel upward movement of the dark bands in the oscillogram, which means that the activation changes to higher oscillator frequencies. semantics, i.e., expressivity of something, is uncoupled from the text. This rationale is a kind of "pure expressivity", not a result of reflection or analysis, nor a result of gestural structures. It must therefore be an emotional rationale, although the authors are not precise on the cognitive category and the topographic origin of such neuronal oscillators (it could be that the limbic system is meant), and an experimental verification of the existence of such oscillators in human brain outstanding.
36.2
Motion: Gestural Semantics
Summary. Besides-but connected to-emotions, motion, as mediated by gestural paradigms, is a widespread rationale of expressive semantics. Neil McAgnus Todd [532], then Johan Sund-
738
CHAPTER 36. TAXONOMY OF EXPRESSIVE PERFORMANCE
berg, Violet Verillo [518], and Ulf Kronman [280], and also David Epstein, Jacob Feldman, and Whitman Richards [146] have proposed physically motivated descriptions of retards at group endings. Richard Parncutt, Jan Sloboda, and Eric Clarke [412] have added the concrete aspect of anatomical constraints in piano performance. We describe and analyze these approaches. We shortly account for the work by Shuji Hashimoto and Hideyuki Sawada [210] concerning the Japanese kansei processing research in the musical domain of gestures. -~-
The emotional argument of performance is often paralleled by the argument that gestures, "motion", are also present as a parallel phenomenon to emotion, we have already seen this in Gabrielsson's isomorphism statement in section 36.1. It is also stressed by Clynes' sentic concept which measures the human input by a gestural device of joystick-character, and by Kopiez-Langner [271, p.32]. Citing Francois Delalande [115], they state that "body movements and gesture are in close relation to musical timing. Our neuro-psychological oscillation model implies that body movements can be triggered by musical events." And of course vice versa: Body movements trigger expressive performance. Also, in the framework of Japanese "kansei information processing" 1, Shuji Hashimoto and Hideyuki Sawada stress that "gesticulation is often employed in musical performance to express the performer's emotion." Using the dataglove, they have implemented applications which transform gestures into "MIDI control units to improve the performance in real-time" [210]. This regards expression of crescendi, vibrati, or pianissimo, for example. Richard Parncutt has investigated the intuitively evident fact that expressive performance is strongly conditioned and induced by physical constraints from fingering [411, 412]:"good fingering is a crucial ingredient in the preparation of performances that are both technically reliable and appropriately expressive." In [410], he has also maintained the thesis that "the most important sounds conditioning the perception of rhythm may be the sounds associated with the heartbeat and walking movements of a mother, as heard by her unborn child." So good performance should been adapted to this motion trigger. In accordance with these general observations, several performance scientists have proposed models of performance which derive performance fields from mechanical principles of accelerated motion. In particular, this has been set forth regarding tempo curves. Analyzing the first experimental studies of final retard phenomena [518] by Sundberg and Verillo, Kronmann and Sundberg [280] propose a model of final retard which is completely derived from the mechanical analogy of constant deceleration of tempo, as if it were induced by the action of a constant force upon a physical mass. However, tempo T is seen here as a kind of velocity that is a function of physical time, not symbolic time, as it is standard. So what is constant is the force as a function of physical time. We then have T( e) = dE j de = c.e, c = canst .. The resulting formula is T(E) = ToVl- EjEo, where the tempo T(E) at onset E is related to the starting tempo To, the total onset interval Eo until total stop (which is not included in the really onging music, but has to be set as an ideal endpoint of motion). If such a formula is derived, by the same reasoning, one could also derive formulas with different, not necessarily constant force function; this has also been observed by these authors. However, there is IThis is the term coined to stress a special application of information technology to implementation of emotional contents against the classical Artificial Intelligence and other, more logically and mathematically oriented approaches.
36.2. MOTION: GESTURAL SEMANTICS
739
no indication of which force should act and why so. So in principle, any tempo curve can be constructed by an adequate acceleration and therefore force function. More generally, it is not clear, why the supposed mass should be constant. As with special relativity, the mass could vary as a function of tempo. It seems that the mechanical motion model is only the construction of an intermediate layer to the real question: What are the basic forces which shape tempo? If the straightforward mechanics (constant mass, constant deceleration) are maintained, however, this explanation completely standardizes the final retard phenomenon and uncouples it from the underlying score and composition. This type of approach has also been proposed by Jacob Feldman, David Epstein, and Whitman Richards [146]. Their paper models tempo T(E) as a velocity function of symbolic time E, and its derivative is meant to be determined by a quadratic force function F(E) rv E2. The Newtonian equation F(E) = m.T(E) yields a cubic polynomial function T(E) = a.E 3 + ... for the tempo. Of course, this is a completely different mechanical situation, here the force really acts on the symbolic level instead of the physical action described by Kronman and Sundberg. While that one means T(E) rv E 1 / 2 , this one yields T(E) rv E3. Unfortunately, the latter approach is not congruent with the examples shown in [146]: They refer to the reciprocal value l/T(E) instead of T(E)! So we should have T(E) rv e 1 / 3, but that requires another mechanical situation. In terms of Kronman and Sundberg, this requires a force which, as a function of physical time e, is proportional to e 1 / 2 • A less simplistic approach which also includes structural analysis (see also below in section 36.3) and not only mechanical generalities is presented in Neil McAgnus Todd's paper [532]. He rightly observes that the final retard is only a very special agogical situation, and therefore models his tempo curves according to a superposition of accelerando/ritardando units which are defined by a triangular sink potential V. Accordingly, tempo is defined as a velocity v, and the total energy of the system E = ~mv2 + V, supposed to be constant (why?) gives the velocity formula v = J2(E - V)/2. Todd further supposes that there is an intensity variable I for loudness, with a relation I = K. v 2 that is common to many physical systems. This yields the relation 1= 2K(E - V)/m and sums up to an aggregated formula I = 2:12K(E - v,,)/ml if the grouping of the piece is taken into account. Thus, the idea is that there is a physical energy and intensity parameter system that controls the "surface" of the tempo (= velocity) via classical energy formulas. So the background structure is an energetic one, i.e., the tempo curve and loudness are an expression of mechanical dynamics. The author comments on his method as follows [532, p.3549]:
The model of musical dynamics presented in this paper was based on two basic principles. First, that musical expression has its origins in simple motor actions and that the performance and perception of tempo/musical dynamics is based on an internal sense of motion. Second, that this internal movement is organized in a hierarchical manner corresponding to how the grouping of phrase structure is organized in the performer's memory. The author also suggests a physiological correlate of this model (loc. cit.):
... it may be the case that expressive sounds can induce a percept of self-motion in the listener and that the internal sense of motion referred to above may have its origin in the central vestibular system. Thus, according to this theory, the reason why expression based on the equation of elementary mechanics sounds natural is that the vestibular system evolved to deal with precisely these kinds of motions.
740
CHAPTER 36. TAXONOMY OF EXPRESSIVE PERFORMANCE
Todd refers to insights of neurophysiologists according to which the vestibular system is also sensitive to vibrational phenomena. So the musical expressivity is understood as an effect of transformed neurophysiological motion. The drawback of this approach is that finer musical structures are not involved in the structuring of the energy which shapes tempo/intensity. And even if that could be done, there is an essential kernel of this shaping method which should be based upon paradigms of motion. These paradigms do however not appear clearly in the above approach. More precisely: The complex motion dynamics of the vestibular system cannot easily be mapped onto the structures of performative expressivity. What is the operator which transforms whatever structures of motion into expression parameters? If music were isomorphic to motion, no such isomorphism could be recognized from Todd's clever approach. Beyond general motion paradigms there is the more visible level of gestural structures which can be implemented in operators for musical expressivity. Unfortunately, the existent classification of gestures is anything but detailed. Classification of gestures is only settled on a prototypical basis. But this is precisely not what music needs so urgently. For example, the extremely refined gestures of Glenn Gould's performance 2 , the movements of his hand and arms, his head and thorax, this is beyond any scope of present classification.
Figure 36.3: Glenn Gould while performing. It is the same with the incredibly refined movements of Herbert von Karajan's hands, and especially their single fingers: It is known that these gestural directives to the members of the 2The reader should be careful on this point: In [115], Francois Delalande has recognized the expressive role of Gould's gestures (analyzing a film record of Gould's performance of Bach's Kunst der F'uge, see figure 36 .3) with respect to musical structure. This is an extension of the performance parameters from sound to gestural parameters, but this is not what we are discussing here. We are discussing the role of gestures as a cause for musical performance, not a media thereof. Evidently, a pianist's (and a fortiori a violinist's) performance is strongly and essentially driven by gestural shapes which are not only mediators of structural facts.
36.3. UNDERSTANDING: RATIONAL SEMANTICS
741
orchestra were observed and followed with extreme attention. To measure such details is not what we can control at present. To be clear: We are not contending that gestural and motion information is irrelevant to performance. On the contrary, this is an essential contribution, but it is too difficult for scientific research as long as classification of gestures is so far from being settled. And that alone does not solve the problem, since operators for shaping performance parameters must be defined from the information provided by gestural input data. All this seems to be a bit easier than the far-out emotional rationales, but still is subject of advanced research.
36.3
Understanding: Rational Semantics
Summary. As opposed to "low level" emotional and gestural expressivity, rational semantics deal with expression of rational interpretations of the score structure. This means that the text is analyzed from different points of view, such as harmony, rhythmics, motivic content. These analyses are used as an input to shape the performance structure. This aspect is dealt with in research by the group of Anders Askenfelt, Anders Friberg, Lars Fryden, and Johan Sundberg [23, 163, 164, 166], then by Neil McAgnus Todd [530], Gerhard Widmer [567, 568], then by Jan Beran, Guerino Mazzola, Joachim Stange-Elbe, and Oliver Zahorka [346, 347, 348, 349, 350, 357, 360]. -~-
Already Hugo Riemann [452] had stressed that the scope of rehearsal should be to support the communication of the motives' comprehension. And it is a fact of music psychology (see for example [87] or the excellent overview [272]) that performance as an expression of the score's structure is better understood than performance which disregards structure. Also, in Theodor W. Adorno's theory of performance [6, 7], the analytical point of view, i.e., the purpose of performance to transmit analytical insights, is prominent. In fact, the most explicit starting point of any performance is the given score. This is a text that abounds with structure that must be shaped in a physical performance space. The reference to this structure as a rationale for performance is a straightforward logic which is completely standard in literature: Interpretation of a text is one of the most recognized and widely practised methods in text performance, especially in the actors' interpretation of dramas. Besides emotion and motion it is therefore logical to refer to the rational text analysis in order to shape performance. Probably the first explicit and quantitatively stated contemporary approaches in this vein is the "analysis-by-synthesis" method of Sundberg and his collaborators which was first presented in [517]. Analysis-by-synthesis means that a bunch of performance rules for the shaping of different parameters is defined in a software environment, and then applied to the production of a synthetic performance (on an MIDI-instrument, say). The result is then analyzed by an expert (in Sundberg's group this was the professional violinist Lars Fryden) who proposed alterations of the given rules and/or new ones to the programmer. In this experimental cycle, the rules are always of a general character which is based on structural data, not on direct-emotionally or gesturally driven-interventions on the performance data of the individual composition in question. This approach has been implemented on the basis of eleven rules
742
CHAPTER 36. TAXONOMY OF EXPRESSIVE PERFORMANCE
in the computer application Rulle, later, renamed to Director Musices [166], with an extended repertory of rules. The rules of this approach are grouped into three categories: the • differentiation category, including these rules: Duration contrast, High sharp, Double duration, High loud, Accents, Melodic charge, Melodic intonation; • the grouping rules, including these rules: Punctuation, Phrase arch, Leap articulation, Phrase final note, Leap tone duration, Harmonic charge, Faster uphill, Chromatic charge, Amplitude smoothing, Final retard, Im3gales, Repetition articulation; • the ensemble rules: Melodic synchronization, Bar synchronization, Mixed intonation, Harmonic intonation. We shall discuss these rules in section 37.1.1 of chapter 37 on performance grammars. Let us just give two examples here: The rule "Faster uphill" requires in a melodic context, that if a note is followed by a higher pitch note, its physical duration is shortened by 2.k ms, where k is a system variable in order to give its rules a variability in strength, default is k = 1. A second example is "Melodic charge", which depends on a non-negative numerical weight attribution for every pitch as a function of its position in the selected tonality. The weights are roughly proportional to the distance of the pitch from the central pitch of the tonic (its weight is zero). Proportionally to the weight, amplitude (in Decibel), duration (percents), vibrato amplitude (percents), and unevenness smoothing are effected, also multiplied with the omnipresent strength factor k. These rules are all very elementary in their mathematical as well as in their music-theoretic approach, but they are completely concrete, and this makes this early attempt so precious. However, the present presentation and formal statement of these rules lacks a clear-cut distinction between symbolic and physical reality. For example, the above mentioned rule "Faster uphill" does not ask for the absolute duration of these notes, the change of duration is independent of the physical data, and produces either too large or too small duration changes in extremal tempi. And this is the delicate point here: A distinction between symbolic and physical reality must rely on absolute tempi. Tempi are however not mentioned in the entire machinery, although in his PhD thesis, Anders Friberg makes a comment on tempi. These rules act not in the sense that they define a map from the symbolic reality into the physical one, but they act on an already given physical image of the symbolic reality. This seems to be a kind of "prima vista" performance, but no information on this is given. There is no performance of a score which lives outside some tempo specijication3 , there is only performance of performance. Moreover, the lack of agogical shaping operators is also manifest from the absence of onset in the tone parameters. The shaping of onset as such (independently of articulatory shaping) is not defined. The example of a motion-triggered performance model by Todd which was discussed in section 36.2 above is a realization of Todd's generic approach to rational semantics in performance [530] which we shall now describe. The background structure of that motion paradigm in fact relies on structural data (of grouping), as we have already mentioned above. Todd's generic performance model is designed upon a bidirectional transformation pairing from a score representation 1}i to a performance P and backwards by means of: 30r something of this type if simple tempo curves are not possible in complex hierarchies.
36.3. UNDERSTANDING: RATIONAL SEMANTICS
743
1. a performance procedure II acting on Wand an encoding function T
P = II(W, ')'),
2. a listening procedure A acting on P and a decoding function 0:
W = A(P, 0).
In this generality, "the theory ... is sufficiently general to cover any variable of expression. At the same time, it is agnostic as to what is being communicated, be it structure, emotion, or extramusical reference" [530, p.407]. The generic character of Todd's approach hides an asymmetry of the transformation pairing which is due to its semiotic background; see also [361] for a modern survey on music semiotics. In fact, performance is a poietic process issued by the performer from the composer's score. In other words, a performance is caused by its creators and must be understood by t~e listener, not vice versa. Hence, the performance transformation has to be specified as a semiotic mechanism. This is the difficult part of the business. Without entering into details here (see chapter 46 for a detailed discussion) it can be said that the critical subject of performance theory-a problem which Todd thematizes in the spirit of cognitive science-is a reconstruction problem: Given a performance P, how many representations wand encoding functions,), can you find such that P = II(w, ')')? In mathematical terms, we are looking for the fiber II- 1 (P) over P. This is the so-called inverse image of P, and therefore, this branch of performance theory is called inverse performance theory. The listening procedure in [530] is just a formal setup for a section A to II, i.e., the selection of an element in the fiber over P as a function of the decoding data o. Clearly, the fiber cannot be described in effective mathematical terms if one does not assume a well-defined transformation model. And even for very special models, the so-called locally linear performance grammars (see [352] and section 46.2), fibers turn out to be highdimensional algebraic varieties. Further, the encoding function must be meaningful enough to reflect the score's structure and its relations to the above categories of expressive semantics. Otherwise, performance cannot claim to interpret the selected score. In other words, the big problem of performance theory is to propose models of adequate generality that cope with expressive semantics. In Todd's singular example to his theory, he restricts to hierarchical grouping data for the shaping of duration. Commenting on the inverse problem of listening procedure, he states that "the durations used in the calculations are from only one metrical level. Much information about tempo is given at metrical levels below the tactus and in the durations of actual notes. The representation needs to be extended downwards to include note timing, which would mean that a rubato handler would have to work in cooperation with a metrical parser, one feeding the other. Clearly, a lot of work is needed in this area." Concluding, he notes that "the known algorithms make no reference to any tonal function. Therefore, a rubato handler could be a vital component of any theory of grouping in the perception of atonal music. A complete theory must of course include dynamics, articulation and timbre." Methodologically, this approach is tightly bound to cognitive science in that any algorithm is first of all tested upon its immediate fitting into human perception mechanisms, within real-time constraints, say. We believe this is a too narrow approach for two reasons: First of all, the investigation of general structural facts must be carried out before any relevance to human perception is taken into account. There is the general problem of getting an overview of possible models and their classification. Second, the cognitive knowledge is all but settled, more
744
CHAPTER 36. TAXONOMY OF EXPRESSIVE PERFORMANCE
precisely: We do not know, by what processes cognition of performative expression is handled in the human brain. It could happen that a rather abstract invariant of the geometric structure of a mathematically complex fiber rr- 1 (p) can easily be detected by the cognitive machinery, but that this invariant would not have been detected if we were only permitting fibers which allow an immediate access by the cognitive capacities. For example, the mathematical structure of a Mobius strip shaped fiber may be too complex to be grasped by the cognitive machinery, whereas its lack of orientation may be an easy task to be tackled by a small test routine built on a neuronal basis. Gerhard Widmer's work (e.g., [567,568]) is based on the machine learning method. In this context, structural features of given scores, such as chords, or small motives, are correlated to the performance data of a given human performance. This data is then used to shape a second score. Here, the performance rule is found by an investigation of a given performance. However, there is no background theory to this, it is purely imitative with respect to human competence. We cannot see any deeper value of such an approach since nothing is really learned beyond parrot-style imitation, although such an imitation may sound quite attractive. A completely orthogonal approach, which is more akin to Todd's intelligent setup, was undertaken by the Zurich school4 of the author and his collaborators in [50, 51, 52, 346, 347, 348, 349,350,357,360]. In our setup, the structure theory of performance in its full-fledged concept framework as developed in chapter 35 is used as an output level for performance, whereas the LPS operators are designed to implement different output data from musical analyses, such as metrics/rhythmics, motives/melodies, harmony, counterpoint, or grouping. This approach was first sketched in [341], then presented at the SMAC in [345], at the ICMC in [347], in a paper [346] concerning tempo hierarchies, and above all in the SNSF reports "Geometry and Logic of Musical Performance I,II,III (1993-1995) [348]. This approach is completely general in the sense of Todd's scheme [530] described above. However, we have stressed a specific approach to the communication of analytical facts to performance operators which eases many methodological questions concerning the interplay and concerted action of an entire collection of analytical results. This is the method of weights, i.e., it is required that the analysis of the music-theoretic procedure A be delivered in the form of weight functions WA : K ---> JR, where K is a local composition in some parameter space associated with the score, and more directly with the underlying cellular hierarchy of the LPS which implements a performance operator. For example, a metrical weight function WM : KE ---> JR typically associates a non-negative weight wM(E) for each onset E in a given local composition KE of (symbolic) onsets. At first sight, this seems to be utterly restrictive, but the explanation is this: Ultimately, performance has to be defined via numerical indications of how to specify the instrumental parameters. So even if the analysis is of a more symbolic character, it has to be filtered or transformed into numerical values sooner or later. Of course, this could be done in the innards of a specific operator, and there is no obstruction to defining such operators. However, if we want to combine different analyses to be used as nurture for a determined operator, the problem of uniformity of such an input combination arises. But if the input is a priori a weight, the analytical arguments of an operator can be designed in a much more weight-independent format. For example, one may then feed a linear combination of a number of given weights into an operator, without being concerned about where these weights stem from. 4The term was coined by Thomas Noll.
36.4. CROSS-SEMANTICAL RELATIONS
745
Apart from these analytical input strategies, the approach of the Zurich school also stresses the generative nature of performance. Its differentia specific a to other approaches is that the unfolding of performance has been thoroughly formalized in the concept of a performance stemma, the genealogical tree of performance rehearsal and development history (see chapter 38 for details). So Todd's hierarchical examples of refined grouping are not only extended to a cascade of LPS, their hierarchical nodes are also turned into autonomous agents of performance shaping processes which trace and group the entire performance unfolding as a compound historical and logical process in the large. This refined input policy, extending the elementary grouping rationales as exposed by Todd, yields a more sophisticated option to test correlations between measured performances and analytical insights. In this vein, different investigations have been executed with a good success by Beran and Mazzola [50, 51, 52] on the level of statistical methods. The point of this approach is that performance is so complex that it requires a faithful representation of the analytical structure of a score. And that this analytical structure must be an essential input qua significant function of the given individual score. We argue that principles of performance shaping which are completely unspecific towards the concrete score structure cannot provide us with relevant performance directions.
36.4
Cross-semantical Relations
Summary. Since music has a communicative dimension between poiesis and esthesis, each of the three above semantical directions on the poietic level may influence the esthesic level in one or several of the other semantical directions. For example, a rational expressivity may produce a gestural understanding or vice versa. We give an account of these "cross-semantical" phenomena. -~-
As was already observed above, it may (and will in many practical situations) very well happen that one type of semantic rationale may act only indirectly upon the shaping of performance. For example, an emotional rationale may first "shape" a gestural object which in turn will shape musical performance. There are no restrictions to that. It is only important that there be one semantical modality at least which is able to convey its contents to the effective shaping of musical parameters. The rest may be arbitrarily complicated: For instance, a rational instance (score analysis, say) may produce an emotional object as its consequence, and the emotional object may produce in turn a gestural object which may evoke a second emotional object, completely different from the first! And then it may happen that this second emotional instance acts directly upon the shaping of performance. It is not clear at present, how much and on which basis the interrelation of different types of performance rationale could be implemented as computer applications, since the cross-modal assignment procedures (e.g., emotion from ratio, ratio from motion) may be hard to realize. Undoubtedly, this question is very interesting, be it for Japanese kansei research (as discussed in Hashimoto's approach in section 36.2), be it for the general problem of harmonizing divergent semantic directions in music, above all: harmonizing the emotional direction with the rational semantics of the text.
Chapter 37
Performance Grammars Why care for grammar as long as we are good? Artemus Ward (Charles Farrar Browne) (1834-1867)
Summary. The idea of basing performance on rules in analogy with linguistic grammar goes back to Mathis Lussy [311]. In modern performance research, this terminology was recovered by Johan Sundberg and his school [520]. We discuss the principles for a grammar of performance and give an overview of representative approaches to this theory. -~-
To our knowledge, the term performance grammar was coined by Johann Sundberg on the occasion of a performance theory conference in Aarhus [520]. The reason for such a conceptualization is that the specialists became aware that performance can be shaped in various ways, but not from an amorphous design rationale, on the contrary: It became evident that there are entire organisms for shaping performance from the given score, its different semantic approaches, and the way these approaches are transformed into concrete performance instructions. The idea that performance should be executed along certain regular patterns that remind us of a language structure comes from the fact that performed music is viewed as a rhetoric vehicle of contents, and that these contents are, by the very nature of musical semantics, hidden, difficult and ambiguous. In other words, the way they are expressed is an essential condition for their communication. For a number of expressive methodologies, their architecture in fact resembles a language although, at present, only very elementary grammatical patterns are known. In [519], Sundberg proposed the creation of a dictionary of expressive rules where the patterns of the performance language can be looked up. We distinguish rule based approaches from rule learning procedures (and we shall not deal with chaotic ad hoc performance being taught in the vast majority of music conservatories!). Representative research is reviewed and classified according to the semantical perspective, as discussed in chapter 36. But the very need for a performance "language" has also a deeper explanation. If we listen to performance, it is not just the concrete piece being performed and the concrete way of performing without further context which are perceived and judged. In fact, one cannot understand the expression of a coherent and extended text without having an access to the
747
G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 37. PERFORMANCE GRAMMARS
748
method of shaping contents. Understanding performance really means that we gain access to a system which is applied in this concrete expressive work, and this system is a language with its grammatical rules which shape the message. In other words, understanding the performance of a musical composition amounts to understanding its system, the language and, in particular, the grammar which guides performance.
37.1
Rule-based Grammars
Summary. The generic scheme of rule-based grammars and three representative directions in rule-based grammars are presented: the KTH school, Niel Todd's approach, and the Zurich School.
The known rule-based performance languages share a generic grammatical structure which can be described as follows. Basically, there are three components which shape the performance transformation p : K ~ p(K) of a given local composition K, i.e., the kernel of the top cell in a hierarchy, see figure 37.1. The first component, which we call the rationale Rat, yields the
Rat
K
Op
Ratio
.'
Arg Emotion
Motion
.'
.... mr .::,: ...............• p
"
":4. p(K)
Figure 37.1: The three components of rule-based performance grammar: the rationale Rat, the argument Arg, and the operator Op. They contribute to the construction of the performance transformation p of the kernel K. The action of an operator on the performance map can be either symbolic (sb), morphic (mr), or physical (ph). raw material which is intended to act on p. As we have seen in chapter 36, Rat is a complex organism which may include emotional, motional/gestural, and rational (analytical) agents which may interact and result in a final statement to be delivered to the subsequent shaping actors. The output of these operations is the second component, call it Arg, the argument of performance. For example, in the analytical approach, this may be a weight function. In the motional situation, it could be an object which parametrizes a physical movement, and in the
37.1. RULE-BASED GRAMMARS
749
emotional setup, this may be a verbal description of a feeling, for example. The third component is an operator Op which "understands" the Arg and which, when fed with this argument, yields a determined function Op(Arg) which defines the performance transformation p = p(Op(Arg)). If we view this situation in the rich context of LPS theory, the argument may of course include the mother LPS and thereby determine p not only from the LPS' proper rationale, but also from the LPS' inherited data from previous shaping activities as they are traced on its mother LPS. The operator's action may typically be targeted to one of the three ingredients of p: its domain K, its codomain p(K), or the map (the functional expression) as such. The first type is called a symbolic operator because it alters the domain. This is a very strong action since the original notes, i.e., the score's genuine structure, are changed. Symbolic operators create a new composition such that performance is defined on new input data. Whether this type is really a case of performance seems somewhat critical. But suppose we are given the portion of a score which is written in fortissimo, and suppose that this dynamical attribute is a part of the kernel's specification. Then, if an additional dynamical annotation, such as diminuendo, is inserted in the score text, this may be seen as a performative prescription: Change the dynamics of the specified group of notes by a successive lowering of fortissimo dynamics in some determined range. In this case, a symbolic operator would do the job since it is an action which is required before any artistic shaping begins. We shall call this a primavista operator in the operator theory of chapter 44.7. The second operator type acts directly upon the given physical output p(K), this is why it is termed physical operator. It may deform the physical data without altering the kernel K or the function p. We have to explain this seemingly contradictory argument, because it effectively changes p. We have in mind that a physical operator is a successive map ph : p( K) --+ ph(p( K)), i.e, the original map as such remains what it is, but it is composed with the physical operator's action and yields ph 0 p. The third operator type alters the functional description of pinto Op(p) and therefore is termed morphic. This is typically the case if p is defined via a performance field as it is implemented in the cellular hierarchy of an LPS. Summarizing, we may restate this principle for a rule-based performance grammar: Principle 25 The described scheme englobes the generic framework wherein the performance language is structured. The contents which are shaped by this grammatical structure are cast in the argument Arg, whereas the grammatical structure is centered around the operator instance Op. The operator is the rhetoric element, it tells how the shaping works, whereas the argument Arg is the codified message to be conveyed after an encapsulated process of seman tical elaboration in Rat.
37.1.1
The KTH School
Summary. The KTH school's system is dominated by local rules which are based on low-level structural analysis of the text. Semantics of this analysis are rational and-to a lesser degreegestural. They are found by the characteristic empirical "analysis-by-synthesis" method. -I;-
CHAPTER 37. PERFORMANCE GRAMMARS
750
The grammatical structure of the KTH school has been described very clearly in [165], see figure 37.2 for a reproduction of that grammatical scheme. According to this scheme, the surface
input
surface level
mu
ic performanc grammar duration amplitude pitch etc.
scorc
-----}underlying level
phra ·c ana lysis harmonic analy is
melodic gesture alJalysis
transformation ru lcs
/ melod ic charge harmon ic charge chro matic charge
i
k-val ues
Figure 37.2: The KTH scheme following [165] shows a clear congruence with the general scheme which was described above in 37.1. level shows the transition from the symbolic score to the physical performance (horizontal arrow). This transition (in fact: the performance map p) is shaped by an analytical rationale (left lower group in the underlying level) which comprises phrase, harmony, and melody. The output of this rationale is given in the charges for melody, harmony, and chromatics. These charges are then fed into the operator unit, the transformation rules (right lower group). The performance transformation (horizontal arrow) is visibly factorized via the transformation rules, i.e., these rules define the entire performance map. It is however not clear from this scheme, which kind of action: symbolic, morphic, or physical is taken in the concrete cases. As was already mentioned in the first discussion of the KTH system, the original transformation of symbolic data into physical data is not made explicit in these rules. It is supposed that by
751
37.1. RULE-BASED GRAMMARS
some underlying procedure, there is already a physical image of the score symbols before the explicit rules are activated. So the explicit rules seem to be physical operators. They act on a "primavista performance" which is implicitly assumed. The rationales are purely analytical in their majority, but melodic rules also refer to gestural rationales. For example, "the leap tone duration modifies duration of tones in singular leaps" ([163, Rule GMI IB]) in the sense of an expression of a gestural constraint of hand and arm movements when playing a keyboard. In fact, the duration of a high tone after a low one (or vice versa, same rule) is shortened by l:l.DR = 4.2v'l:l.N.k msec, with the absolute pitch difference l:l.N of the leap, ad k the system constant. The structural penetration of these rules is however quite poor. For example, the harmonic analysis does not take into account longer syntagmatic units of harmony. Only values of isolated chords within a predefined tonality are evaluated. Cadences or modulations are not considered. Moreover, rhythmic structures are completely neglected as a rationale for dynamic accents, for example. Ditto for contrapuntal structures. Further, these rules are not inductive in the sense that they are not built to shape already given performances. They just act on the physical output and do not take into consideration the special character of the already given data. And an interaction between different arguments in combined rules is not developed. A hierarchical perspective is also not envisaged in the KTH approach. But this grammar is nonetheless a very clear scientific method that can be verified/falsified upon the audible quality of its output.
37.1.2
Neil P. McAgnus Todd
Summary. Todd's approach is backed by a systematic formalism of performance as a function of structure and specific grammatical arguments. It relates simple structural data, such as grouping boundaries, to expression by means of physically oriented transformation rules. -~-
As we have seen in section 36.3, Todd's approach to performance is a symmetric one using a performance procedure II which acts on the input wand an encoding function ,,{, whereas its inverse is a listening procedure A which acts on a performance output P and a decoding function 8. The latter is in fact meant to be a section which determines the parameters of the encoding function that lead to the given encoding values. This distinguishes Todd's approach from the KTH approach since the requirement of a listening procedure which accompanies the performance procedure is a strong restriction to the entire theory. However, this restriction is not really needed, i.e., one may also investigate the performance procedure without knowing whether there is an inverse solution. This is also the way Todd has viewed his scheme in [532]. As an example of an analytical rationale, Todd has described a rubato encoding formula
at time t, and with parameters 1.
CHAPTER 37. PERFORMANCE GRAMMARS
752 2.
= (rubato) amplitude,
3.
= length of phrase,
4.
5.
37.1.3
The Zurich School
Summary. We give an overview of the approach of the Zurich school. The details will be discussed in the subsequent chapters. -~-
Essentially this approach is centered around the key concept of weights. These are numerical functions that encode analyses and serve as an input to the core of grammatical instances: stemmata and operators. Accordingly, a performance is generated by the stemma, a genealogical tree of nodes representing local performance scores of successively refined performance quality. The generation of such node "daughters" from antecedent "mothers" involves performance operators in the role of "fathers". The latter are charged with weights and realize grammatical rules of different flavors. The nature of these rules is not further specified, and may include any of the systems proposed by other approaches as long as they are based upon weights (in particular the KTH and Todd proposals). The qualitative difference to the KTH and Todd systems is that a clear primavista performance is defined as a starting point of successively refined performances, and therefore, the initial transition from symbolic score data to physical data is anchored. Further, the weight system is conceived in such a way that a combinations of weights may define new weights to be fed into an operator, thereby allowing a simultaneous combination of different analyses to act on performance. Further, the LPS approach is so rich that any performance situation can
37.2. REMARKS ON LEARNING GRAMMARS
753
be dealt with: tempo hierarchies, abolition of tempo in arpeggio and rubato effects, combined deformation of parameters, and also the treatment of gestural output (beyond physical sound parameters). Finally, the stemmatic genealogy with sexual propagation from mother LPS and father operator guarantees an in-depth simulation of the process of rehearsal, where the spiritual unfolding of understanding a score may be modeled. All this has been implemented in the RUBATO® workstation, see chapter 40.
37.2
Remarks on Learning Grammars
Summary. This section gives a very short remark about grammatical patterns generated by machine-based learning from empirical performance data. -~-
For this section, we refer to [520J. We do in fact not believe that machine-delegated statistical methods such as neural networks or proper machine learning algorithms for rule learning are of proper scientific value, since when machines learn, we do not. Of course, this is an ideological point of view, but we cannot follow methods which delegate decisions to structured ignorance: understanding cannot be delegated to engineered devices. For example, Gerhard Widmer's approach starts with a relatively detailed structural analysis of the score, including motives, groupings etc. It then correlates these structures to empirical performance data, such as dynamics or articulation, in order to apply machine learning algorithms for extrapolation to other scores.
Chapter 38
Stemma Theory o matre pulchra filia pulchrior.
Horace (65-8 B.C.)
Summary. The stemma theory is introduced from its musicological and practical motivation. Semiotically speaking, performance is a result of a diachronic process. This is traced on the structure of a genealogical tree, the stemma of a performance. The stemma formalizes the diachronic process of rehearsal and practising. We describe the structure of stemmata as "family trees of performance", together with the corresponding genetic and environmental principles. -1::When we compare the performance grammars developed by the KTH school and Todd to what happens when a musician learns to perform a new composition or when a conductor rehearses a composition with his/her orchestra, there is a tremendous difference of procedures. In the KTH system, the analysis-by-synthesis makes this particularly evident: The rules are not given a priori, but have to be derived via human criticisms and successive revision. This diachronic process is however not part of the grammar, it is a meta-theoretic construct. There is no trace of this successive improvement of rules in the system. History is annihilated by the uncontrollable criticism of a human expert (the violinist Lars Fryden in the KTH methodology). No trace of how the improved rules are produced from the old ones and their result is retained. The KTH methodology is loaded by a meta-theoretical historical dimension in the analysisby-synthesis loop. This dimension is not present in Todd's approach, although his hierarchical technique suggests a construction of the surface level (single beats in his prototypical example) from hidden levels. These hierarchies are more of a generative nature from global to more local structures, and not historically guided, though. If one studies the way a musician rehearses a performance, it seems that a decisive component of this process is the successive improvement of performance which is built upon an added value with respect to the respective previous stages of perfection. The formal theory of such a diachronic process is the following stemma theory. This theory is already prefixed in the definition of the LPS spaces, since the variables "mother" and "daughters" create the connection to inheritance structures. This theory cannot grasp the
755 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 38. STEMMA THEORY
756
totality of diachronic strategies, though. While a stemma comprises genealogical LPS trees, the selection strategies for the construction of determined LPS are not formalized in the stemma concept, only the result is. Also, the arsenal of performance operators is not under control in this setup, it is supposed that a number of operators is available, but their systematic construction is not dealt with in this framework. The classification of performance operators is far from settled, see chapter 44.7 for this subject. We are also aware that the present stemma structure is far from general with respect to feedback options, although the circularity of the LPS definition includes quite a lot of feedback techniques. But there is a serious limit to generalization: If one is going to implement denotators for stemmata by way of LPS denotators, their finite character must be assured in order to create performance outputs in a finite calculation time.
38.1
Motivation from Practising and Rehearsing
Summary. This section analyzes the development of an artistic performance through practising and rehearsal. We exhibit structural ramification and hierarchy, together with shaping mechanisms. -I;-
When a performance is realized, this is never the result of a one-step process. Performance results from a development of successively improved intermediate performances. To understand performance, we need to understand its genealogy. A number of experiments have been undertaken in order to analyze the process of performance preparation via rehearsal and preparation. For example, Kacper Mikaleszewski has investigated this process via video recordings of the preparation of the XII Prelude from the Second Book of Preludes by Claude Debussy [375] by a professional pianist. Mikaleszewski introduces his investigations by the remark that it is not mandatory that pianists may clearly separate stages of their preparatory work of new compositions. In an investigation of A.A. Wicinski [566], ten famous pianists l were interviewed on their strategies. Seven of them comprise the first group of pianists to distinguish separate stages in their work. The second group of three pianists were not able to separate any such stages. This does not demonstrate that the pianists of the second group do not really follow unconscious strategies. The incapability of verbal description of such strategies is in fact a problem known among musicians: Very often, they are not able to verbalize their activities. The first observation about this experiment is that "the divisions of the musical material introduced by the subject (the pianist) agree with the basic formal units of the composition, here related tightly to its texture. This characteristic agrees with earlier notions about the role of the structure of music in performance, and the tendency to practise longer compositions divided into shorter units (... ). At the same time, more complex textures led to the selection of shorter fragments for separate practice, and to making more divisions of the musical material. ( ... ) we may say that what he (the pianist) was been doing is to prepare effective sub-routines of a more complex programme which in turn would make him able to perform the musical composition at a satisfactory level of proficiency." [566] 1 Among
them were: Sviatoslav Richter, Emil Gilels, and Harry Neuhaus.
38.1. MOTIVATION FROM PRACTISING AND REHEARSING
757
Although the general picture is somewhat ambiguous, it can be deduced from these findings that the subdivision of the given score structure, together with a development of local strategies of performance, are crucial. This can be viewed as a strategy for the acquisition of a global "performance plan". The analysis is that "the majority of comments concerning the text of the composition, fingering, hesitations, error corrections, and memorization seem to be in agreement with the general objective of the first stage of work mentioned by Wicinski: to work out a general idea of the composition and to become able to perform it with sketchy interpretation." In other words, the strategy is not to start working out details and local performance, but to go topdown from the overall picture (as sketchy as it might be) to more and more detailed aspects in the fragments of the subdivision of the given score. So one exhibits a hierarchy of performance development which starts with the global sketch and successively ramifies to sub-routines of local aspects.
38.1.1
Does Reproducibility of Performances Help Understanding?
Summary. Psychologically, the structure and function of performance generation is far from inscripted within a conscious memory. We discuss the value of an explication and memorization of such a process: Why is reproducibility of performance processes of scientific interest? Memorization relates to the question of identity of a performance (process). This leads to the question whether human precision is different from "machine" or "mathematical" precision. -~-
As we have already mentioned above, a number of excellent musicians cannot (or do not want to) control their performance generation, they just rehearse by some instinctive activity and do not care about strategies and conscious plans. So why try to make such processes explicit, since a good number of artists just do not care. The question really is whether there can be a culture of performance without reflection of the conditions of good performance, good in the sense of Adorno: expression of contents that are discovered via analyses of the underlying text. Now, if this goal is accepted, we need to know about the unfolding of such a performance along the nerve of the inner logic of an artist's elaboration-supposing that such an inner logic subsists. We insist that the absence of such a logic would result in a random walk to performance, an agnosticism driven by blind admiration of an artist's instinct, genius, call it as you like. But then, understanding performance would reduce to plain admiration of a miraculous phenomenon which does not meet our concept of a performance culture. The scientific treatment of performance culture seems to face still another objection, i.e., the problem of objectivization of performance structures: The uniqueness and magic of artistic performance is sometimes viewed as being in contradiction to objective description, of conceptualization in the framework of scientific experimentation where reproducibility of objectively given conditions is mandatory. This skepticism culminates in the claim that objective, "mathematical" precision misses the precision of a human artist, that the latter precision is of another nature. This is to say that you may draw a faithful trace of a performance and miss the essence, because a human may reproduce the performance in another way, however maintaining the core of "human precision" as opposed to machine precision. This is however not an argument against "mathematical" precision since its claim is an invariance argument: Although mathematical
CHAPTER 38. STEMMA THEORY
758
changes in a performance may occur, the "human precision" is maintained. This means that we would have to search for invariants within the scientifically precise variety of performances. So this is not in contradiction to "mathematical" precision, it simply states that the latter may be too precise. But as long as we do not know what the "human precision" is about, we better stick to the "mathematical" precision which englobes a faithful trace of performance. Another argument against scientific performance analysis is that its very concept of performance is illdefined. But this means that something has been forgotten in this conceptualization, i.e., one ought to include other components of a performance to grasp its identity. This could be: the audience, the space (concert hall, studio, etc.) of performance, the individual conditions under which a performance is perceived (listening in a bad mood, listening repeatedly to a recording, etc.), and the general historical and cultural background of such an experience. It is evident that the effect of a performance depends on these factors. And that therefore, a complete image of the phenomenon must include these factors. But this does not change the problem of giving a precise description and pursuing an in-depth analysis of the performance in the sense developed so far. This is a fundamental objection we make against the methodology of the humanities: that they refuse to investigate parts of a phenomenon because they are related to other parts. We do not contend that understanding the whole can be reduced to the understanding of its parts, we contend that understanding the whole cannot refrain from understanding its parts, and that this latter task is the first step of any scientific procedure.
38.2
Tempo Curves Are Inadequate
Summary. We discuss the conceptual and technical inadequacy of "flat" tempo structure for performance construction and derive the ramified hierarchical tempo trees as realized on the presto® software. -~-
It is a commonplace in performance research that a single tempo curve cannot control non-trivial tempo configurations. We therefore implemented a module for hierarchical tempo configurations, called AgoLogic, within the composition software presto® which was developed for the Atari® computer from 1988 to 1994 [335, 338, 340]. Tempo hierarchies 2 are a preliminary version of stemmata in the onset domain. In order to make clear the scope of tempo hierarchies, we first give three examples of tempo hierarchies which are of practical significance in the historical context. After that, we shall give a formal definition of a tempo hierarchy in terms of corresponding denotator spaces. Example 54 The first example is an exercise in tempo curves taken from Carl Czerny's "Pianoforte Schule" [98], see also figure 38.1 . Czerny's exercise proposes to play this short composition with different tempo curves: first without any tempo change, second with an accelerando in the middle, third with a strong rallentando at the end. We have simulated these proposals 2This terminology is antiquated now, since we reserve hierarchies for cell hierarchies, and what we call a tempo hierarchy here is in fact a tempo stemma in the present terminology. We nonetheless conserve this antiquated terminology in this special discussion of tempo in the presto® software.
38.2. TEMPO CURVES ARE INADEQUATE
I, ,t ·I III, ·
~ " · I· ~.
759
.
hi,-tl" II!'
\ ,.!'l l· " W"',I ,·r '· II . _
\ 1. ) 4t
/It UI
rPlIlp" -
l;J.
in ill
T_lItp JI .
f ,.
-:
T,.mjJfJ _ _ :
_ _ lin ,
. /-1" (''' _
t
_
l'i/~Ii"'(J_
_
_ :
_ .m tJl'f.IlIuiu ~ .
pur'" rr"r~"' ~ rlJltdn _ t'fl/::: ',·n -=- Itllld" . l'~'"JI' _ _ : 111.,1' 0 _ ,,' _ ~ tffl' :' tinflilo .. - I p~r ~ tf,..t/d r · _I
I
120 110 100
M.
90 l)0
70 60
bal' l
bar2
bar3
bar4
Figure 38.1: Czerny's exercise for tempo curve testing. on presto®, and it turned out that the result is quite deceiving: No really relevant tempo experience results. In order to construct a less poor tempo structure, we have split the tempo levels in order to achieve the so-called Chopin rubato3 . The three tempo setups as well as the Chopin rubato version can be heard in the first four samples on the audio-file Czarny on the book's CD-ROM, see page xxx. This is a very classical technique of tempo shaping. In presto®, this works following a hierarchical construction. In our example, we want the left hand to play a constant master tempo T. The right hand is the slave in tempo; we ask that the right hand tempo may vary anyhow under the condition that both hands coincide on each bar-line. To do so in presto®'s AgoLogic module, we may split the onset domain I = [aD, an [ of the right hand into a sequence of one-bar portions h = [aD, al [, h = [aI, a2[, ... In = [an-I, an [. Then we have a hierarchy I -----+ h, I -----+ h, ... I -----+ In. On each bar portion Ii of the right hand, the user may reshape tempo via graphically interactive editing. The user can define any polygonal tempo curve Ti within the limits ai-I, ai of this interval, provided the integral is conserved, i.e., ~ = see figure 38.2. The routine taking care of the boundary condition of invariance of the above integrals was presented in corollary 21 of section 34.2. Of course, the mother tempo of the left hand need not be constant, any polygonal tempo curve can be produced for the left hand, and the daughters intervals Ii can be given corresponding polygonal variations with the above boundary condition ~ = "
IIi
IIi A,
II.
II
*-
3Sometimes also called "bound rubato" since one hand is playing a trigger tempo, whereas the other is bound to cope with the master every bar-line onset, say. In contrast to this concept, "free rubato" means that both hands play rubato, but exactly the same, and this frees them from following a master tempo.
760
CHAPTER 38. STEMMA THEORY
+
•
•
M . M. 71
•• :
• •• : ••• I •• • •
4 1' 4
D •
•
•
. ..
BAR NO.
.
I I
chord
Attive and Master
o
K
1~l erste
tempo before refinement of time granularity
rubato for melody line
tempo for left hand (chords)
37
Figure 38.2: Czerny's example with a depth 1 hierarchy of tempo curves where each right hand bar is a daughter tempo polygon of the right hand tempo which is constant here. Example 55 The second example is taken from the Chopin Impromptu op.29, refer to figure 38.3, bars No.78-80. This situation shows a series of trills and arpeggio The tempo of bars SS is confronted with the tempi of the trills and the arpeggio This is not the situation of a bound 4 Chopin rubato, since the trill ornament is only one portion of a number of notes to be played by the right hand, it is more of a hierarchy in a scenic arrangement where the ornamental notes add a supplementary level of structure. The trill notes are not even explicitly denoted, the artist has to fill up the trill sign in the spirit of Chopin style tradition. The second auxiliary structure in this example is the arpeggio. This is also an incomplete notation insofar as you have an anchor note and several "satellite" notes, i.e., the chord's notes attached to the anchor note. These must be played in a temporal succession. The temporal succession and the anchor note are not always clear: It could be that the arpeggio succession is read top-down in pitch, or vice versa. Also could the onset of the last or that of the first note be the anchorage onset. These things being selected, the speed and shape of temporal succession 4 A bound Chopin rubato is one where one hand plays a trigger tempo whereas the other hand's tempo may vary locally, as a tempo slave, but coping with the right hand trigger tempo on a number of master events. A free Chopin rubato is one where the rubato is played synchronically for both hands.
38.2. TEMPO CURVES ARE INADEQUATE
~.
p ~
~
•I •
761
~.
~.
p ~
tr
~
•I •
~.
p ~ ----- tr
tr
~
•I •
Figure 38.3: Chopin's Impromptu op.29, bars No. 78-80 shows a series of trills and arpeggio in the arpeggio are not well defined. As with the trill notes' temporal distribution, the arpeggio development is a "satellite" phenomenon in the note hierarchy. We have chosen this hierarchy in presto®'s Agologic module (see again figure 38.3): The mother tempo is present on the top level represented by the half notes in the graphic below the score. The top level has two daughter tempi: the trill daughter (to the right below the half notes) for the trill tempo, as well as the arpeggio daughter (to the right below the half notes). The latter controls the tempo of the descending interval notes. It has a daughter tempo for the expression of the arpeggio tempi. This hierarchy does not determine the concrete tempo curves on the mother level, on the trill, and on the arpeggios; to this end, the graphically interactive input by the user is needed. This means in particular that the same tempo hierarchy can express very different ways of performing a piece: from the beginner to a virtuoso. On the book's CD-ROM (see page xxx), four samples of performances of this tempo hierarchy are traced. They are heard on the last four samples (after the Czerny samples) of the audio-file Czerny. Example 56 The third example illustrates the tempo hierarchies as they are needed in the performance of a large orchestra with special instrumental groups. Our example is a large orchestra which is controlled by a conductor. Within this orchestra, we suppose given a string group (violins, say) which has to obey the conductor's indications, but within two time windows (the curval, and the curva2 windows) may follow an individual tempo curve (as indicated by the concert master, say). Within each such slice, on a small time window (curvetta3, curvetta4), there is a soloist part (played by a first violin, say) which may realize a cadence-like small expression, but has to cope with the string group (curval for curvetta4, curva2 for curvetta3, respectively). The tempo window of presto® shows the individual curves with their local deviations which, according to the implemented algorithm, yield the same total physical durations
762
CHAPTER 38. STEMMA THEORY
-
_... - ....._..... - . . . , .. ....
o
---.~-
K
I~I
(urvetta 3
cu rvelta 4
mamma
_~=::::::::~~c~u:..'rv~a~1---_ _ _ _ _~
cu rvclta 3
curva 2 / '
Figure 38.4: The tempo hierarchy of an orchestra is shown. The mother tempo curve (mamma) is ramified into two daughters (curval, curva2) which in turn have one daughter each (curvetta4, curvetta3, respectively). This interdependence of local and global tempi is used to differentiate roles in orchestral time control. as their respective mothers.
38.3
The Stemma Concept
Summary. The formalism of performance stemmata is introduced. A stemma is a rooted directed graph which carries on its nodes local performance scores. The generative principles as well as the entire structure are modeled after a matrilineal scheme: There are only mothers and daughters on the stemma, starting with the primary mother. But the proliferation of these families is sexual: Fathers, formally represented by performance operators, do contribute to their daughters together with their mother. The detailed "sexual behavior" is described and turns out to be quite similar to the veritable biological/sociological behavior in life.
-E-
763
38.3. THE STEMMA CONCEPT
This section terminates the chapter with a formal construction of stemmata. This is a remarkable subject for three reasons: First, the stemma concept is probably one of the first to grasp historical processes on the formal level in the mathematical sense, where rather than "formal" we should say: "precise", as contrasted to the notoriously poor conceptualization in the humanities. This has tremendous consequences for the experimental modeling of historical processes: One can now perform historical developments which never have taken place, more concretely: rebuild real and simulate fictitious performance history of the rehearsals of a pianist, compare these directions and draw conclusions on the quality of the real history against the way of fiction: Are we, and why are we living in the best of possible worlds. Second, this formalization is not only formal in the mathematical sense, but also on the level of implementation in computer software. In fact, we have seen in the course of the intricate definition of an LPS in section 35.4 that this concept can only be defined by means of a circular form LoePer f ScoreBP, since a standard mathematical definition would not allow circularity, circularity being only an accepted technique in implicit equations, not in conceptualizationalthough, as we have seen in chapter 9, set theory (probably unconsciously) conceptualizes sets in a circular form. But this nature perfectly fits the nomenclature of object-oriented programming languages: An instance variable can very well be an object of the same class as the object whose instance variable it declares. This observation once more evidences a turning point between mathematics and object-oriented programming, where the denotator and form concepts have been derived. Third, the very nature of this formalization of historicity appears to fit with biological inheritance principle of sexual propagation. This is not only a happy coincidence, rather is it a mandatory direction if one wants to model learning: By sheer life experience, inheritance and evolution are the best proven models of successful learning. Although it is possible from the preset concepts, we do not diverge on global performance score constructions for the stemma theory and leave this segment to future research. All the LPS denotators will be situated at the zero address as long as we do not stress the contrary.
38.3.1
The General Setup of Matrilineal Sexual Propagation
Summary. This section describes the overall mechanism of stemma construction. In particular, we discuss the reason for the matrilineal approach. -~-
Given a sequence of basis and pianola spaces B, P, recall the definition of the LPS space (35.2) with the six factors Mother IL, Daughters~, CellHierarchYBP I), Instrument t, Operator n, Weight w.
Definition 106 If A : O"""LocPerfScoreBP(IL,~,I),t,n,w) is a local performance score, let A ! (resp. Ai) the directed graph of all LPS that can be reached by finite descent from A to the daughters, to their daughters, etc. (resp. the set of all LPS that can be reached by finite ascent from A to the mother, to its mother, etc.), together with the mother-daughter arrows. Denote
CHAPTER 38. STEMMA THEORY
764
by A 1 the union of the graphs A ! and Ai. Call A00 the directed graph of all LPS that can be reached from A by finite ascent and descent, together with the mother-daughter arrows. Definition 107 A local performance score A: O"rlLocPerfScoreBP(/-t,~, I),~, n,w) is called a stemma5 if its graph A 1 is finite and defines an undirected tree (no undirected cycles), and if it is not the daughter of its mother, if that mother exists6 . The leaves of A 1 (which are also the leaves of A! in this case) are called the leaves of A, whereas A is called the primary mother of the stemma. If A has no mother, it is called a prime stemma. The set of leaves of A is denoted by A. Intuitively speaking, a stemma is a primary mother, together with its daughters, its granddaughters, etc., until we reach the leaves which are the output LPS that will eventually yield the data to be performed. It is important to have the mother of a stemma being disconnected from its daughter, i.e., not pointing at the latter in its daughter list. The primary mother of a stemma is meant to define a new tree of unfolding LPS which cannot be accessed by any other stemma. Although it can access another stemma, it is invisible to this latter. We have seen a stemmatic structure in the previous discussion of "tempo hierarchies" which are, in the present terminology, a kind of tempo field stemma, although usually there is more in a stemma than just onset-related fields and onset kernels. Exercise 79 Restate the "tempo hierarchies" in terms of stemmata with B
= {Onset},P = 0.
The matrilineal terminology is not really the whole truth here. In fact, each LPS contains its operator, the patrilineal component which is responsible for the generation of this LPS. But the result, more precisely: the hierarchy I), returns the output information for performance. And it is this result which will also be used to produce further daughters, and not the operator. This justifies the matrilineal terminology, whereas the operator is only a hidden generative instance. Let us look at a historical example of a stemma: the stemma for the composition K uriose Geschichte, the second Kinderszene in Robert Schumann's synonymous collection op.15 [482]. This stemma was constructed on the performance platform RUBATO® in 1996 at the Staatliche Hochschule fUr Musik, Karlsruhe by the author, Oliver Zahorka, and Joachim Stange-Elbe. It took us three days to realize the whole setup and performance on a Bosendorfer MIDI grand. The performance of the piece is documented on a CD, see [360], and in a broadcast of the Austrian TV [161]. Although the stemma is quite primitive, the shaping results were satisfactory and taught us a lot about the empirical aspects of computer-assisted performance research. The stemma is visualized in figure 38.5. Although we see that each single refinement layer is controlled by one and the same operator (horizontal arrow), the layer did not form a grouping in the technical sense to be discussed in section 38.3.3: Each daughter had to be performed as an isolated instance, since no grouping methods were implemented at that stage. The construction of this stemma first follows the splitting of right (RH) and left hands (LH), then, after the shaping of primavista dynamics and agogics, global agogics is constructed on these two LH and RH symbolic kernels. The splitting for operators n5, n6, n7 regards a small 5 "Stemma" is synonymous to "genealogical tree" . 6This is a slightly irritating subtlety of our conceptualization.
38.3. THE STEMMA CONCEPT
765
Mother
LH _ _ _ _ _ _ _ _ _ _ _ _ _ _ RH LI
Q
~ separation ofLHfrom RH
RI Qz
~
L2
primavis/a dynamics
R2 Q
~ L3
Q.
~
L4
/ RA5
LA5
Qj ~
LB6
LB7
RA6
RA7
shaping "Rubato" parts
RB5 Q.
LA 7
shaping global agogics
R4
/ LA6
prilllavis/a agogics
R3
~
fine shaping o/dynamics
Q7 ~
fine shaping of ani,·ualtiOi
RB6
RB7
Figure 38.5: The stemma of the first performance of Schumann's second Kinderszene: Kuriose Geschichte that was constructed and performed 1996 on the Bosendorfer grand piano at the Staatliche Hochschule fur Musik in Karlsruhe. number of bars which have to undergo a more differentiated rubato. The final shaping regards fine "tuning" of dynamics and articulation in all leaves.
38.3.2
The Primary Mother-Taking Off
Summary. The primary mother represents the performance score which is deduced from the score data as they are inscripted on the predicate level. We make the deduction process explicit, together with the set of prima vista parameters.
The primary mother A of the stemma is the starting point of a stemmatic evolution process, it is used to derive all the LPS in A 1, and eventually leave set ;1. There are two situations in such a primary mother: Either it has a mother LPS Ao or else it is a prime mother with the score form mother denotator S : O"",ScoreForm(a-).
766
CHAPTER 38. STEMMA THEORY
If it has a mother LPS, this means the following: The stemma which it defines is not an autonomous structure, but it is derived from another stemma Ao. This other mother stemma can however not access A, this one is invisible to Ao. The idea of this asymmetry is that we want to separate stemmatic processes in their information flow. A stemma is a closed unit of unfolding performance stages. It can be used to induce a new stemma, but it is not related by a daughter pointer to the new one. This prevents mixtures of stemmatic unfolding: Once a stemma is defined, it can help another stemma to build refined performances, but it does not refer to this one, while vice versa, a new stemma may refer to the old one via the mother pointer of its primary mother. This technique creates a series of successively improved stemma
Ao, A, A', A", ...
If A has just the score denotator S, there is no reference to other previous stemmata, and we are in the properly termed primavista situation: There is a score, which is formally expressed in terms of the denotator S, and this score is the only reference to create the first performance. This is a complex operator which we shall now discuss. The construction of all data of this prime mother is the scope of a special operator which we call the PrimavistaOperator7. The PrimavistaOperator is a macro-operator since it has to deal with all possible score signs incorporated in S, but this is more a question of software implementation than a mathematical problem. The problem is this: recall that in the KTH theory, there was no such a thing as a primavista setup. This prime mother construction must be dealt with somewhere in the performance process. In the unfolding of a performance, it is the first action to be taken: to establish a first version of a performance which is uniquely based upon score data. It could be argued that the score data can be supposed to be introduced in advance via the symbolic kernel. This could even be admitted, but then, all the other data, such as rallentandi, fermatas, slurs, etc., where should they be piped in order to contribute to a primavista performance? For example, a fermata sign must be taken into account while it defines a sensible tempo sink, and this must be done before any refined shaping activity of performance is set forth. Of course, it is not sufficient to just know that this fermata sign is situated at a determined place, we also have to provide the data for the exact shape of this specific tempo sink. So, first of all, the input must list all the possible relevant signs, i.e., the primavista predicates in the sense of section 18.3.3. The input data are all based on the first information about the events of the score S in the different parameter spaces of BP that will participate in different special predicates. These may be ordinary notes, bar-lines, pauses, and the like. In order to produce a set of events that cope with the hierarchy induced by condition 1 in definition 102, i.e., that in a cellular hierarchy, kernels project into kernels. This means that the symbolic kernel event set Evts(U) associated with space U E BP must project into Evts(V) if V c U. We shall here speak of "events x in space U" in the sense of zero-addressed denotators x : O~U(~). This induces the Boolean predicate Evts(U), U E BP, via
Definition 108 For U E BP and an event x in U, we set xl Evts(U) = T iff x : O~U(~) is such that there is an event y : O~ W (1]) in the score denotator S, living in a (not neces7In accordance with code naming conventions in object-oriented programming, the nomenclature is this: every operator is named by a special name Specialname, directly followed by the postfix "Operator", yielding "SpecialnameOperator" .
38.3. THE STEMMA CONCEPT
sarily strict) superspace W of U in BP, such that Pu(y)
xl Evts(U) = .L
767
= x (i.e., Pu(",) = e), else we set
This presupposes that the events of the score denotator S have been identified a priori. These are denotators in spaces of BP that can be found upon inspection of the S. This is a knowledge which ultimately exceeds the formalized knowledge base we are dealing with in our mathematical framework, it needs an instance which can create a score denotator S from the given score. This could be an optical character recognition (OCR) software for scores, or any machine that collects events from MIDI files, for example, or just a human expert in score reading. In any case, we may suppose that the score is transformed into a score denotator S in a score form, and that the events are deduced from S. The latter is a standard task in logical and geometric motivation predicates. To define the prime mother LPS, we have to define all its constituents: Instrument, weights, operator, and hierarchy. As to mother and daughters, the first is S, and the latter will be added when the stemma is made explicit in a later stage of the historical process-presently, it is empty. The most important data is the hierarchy. We have to construct it by use of the PrimavistaOperator and the given weights. The hierarchy defines also the performance map on its top space, and we suppose that the instrumental specifications are sufficient to transform the physical parameter vectors of the image of the performance map p into sound objects. So we may concentrate on the hierarchy construction here. In the hierarchy IJ to be constructed, we have to define the diagram of cells. This first of all means defining the projections of kernels. We instantiate all the event sets as the first bunch of predicates Evts(U) in the weight list of our prime mother. For any space U E BP, we select the kernel Ku = supp(Evts(U)), by definition ofthe predicates Evts(U), the kernels map into each other. Also, the frames Ru may be pre-defined as the smallest cubes containing all the predicate supports Ku. Since we are starting with an instrumentally well-defined local situation (only one instrument), we may also suppose that there is a top space Tops in BP where the kernel KTops is not empty. To define the space hierarchy, start taking all spaces with non-empty kernel, these are just all subspaces of Tops. This hierarchy is much too large, in general. We first have to restrict to the standard hierarchy requirement described in section 35.3, requiring that for any pianola space within a hierarchy space U, the corresponding basis space must also be in U. For example, in piano music, duration cannot be a reasonable hierarchy space. One further has to restrict this hierarchy to a subhierarchy which is reasonable for the given instrument. There is no general algorithm for such a procedure, one has to observe two things, however: • the hierarchy must be standard with respect to basis and pianola spaces; • if a Ku contains events that are not proper projections of other events, this space must be retained in IJ; • there are default space hierarchies for specific instruments, such as the piano and violin hierarchies described in section 35.3.3. With respect to these constraints, one will choose as space hierarchy the smallest standard subhierarchy IJ of the total hierarchy of Tops such that it contains the default space hierarchy and the spaces containing events that are not proper projections. For piano music this means
768
CHAPTER 38. STEMMA THEORY
taking notes in EH LD, pauses, slurs, accelerandi and similar events in ED, bar-lines, time and key signatures in E, but observe that these event types are all contained in one of the spaces of the default piano hierarchy. The selection of such boundary conditions for hierarchies is a typical system parameter in the PrimavistaOperator, a parameter which, together with the weights, creates concrete hierarchies. A further set of system parameters: the selection of initial sets, performance fields, and initial performance, is a bit more delicate. The performance field to start with depends on the physical values attributed to the symbolic ones before any refined performance takes place. Typically, this can be solved by an affine isomorphism if we convene that on the physical level, we have the common space of pitch, corresponding to the logarithm of frequency, loudness, corresponding to the logarithm of intensity and time corresponding to onset units, etc., see appendix A for such common spaces. Then the performance field is a constant one (the derivative of an affine isomorphism being constant, i.e., its linear part). Since in common situations, the linear part of p is also diagonal (just some positive gauging constants), the prime mother performance field is constant with positive coordinates. This setup evidently guarantees that the projections commute with the constant performance fields 'Thu on the spaces U (condition 3 in definition 102 of performance cell morphisms), which we have to define in their respective coordinates in each space U of the defined space hierarchy. This is what we now assume. Consequently, the entire frame volume can be reached on these constant prime mother performance fields from the left "bottom walls" of the frames R = [aI, bl ] x ... [an, bn ]. There are n bottom walls for R, i.e., the (2 n - 1 - I)-simplexes (38.1) for i = 1, ... n; they are degenerate for n > 2. The initial set I on an n-dimensional space U with frame R is the family (W~i)i of bottom-wall simplexes. Since we have I 'Thu = R (with c = +), condition 4 of the definition of a morphism of performance cells is also fulfilled. We are left with the initial performance condition 5 in the definition of a morphism of performance cells. This initial performance can be defined by the same data which we have used to define the performance fields. Therefore, condition 5 is automatically fulfilled. So we are left with the construction of the weight system and the PrimavistaOperator that acts on the defined hierarchy in order to integrate the information from the primavista score predicates. For events with x / Evts (U) = T, or for local compositions of such events, or for local compositions of such local compositions, etc., as required by the weight definition of an LPS, we now look for special predicates according to the musical notation conventionss . We have already listed the common prima vista predicates in section 18.3.3. Here, we give a selection to show how these predicates can be restated in terms of weights. If needed, such weights are added to the weight list of the LPS.
r
Slurs. There are two types of slurs: normal legato slurs and articulation slurs. Both are boolean predicates which are evaluated on sets of events in a space U. Call the legato predicate LegatoSluru such that x / LegatoSluru = T iff x is a local composition in U which is embraced by a legato slur. As we are specifically interested in x being also a local 8By that we mean the conventions of European tradition. For other traditions, special predicates and parameter spaces have to be introduced, but the procedure is the same, though not always an easy one, as the Japanese Noh nomenclature may illustrate (see the Noh example in 18.3.3.2).
38.3. THE STEMMA CONCEPT
769
composition stemming from S, we set LegatoSluru,s = LegatoSluru&Evts(U). We shall henceforth abbreviate the logical combination ?&Evts(U), by an added index ?s; for example, the articulation slur is denoted by ArtiSlurs,u.
Articulation. The following predicates are self-explanatory by their names and all relate to evaluation on single events of U in S:
Staccatou,s, Staccatissimou,s, M arcatou,s, Tenutou,s, Accentu,s Fermata. A fermata lives in the U = E space of onset; no duration of the fermata is explicitly defined. The predicate weight is denoted by FermataE,s, and an onset x is a fermata iff x/ Fermatas = T. Value Change. A value change p/q = r/s is a change in the time signature on a special barline, from p/q = 2/4 to r/s = 3/8, say. So we first need a time signature predicate for time signature p/q. This is an onset-located predicate, call it TimeSig(p/q)s, and then a predicate for two-element onset sets x = {a, b} of time signature predicates, i.e.,
x/ValCh(p/q = r/s)s = T iff card(x) = 2 & 3a E x, a/TimeSig(p/q)s = T & 3b Ex, b/TimeSig(r/s)s = T, a predicate that is motivated by a mathematical predicate and the already given time signature predicate. Observe that the structure of the mixed powerset spaces U enables us to use ordered pairs of any points of U as they are defined in classical set theory, see appendix C, definition 114. So it is also possible to redefine the above predicate via ordered pairs a, b of objects with a/TimeSig(p/q)s = T, b/TimeSig(r/s)s = T. The details are left as an exercise. So the weights in our weight list of the prime mother LPS is the list of a) the symbolic kernels, and b) the appended list of all primavista predicates, as discussed above, with their restatement as weights with truth values in R The PrimavistaOperator now has to add operations on the given hierarchy ~ which stem from a paratextual meaning of the weight predicates, i.e., an interpretation beyond the abstract textual trace which the score denotator has induced. For example, a fermata predicate has to induce a tempo sink. This and all other actions of the PrimavistaOperator will be discussed in chapter 44.7 about operator theory.
38.3.3
Mono- and Polygamy-Local and Global Actions
Summary. The typology of actions that operators may take in order to unfold the stemma includes monogamic coupling with one mother and production of one or several daughters, or else polygamic coupling with simultaneously several mothers. We give the formal description and its justification in terms of practising and rehearsal. -~-
CHAPTER 38. STEMMA THEORY
770
A performance operator n of an LPS A : O-v--;LocPer fScoreBP(J-L, i:l., I), t, n, w) has the function to define the cellular hierarchy and the initial performance t of A. This data is calculated by use of the weight input w and~except for the primary mother~by the already calculated mother LPS J-L. This is also represented by the equation
n ex J-L = A. In general, a mother may have several daughters with the same father operator, i.e.,
which seems to be an inconsistent notation since the index is absent in the left part of the equation. In fact, we have to be more precise here. The operator for daughter Ai must have a system parameter i which specifies that daughter from one and the same mother J-L. So the correct notation would be n(i) ex J-L = Ai, where the system parameter is specified. This is also what in software design is realized: For each parameter i, the corresponding daughter is instantiated as a function of that operator with the i-value in its instance variables. This ordinary christian family life is however not the most economic and reasonable in many performance situations. For example, it may happen that one has already developed the stemma to a strongly ramified tree, and that one wants to apply the same refinement procedure to all or a large number of leaves A k , k = 1, ... K. One could of course apply the same operator n to each Ak independently of the other leaves. But then, changing some parameters of this operator would require us to go through each leaf and alter the data step by step. In order to avoid this "copy and paste" process, it is better to let the leaves know that it is one and the same operator and not just a set of clones n* of n which have to produce daughters n* ex A k . The problem here lies in the definition of a denotator's identity. In fact, if we assign to all leaves Ak the same denotator n as their operator, then any change in this operator is simultaneously carried out on all these leaves. But this is not so trivial: If we had a situation as it is known from object-oriented programming, i.e., if the operator were an instance variable of a LPS class, then its change would be an automatic change in every instance A k . In the context of denotators, however, we do not have object-oriented structures, and the change has to be declared explicitly according to some kind of concept surgery, as it was discussed in section 6.9. The surgical intervention to be carried out here would be this: Search for all LPS Ak in the given stemma, or else in some more specific hierarchical position, such that their operator is named n. Then define new LPS A'k by replacement of the coordinate (named) n by a new coordinate (named) n*. This only works if names identify denotators, otherwise, we have to search for other keys to retrieve the wanted denotators. One could denote this intervention by the symbol A'k = Ak/n rh n*. A global notation of a grouping of LPS within a stemma is to write G /n or simply G for a set G of LPS which is operated by one and the same performance operator n, we call such sets stemmatic groupings, they are evident denotators in the powerset space of the present LPS space. Accordingly, we write G /n rh n* for the replacement of n by n* in the grouping G. Clearly, any two groupings within a given stemma are disjoint. Observe also that the mothers of a grouping do not automatically define a grouping for this reason! Moreover, we ask for the following transitivity (collective responsibility) axiom:
38.3. THE STEMMA CONCEPT
771
Axiom 4 If G is a stemmatic grouping, and if a daughter 6 of a mother J1 EGis a member of a group H, then every daughter of a mother in G is also a member of the group H. For example, if G is grouped by a tempo operator which imposes a new tempo curve on each member of G, and if we create a group H of daughters with a refinement of their tempo curves, then, if a daughter 6 E H has its mother J1 E G, it is reasonable to have the same refinement of the tempo curve for all daughters of all members of G, because the shared tempo curve from G should inherit its refinement. Axiom 5 It is always possible to resolve a grouping in the sense that any further changes to the stemma only affect the grouping's former members instead of the entire grouping. However, the descendants of a grouping's members are not affected in their grouping memberships. This is a typically historical process in the stemmatic construction: The final stemma is a sequence of intermediate stemmata, i.e., we really have the time parameter entering the world of denotators, or, rather (and more precisely) the world of predicates.
38.3.4
Family Life-Cross-Correlations
Summary. Apart from genetic interaction as described by mono- and polygamic propagation, the shape of a single daughter can also be determined by cross-correlations with its sisters or other relatives. We again give the formal description and its justification in terms of practising and rehearsal.
Suppose that a mother J1 in a stemma A has a number 61, ... 6m of daughters. What is the common situation for such a family? We have seen in the example of the composition K uT"iose Geschichte (figure 38.5) that such a set of daughters can occur if we have to split the whole composition or a part of it into mutually disjoint subcompositions, such as left hand and right hand, which must be treated in individual ways. Or if some bars require a different performance shaping than the other bars, as is the case for the splitting of left and right hand, respectively, in that stemma. This procedure might also be applied in a more systematic way with respect to natural grouping structures of a composition: For example, in Schumann's Kinderszene 7: Triiumerei, we have four periods AI, A2 (= the repetition of AI), B A3 (= recapitulation of AI)' eight bars each. This could be the basis of an operator which splits the entire composition into these four daughter. In each period, we have the grouping into the eight bars, and that can provide us with granddaughters Al,l, A l ,2,'" Al,s,Al,l, A2,2,'" A2,s, B 2, ... B s , A 3,2, ... A3,s. This situation is however not what is really relevant in a realistic performance. The point is that we need operators that deal with relations between different sisters and not only between daughters and mothers. Musically speaking, this means that an artist must take into account what will be played in future bars and what has been played in previous bars of the present period, when shaping a particular bar performance. One must also take into account what is the performance of future and of previous periods qua periods. So we have to face operators which take into account more distant relatives in this large family of stemmatic nodes.
772
CHAPTER 38. STEMMA THEORY
In fact, from the artistic point of view, a performance must testify a sophisticated coherence in the large and also within more restricted neighborhoods of musical events. This is a common place in musicological performance theory and in the feuilletonistic music criticism. But such a general insight is very difficult to make precise for several reasons. First, it is not clear which performance structures should be involved when being set into a coherent whole. Must we think of coherence among local tempo curve segments, or, more generally: coherence among local parts of performance fields? Or should we separate coherence questions along the strata of cellular hierarchies? And if we are searching for such a coherence, how should the analytical background, more precisely: the weights which aliment operators, be screened for coherence? In other words: What is a coherent analysis? Evidently, without such a concept, coherent performance fields can be defined, but they risk failing in their rational task: to reflect the analytical background. Can they be coherent just with respect to themselves without expressing coherence of analyses? Second, the mathematical variety (in the non-technical sense of the word) of coherence structures is virtually infinite: One could imagine linear, polynomial, differentiable, analytical, and-Mlas-statistical, any kind of non-linear relations that would be involved to define coherence. The influence of other family members on a determined LPS could also come from its sisters, from cousins, from more distant relatives, anything is imaginable. We shall make explicit models of stemmatic coherence in chapters 44 and 46. These models are essentially linear models on every set of sisters, but in their combination among the whole stemmatic inheritance, they accumulate non-linear phenomena which lead to non-trivial algebro-geometric phenomena, and phenomena pertaining to second-order differential operators. These perspectives should make clear that we scarcely understand the genuine concepts of performative coherence in their musical phenomenology and accordingly cannot construct mathematical models on the basis of such a blurred phenomenology. Adorno and Benjamin have given us the catchwords for a deeper investigation of performance, catchwords which we could very well transform into adequate mathematical concepts, But these theorists did not elaborate their conceptual germs to a degree of differentiation that could help describing and understanding the concrete artistic shaping of performance. We can hardly understand and even less forgive the tremendous lack of musicological conceptualization and knowledge about performance in view of the overly fluffy and too often ridiculously blase music criticism in the feuilletons of our newspapers.
Part X
RUBATO®
805
Chapter 39
Operator Theory If I chance to talk a little wild, forgive me; I had it from my father. William Shakespeare, Henry VIII. Summary. Operators are the substance of shaping performance. They refer to mothers, but they are the only instances capable of altering, refining, or ruining what has been achieved. They are also the pipes where exterior information, be it from score predicates, from analytical data, or from general system parameters, can be channeled and transformed into performance structures. We describe and motivate the concept of a weight (function). This is the turning point between "exterior" and "interior" strata to performance. We discuss different exigences for performance operators to cope with 'primavista' and 'analytical' data. A series of common primavista and analytical weights is discussed. We then expose a taxonomy of operators, followed by some special examples of the existing realizations, regarding tempo and articulation, as well as theoretically founded generalizations which are based upon Lie derivatives. The chapter concludes with a discussion of more "social" types of operators which correspond to "family life" correlations introduced in section 38.3.4. The final subject is a prospective to 'continuous stemmata', i.e., generalized stemmata based upon infinitely small coupled space portions. -~-
Whereas the previous theory culminated in the matrilinear stemma theory, we now have to face the masculine contribution to sexual propagation of performance shaping. This part is driven by the performance operators n which are a coordinate of each LPS 8 and generate this LPS from its mother f.L: 8 = n a: f.L. The information which is used to feed an operator is a list of weights in the sense of the space form WeightListBP of section 35.4. Recall that a weight is a real-valued predicate on an iterated powerset of local compositions derived from the kernel in the hierarchy spaces. We have to justify this approach and will do that in section 39.1. But from our discussion of expressive performance in chapter 36 it is clear that operators have to use some rationale to shape their LPS' performance maps. In our following examples, we shall restrict such rationales to rational operators, i.e., such operators which use exclusively score-related primavista and analytical information. 773 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 39. OPERATOR THEORY
774
Despite this restriction, we are far from understanding the general nature of rational operators. Clearly, the primavista operators are not the problem: they are straightforward (though not always easily formalized) translations of musical score competence into mathematically valid operations. The problem is rather on the side of general formalisms which could englobe the processing modes of analytical weights for producing reasonable (expressive) performance deformations of a given mother LPS data. A coarse qualitative classification of operators is nevertheless possible according to where the operator intervenes in the entire transformation process from the mental (symbolic) kernel, through the performance field (and the associated performance transformation), to the physical output data. It is also possible to show that a number of special formulas for rational operators, such as the tempo operator, or some types of articulation operators, are special cases of the so-called Lie-type operators (section 39.7). "Lie-type" means that the Lie derivative of a weight function along a performance field is responsible for the field deformation of the daughter with respect to its mother. These Lie operators then specialize to basis operators and pianola operators according to their action being on basis or pianola parameters, respectively. The chapter is concludes with a subject that extends the discrete character of a stemma to continuous parameters for the description of families of daughters. It is in the same vein as the introduction of continuous performance maps to replace discrete maps on discrete sets of notes: Although the material is discrete, the mental construction behind the shaping of this material is more than that: Human cognitive activities can operate on a continuous (or differentiable, etc.) paradigm when generating discrete effects. In fact, we could state this as a very principle of musical activity:
Principle 26 Music notation and also the very expression of musical material is essentially a trace of infinitesimal forces on a discrete reduction!. This is, what Valery was alluding to when he stated2 : "La musique mathematique-ment discontinue peut donner les sensations les plus continues." In our understanding, this is also valid in the sense of a poietic principle, the words "donner les sensations" being replaced by "provenir des forces".
39.1
Why Weights?
Summary. Within the transformation process from abstract and conceptual analysis and representation of score data, weights are an inevitable instance charged with the production of numerical performance data. -~-
Usually, in musicology, analysis does not end up with numerical data. A harmonic analysis, for example, yields a sequence of harmonic functions which take their values in abstract symbols, such as D F , Am7. Or a motivic analysis ends up on a verbal description of certain important motives. However, when an artist has to perform a score, the abstract description level is of lThis reduction is in fact-among others-a reduction of originally gestural neumatic signs! 2See the catchword of chapter 33.
39.1. WHY WEIGHTS?
775
no direct use: Only the numerically quantized information can immediately help in defining shaping processes of instrumental parameters. The final condensation of any analytical facts in performance has a numerical appearance. Our presentation of weights is in some sense an ideal compromise between abstract predicates, such as harmonic symbols, and the naive numerical evaluation of analytical properties. We understand that weights are predicates, i.e., truth-valued functions on local compositions and their powerset constructions, but the truth module is at the same time a numerical one, or at least a truth value can be associated with a numerical value if required.
39.1.1
Discrete and Continuous Weights
Summary. A priori, weights are functions on discrete sets of points within determined parameter spaces. In order to insert weights into general tasks of performance, in particular those referring to infinitesimal or continuous character, one has to consider extrapolation methods. -~-
We recall that we have defined a cubic interpolation formula of class e1 for a real-valued (discrete) weight w with zero derivative on local compositions K in an n-dimensional simple form space S over the reals, according to defintition 99 of section 32.3.2. We also supposed that K is contained in the n-cube and that the interpolation is constantly equal to 1 outside In our present situation, we want to make a slight generalization to general weights with truth values in T IR • We suppose that the truth values w(x), x E K are all non-empty sets, i.e., Usually, this is the case since we do not have the value ..L, and that K is still in the cube are given weights which are open intervals ]- 00, a[ with non-negative upper bound a. Suppose that Wx E w(x) is a selection of values from all truth sets w(x), x E K. Supposing that an interpolation function F has been chosen (e.g., a cubic interpolation with respect to a specific permutation of coordinates, or the weighted sum of all such functions, or still another adequate candidate), we have an interpolation function Fw. for each such selection w., which is constant with value 1 outside and with zero derivatives in all points of K. Then we obtain a predicate on every element s E S by the definition
en
en.
en.
en
F(s)
=
{Fw.(s)lw.
E
II w(x)}.
(39.1 )
xEK
In case the truth values are intervals w(x) =] - 00, a(x)[, the interpolation yields the value intervals associated with the interpolation values stemming from the upper bounds a(x). The fact that such a generalized interpolation is not defined for higher powerset arguments is not really a problem. Some operators in fact take these weights and boil their information down to more common weights as discussed above. We may then apply interpolation formula (39.1) to these boiled-down weights. Behind such an interpolation procedure there is the non-trivial question about the justification of the usage of continuous weights in performance operators. There are two reasons we can find for this: The first is a very practical one: calculation precision. Suppose that we are given a local composition K and that we want to calculate the weight w(x) at a point x E K. In the context of computer programs, it is often not clear whether we can really catch x in its
CHAPTER 39. OPERATOR THEORY
776
numerical identification, for example for numerical calculation rounding effects. Therefore, it would be good to have the weight being a non-discrete function which varies little around a given argument. The second argument is that the continuous character of performance fields suggests continuous methods to shape such fields. For example, the commonly accepted continuous character of tempo would ask for a continuous weight function in order to determine a tempo change for all values where the tempo curve is defined. The inherent idea is that anaytical facts that are calculated on discrete events of a given score should have their presence in a neighborhood of such events, in the sense of a field action of such predicates.
39.1.2
Weight Recombination
Summary. We describe and motivate a set of procedures for building new weights from given ones. -~-
A further advantage of the numerical truth value set W
----t
Id
Simple(JR) for weight predi-
cates is that one may recombine such weights on standard operations from linear algebra. This type of building new predicates from given ones is a special item in the methodology of predicate calculus as exposed in section 18.3.4, i.e., it is a special logical motivation since it operates on the codomain of predicates. Given a scalar A E JR and a weight w on a local composition K, or its differentiable extension Fw by a cubic interpolation formula as described above, we can define
A.W(X) = {A.vl v E w(x)}, A.Fw(x) = {A.vl v E Fw(x)},
(39.2) (39.3)
and it is clear that A.Fw = F)...w, and (,...A).Fw = ,...(A.Fw), (,...A).W = ,...(A.W), respectively. If A is positive, and if w is a weight whose values are intervals] - 00, a[, then A.W is still of this type, viz, its intervals are the shifted intervals ]- 00, A.a[. If we are given two weights v, w, then we can define
(v + w)(x) = {po + vi J1. E v(x), v E w(x)}, (Fv + Fw)(x) = {J1. + vi J1. E Fv(x), v E Fw(x)}; (v.w)(x) = {J1..vl J1. E v(x), v E w(x)}, (Fv.Fw)(x) = {J1..vl J1. E Fv(x), v E Fw(x)},
+ Fw = Fv+w, Fv.Fw = + A.Fw, in general. If both,
and we also have Fv
(,.. + A).Fw =I-
,...Fw
(39.4) (39.5) (39.6) (39.7)
Fv.w. Sum and product are associative, but v and w (and therefore also the differentiable
extensions) have intervals 1 - 00, a[, then so is their sum. The philosophy of these combinations is that the action of weights may be better if their influence is mixed and weighted by scalars such as with cooking, the delicate dosage can be controlled. A more general type of recombination is a non-linear deformation of a weight according to a deformation function 8 : JR -'> R With the previous notation, we define a scalar multiplication
39.2. PRIMAVISTA WEIGHTS
777
by
5.w(x) = {5(v)1 v E w(x)} 5.Fw(x) = {5(v)1 v E Fw(x)},
(39.8) (39.9)
and it is clear that for two such functions 5, E we have (50 E).Fw = d5.Fw). In practical cases, we often have the weight w being defined by intervals w(x) =]-00, a(x)[ for x E K, and therefore also Fw(x) =] - 00, a(x)[ for any xES. Since K is supposed to be finite, and since by construction a(x) = 1 outside the defining cube the image set a(x), xES is a finite interval [amin, a max ]. We then consider a continuous deformation function 5(amin' a max , T) such that 5(amin' a max , T)(t) = t for t ¢ [amin, a max ]. For t E [amin, a max ], the deformation parameter T describes a one-parameter family of continuous, monotonically increasing deformations of the interval [amin, a max ] with 5(a min, a max , 0) being the identity. Typically, the deformations for T and - T are related to each other by a reflection of their graph at the main diagonal in ]R2. For example, we may take affine images of hyperbolas y = -l/x on intervals [-u,-l/u], where u = eT , yielding the typical formula 5(0,1,T)(t) = e2T-t(~2T-l) for non-negative T, and the symmetric (diagonal reflection) formula for negative T. The philosophy of non-linear deformations is that often, the action of an analytical weight on performance is qualitatively correct, but its quantitative influence should be distorted in order to yield a good perception. This effect can then be achieved via non-linear deformation functions.
en,
39.2
Primavista Weights
Summary. Many of the traditional score data are not codified in numerical values. Numerical quantification is, however, a conditio sine qua non for any performing artist. We give an overview of common transformation procedures for non-numerical score parameters, including their extrapolation to continuous weights. -~-
39.2.1
Dynamics
Summary. We discuss the quantification and syntax of absolute and relative dynamics. -~-
Here, we need some preliminary remarks concerning the parametric interpretation of dynamical score signs since these are very coarse and of different types. We distinguish three types:
• Absolute dynamical signs such as ppp, mj, ffj, sempre pp, etc. They give information at a determined onset. • Relative punctual signs such as jrz, sj, etc. They indicate a momentous change of dynamics as a junction oj the momentous dynamical level.
CHAPTER 39. OPERATOR THEORY
778
• Relative local signs such as the crescendo signs, giving verbal or pictorial indications on their domain of validity or having the shape of short wedges. To begin with, absolute dynamic signs are ambiguous verbal descriptions of intended loudness. The first step towards a performance-adequate representation of such symbols is to assign them numerical values in a symbolic loudness scale. The good thing to do here is to assign them values which are in an affine relation to physical (Cents) units3 . The precise value relations are however not codified and must be left to the free decision of the formalizing instance (in instrumental practise: the performer, the conductor, in computer-systems: the user). So we start with the usually sufficient setup of an increasing sequence
< "((mpppp) < "((pppp) < .. . < "((mp) < "((p) < "((m/) < "((/) < .. . < "((JJffJ < "(( mJJffJ < ,,((JJJJ/)
(39.10)
"((ppppp)
of 19 real numbers in the Loudness space. The naive setting would be an equidistant series, such as suggested by MIDI and other technological codes. If we could limit loudness quantification to such absolute signs, this would be the end of the story: We would just define a parser for verbal dynamics signs into the numerical format of the space Loudness. However, the meaning of the absolute signs is more than the above gauging convention. In fact, when we are given a sequence of absolute dynamic signs, we are also facing relative dynamic signs which have extensions, i.e., onsets and "offsets" within the range of the given absolute signs. For example, we may see a crescendo wedge, followed by a second such wedge, and then by a decrescendo wedge. What is the loudness curve for this situation? The basic data is a sequence of absolute loudness symbols which are specified by their onsets and the symbols, i.e., a sequence of denotators in the limit space
AbsDynamicEvents AbsDyn
---+
Id
---+
Id
Lhnit(AbsDyn, Onset, Duration)
Simple(Z < UNICODE
(39.11)
».
The duration of such an event is the difference from its onset and the onset of the next event, except for the last event, whose duration defines the end of the given composition. So we have the sequence AbsDynSequ =
Evt1 : O~AbsDynamicEvents(Dyn1,E1,DI),
(39.12)
Evt2 : O~AbsDynamicEvents(Dyn2' E 2, D 2), Evtm : O~AbsDynamicEvents(Dynm, Em, Dm), with D1 = E2 - E 1, ... D m- 1 = Em - Em-I· This data induces the weight WAbsDynSequ on onset events E with these values WAbsDynSequ(E) =]- oo,a(E)[:
{
a(E) = a(E)
-00
+ Dm ~ E, [Ei, Ei + D i [, i = 1,2, ... m -
if E < E1 or Em
= "((Dyni) if E
E
3See appendix A.2.3 for this gauging question.
(39.13) 1.
39.2. PRIMAVISTA WEIGHTS
779
Over this absolute dynamics data, we now tilt the relative dynamic events: On each interval defined by event Evt i , we are given a sequence of events of this form:
RelDynamicEvents RelDyn
------> Id
------> Id
Limit(RelDyn, Onset, Duration)
Simple(Z < UNICODE
(39.14)
».
The symbols in RelDyn are such as crescendo, molto crescendo, decrescendo, molto decrescendo, for example. Their onset and duration are visible from the position of the wedges or the dashed lines in the score notation. So we are given sequences RelEvti,. =
RelEvti,j : O-v-+ RelDynamicEvents( RelDyni,j, r Ei,j, r Di,j),
(39.15)
j = 1, ... mi, 0 :::; mi, with mi = 0 for empty sequences. This time, the conditions on the respective onsets and durations are
rEi,j
+ rDi,j
:::; rEi,j+l, 1:::; j
Ei :::; r E i,l, r Ei,mi
+ r Di,mi
< mi,
:::; Ei
(39.16)
+ Di.
This is the framework for redefining the finer dynamics according to the relative signs sequences. The problem here is that we do not know how much a crescendo may increase absolute dynamics in order to remain within interval E i , Ei + Di of the given absolute value "Y(Dyni) of the absolute event EVti' To this end, one needs other information which is implicit in the interpretation of a score: this is the tolerance of dynamical variation around "Y(Dyni) such that we still accept the label Dyni. So we have to provide each symbol Dyn of absolute dynamics with a tolerance number 0 < TDyn, and this means that all values in the half-open interval ["Y(Dyn) - TDyn,"Y(Dyn) + TDyn[ will be accepted as variations of the label Dyni. This does not exclude that these intervals may overlap, so the labels can be ambiguous and contradictory: a high pp value can be higher than a low mp value. Upon this tolerance system, we now have to define the meaning of relative dynamics in the sequences RelEvti, .. To this end, we have to fix a quotient 0 < KReLDyn of dynamical increase or decrease of each relative dynamical symbol. For crescendi, we suppose 1 < K, and for decrescendi, we want K < 1. For each relative dynamical event RelEvti,j, we calculate the dynamical values (Le., the upper bounds b of the predicate's intervals 1- 00, b[) at rEi,j and r Ei,j + r Di,j, and then, we interpolate linearly between these cornerstones, whereas the value remains constant inbetween two relative dynamics events. The first relative event RelEvti,l has the starting value at rEi,l equal to the absolute value "Y(Dyni) of EVti' We now suppose inductively, that the starting value Vj of the relative event RelEvti,j has been defined within the open interval h(Dyni) - TDynil "Y(Dyni) + TDynJ This defines the quotient ,t(~,(Dr;i)-T)yni) which we want to increase by the factor KReLDyn· . 'Y yni TDyni -Vj 'L,]
defined by the relative dynamical event at index i, j, i.e.,
Vj+l - h(Dyni) - TDynJ h(Dyni) + TDynJ - Vj+l
(39.17)
defines the new dynamical value Vj+l at the end of the (i, j)th relative sign. The new value is still in the open interval h(Dyni) -TDyni' "Y(Dyni) +TDyni [, and we may go on inductively until
CHAPTER 39. OPERATOR THEORY
780
all relative dynamical signs are parsed. Observe that a succession of an increase factor", by its inverse neutralizes the dynamical value change. More generally, the succession of two factors "'1 and then "'2 results in a value change by factor "'1."'2. So we can manage any succession of relative dynamical signs of this "crescendo/decrescendo" type without falling out of the prescribed tolerance interval. Denote the weight on all of the onset axis which is defined in this way by WEvt.,RelEvt ... So we are left with punctual relative dynamical signs pet at a note x, such as pet = accent or pct = marcato. We may define the predicate
pet () x
={
{I} if x is not marked by pet,
{A(pct)} else,
(39.18)
where A(pet) is the increase factor for pet. Then, the punctual dynamical sign influences the already given absolute and relative dynamical weight dyn by the product pet.dyn of weights. Of course this implementation need not live for ever, but it is one reasonable solution of a non-trivial problem of making blurred score signs precise. Observe that this kind of weight is not a performance map, it is just a weight on the symbolic (mental) events that must be used for operators to be defined later. Observe also that the above weights are not continuous functions of onset if several absolute dynamical signs are present.
39.2.2
Agogics
Summary. We discuss the quantification and syntax of absolute and relative agogical indications. A special attention is payed to general curve types for retards, fermatas, and general pauses. -~-
These are the common agogical indications:
• Absolute tempo, such as Malzel metronome, or anterior verbal indications of type andante, adagio, etc. Formally, they correspond to absolute dynamic signs. • Relative punctual tempo signs such as fermatas and general pauses. Remarkably, there is no relative punctual acceleration sign corresponding to the fermata. • Relative local tempo signs are the following: 1. Coarse indications concerning agogics, e.g., ritardando, rallentando, accelerando, stringendo etc.
2. notation of correspondence between two adjacent tempi, such as "2/4
= 3/8",
3. rest signs such as a tempo. The essential difference to dynamical signs is that symbolic onset and duration are precisely codified, so agogical signs relate to proper performance transformations, and not to making blurred signs precise.
39.2. PRIMAVISTA WEIGHTS
781
We have already discussed some of the agogical predicates (fermata, value change) in the presentation of weights for the prime mother LPS in stemma theory, section 38.3.2. Let us first look at the Malzel metronome sign x quarter / Min .. This is a predicate which takes its T values exactly on onsets E and tempi x where a Malzel indication is x quarter/Min., and 1- else. The reader may easily make the underlying forms precise. Verbal absolute tempo indications need a parsing instance to render them in numerical (Malzel) terms. E
11:'
onset range offermata D
tempo shape
100%
a% b% D.d
D.u
Figure 39.1: The interpolation curve of a fermata weight. See the text for the explanation of the symbols. A fermata has only its onset E made precise, the rest is blurred. We need several additional parameters to generate a viable weight. We first need a duration D, then a shape parametrization to describe the fermata's tempo sink. We may for example take this form:
Fermata
---> Id
Limit( Onset, Duration, Bottom, Down, Up, After)
with Bottom, Down, Up, After
---> Id
(39.19)
Simple(IR).
A fermata Ferm : O~Fermata(E, D, b, d, u, a) is parametrized by (1) the percentage number B of maximal tempo lowering with respect to the given tempo, (2) the percentage d of the duration D from the beginning to reach the lowest tempo, (3) the percentage u from the beginning to restart getting back to the following tempo, (4) the percentage a of the original tempo which is resumed after getting back. This data is used to define a cubic spline interpolation as follows: We have four onsets: E, E + b.D, E + u.D, E + D and corresponding relative tempo values
CHAPTER 39. OPERATOR THEORY
782
1, b, b, a. This defines a discrete weight w(Ferm) and a corresponding interpolation function Fw(Ferm) , see figure 39.1. Of course, many different curves are possible, but with this shape, the main quality of a fermata can be imitated. Weights for accelerando- or ritardando-typed predicates follow the building scheme of a fermata, except that the duration of these signs is defined. We leave this as an exercise to the reader.
Exercise 80 Give an explicit description of accelerandi and ritardandi in terms of denotators and associated weights. The general pause (G.P.) predicate is also undetermined like the fermata. It does however not imply a smooth recapitulation of the original tempo: After the general pause, the previous tempo is reset. We will have to deal with this in corresponding operators. The same phenomenon of a tempo reset is the case for the a tempo predicate.
39.2.3
'lUning and Intonation
Summary. We discuss the weights for pitch values. -~-
Tuning and intonation is a delicate subject for the PrimavistaOperator since it it not clear from the beginning how much the settings depend on the instrument and how they depend on individual instruments. For piano, the situation is easy since we have a fixed tuning and very often, it is even well-tempered. For violins, this is much more complicated: Should the primavista (!) tuning be a just tuning for each tonality which is encountered in the score, or should it be just one "default" just tuning? It is wise to let a special operator, not necessarily the PrimavistaOperator, do the work of delicate tuning and to just operate the minimum on the first process level. The tuning information is twofold: We have the chamber pitch which is the initial set and initial performance, this is ok. And we have the tuning data for all pitch events of a sufficiently large chromatic scale (88 keys for common pianos). This is completely analogous to a step tempo function given by a number of absolute tempo settings. One may use the form
Tune
--* Id
Limit (Pitch, StepTune)
with StepTune
--* Id
(39.20)
Simple(JR)
being the tuning quantity that measures the "pitch velocity" between to neighboring pitches . . Examples for default tuning for some classical cases are found in appendix K. The common well-tempered case has the denotators
x : O"",Tune(x, 1/100 Semitone/Ct) for each pitch x of the well-tempered chromatic scale.
39.2. PRIMAVISTA WEIGHTS
39.2.4
783
Articulation
Summary. We present the parameters for articulation types, from molto staccato to molto legato. -~-
On one hand, articulation is a predicate type which regards single notes, for example molto staccato, staccato, legato, or molto legato. All these predicate types can be encoded by weights on events x of a space U within the BP hierarchy which contain duration and take their values 1 - 00, a(x)[ according to whether articulation stretches durations or compresses them. For staccato and other compressing signs, we take 0 < a(x) < 1, whereas for legato and other stretching signs, we take 1 < a(x). On the other hand, we have articulation in the sense of grouping of a set G of usually consecutive notes which are grouped by an articulation slur via form ArtiSluru,s as introduced in section 38.3.2. Such a predicate may be made more precise by a weight which takes into account the group G as well as the single notes x with their position within G. On a space U, this is covered by the form U2 with the notation from formula (35.18). The weight WGrpArti(G,X) = ]-00, a(G, x)[ is parametrized by the number a(G, x) which tells how much the note's x duration is altered relatively to its nominal duration.
39.2.5
Ornaments
Summary. Historically, ornaments form a complex set of constructions. We introduce a unified language for ornaments which is based on macro-events (see section 6.7). -~-
Recall from section 6.7 that we had defined macro-events by the form MakroBasic which is based on an event form Basic. Here, we take Basic = U, one of the event spaces in the hierarchy of our stemma. An ornament (one of the many forms of a trill, for example [66]) of events in U can be described by a special denotator of the form M akrou. An ornament is first of all anchored at an event a of U, this is the note which is ornamented by a specific sign, with a reference onset a, see figure 39.2. So we start our ornament denotator by a singleton: Ornament: O-v-+ M akrou ({D} ) with
D : O-v-+Knotu(a, SmallAnchors). Here, the next ramification denotator SmallAnchors encodes the three anchor events of the three structural units 0 1 , O 2 , 0 3 , each possibly with repetitions within:
784
CHAPTER 39. OPERATOR THEORY
with macros P ll P2 , P3 . Each of these macros is a set of knots:
with multiplicities mi. Each such knot has the shape
where the macros 0i are independent of the second index j: They signify a repetition of the same macro according to the shift quantities di,j. The terminal macros 0i have this meaning: The first, 01, is the starting "micro-motif" of the ornament, it is played before the reference onset of a and ends with the beginning of the second micro-motif 02 which usually starts from a. This inner motif is repeated several times, usually something like eight times, and very fast. The ornament terminates on the third micro-motif 03 which is a tail to the ornament and may be played much slower than the middle sequence. If we apply successive flattening operations Flatten n , n =
onset reference onset _ _ _----tempo total duration
Figure 39.2: The structure of an ornament, showing a start shape (sequence) middle shapes 02, and a final shape (sequence) 03.
01,
a sequence of
0,1,2,3 to the Ornament denotator, we obtain a hierarchy of successively appearing U-events which gives rise to different refinement levels of tempo curves which an Ornament Operator in the PrimavistaOperator has to manage. This perspective suggests that the hierarchy spaces U in BP should really be extended to the spaces K notu in order to couple stemmatic refinement with hierarchies of sounds, but this theory is not developed so far. It transpires however that macro-events fit in the general weight scheme which is defined by successive powerset constructions from the spaces U in BP. So we can assign to each of the involved macro-events or knots a weight in 11'IR in order to weight the performance role of the ornament's components.
39.3. ANALYTICAL WEIGHTS
39.3
785
Analytical Weights
Summary. We give a generic view on analytical weights. Their concrete shape depends on the analysis which is available. Several types are realized on analytical RUBETTE®modules, see chapter 41. -~-
Analytical weights are crucial to rational performance and intervene from analyses of the score S relating to metrical, rhythmical, motivic, thematic, harmonic, contrapuntal, grouping and other structural perspectives. We shall sketch the construction for metrical, motivic, and harmonic weights. In all these descriptions, we shall abbreviate the nomenclature of predicates and simply write down the numerical values a of such weights, the predicative statement being implicit, either in form of intervals J-oo, a[ or in form of singleton predicates {a}, the information is the same, and the usage of the operators does not depend on which predicative encoding is chosen. Also we restrict to the discrete weights, the continuous associated weights being automatic from our previous discussions (subsection 39.1.1) 1. Metrical weights w!etro: We have a sober weight on the onset space U = Onset for which we have prepared a formula in example 43. In that formula, the weight n W (a) is calculated on a simplex a of maximal local meters. For an onset E, we then take the simplex as(E) consisting of all maximal meters within the onset kernel SOnset of S which contain E and set w!etro(E) = nW(as(E)) (with the evident singular value nW(0) = 1.. for onsets outside SOnset).
2. Motivic weights W!oti/ Following the preliminary discussion in section 22.9, we take an interpretation SbnsettBPitch of the onset-pitch space of BP by motives. More precisely, Ji, may be chosen as the set of all motives within a range of cardinality and extent of onsets between the first and last event. We define sober motif weight which is induced by the weight on motives from the atlas Ji,. So we are left with the definition of the weight on motives M E Ji,. To this end, we consider a more precise motif theory framework as defined in chapter 22. We choose a shape type r t and a distance d, on which an equivariant, isometric group action of a paradigmatic group P is defined, and such that we have the inheritance property fulfilled (for example, the elastic and diastematic types with Euclidean distance and counterpoint groups). We then have the epsilon topology 'I = 'It,P,d and choose a neighborhood radius 0 < E. For a motif M E Ji" we consider the E-disk neighborhood Dt(M) of M in the relative topology 'II" We define the E-presence of M by the weighted sum P€(M)
=
L
2card (M)-card(N)
(39.21)
NED!,(M) which visibly counts the elements of the E-neighborhood of M with a weight according to the cardinality difference to M's cardinality. This is roughly speaking4 the presence implemented in the MeloRUBETTE®. 4In our implementation of this function, we have also taken into account the multiplicity of submotives X of the N in the neighborhood with card(X) = card(M), which are in the distance less than € to M.
CHAPTER 39. OPERATOR THEORY
786
We then also define a E-content of M function which is the following weighted sum:
2card(N)-card(M) ,
(39.22)
N,MED':(N) and we therefrom get the weight of M, i.e., the product (39.23) which-again roughly speaking the presence function which is also implemented in the MeloRUBETTE®-measures the presence of M combined with the content of M. Intuitively, the weight of M is a measure of where M appears in other motives-up to E distance-and how much motives are contained in M-up to E distance. The coefficients are a weight for the cardinality distance form card( M).
3. Harmonic weights w~armo: This weight is also a predicate whose values are non-false only on the onset-pitch space SOnsetfJ)Pitch of BP. To define it, we have to start from an interpretation S6nsetfJ)Pitch by chord events. The construction of a chord atlas is however not uniquely determined since it is not clear what is a chord. We may just take those local compositions of all events having one and the same onset. But we may also consider all the onsets where some chord notes end and some others still last, and then take the still lasting notes as constituents for new chords at those ending onsets. Given such an interpretation "7, we take its abstraction in form of a temporally defined sequence a. = ao, aI, ... ak of length k of chords, i.e., local compositions in the Pitch space. For any such sequence a., we have defined weight functions in the context of the Riemann algebra, as discussed in section 27.2. There, formula (27.13), with its interpretation as a truth denotator-valued weight, as explained in the example of section 27.2.2, gives us a criterion for finding a best path Popt = (vo,fo,ao) ---+ (vl,h,al) ---+ ... (vk,ik,ak) in the Riemann quiver. We now use this best path to calculate the tension t pop , (ai) of each chord ai with respect to this path. The immediate tension function would be t pop , (ai) = n(poptlai), where we denote by Poptlai the optimal path from the beginning to (inclusively) chord ai. However, the global tension is a problematic quantity as discussed in section 27.2.1. We therefore renormalize the function tpop,(ai) by the unique shearing of its graph in JR.2 such that the first point (0, t pop , (ao)) goes to a predefined initial tension (O,t o) whereas the last point (O,tpop,(ak)) goes to (O,tk). This has the advantage that the global tension can be cast to initial and final values which the local methods cannot predict and control. Call tpto,tk op' this new tension function.
In order to calculate the weight of single notes x within a chord ai, we look at the Riemann matrix predicates TFj,t(ai) = [0, ¢(ai)[ and TFj,t(ai\x) = [0, ¢(ai\X) [ where ai\x is the chord ai after omission of x. We know that ¢(ai) is not zero. We may then map the interval [O,¢(ai)] onto the interval [IP, 1],0 < IP < 1 by an affine map Q for a normalization purpose. Then, we get the relative importance rel(x, ai) of x within ai by the formula (39.24)
39.4. TAXONOMY OF OPERATORS
787
This gives us the final expression for the weight of the note x: We take the tension of the underlying chord ai and multiply it by the relative importance of x within this chord, i.e.,
w~armo(x)
= rel(x, ai).t~::tk (ai)'
(39.25)
This is the harmonic weight which is implemented in the HarmoRUBETTE®. An important technique to produce new weights from analytical weights is the boiling down method: Often, a weight is given on a space where the operator at hand does not work. For example, the TempoOperator (to be discussed later in this chapter) needs a weight on the Onset space. If a weight lives in a space with more dimensions, we should be able to boil it down to a weight on Onset. Here is the procedure: We have two spaces U, V in the system BP, and a projection Pv : U -+ V. A weight w is given on the kernel Su, and we would like to get a boiled-down weight BDv(w). We make these two definitions: BDv(w)(x)
L
=
w(y),
(39.26)
YEPv'(x)
BDvean(w)(x)
=
d( 1_1( )) BDv(w)(x), car Pv x
(39.27)
where x E Bv, and with the value -00 for empty fibers. In general, we have this situation: we would like to have a weight w on a specific space V of the hierarchy, but the given weight lives either in a larger space U which projects onto V, or it lives in a smaller space U onto which V projects. The former case is solved by the boiled-down construction BDv(w), BDvean(w), whereas the latter is straightforward by the formula wu(x) = w(pu(x)),x E V. In the future, whenever we use a weight which is possibly defined in the "wrong" space, these constructions are referred to, and we simply write winstead of the above correct symbols if no ambiguity is likely.
39.4
Taxonomy of Operators
Summary. Though a general description of operator principles is risky (since there is no general theory of how a stemma can be altered), we want to give a preliminary classification of the ways an operator may choose to intervene in the existing configuration. This taxonomy is guided by the generic description of how performance works: as a transformation from mental to physical reality, and by means of its description via performance fields given in section 33. This means that an operator can intervene on the level of mental or physical reality, on the frame of the local performance cell, see chapter 35.1, and on the performance field together with its initial set.
-EThe basic data of a performance is encoded in the performance map p : K -+ p(K) on the given performance cell. An operator has to change any of the involved structures: K, p, or p(K). This is understood in the sense that K is a set of arguments upon which the prescription p acts, and a set p( K) which is the output of that map.
CHAPTER 39. OPERATOR THEORY
788
The first operator type acts on K. It does this in two ways: First, named splitting operator, it just divides the kernel K into sub-kernels, nothing else. Second, it alters the kernel's events and submits these new events to the given prescription p. This one is termed symbolic operator. It is as if the artist would play in the same "mood" with a changed score. The third type takes the input K, the map p, and the output p(K) as given and now changes the output without regard to the previous process. This is the physical operator type. The fourth type takes the input kernel K for granted and just changes the map's "formula" . For example it changes the tempo curve or the intonation, etc. This is the so-called field operator type. This is the most complex operator type and has not yet been understood in its different outfits. Let us now have a closer look at these types.
39.4.1
Splitting Operators
Summary. A splitting operator is rather simple. It restricts to partitioning the performance score's kernel into sub-kernels without any further change on the performance score. Splitting may also be operated on a group of selected instruments in a global performance score. -~-
The idea of a splitting operator is this: You are given a score with different groups of notes that you want to perform in an individual way. For example: right hand and left hand, or onset-driven grouping: split the score at a given parameter, such as onset or loudness or duration etc. This may happen at a grouping line such as the beginning and the end of an eight-bar period. Very often, this also happens if a special group needs a special performance procedure because of its inherent structure, such as a trill or another ornament. Whatever you will do to this group is irrelevant, you decide later. So the splitting operator is a propaedeutical operator intending to prepare more in-depth operator actions. A prototypical realization of the splitting operator may be implemented by use of an essentially mathematical predicate Splitu,lI, where U is one parameter (such as pitch, onset, etc.) in the given hierarchy ~, i.e., of the top space BP, and where II is a real number parametrizing a coordinate value in the U parameter. Then if X is an event in one space of the hierarchy, the predicate X / Splitu,1I = T iff either X does not share this parameter U, or if it shares it and the U-value Xu of X verifies II :::; Xu. So X/Splitu,1I = ..1 iff Xu exists and Xu < II. We then get the logical combination predicate Splitu,II,p, = Splitu,II&-,splitu,p, which selects those events with either no Xu coordinate or else with II :::; Xu < f.L. In the RUBATO® software, the splitting operator has been implemented such that it may perform any logical conjunction of splitting operators of the types Splitu,1I and their negations -,Splitu,lI. For example, this enables us to select all events within a half-open onset interval [E1 = II, E2 = f.L[ and a half-open loudness interval [L1 = L2 = xl, and having durations D with D1 = 8 :::; D. So the splitting operator Split (we omit further specifications in this notation) is defined by a Boolean predicate which extracts a given set K' of events from a given kernel K. This means that Split produces two daughters, one with the K' kernel, the other (by logical negation of the former) with the remainder kernel K - K'. In the nomenclature of section 38.3.3, we have a coupling
e,
O(i) ex f.L
= Ai,i = 1,2
39.4. TAXONOMY OF OPERATORS
789
which is also written more intuitively as
Split ex J1, RemainderSplit ex J1,.
(39.28) (39.29)
This operator may also be applied to a GPS (J1,i)i just by applying the operator simultaneously to all the member LPS J1,i of this GPS. We should add that all the other data of the mother(s) in J1, (in (J1,i)i) are left as they are, and everything still works fine. The new LPS just have some points of the original kernel being removed. In the formal setup of the weight system of an LPS, the predicate Split would be viewed as being a weight of both daughters and the operator would then act on the mother via the Boolean selection and its negation which is defined by this weight.
39.4.2
Symbolic Operators
Summary. A symbolic operator affects the score data before they are performed whatsoever. This means that the operator really changes the composition which is to be interpreted. Such an operator type is seemingly contradictory to the very objective of performance. Nonetheless, primavista weights suggest an intervention of operators before any real performance, Le., performance is already initiated on the very level of the score's interpretation. -~-
Symbolic performance operators are a delicate species since they intervene at a very early stage of the performance process: on the kernel level of the LPS hierarchy ~. One could see this fact as a natural completion of the performance philosophy in that composition is the first stage of performance, so why not alter the kernels, Le., make a new composition out of an old one. We adopted this integrative point of view while developing RUBATO@. From the software engineering perspective, this is indeed tempting since it would in the limit yield an integrated software for performance and composition. In this generic setup the program routine must, however, recalculate virtually every ingredient of the given LPS. The change of the basic score elements in a recomposition entails that the frames, the initial sets, the initial performances, the fields, and the weight list must be updated, or, rather: rewritten from scratch. Potentially, nothing will be the same again. So the operator then would just be a reset of the complete LPS data for a new composition. And this is not what performance was meant to do. After all, having rehearsed on a given piece and then being told that you get a new score, but you may go on with the old tempo curves, is not the kind of thing you will enjoy, since it will not make sense to perform on this schizophrenic data. For example, if a tempo curve has been developed, and then all onsets and durations are reset to half of the former values, the tempo curve becomes useless unless you also redefine the curve by a so-called time stretching operation (mUltiply the time arguments by 0.5). In order to avoid such risks, one should really recalculate all weights whenever a symbolic operator has changed the kernel data. But even then this would not necessarily be the right solution since some primavista weights of analytical nature really need to be calculated on an unrefined score, i.e., on symbolic data which do not yet share some sophisticated symbolic
790
CHAPTER 39. OPERATOR THEORY
explications, for example in dynamics. This problem could be attacked by an implementation of weight calculation routines which average out the over-refined symbolic data, but this is not very elegant. It remains a fact that weights should also be given the parameter of the stemma LPS where the weight is instantiated. By this method, one would then be sure that a weight is related to a determined kernel in the stemmatic inheritance tree and not exclusively to the primary kernel data. With these caveats in mind, we may nevertheless define some useful (though also risky!) symbolic operator. We name it SymbolicBrueForceOperator to remind you of this dangerous enterprise. It takes as arguments a "directional factor" dEW, W a space of the hierarchy H, with components dR for parameters R of W, and a weight w. Then, if X E U is an event of K ernelu, we set SymbolicBruteForceOperatord,w (X)
= Wd(X)
(39.30)
where Wd(X) is the new event in U, whose R-coordinate Wd(X)R is the product w(X).dR,XR' if R is a coordinate of W, and X R else. So we only change X -coordinates for the directional factor, and there, we scale the coordinates of X by the weight and by a fixed directional coordinate. The risk here is that for d or w(X) values far from 1, the image Wd(X) is likely to fall out of the given frame. But one may then easily redefine the frame, the field, etc., since usually this symbolic operator is applied in a stemma stage where we are just given the default data. As we have seen in section 38.3.2, this data can be adapted to special kernels without difficulty: The frame is extended to include new events, the performance field is the constant field, and the initial set and performance are not a function of special kernel data. Under general stemmatic conditions, this operator is risky, however, but it is also an elegant solution of some nasty parameter problems which arise from the fuzzy score notation. For example, if we are given a weight for primavista dynamics WEvt.,ReIEvt .. (see section 39.2.1), then the directional factor d = (1) E Loudness gives the complete dynamics values if the given loudness was set to a preliminary value 1, say. Same for other primavista weights relating to symbolic parameter values. For dynamics, it seems to be clear that the symbolic operator is the right one to be applied. For other primavista weights this not so easy to decide. For example, the articulation weights for slurs, legati, or staccati could be seen as symbolic prescriptions to alter duration within the score event framework. We contend that this is not analogous to the dynamics situation because absolute dynamical signs (such as jJ, mf) are not quantified and even less are relative dynamical signs (such as crescendo). We therefore have a clear mission to transform such verbal signs into quantitative data. In contrast, duration is a perfectly quantified parameter, so it needn't be generated on that level. We therefore decided to deal with these weights in the context of field operators which alter the mother's articulation field. This minor dilemma demonstrates (once more) that one may have the same performance output with very different infrastructure in the transformation process from the score to the physical events. This subject has not been dealt with from a more theoretical point of view: It pertains to the inverse performance theory to be dealt with in part XII, i.e., the theory of the variety of performance scores leading to a given performance output. But there is no result on the variety of local or global performance scores with variable score data inducing a given performance output; everything to date supposes that we are given a fixed score. Especially in ethnomusicological contexts, where the very concept of a score is uncertain, this is not
39.4. TAXONOMY OF OPERATORS
791
acceptable. For completeness of this taxonomy, we should add that we also call symbolic operator an operator which changes the initial set ofa performance cell. This may be necessary if we want to enrich certain boundary conditions on the performance of selected groupings of notes.
39.4.3
Physical Operators
Summary. Physical operators affect given results of performance which are inherited from the mother's data. They act in a very simplistic way since they do not relate to the deeper process of performance; they only take for granted what the mother had already produced as a performance output.
At the other end of the operator scale, we have the physical operators. While symbolic operators should be applied in the very beginning of a stemmatic unfolding, physical operators should be applied after all other operators have been deployed and no risk to have them applied once more is likely. An exception to this general rule is the case where after a physical operator another operator is applied, but not in any of the coordinates to which the physical operator has made changes. But the one who applies such exceptional stemmatic operations must know very well what is being done-as with pointers in C-style programming. Formally, what we call the PhysicalBruteForceOperator is quite analogous to the operator SymbolicBruteForceOperator discussed in equation (39.30). We are again given a directional factor dEW, W a space in the hierarchy H, with components dR for parameters R of W, and a weight w. Then, if X E U is an event of Kernelu, we set
PhysicalBruteForceOperatord,w(X) =
w~(X)
(39.31 )
where w~(X) is an event in the physical space u corresponding to U, whose r coordinate (r corresponding to R according to our general nomenclature) is the product w(X).dR,p(X)T) if R is a coordinate of W, and p(X)r else. So this operator really changes the performance at the point X, but the symbolic operator only affects the symbolic point and does not intervene on the p-level or the output. There is another very pleasant difference: Whereas the symbolic operator is risky as to the frame, field, and initial conditions, the physical operator is completely indifferent against these conditions. It only needs the mother's performance and the rest will always work! This operator is also very fast on the programming level, it produces very little changes and therefore is quite comfortable to handle. This preconizes another application of PhysicalBruteForceOperator in the sense of a test operator. In some situations, we would just like to know how an alteration of the output looks. Instead of applying a complicated operator (acting on the performance field and consuming much calculation power, say), we first apply this physical operator and obtain a first impression of how the given weight might influence the mother's performance. If we like the effect, we may go on, first resetting the stemma to the mother's level, and then apply a more sophisticated operator with the weight in question. For completeness of taxonomy, we should add that operators which alter the initial performance map are also called physical operators. They tell initial events where to be mapped in the
792
CHAPTER 39. OPERATOR THEORY
physical space. This is however not completely satisfactory since it usually depends upon the way the mother told its initial events where to be mapped. And this could have been done on a more field-theoretic level. The actual change could also involve field-theoretic considerations.
39.4.4
Field Operators
Summary. Field operators are the very essence of shaping performance. They include a variety of more or less general operators according to the direction where the field has to be altered. The theoretical complexity is paralleled by a computational complexity (numerical integration of ordinary differential equations) which is very important for software implementation of such operators. -~-
Whereas symbolic and physical operators are an easy chapter in performance classification, field operators constitute an entire world of various approaches to the definition and deformation of performance fields. There are several reasons for this. The first is the mathematical complexity of fields, the definition and deformation of a field requires differential-geometric tools of local and global nature. The second is that performance fields have to be inserted in the hierarchies of their LPS. Therefore, the change of a field is always coupled to the fellow fields in the hierarchy. The third reason is that the stemmatic inheritance enforces operators which are capable of acting upon given mother fields of quite general nature: the stemmatic framework has to face unexpected field inheritance. Therefore, the definition of field operators is not completed if a particular type of deformation is implemented, one must also allow for mother fields which do not fit in the naive setup of such a deformation type. The fourth reason is that there is no such a thing like a performance field theory in traditional performance theory, and not even in modern approaches, such as the KTH school or the Todd approach. Performance fields are a new paradigm to performance theory, although, as we have shown, they are a completely natural and mandatory setup to control performance on an artistic level and from the more esthetic point of view as put forward by Adorno and Benjamin. We shall start our investigation of performance field operators by the classical tempo operator. We then generalize its setup to cope with more general mother fields. Then we generalize this approach to a more systematic framework, the so-called scalar operator. Another approach will be discussed under the title of basis-pianola operators, which arose from a systematic generalization of a physical operator to quite general situations. It turns out that this type is an interesting general approach involving Lie derivations of weights along the given mother fields. We then specialize this setup to operators for basis parameter field and pianola parameter field deformations. Although the field operators look very attractive, they have a big practical drawback: If one has to compute integrals of ordinary differential equations in order to solve the integral curve problem for performance fields, one has to control the type of vector field. In fact, not every wild field can be integrated by one and the same numerical ODE routine. For example, we have implemented Runge-Kutta-Fehlberg routines in RUBATO®'s PerformanceRUBETTE®. We often experienced that the application of a field operator, such as the tempo operator or the scalar operator (see sections 39.5 and 39.6, respectively, for these operators), causes a breakdown
39.5. TEMPO OPERATOR
793
of these numerical routines. So the routines should be adapted to the field, but this is beyond this discussion and should be left to numerical ODE specialists.
39.5
Tempo Operator
Summary. The tempo operator is the classical field operator which alters the mother's tempo as a function of given weights. We expose the different variants of such an intervention. -2;-
This operator acts on the tempo field 'ThE of a performance cell, so we suppose that E is a member of the space hierarchy H of~. We are given a weight wand set TempoOperatorw('Th)E(X E )
= W(XE).'ThE(X E )
(39.32)
for a E-event X E • This operator is only defined if the weight is a strictly positive continuous function on the given onset frame, which we shall from now on tacitly assume. If the tempo operator is clear, we shall write the shorthand TempoOperator w ('Th) = 'Thw. This operator is as easy as problematic if we do not take into consideration other parameters which are canonically tied to onset. Duration is such a parameter. Let us look at the combined articulation field that is influenced by the weight in the tempo operator. Let us first start with the parallel articulation field (see section 33.2.1), i.e., 'ThD = 2.'ThE 0 D:+ - 'ThE. SO if we deform the tempo field 'ThE to 'Thw,E, the D-component of the parallel field is deformed to a Dcomponent of the parallel field with the weight contribution, i.e., 'Thw,D = 2. 'Thw,E OD:+ - 'Thw,E = W 0 D:+.2.'Th E 0 D:+ - w.'Th E . However, this formula is not satisfactory since it only works if we are given a parallel field situation. Else, we would destroy the given mother D-component and retain only the mother's tempo information. In order to take the mother's duration field component into consideration, we need a more invariant formula. To this end, rewrite (39.33)
and then get the formula 'Thw,D
= W0 =W0
or for the total articulation field 'Thw,ED 'Thw ED ,
=
(
+ 'ThE) - W.'ThE + (w D:+ - W).'ThE
D:+.('ThD D:+.'ThD
(39.34)
0
= ('Thw,E, 'Thw,D): w
WOD:+ -
(39.35)
W
The new articulation field therefore results from the old one by a linear action
Q_( wW
W 0
D:+ - W
0)
W 0 D:+
(39.36)
CHAPTER 39. OPERATOR THEORY
794
on the tangent bundle of the ED-frame 5: 'lSw,ED = Qw.'lSED. The shape of the operator Qw is precisely that which one obtains for the parallel field at tempo w. This means that instead of a real tempo, we take the 'weight tempo' wand the associated 'performance' Pw. Its inverse 'is' Qw, and the new articulation field 'lSw,ED is the inverse image of the old articulation field 'lSED under the 'weight performance' Pw. Usually, the change from the given tempo curve 'lS E to 'lSw,E = w.'lS E changes also the duration ~ = fEEl.! to the duration ~w = fEEl0 ~. If the integration limits Eo, EI are o .ABE .lZtw,E initial onsets, this is bad news, and we have to adapt the tempo deformation to the condition that these integrals should coincide. Moreover, one usually requires also that the tempi at the initial points should not change. To this end, one introduces a continuous support function supp which vanishes outside [Eo, Ell, is identically 1 in a slightly smaller interval [Eo + {J, EI - {Jl for a relatively small positive real number {J, and is the cubic spline with zero slopes on the boundary intervals [Eo, Eo + {J], [EI - (J, Ell. Then
S('lSE, W, t) coincides with w within [Eo + {J, EI -
= 'lSE(t)(1 + supp(t)(w(t) - 1))
(Jl
and is 1 outside [Eo, Ell. We then try the new duration
J('lS E , w)
=
rEl S('lSE,
JEo
---:-_1_---;W,
t)
and compare the durations:
r('lSE, w)
J('lS E , w)
= J('lS E , 1) .
We then try the obvious correction WI = r('lS E , w).w and get a weight which gives us nearly the right duration since within the interval [Eo + {J, EI - (J], the error is corrected. We look for the new error r('lSE, wt), set the new weight W2 = r('lS E , WI).WI, and repeat this procedure until the error becomes small enough. This procedure has been implemented in RUBATO®'s PerformanceRUBETTE®, but we do not know whether it converges6 .
39.6
Scalar Operator
Summary. The scalar operator is a first generalization of the tempo operator to articulation. The conceptual background is that tone parameters can be split into basis and pianola parameters. Onset, pitch, loudness are basis parameters. The corresponding pianola parameters are duration, glissando, crescendo. Pianola components of performance fields are coupled to the corresponding basis components of the performance fields. This coupling can be distorted by making operator weights act on specific parameter sets. The scalar operator does this job. -~-
5Recall that performance fields are defined on open neighborhoods of frames. 6Perhaps this could be settled by use of Banach's fixpoint theorem, we do however not know whether the map is a contraction.
39.7. THE THEORY OF BASIS-PIANOLA OPERATORS
795
Suppose that we are given a space B of basis parameters, and that P is the corresponding space of pianola parameters. We suppose given a weight w on the B space. The scalar operator ScalarOperatorw deforms a performance field Th BP = (ThB' Thp) to a new field ScalarOperatorw(Th BP ) = (Th~, Th,?) as follows. We have two parameters: parallel/not parallel, and a two-bit parameter B = yes/no, P = yes/no. The notparallel case is a deformation of the components of the given field according to the second parameter: 1. B = P = no: We leave the field unchanged.
2. B = P = yes: We set ScalarOperatorw(ThBP) = w.Th BP . 3. B = yes, P = no: We set ScalarOperatorw(Th BP ) = (W.ThB, Th p ). 4. B = no, P = yes: We set ScalarOperatorw(Th BP ) = (ThB,W.Thp). The parallel case runs as follows. It means that we refer to the parallel field structure in one way or another. The four cases run as follows: 1. B
=P =
no: No weight influence, but the parallel structure is installed, i.e.,
(39.37)
2. B
=P=
yes: This is the generalized tempo operator, i.e.,
(39.38) with the matrix being understood as a block matrix of scalar endomorphisms on the basis and pianola spaces, respectively. The original tempo operator is the special case B=E,P=D.
3. B = yes, P = no: We set ScalarOperatorw(ThBP) = 8(W.ThB). 4. B = no, P = yes: We set ScalarOperatorw(Th BP ) = w.8Th B . We should remark that the ScalarOperator and a fortiori the TempoOperator do not change any standard hierarchy. But the initial data could be affected. First of all, the field changes in the articulation plane of onset and duration could throw some kernel event out of the reach of the given initial set, excluding them from being performed. Second, the initial performance could be turned into an incompatible state, a problem we have already dealt with at the end of the TempoOperator discussion. For the general situation of the ScalarOperator, one must develop specific routines to control initial data, not a trivial task.
39.7
The Theory of Basis-Pianola Operators
Summary. The tempo and scalar operators can be generalized to operators which act on the basis and pianola parameter grouping and their coupling on the level of performance fields. The formalism is deduced from the standard situation and stated in its general shape. The point of this approach is the introduction of Lie derivative as a general device for the production
CHAPTER 39. OPERATOR THEORY
796
of deformations of performance fields by use of scalar potential funcions. The Lie formalism is applied when stressing on basis rather than on pianola parameters. Although it is a general formalism for operators, we do not know whether its application can generate all important deformation operators needed for a musically satisfactory performance construction.
-'EThe standard situation which motivates the basis-pianola operators is this: We are given a performance map p(X, Y) = (x(X), y(X, Y)) on variables X, Y of two disjoint spaces U, V, with a hierarchy projection U ED V --+ U. For example, this could be an articulation (U = E, V = D) or a direct product (U = EL, V = H), etc. We now want to deform this performance on the second factor, and we are using a positive deformation weight function>. on U. The deformed performance map is defined by p>.(X, Y) = (x(X), >.(X).y(X, Y)). So this is a kind of physical operator we are familiar with. Let us calculate the Jacobian of this transformation (supposing that everything is smooth). The Jacobian for>. = 1, the start situation is
Jp1
= Jp =
(88x 0) = (AB C0) XX y
(39.39)
8yy
where 8x x is the submatrix with the partial derivatives of all x coordinates with respect to the X arguments, etc. The inverse Jacobian is (39.40)
And the general inverse Jacobian is (39.41)
with the dual gradient d>' = grad>.* and the identification7 Lin(U, V) ~ U* 0 V. The performance field is calculated by use of the diagonal unit vector ~ = (~X, ~Y) and yields
If we set 'IS(X, Y) = (Z(X), W(X, Y)), and 'IS>. (X, Y) = (Z(X), W>.(X, Y)), we get
W>.(X, Y) = W(X, Y) - >. -lC- 1 . d>' 0 y. Z(X)
+ (~ -
= W(X, Y) - L z (A)C- 1y + ( ~ _1)C-1~y e
1)C- 1 ~Y (39.42)
with the Lie derivative Lz(A) of the function A = In(>.), knowing that>. is a strictly positive function. Everything is taken at the argument X, Y, or (X, Y), respectively. 7See appendix E.3.2.
39.7. THE THEORY OF BASIS-PIANOLA OPERATORS
797
This gives us formula 'ISA = 'IS - Lz(A)ivC-1y - (1 - e-A)ivC-l~y
(39.43)
where iv is the canonical injection V ---+ U EB V. So we obtain a deformation of the original field 'IS by two additive terms Lz(A)ivC-1y and (1- e-A)ivC-l~y, which we want to interpret in the following sections. Both terms involve the weight function. The first term Lz(A)ivC-1y involves the basis field Z, while the second term only involves contributions from the Y component of the field. Therefore, we call the first term the basis deformation, whereas the second is called the pianola deformation.
39.7.1
Basis Specialization
Summary. The general basis-pianola theory is specialized to basis parameters and explicated through concrete formulas.
-EIf we have a weight A E U€ (0) in a very small neighbor hood of 0 (or else .x in a small neighborhood of 1), we may neglect the pianola term in formula (39.43). If further the performance map in the second variable is affine, i.e., y = A.Y +B, and does not depend on the first variable X as is the case with pianola coordinates Y, then C- 1 = const. = A-I, i.e., C- 1y = Y +const. So we have a deformation of type Lz(A)i v C- 1y = Lz(A)iv(Y + const.). For example, consider the two-dimensional situation U = E, V = L of two basis spaces. The performance map for a weight A E U€(O) on E is PA(X, Y) = (x(X), eA.y(Y)), where we start from a primavista performance map P = Po. We may therefore suppose that y = A.Y. Then, one has the primavista product field 'IS = T x D of tempo T and dynamics D. Then the deformed field is
(T
X
D)A
=T
x D - LT(A)iLY
=T
x D - T. :~ iLY.
In other words, we deform the given tempo by the gradient of A and project this component to the loudness field component via a field iLY. So musically speaking, the loudness field deformation is essentially controlled by the change of A and the mother tempo.-Let us now propose a generalization of the basis deformation: We are given • two not necessarily disjoint spaces U, V of the space hierarchy, • a weight A on the first (the basis) space U, • an affine directional endomorphism Dir(Y) E V@V of the V space. With these data, we define a field deformation of the performance field 'IS on a superspace of U U V in the hierarchy by the formula
'ISA,Dir
= 'IS - L'ISuA.iv Dir
(39.44)
where the argument of Dir is the V-component, and that of the Lie-derivative is the Ucomponent of the total argument, whereas iv is the embedding of V in the space of 'IS. The following is immediate from the linearity of the Lie derivative in both arguments.
CHAPTER 39. OPERATOR THEORY
798
Lemma 53 With the above notation:
(i) The basis operator is linear in the performance fields, i.e., (j.t'Th 1
+ 1I'Th2 )A,Dir = j.t'Th 1A,Dir + 1I'Th2A,Dir·
(ii) If U and V are disjoint, and if AI, A2 are two weights, then
Example 57 An example of a non-disjoint union of spaces U, V is the elementary deformation of tempo by a positive C 1 weight "(. On onset arguments X in a frame of positive onsets, we set A=J This implies "(T = T - LT(A).lE, and we have another example of a basis operator.
y.
Example 58 For this example, we suppose given a two-dimensional cf2 performance map ffJEH : EH ----t eh on the plane of onset and pitch with a hierarchy EH ----t E over the onset performance ffJE : E ----t e with tempo curve T. So the performance field 'ThEH is C1 . Setting Ip = (oHh)-I, T = (oEe)-I, the field reads (39.45) We want to view such a field as a deformation of the primavista product field T x I which is composed of the same tempo factor T and an intonation factor I, which is also C1 and strictly positive, as usual for primavista fields. We are looking for a weight A on the onset-pitch frame R and a pitch shift e- Yo E H@H such that the deformation LTXI(A)iHe-Yo of the primavista field yields 'ThEH. This defines a linear partial differential equation (PDE) in the onset variable X and the pitch variable Y: (39.46) This equation has C1 coefficients T, I, Q, if we let Yo be smaller than the lower boundary value of the pitch frame interval. Using the method of characteristic curves (see appendix 1.6) in pseudolinear PDEs, we can see that there is a solution of equation (39.46). In fact, the characteristic curve projection ODE reduces to the pair dtX = T(X), dtY = I(Y), whereas the third curve component Z(t) is defined by the ODE dtZ = Q(X, Y). This means that the characteristic curve projection onto the XY-space is an integral curve of the primavista field T x I. Clearly, a transversal curve r to the characteristic curves exists since the Jacobian criterium Tr -It T x I can be met for the non-vanishing field T x I. We therefore have this result:
Proposition 59 Let ffJEH : EH ----t eh be a two-dimensional cf2 performance map on the plane of onset and pitch with a hierarchy EH ----t E over the onset performance ffJE : E ----t e with tempo curve T. For the primavista product field T x I which is composed of the same tempo factor T and an intonation factor I, which is also C1 and strictly positive, there is a C1 weight A on the EH -frame of this performance such that the performance field 'ThEH is a deformation of the primavista field T x I by a basis operator: (39.47)
39.7. THE THEORY OF BASIS-PIANOLA OPERATORS
799
The basis operator is designed for distributing weight information from the "basis" subspace U over any other space V, independently whether this one is also in the hierarchy or not. This creates a considerable freedom of shaping performance. The way this shaping is related to the given weight is the Lie derivative with respect to the performance field 'IS u on U. This one also measures the angle between the weight's gradient and 'IS u . If their mutual position is perpendicular, the derivative vanishes and the operator has no effect.
39.7.1.1
Deforming Hierarchies
Summary. We discuss the change in an existing hierarchy after the application of a basis operator.
The most dramatic effect of the basis operator is the deformation of the given hierarchy. For example, in the above example of proposition 59, the projection EH ---> H is destroyed since the weight A is not only a function of H or of E, but of both, in general. So only the projection EH ---> E survives this deformation. Let us describe more systematically which hierarchy spaces disappear a priori by a basis operator deformation. Suppose that we have a hierarchy space U, a weight A on U any other space V within the top hierarchy space, a directional endomorphism Dir = e B .A, and any hierarchy space W which we want to test for survival after the subtraction of the basis deformation L'Thu (A)iv Dir. Consider the projection p : V ---> V n W onto the intersection space, including the empty intersection which then defines the projection onto the zero space. Let [p, A] = p.A - A.p denote the commutator endomorphism of p and A on V.
Proposition 60 With the above notation, the mother hierarchy space W remains a priori alive (i.e., member of the daughter hierarchy deduced from the mother hierarchy) after the basis deformation L'Thu(A)ivDir iff either p = 0, i.e., V n W = 0, or p[P,A] = 0 and U C W. In particular, for [p, A] = 0 and U C W, W remains alive. Proof. Suppose that W remains alive. If p 1= 0, we have at least one coordinate that is common to V and W. On that coordinate, the functional dependence of U is inherited via L'Thu (A). Therefore we must have U C W. Since no functional dependence from W - V arguments can be the case on the coordinates in V n W, and since the constant part of Dir is irrelevant here, we must have p.A.(l - p) = O. On the other hand, if p = 0 everything is clear, and under the conditions p[p, A] = 0 and U C W, the deformation arguments all stem from W, and we have saved the life of W, QED.
Example 59 Let the top coordinate space be all six usual basis and pianola parameters EHLDGC. Let a weight A act on U = EH, take V = ED, whereas the directional endomorphism is
with a rotation A as its linear part. Suppose that the hierarchy is the parallel hierarchy, i.e., the hierarchy generated by the basis hierarchy TID, T I, T D, T and the parallel fields
CHAPTER 39. OPERATOR THEORY
800
8TID, BTl, BTD, BT with the corresponding projections. Since the rotation A has no proper invariant subspaces, we must have either EDnW = 0 or else (because ofp[P,A] = 0) ED c W. But every field in our hierarchy contains E, hence U U V = EHD c W. This is only the case for the sub-hierarchy EHD,EHDG,EHLD,EHLDG,EHLDC,EHLDGC. 39.7.1.2
Lie Derivatives
Summary. Basis-pianola operator theory leads to Lie derivatives as a device for operator definition. We discuss the Lie formalism in its realization as a component of a performance grammar. -~-
The appearance of the Lie derivative in this context is quite surprising. Its usage is well known from classical mechanics, for example, where the Lie derivative of a function with respect to a Hamiltonian vector field is related to the Hamiltonian function H via the Poisson bracket [2]. Presently, we do not know of any analogous structures of Hamiltonian or Lagrangian type in performance field theory. But it is good to have this perspective in mind for a future 'dynamics of performance'. We should however observe that the Lie derivative L'IS associated with a vector field 'IS induces an isomorphism L? : X R .:+ Der(F(R)) between the real vector space XR of smooth vector fields over the frame R and the vector space Der(F(R)) of derivations on the real algebra F(R) of smooth functions (see appendix 1.2.4). So L'IS can be identified with 'IS, and the basis deformation means taking into account the 'weight' L'IS(A) deduced from the weight A. In this sense, weights become natural mathematical objects associated with performance fields: They are just the natural objects, these fields act upon qua derivations, they are not only justified by the quantification argument given when we introduced weights in section 39.l. In other words:
Thesis 6 A performance field is not only a construction principle for the performance map, but equivalently an 'interpretation of weights'-this is effectively the mathematical transfiguration of the rational approach to performance. This thesis suggests that one should study the natural properties of the Lie algebra
Der(F(R)) with respect to performance theory, in particular the question of what is the musical interpretation of the Lie bracket ['IS 1 , 'IS 2 ] of two performance fields. We have to pass this subject to future research.
Exercise 81 Consider the performance fP : ED --t ed : (X, Y) I-t (x(X), f(d)), d = x(X + Y) - x(X), which is a functional change in duration, built upon the parallel performance, with a C1 invertible deformation function f of physical duration d. This is an example of a physical operator. Show that its field is
0) BT 1
'
(39.48)
Le., a linear automorphism of the tangent bundle of which we have already seen an example for the tempo operator. Observe however, that the automorphism is also a function of the tempo
39.S. LOCALLY LINEAR GRAMMARS
801
curve T! Show by use of the characteristics method for quasilinear PDEs that this deformation of the parallel field {)T can also be obtained by a basis operator with U = ED, V = D.
39.7.2
Pianola Specialization
Summary. The general basis-pianola theory is restricted to pianola parameters and explicated through concrete formulas. -~-
Compared to the basis deformation, the second contribution (1 - e-A)ivC-1AY in the general deformation formula (39.43) plays a different role. Whereas the basis contribution is sensitive to the gradient of A, i.e., its local changes, the second contribution is sensitive to the absolute values of the weight, so this contribution is relevant if the weight changes little, but has values different from zero. We give a more precise interpretation of this contribution in the case of a basis-pianola-space situation, i.e., U is a space of basis parameters, and V is the corresponding space Pu of pianola parameters. We then have the alterator (i;+ : U E9 Pu -+ U, and we have a canonical isomorphism T : U ..:::. Pu . If the original field is defined on U E9 P u and is a parallel field {)Thu, then we have C- 1 AY = TO Thu 0 (i;+. Therefore we obtain the pianola operator for this special space configuration: (39.49) This formula only involves a weight on any subspace of the top space and a hierarchy space U consisting of basis parameters, together with the corresponding pianola space Pu which must also be a subspace (but not necessarily hierarchic!) of the top space.
39.8
Locally Linear Grammars
Summary. According to section 38.3.4, interaction between "inherited" performance score structures of sisters-or farther relatives-can be envisaged. We describe this formalism which is a basic approach in inverse performance theory (see section 46.2). -I;-
Until now, we have only considered operators which are directly related to the mother data, and not to farther relatives, such as sisters, or daughters of sister, etc. In the following discussion, we shall present an essentially linear model for such a more global interconnection of a stemma's LPS. We start with a stemma, i.e., a local performance score A whose graph A 1is an undirected tree. We want to forget about the mother of A and concentrate on the tree A 1= To. For the following construction, we need the stemma quive"s T = Til. associated with A:
Definition 109 The stemma quiver is a finite directed graph T = (V, A) with vertex set V and arrow set A, including multiple arrows and loops. It is constructed as follows. We start 8See definition 123 in appendix C.2.2 for the quiver concept.
802
CHAPTER 39. OPERATOR THEORY
with a directed tree To = (V, Ao) with root r, i.e., each vertex can be reached by a unique path starting from the root. If x ---+ y is an arrow of To, we say that x is the mother of y and y is a daughter of x (in combinatorics they are known as 'father' and 'son', respectively, here we try to be politically correct). If we have a path x ---+ y ---+ z, then z is a granddaughter of x, while x is the grandmother of z, and so on. For a vertex x of To, the set of vertexes which are daughters of x is denoted by Dx(To). The vertexes x which are not mothers, i.e., Dx(To) = 0, are called final (the 'leaves' in graph theory). Similarly for each vertex x E V(T) we define Mx(To) ~ V(To) as the set of vertexes lying on the unique path from x to the root (x included). In order to define the stemma quiver T the directed tree To is enriched by the following set of arrows (no vertexes added): First, each vertex x is given a loop x 0, and for any couple of sisters Xl, X2, i.e., of daughters of a common mother y, we add an arrow Xl ---+ X2. The resulting quiver T = (V, A) is called a stemm a quiver of the tree To, which is uniquely determined by the stemm a quiver and is called the stemma tree. We therefore may define Dx(T) = Dx(To) and Mx(T) = Mx(To).
Definition 110 With these graph-theoretical data, a locally lR-linear grammar is a family of 1R-linear representations9 of a stemma quiver T which is defined by the following data: 1. For each vertex x E V(T) let Ax, Ex be two real vector spaces, Ex of finite dimension sx.
2. Each mother-daughter arrow x
---+
y is represented by a surjective linear map rx,y : Ax
---+
Ay .
°
3. For all x E V(T) let CPx : Ex ---+ End(Ax) be an affine map, i.e., a linear map cp~, followed by a displacement by cp~ E End(Ax). So each loop x is represented by a family of endomorphisms (CPx(b) : Ax ---+ Ax)bEB", parametrized by the parameter space Ex.
4. For all pairs of sisters x, y phism ix,y : Ax
---+
E Dm(T), the sister arrow x Ay with iy,x = i;,t and ix,x = IdA", .
---+
y is represented by an isomor-
For concrete performance field configurations associated with A this axiomatic setup is realized as follows. For the LPS of the stemma tree, we suppose a number n of real parameters for the top space S of the LPS hierarchies. We concentrate on this top space S of all the LPS, and do not discuss the other cell data of these space hierarchies. For each vertex x of To, we have the frame Rx = [lxl, U X1 ] x ... [lx n , U Xn ] of the top space of its hierarchy. For each mother m and daughter x, we suppose that Rx c Rm, and that for each couple of sisters Xl, X2, RXl n RX2 = 0. This corresponds to a restriction of a larger portion of a musical score to a disjoint grouping of smaller portions. Here is the realization of our above system (properties 1-4) of quiver representations: Consider the vector space f"x of ~ functions on Rx. We then set Ax = Der(f"x), the space of derivations, i.e., the ~ vector fields on Rx. The surjective maps rx,y are defined as the restrictions of vector fields on the mother's rectangle Rx to the daughter's rectangle R y. To define the representations for a sister arrow Xl ---+ X2, consider the unique affine morphism a X2 ,Xl : RX2 ---+ RXl on the sisters' rectangles such that the respective vertexes are mapped onto 9These are representations of the quiver algebra over JR, see appendix E.2.1 for the concept of a linear representation.
39.8. LOCALLY LINEAR GRAMMARS
803
each other. Then the sister arrow representations are isomorphisms i Xl ,X2 : Ax, -+ AX2 defined by the transport of a vector field Z on Rx, to z· a X2 ,Xl on R x2 . To define the operation of a parameter family, first take a vector field Z E Der(f"x), and a weight function A E f"x. For Dir E S@S, consider the corresponding vector field rDir(t) = (t, Dir(t)) on Rx. Then we have a new vector field Z - LzA.rDir, corresponding to a basis deformation. This is an lR-linear operator on Z, and the 'deformation' part LzA.rDir is lRbilinear in the weight function and the affine endomorphism. We now take a finite dimensional subspace Wx of f"x which in performance theory represents the weight functions issued from analyses of metrical, motivic, and harmonic structures of the given score. We now set Bx = Wx ®IR S@S, and we obtain an lR-linear map cP~ : Bx -+ End(Ax) defined by cp~(A ® Dir)(Z) = -LzA.rDir. Setting cP~ = IdA"" we have defined the required affine map CPx = cP~ + cP~ : Bx -+ End(Ax) with
CPx(A ® Dir)(Z)
= Z - LzA.rDir.
Let m E V(T) be a mother. Then for each daughter x E Dm(T), we define a triaffine (affine in each argument) map
Ix : Am
X
C#Drn(T)
X
II
By
-+
Ax,
(39.50)
yEDTn(T) by
(am' (C~x)YEDrn(T)' (bY)YEDrn(T») f-+ LYEDrn(T) c~xiy,x(cpy(by)(rm,y(am))) = LYEDrn(T) c~xiy,x(cpg(by)(rm,y(am)))
+ LYEDrn(T) c~xiy,x(cpt(rm,y(am))).
Referring to the above example, this formula describes the following: In order to determine the performance field on the frame F x , we use the field of its mother m, we first restrict that field to any of its daughters y and get the fields rm,y(a m ). These sister fields to daughter x are then deformed under the endomorphisms cpy(by) = CPt + cpg(by) induced by the system parameters by. These deformed fields are then transported to x and weighted by the factors c~x. These sister fields influence the final value of the field at daughter x. Musically, this means that the performance at x is influenced by surrounding sister fields, which are typically the fields of past and future times (past or future periods, bars, etc.). We shall pursue this model in the course of the inverse performance theory of chapter 46. A final word on a perspective of generalized stemmata which seem to be suggested by the locally linear grammars as discussed above: One could envisage continuous stemmata. They are based on a generalization of the stemma's ramification structure to one-parameter families and narrowing of the daughters' extents of 'infinitesimal' quantities. This construction would take care of the fact that psychologically, interaction between neighboring performance moments is continuously 'updated'. A theory of continuous stemmata is still pending.
Chapter 40
Architect ure The most rigorous test of the efficiency of theories in modern cognitive science is the production of a working computer program whose external behaviour mimics that to be explained. John Sloboda [491] Summary. RUBATO® is a metamachine designed for representation, analysis, and performance of music. It was developed on the NEXTSTEP environment during two SNSF grants from 1992 to 1996 by the author and Oliver Zahorka [348, 347, 350, 357, 588, 590]. From 1998 to 2001, the software was ported to Mac OS X by Jorg Garbers in a grant of the Volkswagen Foundation. RUBATO®'s architecture is that of a frame application which admits loading of an arbitrary number of modules at run-time. Such a module is called RUBETTE®. There are very different types of Rubettes. On the one hand, they may be designed for primavista, compositional, analytical, performance stemma or logical and geometric predication tasks. On the other, they are designed for subsidiary tasks, such as filtering from and to databases, information representation and navigation tasks, or else for more specific subtasks for larger "macro" Rubettes. A RUBETTE® of the subtask type is coined OPERATOR and implements, for example, what we have called performance operators in section 44.7. The RUBATO® concept also includes distributed operability among different peers. This software is conceived as a musicological research platform and not a hard-coded device, we describe this approach. Concluding this chapter, we discuss the relation between frame and modules.
-EIn the original concept of RUBATO® [345], we had defined RUBATO® as being a software for analysis and performance, divided into two submodules: one for "structuring" a score, and the other for "shaping" this score. This meant that structuring would yield analytical structures, whereas the other would yield a shaped performance transformation, alimented by analytical data from the structuring process. In the course of the software development, we learned that no data model for music objects known to the developers at that time would be sufficient for all requirements of a comprising 807 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 40. ARCHITECTURE
808
music analysis and performance. This led to the concept of denotators and forms, as realized under the title of the "PrediBase" database management system (DBMS) of the first RUBATO® implementation in 1994, as described in [589]. At that time, it became clear that under such a universal data model, RUBATO® would split into an application framework comprising the "PrediBase" DBMS and a series of dynamically loadable software modules as implemented in the Objective C language of those NEXT STEP OS driven NeXT computers. According to the diminutive convention for modules, such a module was coined "RUBETTE®". In 1996, at the end of a grant of the Swiss National Science Foundation where RUBATO® was realized, three analytical Rubettes and one for performance have been developed l , which will be described in chapter 41. The PerformanceRUBETTE® is connected to five OperatorRubettes (at those times still named "OPERATORS") . The PrimavistaRUBETTE® takes care of paratextual score predicates. In 2001, this software has been ported (and improved in many data management aspects) to Mac OS X by Jorg Garbers and is now available as an open source project on the internet [357], or on the CD appended to this book, see page xxx. The RUBETTE® screenshots in chapter 41 are all taken from this version of RUBATO®. Figure 40.1 shows the info panel of the Mac OS X version. However, the version included in the book's CD-ROM is the latest version before the book went into production, whereas the screenshots are somewhat older. We hope that the reader will excuse us for this slight asynchronicity.
Inro
B T
by Guerino Mazzola & Oliver Zahorka ported to Mac OS X by Jorg Garbers Beta Release' .0
,
COD~rgnt
"!l3.
!IS. G. Mauo a, O. Zan;) a AUFlg'lls RBsBIVB:J. 200
J.Gamer:
Figure 40.1: The info panel of the Mac OS X version of RUBATO®. In the following sections of this chapter we shall however not describe the Mac OS X implementation, we will rather expose the more advanced and flexible architectural principles of the ongoing Java-based implementation of the distributed RUBATO® version.
40.1
The Overall Modularity
Summary. RUBATO® is a modular engine for metamachine rationales and because research is 1 For NEXTSTEP, RUBATO® as well as these Rubettes are available on the internet, see [357] . The source code is GPL and is contained in the book's CD-ROM, see page xxx.
809
40.2. FRAME AND MODULES
itself increasingly modular. Built upon the denotator language, the RUBATO® concept is fully modular, all parts that can be split into modules have been split in this way.
-EThe modularity of RUBATO® has two aspects: First, it shows a composition ofthe software from an arbitrary, a priori undetermined, number of functional units-the RUBETTE® modules. Second, the available Rubettes are a dynamic factor: According to the research progress, new Rubettes of any flavor may be added to the existing arsenal. Modularity is an old principle, in fact, the traditional disciplinarity of science preconizes modules of scientific activities which have or pretend to have a relative autonomy in knowledge production. What is new with respect to the traditional disciplinarity is that this modularity is a dynamical one, at any time new modules of knowledge processing may be added or old ones removed. Discipline becomes a task-driven decision instead of being a rigid preset splitting. Such a modularization can only work on the common ground of unrestricted cross-communication among any subgroup of knowledge modules. Without a common language ground, which in our case is the denotator and form data model, dynamic disciplinarity would inevitably collapse since a new module would require language modifications, adaptations, and extensions. To be clear, we view dynamic disciplinarity as the idealized version of inter- and transdisciplinarity. The unity of knowledge cannot be achieved without a temporary and task-driven compartmentation of research fields, grouping and regrouping is inevitable; there is no direct path to the unity of knowledge. This credo can however not be realized without a common language basis. Otherwise, language engineering frictions would paralyze any major effort of dynamical grouping of disciplines. Evidently, the present RUBATO® environment is limited to musical and musicological scopes. Here dynamic disciplinarity is not that utopic. After all, the denotators and forms are universal language approaches issued from music(ologic)al requirements. But it has been shown in [464] that denotators are not only a priori applicable to non-musical concept modeling, but also in concrete cases such as geographic information systems. This suggests that dynamic disciplinarity could be realized on RUBATO® for modules of completely general scopes as long as the denotator data model is joined. This is one of the most intriguing vectors of future developments concerning the RUBATO® environment. The principle of dynamic disciplinarity has its social form: a so-called collaboratory. According to Bill Wulf, this is "a 'center without walls' in which the nations researchers can perform their research without regard to geographical location, interacting with colleagues, accessing instrumentation, sharing data and computational resources, and accessing information in digital libraries" [278]. To collaborate in this way requires adequate software platforms, and RUBATO® is precisely this type of software in the field of musicology.
40.2
Frame and Modules
Summary. Modularity has been realized on the basis of a frame application which offers interfaces to an arbitrary number of modules. This is one of the technical core features in the realization of a metamachine. We describe its splitting interfaces and their functional positions.
-E-
CHAPTER 40. ARCHITECTURE
810
PEER ] R/lbett..
.
tL . ' sf . ~ • ~
R/l btl/o
f'/i ~·R
J
..-----:----:: ~ .... RMI R"/",,.
PEER 1 Rubettes
Rubato
Class libraries
Figure 40.2: The RUBATO® layers of Rubettes with denotator communication, and the RUBATO®Framework layer with class libraries that are related to different Rubettes. Each peer has one such configuration. Different peers are interconnected via remote method invocation (RMI).
The RUBATO® platform consists of a number of installations of the software on different peers which may communicate via Java's remote method invocation protocol (RMI, see SUN's Java documentation on the internet). For each peer, RUBATO® contains two layers: the RUBETTE® layer and the RUBATO®framework layer. The first contains a number of Rubettes which are autonomous Java applications that communicate with each other and with Rubettes of another peer exchanging denotators via RMI. These are instances of the denotator class. The class library on the RUBATO® framework layer contains corresponding basic Java classes for denotators, forms, diagrams of presheaves, and modules. It also contains other classes which provide Rubettes with the necessary routines. The concept of these libraries is that they should contain all classes and methods that are of general interest, while classes and methods with specific interest for a RUBETTE®'s functionality should be installed in that RUBETTE®. There are a number of mandatory Rubettes: The InfoRUBETTE® (with an "i" in figure 40.2) is the initialization RUBETTE®. It is automatically started when RUBATO® starts and informs the user about available peers and Rubettes on the distributed environment.
40.2. FRAME AND MODULES
811
The visualization of all Rubettes' content and manipulation structures is managed by the PVBrowserRUBETTE®, whose functionality is to visualize any denotator in 3D space via Java3D classes, in figure 40.2, this RUBETTE® is represented by a lens symbol. The concept of this RUBETTE® has been described in chapter 20. The advantage of this centralization is that no other RUBETTE® designer has to take care of graphical and other multimedia representations, every denotator is piped to the PVBrowserRUBETTE® in case a multimedia representation is required. And every such representation is uniform according to this Rubette's visualization routines, which makes orientation much easier than individual design for every RUBETTE®. Nonetheless, as explained in chapter 20, the flexibility of the Satellite form for multimedia objects allows an unlimited multiplicity of shape and behavior. Two further Rubettes are devoted to the storage of denotators. The first, to the upper left of the RUBETTE® layer, we have the DenotexRUBETTE®. It takes care of the storage and editing of denotators which are given in the Denotex ASCII format. The second storage RUBETTE® is a filter to a SQL DBMS such that SQL databases can be transformed into denotators for RUBATO®. A last central RUBETTE® is the LoGeoRUBETTE®, as shown by a toothed wheel symbol to the right front of the RUBETTE® layer. It manages the logical and geometric operations on denotators (see section 18.3.4) and can be used by any RUBETTE® for its specific needs. The methods of this RUBETTE® are encoded in the class library of the RUBATO® layer.
Chapter 41
The RUBETTE® Family Vogel, Vieh und alles, was auf Erden kriecht, die laft heraus mit dir, daft sie sich tummeln auf der Erde und fruchtbar seien und sich mehren auf Erden. The Holy Bible, Genesis 8 Summary. We give an overview of the analytical MetroRUBETTE®, MeloRUBETTE®, HarmoRUBETTE®, the PerformanceRUBETTE®, and the PrimavistaRUBETTE®, which have been realized on the NEXT STEP and then on the Mac OS X environment. -~-
Originally, the Rubettes as such were not the central concern of the RUBATO® project, this was rather to establish their collaboration and the realization of the whole transmission process from analytical data to the performance shaping operators of the PerformanceRUBETTE®. Each of these Rubettes was more an experimental prototype without the claim of a high-end tool in the specific domain. The interest in such experiments lies in the fact that when one starts the design of a RUBETTE®, it turns out that musicology and music theory do not offer any reasonable support, be it in conceptual, be it in operational aspects. The path from the given score to a specific analysis reveals an incredible complexity of what in musicology and music theory looks like an easy enterprise. For example, in the design of the HarmoRUBETTE®, the mere definition of what is a chord cannot be traced from traditional literature. Should we only look for local compositions of pitches that stem from notes with a common onset, or should one also consider durational aspects? The standard answer-or rather: excuse--states that it depends on the particular context, and the context of the context, but this is no way out if one has to implement clear concepts and methods rather than rhetorics. So the design of a RUBETTE® is always a very good test of the validity of a model and of its adequacy with traditional fuzzy understanding. But it is also a test for the tradition: After all, who decides what is a good model for harmony? Here, the alternative between general speculative nonsense theories and concrete, but possibly non-sufficiently general implementation and operationalization becomes dramatic. At least, one can hope that this confrontation will force everybody to rethink ill-defined approaches. 813 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 41. THE RUBETTF.!" FAMILY
814
41.1
MetroRUBETTE® Die Ziihlzeiten (Schlagzeiten, rhythmische Grundzeiten) gewinnen unter allen Umstiinden erst reale Existenz durch ihre Inhalte. Hugo Riemann [453]
Summary. The MetroRUBETTE® is an elementary analysis module which shows that seemingly simple approaches yield complex but informative results. We also make evident that operationalization of abstract concepts reveals unexpected insights into generically not foreseeable structures.
6 0 0
'it PM lin Or All f'tll!!d icalu dall!mm!rung. midf (2)
lSco,u !!
l ;S(.ore
Figure 41.1: The 31-part score denotator deduced from Richard Wagner's composition "Gotterdammerung" . The MetroRUBETTE® is built on the sober weight calculation which we developed in chapter 21 in the frame of the maximal meter nerve topologies. The formula in example 43 of section 21.2 is realized, except that there is no upper length limitation (i.e., it is put to 00). The mixed weight formula W(x) = I:XiESPI(X) Wi(x) is also realized in this RUBETTE®. The input is a score denotator (although still in the early shape of the PrediBase data model, which is the very special denotator form of a list of lists, which start from the simple form of strings). Figure 41.1 shows the 31-part score deduced from Richard Wagner's composition "Gotterdammerung". This data is imported to the RUBETTE® and then evaluated according to the said formulas. For example, the weight of the union of all the onsets of the 31 parts is shown in figure
41.1. METRORUBETTEY 6 0 0
Figure 41.2: he weight graphics of the above composition with profile = 2, minimal length = 2. Horizontal axis: onset, vertical axis: (relative) weight. The unit of the onset grid in the graphical representation of the weight is 1/8.
815 6 00
Figure 41.3: The mixed weight graphics offour parts clarinet (part 11), bassoon (part 16), horn (part 14), and violin (part 27) with scaling factors 3, 2.5, 1, and 0.1 for the respective parts. The unit of the onset grid is again 1/8, as in figure 41.2.
41.2. Its parameters are: profile = 2, minimal length = 2, and the unit of the onset grid in the graphical representation of the weight is 1/8. The mixed weight is also an option of this RUBETTE®. Figure 41.3 shows the mixture of the parts: clarinet (part 11), bassoon (part 16), horn (part 14), and violin (part 27). The parameters are same as above, the weights are scaled by the factors 3, 2.5, 1, and 0.1 for the respective parts. For a more in-depth application of this RUBETTE®in musicology, we refer to [155]. This paper is an excellent example of an unexpected musicological application of a RUBETTE® which was not intended to be interesting on its own. The metrical analysis turns out to be quite sophisticated, although the concept of a local meter is a very elementary one. The surprising effect of this setup is that the combination of the simplistic concept reveals unprecented insights into the time structure of classical works. This is a good hint to the musicologists, teaching them that interesting insight can result from a complex aggregation of simple ingredients. In this case, the simple elements are provided by the maximal local meters, whereas their complex aggregation is conceived by the nerve of the covering they define. A second very interesting application of this RUBETTE® has been presented in [349]. It was recognized that the longest possible minimal lengths of local meters for the left hand part of Schumann's "Traumerei" yields a 3+5 quarters periodicity over two bars, whereas the same analysis of the right hand yields the expected periodicity of 4 quarters with stress on the barlines. The sonification of this fact can be heard on the audio example in the book's CD-ROM, see page xxx. In this RUBATO® version, however, the output is a weight which is by no means a denatator. This output is a final data and must be used in its special format. This will be the case for operators of the PerformanceRUBETTE®. For the J ava-based distributed RUBATO® (figure 40.2, such a restrictive usage would be forbidden.
816
CHAPTER 41. THE RUBETTgw FAMILY
41.2
MeloRUBETTE® In general, the author does not believe in the possibility or even desirability of enforcing strict musical definitions. Rudolph Reti on the concept of a motif [444]
Summary. The MeloRUBETTE® is an excellent example of the tension between abstract concepts and operational implementation. We expose the routines for motivic analysis, the interface concept, and we discuss the performance problem, including proposals for performance improvement and their theoretical limits. -~ -
MeloWelCJht Graph
Figure 41.4: The weight graphics for the celli part 30 in the above score denotator deduced from Richard Wagner's "Gotterdammerung". The MeloRUBETTE® refers to the theory of motivic topologies in chapter 22 and , in particular, to section 22.9 about motivic weights. The score is loaded as for the MetroRUBETTE®, its projection to the onset-pitch space is then analyzed and yields a numeric weight for each onsetpitch event. One such weight is visualized in figure 41.4. It corresponds to part 30 (celli) of the above composition "Gotterdammerung". The weight values are encoded in gray-levels of the discs which represent the events in onset and pitch. The calculation relies on these parameters which relate to melodic topology: • Symmetry Group. This is the paradigmatic group of the shape type. In each group, we include the translation group in pitch and onset. The choice is then between the translation group (encoded by "trivial"), the one generated by the translations plus the retrograde, or the one generated by the translations plus inversion, and the full counterpoint group, i.e. , generated by the inversion and retrograde over the translations. • Gestalt Paradigm. This is one of three possible shape types: diastematic, elastic, and rigid. The first candidate is in fact the type which we called "diastematic index shape
817
41.2. MELORUBETTEW 600
Melo 1
Import
Predicate
Note
3- l.1orll, (1721
NOle
MOlif 1 MOlif 2 Motif 3 3 - Mollf, (172 -
MOI,ll Mohf 2 MOlif 3 MOlif 4
MoltfGeslaU Symm&1ty Group
Tmu:'1
Gesl8U PalacHgm
OUt~tetnatic
Neighborhood
089897'
:'
i'
...r'
MolJl l lmtts
Sp.n
0.306327
Caldtnl!l.tllty 3 Nillfflt' oll~e Qtll
New
We~ghl
32
~
......J"\
.-,
[fA
Figure 41.5: The main window of the MeloRUBETTE®. type" in chapter 22. Observe that we have no topology for the diastematic type, but may nevertheless define neighborhoods! • Neighborhoods. This is the neighborhood radius D~(M) in section 22.9.
E
which was used
III
the expression
• Span. This is the maximal admitted onset difference between motive events. • Cardinality. This is the maximal admitted cardinality of motives. Together with the Span, this condition, defines the selection J-L of motives which is addressed in the approach from section 22.9. The presence and content functions are defined as follows: 1. Presence. For a motif M, the presence value pr/",,(M) is the sum of all these numbers: For each N E D~(M), we count the number m of times where M has a submotif M' of N that at a distance less than E from M. We also look for the difference of cardinalities d = card(N) - card(M). This gives a contribution p(N) = m.2- d , and we add all these numbers. 2. Content. Similarly, for each motif N E J-L such that M E D~(N), we take p(N) = m.2- d with d = card(M) - card(N) and m the number of times, where N has a submotif N' of M that at a distance less than E from N. We add all these numbers p(N) and obtain the content ct/",,(M).
CHAPTER 41. THE RUBETT~ FAMILY
818
3. Weight. Given a motif M E /1, this is the product nWf(M)
i.e., taking the function w(x, y)
=
= pr,..,f(M).d,..,f(M),
x.y from section 22.9.
We have already given musicological comments on this construction in chapter 22. It is however remarkable to see the overwhelming amount of calculations which arise in this routine. For example, we have calculated the number C of comparisons of motives (for distance measurements) Schumann's "Triiumerei" ("Kinderszene" number 7) which comprises 463 notes. If we take Span = 1/2 bar and Cardinality = 4, we obtain 25'745 motives and C = 1'023'490'904,....., 1.023.109 . This is beyond any explicit human calculation power. It demonstrates that the task of finding a dominant motif is a very hard one, and that this one, if it is recognized by a human listener, can at most be present in a very hidden layer of consciousness. This becomes even more dramatic for larger pieces, such as an entire sonata, say! Here, the combinatorial extent of motivic units exceeds any calculation power of humans and machines, as is easily verified. This means that a huge composition bears a motivic complexity that will escape to (human or machine-made) classification forever. Nonetheless, the usage of statistical methods, of simplified approaches to motivic topology, or of topological invariants that are more easily perceived could help find a rough orientation in the virtually infinite motivic variety of music. This implementation also makes evident the tension between fuzzy concepts in musicology and implementation of a precise model. Although the concept of a motif is rather elementary, it entails a very sophisticated motivic analysis which could eventually converge to the intensions hidden l behind those fuzzy motive theories.
1 We
are not sure whether they are really hidden and not only faked ...
41.3. HARMORUBETTEJID
41.3
819
HarmoRUBETTE@
Eine Theorie aber, die gerade dort versagt, wo auch das Phiinomen, das sie erkliiren soll, ins Vage und Unbestimmte geriit, dar! als adiiquat gelten. Carl Dahlhaus on Hugo Riemann's harmony [100] Summary. The HarmoRUBETTE@ makes clear that a vague theoretical approach does not reflect a vague phenomenon but an extremely complex one. The implementation of this RUBETTE@ reveals several deep deficiencies of traditional "messy" analysis in harmony. We account for this on the level of preferences that have to be defined in order to get off ground with the analysis. The chapter concludes with a discussion of combinatorial problems due to the global complexity of harmony, and to the local character of tonal paradigms.
The HarmoRUBETTE@ is probably the most interesting RUBETTE@, since it is situated on a turning point of several critical issues in harmony. To begin with, the context problem in harmony is a multilayered and ramified one which is (we said it repeatedly) not clarified by music theory. This is manifest in the preliminary question of what is a chord in a given score. Should one only look at groups of notes as a common onset, should one also consider onset groups which are not manifest, but can be deduced from plausible rules, or is the selection of the relevant set of chords within a score also a function of the harmonic statements which could result thereof? We have implemented two variants. The first one takes as the sequence (ai)i all maximal zero-addressed local compositions ai = {ai,j Ij = 1, ... td of pitches of note events with identical onsets. The second one is less naive. Within the given score, we take all local compositions ai = {ai,j Ij = 1, ... td of pitches with this property: There is at least one onset which is the offset time of an event, and the chord ai is the non-empty set of pitches of all note events which either start or still last at this offset time. This option is chosen by the button "use Duration" on the RUBETTE@'s main window, see figure 41.6. This second variant encodes all changes of chord configurations, not only the onset-commonalities. Using either of these methods, the generated chord sequence is the basis for the following analysis which at the end will yield a harmonic weight for each note event, and for which we refer to the harmonic tension theory presented in section 27.2.2. According to that approach, each chord ai of the chord sequence (ai)i must be given a Riemann matrix (TF/,t(ai) = ¢/,t(ai)~ )/,t, from which we deduce the weights w(j, t, ai) = In(¢/,t(ai)) and also call this data the Riemann matrix of ai. According to that discussion, we may also downsize weights below a threshold ¢min to -00. This is what the user sets when defining the "Global Threshold" in the main window. The local threshold is just the same for relative weights within a given Riemann matrix. The percentages in the main window mean that we downsize values below a defined percentage relative to the global or local (only within a fixed Riemann matrix) value range. Following the rules for the value -00, we may neglect any path through a chord which has this value. So this singular value means that a chord is "inharmonic" insofar as it cannot contribute to a positive harmonic evaluation. This is a mathematical rephrasing of the classical, but fuzzy concept of inharmonic chords. Here it just means that the harmonic weight of a chord is too small to be considered as a contributon to the global harmonic path, and that the minimal size of allowed weights is set without further
CHAPTER 41. THE RUBETT£'® FAMILY
820
800
Marmo 1
Evaluation
Import
Chord (375) • E :6 0
Ptedicart ChoraC3T4 Chord (375) Chord (376) Chord (377) Chord (378) Chord (379)
Size:6
Harmonll; E)Clend Causal D~pth : ,
Final
Deplh ~
0
CalCtJlation Metnod MiittOI2I
~
Riemann Level Global ThreShold_ 8
local Threshokl Amount.
Weaght'!i N
8
~.
625464 I'T1
New Welgh12
'Y. Include ObJects In Weight ~ u:!OOOurabon
Figure 41.6: The main window of the HarmoRUBETTE®. theoretical justification. It is a regulatory limit for the sensitivity of the path maximization with respect to the involved weights. We should stress that this matrix (w(j , t, ai»f,t defines Riemann weights without any contextual considerations. This is information that comes along from the isolated calculation on the chord ai. There are different calculation methods for this matrix, one by the author, and using chains of thirds, as discussed in section 25.3.3, and one by Noll, using self-addressed chords- the user may choose his preferred method by a pull-down button on the RUBETTE®'s main window. The Riemann matrix (w(j, t, ai))f,t is visualized on the "ChordInspector" window for each chord (see figure 41.7). In this window, the nonlogarithmic values ¢(ai) are shown. The window also shows the chord's pitch classes as well as all its minimal third chains. Once the Riemann matrices (w(j, t, ai))f,t are calculated, the optimal path in the Riemann quiver is calculated. This one follows the method discussed in 27.2.2. To this end, we need preferences for the matrices TYAL,ype ' TVALmode' TTON. The matrices TYALmode ' TTON are defined in the windows shown in figure 41.8. To the left, we have TTON , however as a distance value according to the amount of fourth between pairs of tonalities, to the right, we have TYALmode. The matrix TYAL,ype is shown in the left lower corner of the window in figure 41.9. In the middle lower part of that window, we have check buttons for every Riemann locus (i.e., the position in the Riemann matrix), meaning that if the check is disabled , no path is possible through such a locus. The large upper matrix encodes the third weights needed for chord weight calculations according to the formulas (25.8) and (25.9). With these settings, the best path is calculated. This is however a tedious task, to say the least. In fact, if we are given 200 chords (a very small example), we may choose from a number of
41.3. HARMORUBETTEJY
821
e
Chord
~
AttrIbutes
1-
Onset:
60
PllI:::hes: 6
P,tch CI8SseS
Count: , 011
Best Path;C (T) Alemann Main.
r
o
c
Ct
0
l»
14&.&0
E
F
F4
0
Of
A
M
B
1.8&
l!!IJE!Do
S I'" U'-' 1M I
1
°t
.
.0
•
I ....
Figure 41.7: The ChordInspector of the HarmoRUBETTE® shows each chord of the chosen chord sequence with its pitch classes, the third chains, and the Riemann matrix according to the chosen calculation method. The grey level of the values is proportional to their relative size.
12 200 rv 6.8588169039290515.10 215 paths. This number exceeds any calculation power of present computers. This exuberant number is due to different factors. First of all, no larger paths are taken into account, i.e., we have not implemented cadences as preferred paths, nor have we implemented modulatory constraints. More precisely, we do not give preference to maximal subpaths within a fixed tonality. We only take into account tonality changes a posteriori, i.e., via the weights of paths of length 1, when they show a tonality change. So we have to calculate the entire path and then hope that the negative points for tonality changes rule such paths out. It is also not clear whether human harmonic logic really can take into account such global path comparisons. In other words, it is more likely that humans only consider local optimization of paths. This is what we have in fact implemented in the following sense. In each index i of a chord ai, we consider only a local part of the entire chord sequence. Such a part is defined by two non-negative entire variables CD = Causal Depth, FD = Final Depth. This means that we look at the subsequence of chords from index i - CD to i + F D (inclusive) and therein select an optimal path Pi,CD,FD. Within this path, chord ai is positioned at a determined Riemann locus (f( i), t( i)). The path which we finally select is the path p through all triples (f( i), t( i), ai). The causal part is a tribute to the influence of preceding chords down to index i-CD on the harmonic position of chord ai. The final part influences the harmonic position of ai relating to the future chords up to index i + F D. The result is visualized in the Riemann Graph which is shown in figure 41.10. This is however not the end of the job. We do not have the weights of the single notes. To
CHAPTER 41. THE RUBETTEY FAMILY
822
eo
Tonality Preferences
Tonaltty Olstanc:e MatfUt
MCKIe OlsLflnce Mllitrul
o
MaJOr 10
12
10
12
t.rIa~r
D
minor
1.5
mll'.Or
'SaV8 AI ..
l.oad
RoMIIO Ftctory Oe'aut..
Figure 41.8: This preference window (for the author's third chain method) shows the tonality distance matrix TYALmode (left) and the mode matrix TToN (right).
eo
Riemann preferences
FunCbon Scio'le MBtmc
c.
C T
!
0
1
0
0.
o I
0 0
0
s
1
0
I
1
d
0
0
s
1
0
1
-- 0
T
o S
- -1 -02 -0.5
o
o
o o
o
B
A
o
o
o
o
- -0 I
o
o
0
o
s
D
0
G
o
- - 0- -
1
Function O'Sh,mce Matnx T
F.
E
0
O.S 0 .8
0 .2
, ReMI to Factory Defaulls
Figure 41.9: The upper matrix encodes the weights of thirds (relative to a fixed reference tonic, the lower left shows the matrix TYALmode' The lower middle matrix encodes the a priori allowed Riemann matrix locus position. this end, we first need the globally calculated weight of each chord ai. In a provisional form, the weight of chord ai is defined by the weight W(Pi) of the chosen path P from the first chord to ai. This value is finally modified by a global slope a preference such that the final and the first weight can be set to build a defined slope. Thereby, we meet the requirement of a control over the global tension which cannot be deduced from the locally (only 0 and 1 lengths of sub-paths) defined weights. Let us denote by Wi the weight Wa(Pi) corrected by the slope preference. Using the weight of a chord ai, we finally may define the weight 2 W (x) of a determined event x in ai. To this end, the weight w(J(i), t(i),ai) is compared to the weight w(J(i), t(i),ai - {x}) of the chord ai which contains x. With a positive preference quantity 0 < d ::; 1, we consider the factor 1
>.(x) = d + (1 _ d)ew(f(i),t(i),ai-{x})-w(f(i),t(i),ai) 2In this notation, we omit all the preferences.
(41.1 )
823
41.3. HARMORUBETT~
eoo ChD" On!>BI
Riemann Graph
65
366
367
3G8
lG9
370
371
372
373
374
375
376
377
378
379
7.3
57.5
57 .7
57.8
58.2
58. 5
58.8
59
59 .2
59.7
60
GO.7
6'
6'.3
6 1.5
d
T
F' B E A
o
,
G
,
, ,
T
C BD
Eb
,
T
,
d
,
,
,
Ao
Do
T
.
O ·
Figure 41.10: The Riemann graph is the sequence of chords, together with their functional values as they result from the optimal path. which measures the weight differences. It evaluates to 1 for the difference zero and yields lid for the weight w(f(i), t(i), ai - {x}) = -00. This means that the influence of x in the building of the chord's weight is accounted for. If the weight decreases after omission of x, its influence is important and the factor increases .\(x). So we finally get the weight
w(x) = .\(X).Wi. The graphical representation of this weight is the same as for the MeloRUBETTE®, and we may omit this window of the HarmoRUBETTE®. Whereas the Riemann graph is conformal to the usual function-theoretic analysis (although it need not provide the common data in general), the weights of chords and events are far beyond the usual harmonic analysis and therefore cannot be compared without caution to established knowledge in harmony. It is however a common approach to harmony in its performance aspects to weight chords or notes in a more or less metaphoric way. Our present approach in the HarmoRUBETTE® is a concretization of these metaphors and also a point to be discussed with traditional performance theorists.
CHAPTER 41. THE RUBETT~ FAMILY
824
PerformanceRUBETTE®
41.4
All words, And no performance! Philip Massinger (1583-1640)
Summary. The PerformanceRUBETTE® is a 'macro' RUBETTE®: It manages the stem-ma generation, the weight input and recombination, the operator instantiation, and the production of output of performance data on the level of music technology. -~ -
' eoo
Performance 1 Import ~ E.valuation '
Mother LPS ..
PhysicalOperator
r
PhyslcalOperator
SplitOperator 2
SplitOperator 1 SplitOperator 2
TempoOperator 1
C=========================~J . SeIKernel Edll Welghl Walcher Perform
Split
:t
SpillOperalor Show Field
Perlormance Depth Merg e Parts
In
MldlFlle :;
Figure 41.11: The main window of the PerformanceRUBETTE®shows the stemmatic inheritance, descending from left to right. In the figure, the "Mother LPS" has a daughter named "PhysicaIOperator", and this one (all mothers are indicated on top of the daughters' column) has two daughter "Split Operator 1", "Split Operator 2" , generated by the SplitOperator, etc. Originally, the PerformanceRUBETTE® was the very focus of RUBATO®. Its purpose was the implementation of a type of performance logic with arguments from an analytical output. Although the analytical Rubettes have earned a growing importance, one of the cornerstones of analysis is its success in the construction of a valid performance. In fact, playing a good performance is a way of demonstrating one's understanding of music. Therefore the performance theory implementation is important beyond its autonomous interest. The PerformanceRUBETTE® implements the stemma theory of chapter 38. The starting point is a selection of a score denotator. This will play the primary mother's role, i.e., we are constructing a primary mother LPS in the sense of definition 35.3. The score form is provided in the same format that we have known as input for the other Rubettes. For this RUBETTE®, the kernel is always given as a (zero-addressed) local composition in the space form EHLDGC.
825
41.4. PERFORMANCERUBETTEJiY
This local composition is the kernel in the top space of a cellular hierarchy pertaining to the primary mother's LPS. The primary mother's LPS is instantiated according to "hard coded" default parameters in the Objective C source code 3 . However, for each specific performance operator, the LPS data are adopted and define specific daughters. After setting the kernel (see figure 41.11) of the primary mother, the main window shows the stemmatic ramification with individual names for each LPS and arranged in a browser, stemmatic inheritance running from the left to the right. In the figure, the "Mother LPS" has a daughter named "PhysicalOperator" , and this one (all mothers are indicated on top of the daughters' column) has two daughters, "Split Operator 1", "Split Operator 2", generated by the Split Operator, etc.
600
Kernel View - PhyslcalOperalor
.'
--.. ... . - ,.
.
~""'.-.- .
Figure 41.12: The Kernel View window shows the top kernel of the hierarchy of a selected LPS (here the LPS named "PhysicalOperator" in the above stemma browser) in common pianola (piano roll) rectangles, loudness being codified by grey levels. The vertical bars are set to four bar intervals in the given score.
30bjective C is a programming language for the NEXTSTEP-, OPENSTEP-, and Mac OS X-based RUBATO® projects.
826
CHAPTER 41. THE RUBETT~ FAMILY 6 00 -
Figure 41.13: In the PerformanceRUBETTE®, weights are used in their splined interpolation shape. Here, we see a metrical weight issued from the MetroRUBETTE®'s analysis.
e0
0
weightWit
Melro: no namf!
wetghl Sum Vicrw
~"
i*;"!rb""Il
'.2
1hw~
0.1
••• ~~1
0
~"""91M.;
''''''U IotntU
~1"&4U555n5$ ~Ur,,"
Ot
Figure 41.14: The Weight Watcher window shows the loaded weights (top), the upper and lower limit of their range, the non-linear deformation, and the Boolean flag of inverting/non-inverting the weight (button in the lower left corner). For each ramification, one or two daughters are generated according to a chosen operator. For example, in figure 41.11, the SplitOperator generates two daughters, whereas the TempoOperator always produces one single daughter. The operators need only be loaded at run-time as they are needed. So this RUBETTE® is non-terminal in the sense that it allows further ramifications via an arbitrary number of dynamically loadable performance operators. For every highlighted LPS in the stemma browser of the main window, we can visualize the top kernel of its hierarchy on the Kernel View window, as shown in figure 41.12, by means of the usual pianola graphics. Here, the grey level indicates the loudness. In order to apply an operator to a given LPS, one next needs a list of weights, this is conformal with the operator theory exposed in chapter 44.7.
41.4. PERFORMANCERUBETTEfY
827
The management of available weights as well as their concrete application are the business of the Weight Watcher system. Figure 41.13 shows a metrical weight in its splined interpolation shape. The weights to be used for a given operator can be loaded into the Weight Watcher, see top of figure 41.14. The loaded weights are then added or multiplied (according to the Boolean flag button "Combine as Product" to the right, below the weight list), and the resulting weight combination is applied to the given operator. For each weight, one can set the upper and lower limit of range (High Norm, Low Norm), the non-linear deformation quantity (Deformation), the inversion/non-inversion flag (Inverted Weight button to the lower left corner), the influence in a combination of several weights (Influence), the slope of decrease to weight value 1 as the arguments tend to infinity (Tolerance). The moral of this Weight Watcher system is a gastronomic one: Weights may be mixed and dosed at will in order to experience their influence on a given operator. This is not merely a lack of theory, it is above all an experimental environment for effective performance research. In fact, since virtually nothing is known about the influence of weights on performance, we have provided the user with a great number of possibilities in order to realize an optimal testbed for future theory.
e
Phvs~u)Oper.lor
AIr."","
"""II:
.. 15
C "
0--
OOI2~
e
SymbQII(O~r.10f
AIrIIIu,*.
:1
.15000001
,,
.2:000001
H "
"7'::~(11
.
C "
:1
., !;DOOlU
IilZOO!:IOO.
',12SOOI
o •
"'1~III)'1""'T'It'l"on
a"",,.. ~... ~!~1OA
E
C
Y.
1
-
2
H. ~
"2
1
0::
1
1 1
-
c ':' '
Figure 41.15: The PhysicalOperator Inspector allows us to select a number of physical output parameters where the weight changes the values.
Figure 41.16: The Inspector allows us of symbolic input the weight changes
SymbolicOperator to select a number parameters where the values.
Now, given a weight watcher combination of weights, an operator is fed by this combined weight and acts on the mother LPS to yield a new daughter LPS (or two in the case of the Split Operator, where however no weight is needed). The detailed operation of a specific operator has already been described in chapter 44.7, we need not repeat these details here.
828
CHAPTER 41. THE RUBETTEY FAMILY
600
Kernel View - Phv,lcalOperator
.
: .,.-,-~
Figure 41.17: The effect of an operator, here a PysicalOperator, is shown in the Kernel View of the performed kernel. This figure shows the performed symbolic kernel as shown above in figure 41.12.
$(oIbi rOp,t,-al01 I
-..-
-"-' Maotlo_r........
!iQfI
1 S.
If"'"
:' I'
s.omZfO~
Calnl~1lOn
:~
,t. ~ ,
.
~:;; ~
-'l::; ,
,
-t: '
..... "". Figure 41.18: The TempoOperator inspector allows us to select different integration methods. The "Real" method uses Runge-Kutta-Fehlberg routines, whereas the "Approximate" method uses simple numerical integration.
Figure 41.19: The ScalarOperator inspector allows us to select different options as defined in the scalar operator theory, but Runge-KuttaFehlberg ODE integration is mandatory in this situation.
829
41.4. PERFORMANCERUBETTEJ'iY
/////........-:-/ 0 .-· ,.,. .,' ~ -",' ,.r " ~.,
~"
.r~
/..
.
/
"
"".
~
~
-- _
0 00
'0
9 ,,",,,:,...-:_ -._ 0 . . /
'
-'
o
0 0
/' ,/
.'
"
o. 0
'0 '
0
~
O ~. "
'0 r ' / 0 ....... / /
.////
PO
" .>...:/....- ../ o . .....-:/./../ ..... / / . ' /
"eW: 20
. //~/ /'.......-:.................0 ... 0,
./////. ..,/... <;>
../ . ,/,
"9
C .·.wl: E; '
'
' 0 '0 ' , 0 0 , 0 O . -... 0 .
,0
-0
/ , ,./"
660
,"" .~ O...... .
Rad rus:5
y •.ws:
H ; '
F,eld @Event ......; lengU,
x: 0 .5
~/./
(} . / " . / . /
~-"'/./""
"....////~/./
Chance V,ew Frame to selected Kerne ;
Figure 41.20: The performance field of a selected LPS can be visualized. The user may select two parameters whereon the six-dimensional field is projected.
Figure 41.16 shows the inspector of the SymbolicOperator. The weight acts on selected parameters which are defined by Boolean buttons. The same procedure is performed for the PhysicalOperator, whose inspector is shown in figure 41.15. The action of an operator on the symbolic score is shown in the performed kernel in the same pianola representation as for the symbolic kernel. The action of a physical operator is shown in figure 41.17. A funny application of this operator to Schumann's "Triiumerei" can be heard on the book's CD-ROM under the title Alptraeumerei, see page xxx. This piece is the dead-pan version of the score with the melodic weight being applied to pitch via the physical operator, and everything being played with Schumann's original tempo indication. The inspector windows of the tempo-sensitive operators, the TempoOperator and the ScalarOperator, are shown in figures 41.18 and 41.19, respectively. The TempoOperator implements basically two methods: "Approximate", and "Real". The former is a direct integration method, whereas the latter uses Runge~Kutta-Fehlberg numerical ODE routines, including different parameters for numerical precision. The ScalarOperator uses exclusively Runge-Kutta- Fehlberg numerical ODE routines since it is an operator that acts on two or more parameters, where the naive approximation method cannot work. For the visualization of performance fields, the window shown in figure 41.20 is available. Finally, the parameters for the Split Operator are determined on the window shown in figure 41.21. Here, one may define those lower and upper parameter limits of the total sixdimensional frame, where the subframe of the split daughter is cast.
830
CHAPTER 41. THE RUBETTEJW FAMILY Operator Application Control Name: Spill Frame
,--.
H: _
56
LOS! 32
882 2'
D: _ G: _
D
C: _
Cancel
Figure 41.21: On this window, the user may define those lower and upper parameter limits of the total six-dimensional frame, where the subframe of the split daughter is cast for the SplitOperator.
41.5. PRIMAVISTARUBETTEW
831
PrimavistaRUBETTE®
41.5
It is hard if I cannot start some game on these lone heaths. William Hazlitt (1778-1830)
Summary. Several musical predicates from score notation are paratextually loaded. The PrimavistaRUBETTE® takes care of the paratextual signification for the most important predicates regarding dynamics, agogics, and articulation. -I;-
The PrimavistaRUBETTE® serves a different task insofar as it is neither analytic nor performance oriented. It deals with paratextual information as it is provided by verbal indications for dynamics, tempo, and articulation. It basically does this: it transforms verbal information into weights which may then be used to shape the symbolic data and the tempo before performance in the proper sense is shaped. Absolute Dynamics Preierences
Articulation P,efs
Absolute Symbolic Dvnamlcs
Ad.:;u " lIOn
molD slilll«Jlll0
20
'!ili1«.alo
GO
non
gDto
80
~alo
120
me 10 Jegalo
150
Relallve
Dynaml_ ( ~_ Pr_e_fs _ __
RaoatNe Syrroolc. OynII'l"llCIo
me ~o d nllluend-o
ppppp
"1>PPP P~PD
(-----::,------ - - - - - = '
8
~========~~---
I
5 ------~--~--------------~
"",,po
22
PDP
29
~P
36
po
43
-
=---Qr---'""""""""""'",,"=",===""" ~------------~--~
so
SO
dwnrnuend-o 62_5
--
1
P 57
(;feKertdo
160
mI
Tolenllnc:e:
10
M' -7a- ------
~-----~o~-----~
6<
~--------~o~-~~~
o
Relative Tempo Prefs
To,rmlo nllfdlOdo so r.1i1rd a.Mo
80
as
~----~----~O~--~
mill
92
~========~O~~~
rnllif
10G
- -II' -99 -
{I~lerandD
12$
1111
11 3
IUltlo ace.lefllndo
20-0
Mil"
120
11111
127
l,rnlijIola
femta/a Shift
O~----~~~
II
--~--~--~~O~J--~
--------~==~~ ~-----------------~~
50
100
'iuma:iJ Villal 5
Figure 41.22: A number for preference windows for dynamics, articulation, and tempo, allow us to define numerical values of paratextual predicates. The input of this RUBETTE® is a local composition whose elements are events with verbal specification such as absolute dynamics (figure 41.22 right preference window), relative dynamics
832
CHAPTER 41. THE RUBETT~ FAMILY
(figure 41.22 left middle preference window), articulation (figure 41.22 left upper preference window), and relative tempo (figure 41.22 left lower preference window). The functionality of this RUBETTE® is to transform these data into weights, this is performed on the window for primavista operations as shown in figure 41.23, and according to the numerical data that are defined the above preference windows. These methods have been discussed in detail in section 39.2.
eoo
PrimaVista 1 Import
!' Ellaluation
'
Custom PV Parameter Specs PV Parameter Name: PV Coordinate 1:
E ~
CoordInate 2.
H
PnmaVlsta Operations
Make Artrclu3110n WeIght Make Dynamrc Weight Make Tempo Weight
Make Custom 1D WeIght M.ke Custom 2D Weoght
Figure 41.23: The main window of the PrimavistaRUBETTE® manages the transformation of verbal (paratextual) predicates into weights.
Chapter 42
Performance Experiments Learning by Doing. Summary. This chapter traces the analyses and syntheses processes which led to the historically first full-fledged RUBATO®-driven performance in July 1996 on the MIDI-Boesendorfer at the Staatliche Hochschule fUr Musik in Karlsruhe, as well as to the qualitatively high performance of contrapunctus III in Bach's Kunst der Fuge. We report the technical prerequisites, the analytical background generated by RUBATO®, and the step-by-step realization of the stemma and the overall parametrization. -~-
42.1
A Preliminary Experiment: Robert Schumann's "Kuriose Geschichte" Nieht eine Gesehiehte wird hier erziihlt, sondern der Eindruek, den eine solehe bei einem Zuhorer weekt, wird eharakterisiert. Thomas Koenig [267]
The first realistic experiment with RUBATO® took place in July 1996 on the MIDIBoesendorfer grand piano at the Staatliche Hochschule fUr Musik in Karlsruhe. The experiment lasted three days and was led by the author, together with his assistant Oliver Zahorka and the musicologist Joachim Stange-Elbe. The experiment was executed on a NeXTStation. The results were digitally recorded on DAT and everything was protocolled. The technical support for the MIDI-Boesendorfer was offered by Sabine Schafer. In what follows, we want to give a very sketchy account of that experiment, however recalling the essentials of initial experiments in performance theory and the consequences thereof. The more elaborate discussion of such experiments is left to the following section 42.2. 833 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
834
CHAPTER 42. PERFORMANCE EXPERIMENTS
The preparatory work for this experiment was above all the analytical part, i.e., the metrical, melodic, and harmonic analyses on the respective Rubettes. We also experimented on the stemmatic and operator strategy, including the WeightWatcher mixtures as discussed in section 41.4. The selection of the MIDI-Boesendorfer was also a consequence of preliminary experiments on the MIDI-Yamaha grand insofar as this instrument turned out to respond in a much too coarse way to the input. In contrast, the Boesendorfer has a refined calibrating interface allowing a realistic MIDI input (with roughly a thousand dynamical values instead of 128 from MIDI), in particular for soft dynamics, where the Yamaha grand is completely inappropriate. The first day was used to adapt the different technical conditions, such as the calibration of the Boesendorfer, the weight ranges, and the dynamical limits. The second day was devoted to the evaluation of our preparatory material and strategy. The third day was devoted to the production of all stemmatic levels, as well as their recording up to the final output level. The most significant experiences were twofold: On the one hand, the judgment of the three experts concerning which weight mixture to use at which stemmatic knot, and the way to use it, with all the variables from range to inversion/non-inversion and deformation, was very precious! We learned that the effects of such strategies were almost constantly judged in three different ways-except for trivial failure or success situations. This means that performative coherence is a very personal affair, even when using very explicit techniques, analyses, and shaping tools. On the other hand, we had to face a very disagreeable side effect of such an extensive performance shaping: Let me call it the zoom of supersensitivity. The effect is this: After having listened to performances of a selected part of the piece, the ear begins to recognize a steadily refined differentiation in the different parameters, such as agogics, dynamics, and articulation. For example, if in the beginning phase, one only recognized a huge change of a weight influence to agogics, with progressing trials, one would believe that the slightest further change in weight influence could change the performance of tempo in an unsupportable way. Maybe one was eventually incapable of really identifying the details of a performance, i.e., it is either simply too complex to be grasped by humans because the attention to different parameters is always selective, or it is not possible to memorize the samples and to compare them, or-and this is the worst case--our perception really changes so much form case to case that we cannot rely on the individual samples. The latter would mean that performance is greatly co-determined by the performance of the perceptive system. The result was nonetheless acceptable and strongly differs from the so-called dead-pan version, see [360, CD attachment], or this book's CD-ROM (cf. page xxx), where the performance and the dead-pan version can be inspected.
42.2
Full Experiment: J.S. Bach's "Kunst der Fuge"
Die Kunst der Fuge erschien wohl etliche Monate nach Bachs Tode und kostete vier Taler. Sie fand keinen Absatz. (.. .) Enttiiuscht verkaufte der Sohn {Emanuel} die Platten, auf denen das letzte Werk seines Vaters geiitzt war, um den Metallwert. Das war das Schicksal der Kunst der Fuge. Albert Schweizer [484]
42.3. ANALYSIS
835
This experiment is fully accounted in [504]. Here, we give a concise presentation, the version of RUBATO® used in this experiment is the one compiled for OPENSTEP /Intel. The "contrapunctus III" in Bach's Kunst der Fuge has these characteristics: It is a four-voice composition, comprises 72 bars, has time signature 4/4, and tonality d-minor. The main theme of Kunst der Fuge is only used in its inversion and appears the first time in a rhythmically dotted and syncopated variant; the fugue starts with the theme in its comes shape and contains three complete developments (bars 1-19, 23-47, and 51-67).
42.3
Analysis
Summary. We give an account of the metrical and melodic analyses, whereas the harmonic analysis has not been done. -~-
42.3.1
Metric Analysis
For the metric analysis of the "contrapunctus III", the calculations were made for each single voice, including the sum of the voice weights, and for the union of all voices. Please, refer to the discussion of the MetroRUBETTE® in section 41.1 for the following discussion. The settings of the weight parameters are these: Metrical Profile is 2; Quantization is 1/16; Distributor Value is 1. Since the metrical profile of all voices should be viewed under the same valuation, the Distributor Value was set to a common neutral value; the value 2 for the Metrical Profile resulted from several trials of analyses and yields a well-ordered distribution of the weight profile. The value for Minimal Length of Local Meters was successively decremented starting from the length of the largest local meter, descending until value 2 where the smallest cells are caught in their signification for the metrical overall image. When considering single local meters, it is above all the onset time and the step size of the single time grids l , which matters. Based upon the summation formula, this can however not be fully understood since from a certain superposition of periodical onset sequences, the observation of a single local meter becomes very difficult if not impossible (see also the consideration of the union of all onset times). 42.3.1.1
Single Voices
Like all other contrapuncti, "contrapunctus III" shows no regularities in the compositional structure, in the repetition of whole parts or single bars. At least for the four bars of the theme, the onsets of themes in the developments of fugues do however create structural incisions, which only become relevant for the motivic analysis-the metric analysis remains unaffected. As to the analysis of single voices, it is interesting to observe that the longest found local meters nearly exclusively relate to relatively short parts, including only a few bars. These parts usually consist of uninterruptedly pulsating sequences of quavers or semiquavers, which in their regular sequence of notes differ significantly from the otherwise pronounced principle of tying stressed 1 In
this discussion, "grid" is synonymous with "local meter" .
836
CHAPTER 42. PERFORMANCE EXPERIMENTS
bar onsets to unstressed precursors. This principle, having its origin in the shaping of the main theme with its sequences of quavers that lead to the interluding sections, can be recognized as a rhythmical pattern in the interludes and the countersubjects of the theme. By the tying of this sequence of quavers with the preceding half note, the theme breaks the hitherto confirmed bar accent scheme, a peculiarity which will be further confirmed in the sequel of this contrapunctus. It even has effects on the theme onsets which-unusual in a 4/4-bar-are partly shifted by a quarter note (see also the theme variant which is extended by dots and quaver transitions in the bars 23, 29, and 35). When considering the single voices, one realizes that most of the longest grids (i.e., those with highest weights, according to our choice of system parameters) catch exclusively onsets on unstressed bar times in the middle voices. If the initial value lies on a stressed bar time, a dotted duration value as a step length for local meters also weights more onsets on unstressed times. Only the metrical weight of the bass voice acts in a contrary way, in fact an increasing confirmation of the bar meter is observed, see figure 42.1.
Contrapunctus !11 (Soprano): metrical weight (Min. Length of Local Meter: 1/ )
Contrapunctus III (Ten.or): merrical weight (Min. . Length of Local Meter: / 7)
--ContrapUI1CTUS III (Alto): metrical weight (Min. Length of Local Meter: / 9)
Conrrapun.ctus /11 (Bass): metrical weight (Mill. Length of Local Meter: 28)
Figure 42.1: Contrapunctus III: metrical weights for the four voices.
42.3.1.2
Weight Sums of All Voices
When considering the weights of single voices, and in particular the sum of their weights, observe that the longest weights are the most important contributions. This means that the most important weights stem from the bass, followed by the alto, the tenor, and then the soprano voice. The latter has a weakened contribution since in the middle of the contrapunctus, there is a pause of thirteen bars, which breaks down the coherence of the soprano to the metrically weakest voice, see figure 42.2 for the lengths of local meters for the four voices. Further, we observe that the bass voice prevails against the other voices in the confirmation of the bar
837
42.3. ANALYSIS
s A T B
U~'''' ~"
28 26 23 22
~~
18 17 18 17
~"w
15 14 13 12 11
...
2
13 12 11 ...
2
17 16 15 14 13 12 11
...
2
Figure 42.2: The lengths of maximal local meters for the four voices. meter since it creates high metrical weights by the longest metrical grids. Nearly all peaks of the metrical weight sum---except some cases in the middle part~coincide with stressed bar times. Generally speaking, the ambitus of the metrical weight profile increases towards the middle of the contrapunctus~an observation which is valid for all pieces of the "Kunst der Fuge" which we have analyzed, see also figure 42.3. The reasons for weak initial metrical profile lies in the formal construction of a fugue, where the voices do not appear simultaneously, but one after another, such that the full voicing only appears after a number of bars. Another reason can be seen in the structure of the theme of the fugue, which does not develop its full motional impulse before the last five notes. These are used to shape--until the next thematic onset~counter subjects and interludes. The equally very frequent decrease of the metrical profile towards the Metro Weight View
Figure 42.3: The ambitus of the metrical weight profile increases towards the middle of the contrapunctus; minimal local meter length is 2. end of a piece is explained by different formal facts. In the case of the "contrapunctus III" , this is due to long pauses of the single voices~in particular bass, soprano, and tenor~as well as the pedal point in the bass voice (bars 68- 71).
42.3.1.3
Union of All Voices
Relating to the union of all onsets, the "contrapunctus III" shows at first sight dotted step lengths in metrical grids. However, by the two longest grids (Minimal length of Local Meters being 502 and 271), starting at the final notes of the exposition of the theme in the tenor and at
CHAPTER 42. PERFORMANCE EXPERIMENTS
838
the end of the exposition of the theme in the alto, an uninterrupted pulse of quavers and quarter notes is established, and the only remaining grid (Minimal length of Local Meters is 128) with step length a half note also drops at the non-stressed bar times of 4th and 8th quaver. All other grids have step lengths of different dotted durations, however dominated by the doubly dotted half note, the said dotted quarter note, and the half note with tied quaver, as well as once the dotted half note. What is remarkable in this contrapunctus is the three 'double grids' of Start of metrical grid (bar, note)
Step length
502
9
4th Quarter
1/8
72
3rd Quarter
271
4
4th Quarter
1/4
72
3rd Quarter
172
8
1st Quarter
3/8
72
3rd Quarter
171
8
2nd Quarter
3/8
72
4th Quarter
128
8
4th Quarter
112
72
4th Quarter
108
4
8th Quarter
5/8
72
4th Quarter
104
7
3rd Quarter
5/8
72
3rd Quarter
102
8 ~-
...2nd Quarter
3/8
72
1st Quarter
·3fd Quarter
3/8
72
2nd Quarter
Length of metrical grid
8
.
-
End of metrical grid (bar, note)
91
4
1st Quarter
3/4
72
2nd Quarter
78
3
4th Quarter
7/8
72
1st Quarter
75
6
3rd Quarter
7/8
72
2nd Quarter
74
7
3rd Quarter
7/8
72
2nd Quarter
7
4th Quarter
7/8
72
3rd Quarter
8
1st Quarter
7/8
71
8th Quarter
8
3rd Quarter
7/8
72
4th Quarter
I
1st Quarter
4/4
72
1st Quarter
73
71
Figure 42.4: Maximal local meters for the union of all four voices.
Minimal length of Local Meters equal to 102, 74, and 73, which within the same bar are shifted to each other by a quarter note, and two quarter notes in the last case. Therefore, in the contrapunctus III, the tendency of 'metrical instability', which was already encountered in the single voices, persists, since by summing up the weight contributions by the grids with dotted step width, a metrical profile is established which breaks the bar meter as well as the pulse of uninterrupted quavers and quarters.
839
42.3. ANALYSIS
42.3.2
Motif Analysis
For the calculation of motivic weights each single voice of the "contrapunctus III" was analyzed separately. We refrained from a motivic analysis of the union of all voices since by the contrapuntal structure of the single and autonomous voices within the polyphonic setting, a motivic setup across the voices seems rather unlikely and therefore was omitted. The settings for the motivic analysis were chosen as follows: Symmetry Group: counterpoint; Gestalt Paradigm: elastic; Neighborhood: 0.2. By the choice of the counterpoint symmetry group, the theme forms recta and inversa, as well as their (possibly appearing) retrogrades, were considered as being of equal weight. The neighborhood value has been chosen as based upon analytical experiments during the development period of RUBATO®. The elastic gestalt paradigm was preferred against the rigid and diastematic ones in order to obtain a more elastic point of view. As to the values for Motif Limits, compromises with the calculation power had to be made. By the choice of Span equal to 0.625 and Cardinality from 2 to 7, motives within a span of a half note plus a quaver were captured; this corresponds exactly to the duration of the theme where the transition of the virtual theme to the interludes must be recognized.
1otIIo:03- 1~5
': - ,
::
"
,
..
.;..; ..
:.'
..
~
'
6
"
7 12 ,-
.. :
.:, - - ~
Soprano
Tenor .......
':"J1((')':":'" ':' .::: .
Alto
.:.
'.
.
,"".
': .
-
."
Bass Figure 42.5: Motivic weights for "contrapunctus III" .
With the results of the metrical analysis, some regularities in the microstructures can be read at first sight; herein we find in particular the onsets of the theme within a particular development. The representations of the single motivic weights in each voice with a mean value of 5 for the Cardinality may be elucidated in detail, see figure 42.5. Within these graphics, the theme onsets are numbered according to their temporal order; equal weights can be recognized on the onsets 1,2,3,4,8,9,12, and 5,6,7, as well as on 10 and 11. While further considering these weights, the overly long pauses in the soprano, tenor,
840
CHAPTER 42. PERFORMANCE EXPERIMENTS
and bass voices attract attention. Further, in the length proportion of the single weight representations, the succession of onsets of the single voices (tenor-alto-soprano-bass) is reflected. Moreover, a significantly lower motivic profile at the beginning and after the longer pauses of the respective weights can be observed- due to preceding pauses, this is the case of exposed thematic onsets. For the weight values, a neat exposition of the inverted gestalt of the original theme is observed, bearing nearly identical weights at the beginning of every motivic weight, here even the differences of comes and dux forms are visible, since the weights of the tenor (first appearance) and soprano (third appearance) differ slightly by the different initial interval of the theme (descending fourth in the comes, and descending fifth in the dux form) from the weights of the alto (second appearance) and bass (fourth appearance), see figure 42.6.
~ ..
++++ ..
Tenor
.... it
++ .. .
Soprano
~
..... . .
Alto
.
...
++ ++ p
Bass
Figure 42.6: Motivic weights for "contrapunctus III", bars 1-4. Other clearly visible onsets of the theme in inverted shape are recognized after the long pauses in the soprano (eighth appearance), bass (ninth appearance), and tenor (twelfth appearance). Characteristically, the inverted shape always appears after pauses. At first sight these observations may seem to be tautological. However, if these weights are viewed with respect to their sense and purpose, their force to shape performance, then the transition from a quantitative to a qualitative information content becomes evident: Thus the different onsets of themes can be shaped by these weights in one and the same way; if these weights are used-in inverted form- for the dynamic shaping, then the thematic onsets can be stressed with plasticity. When trying to evaluate motivic data more in detail, a problem appears that is inherently founded in RUBATO®'s intended purpose: since the platform was originally conceived for performance research, it is sufficient to present the analytical material for its performance shaping and to ascertain its reasonable construction. But these analyses also yield information about the musical structure, which in the case of metrical analysis-by a certain extra effort apart from RUBATO®-can be traced without difficulties. However, for the MeloRUBETTE®, any practical support beyond the brute calculation of weights- which is also sufficient for performance-is absent; the "introspection" of the details of weight calculation is hidden to the user. Thus for the calculation a motif's weight it cannot be known which other motives are responsible for its presence and content. Although all motives are arranged in the motif browser in their temporal order, but another structured introspection of motives, for example with regard to their weights, is not possible for this version of RUBATO®. This makes the introspection and evaluation of
42.4. STEMMA CONSTRUCTIONS
841
the 'heaviest' motives a difficult task. All analytical information is present, but it is hidden; a particular difficulty may be seen in the exuberant effort in complexity which a detailed motivic analysis, which extracts knowledge from the score in an immanent way, is loaded.
42.3.3
Omission of Harmonic Analysis
A harmonic analysis was omitted in this situation since-to our mind 2 -the approach of Hugo Riemann which is implemented in the HarmoRUBETTE® is not really suited for Bach's harmonies. By use of the Riemann theory which was developed from the Viennnes classics, the specific harmonic structures of a contrapuntal maze, where harmony does not result from progression of fundamental chords but from the linearly composed voices, can only be captured in an incomplete way.
42.4
Stemma Constructions
Summary. This section discusses the performance construction via the stemmatic paradigm, and using the analytical results investigated in section 42.3. -L;-
Before the stemmatic construction and thereby in particular the problem of performing "contrapunctus III" are discussed, some general remarks regarding the various performance strategies are necessary. In the course of the single performance parcours, two different approaches resulted which would turn the given analytical weights into expressivity: the targetdriven and the experimental strategy. The target-driven strategy has its roots in the knowledge about existing performances, it is stamped by a preliminary experience of how the piece should sound and has been performed. With this procedure, the weights are used in a way which targets a predefined performance. One-just to name a pithy example-was oriented towards Glenn Gould's Bach interpretation; the corresponding weights were selected according to these targets to obtain particular effects. In this procedure, however, the intrinsic structural meaning of analytical weights was ignored! Stamped by the knowledge and the expectation of the existing performances, this strategy did not allow one to judge and categorize those performance constructions which did not suffice for the music-esthetic exigencies. The other approach, the experimental strategy, moves the analytical weight to the center in order to investigate how this weight could 'sound', and which analytical insight it could convey in the listening. With this procedure, which views the main performing agent entirely within the weight, one has to free oneself completely from horizons of expectation for any particular performance target. The working process on such performances, the acquaintance of experience with the most different weights, and the playing with their effects taught us in the course of many experiments that this strategy would give rise to much more interesting performance aspects. Here we also have the freedom to admit extremal positions which disclose more about the inherent musical structure and as 'daring ingredients' may evoke lively musical expression. 2Decision by Joachim Stange-Elbe.
842
CHAPTER 42. PERFORMANCE EXPERIMENTS
Moreover, the experimental approach to single performance aspects, which starts from curiosity about the sonic realization of analytical weights, conveys a deeper insight into to score's musical structure. This path has its take-off in a "sonic analysis", or else in "the sonic analytical structure" and aims at a "musically reasonable performance". It is centered around the researcher's curiosity for a sounding and interpretational realization of analytical weights and for "the never heard", and it is paralleled by a liberation from expectational presets. Moreover, this strategy tries to apply as few weights as possible in order to couple the clearest possible analytical statements with the resulting performance.
l . ".
Soprano
-,
[
'
.. ,.. - -'",' . . -, ,-
I ....'" .' '.
,
.~
.....
',-
~
. ... ; ..
.
-- .• .
- .. .
-
.- .. --
..
.
".
Alto inverted
..
~.
'
-
,~~
.,.
,. '.-
'--:-0,: "
~
,
r ,-
.' ~ .... :. . ' "
. . --:-' ... .:
:~
Soprano inverted
'
"..
Alto
.
..
!:. ",.
.'
... . - . ..
... '
.,
':,
., .
II
... ......,. ' .;
~' ~
_ M ._ ,_"I
Tenor
"
~
....... _> _
Tenor inverted
~
.. .. • ':, i "
of",
oJ... .:...
-
•
... ::- .. - :
~. '~ '- '"
_
::~~'..: : - ~.:. -, "I
..
I '
••.
Bass
Bass inverted
_. .. ...
... .... ~
~
,:,.
.. ,
'
: . :.
. ~.
......:
.
",-
. ".:
~ ~
r
Figure 42.7: Metrical weights for "contrapunctus III".
42.4.1
Performance Setup
Summary. We discuss the detailed performance construction. -~-
The performance of "contrapunctus III" took place in three parcours, out of which we only report the last two in more detail. The first one was entirely devoted to a target-driven strategy whereas the subsequent ones switched to an experimental strategy which yielded much more successful and conclusive results. Nonetheless, all these approaches contributed results that influenced the final result in a significant way.
42.4. STEMMA CONSTRUCTIONS
843
Generally speaking, the procedure in all these parcours first focused on isolated single aspects of performance (articulation, dynamics, agogics) and then were put together for the final parcours. For the complete description of all these steps, see [504J. 42.4.1.1
Results From First Performance Parcours
As a result of the first parcours which was executed under the paradigm of a target-driven strategy, the usage of motivic weights for a determined shaping of dynamics has been recognized. Using the inverted form of weights in the Weight Watcher for each voice, the thematic onsets for each development of the fugues could be modeled in an excellent way. Simultaneously however, the global usage of these weights lead to a completely disequilibrated passage so that this type of application was eliminated in the subsequent performance construction.
Cpt-a3
Prima Vi sla Score
t
Division inlO Soprano/AIIO and Tenor/Bass
03-s-a
~
~A
Div ision inlo Soprano. Allo. Tenor. Bass
03-s
Uniform loudness for all voices
03-s-vel
Shaping of dynami cs
----j j
Leve l I PV-03-Agogik.slemma
PV-03-A o ooik ~
Prima Vista Agogics
Shaping of articu lalion
MOl her
j j
03-s-Art- 1
,.
03- -Vel- l
03-a
j
03-a-ve J
1
03-a-Art-1
03-a-Vel- 1
03-l-b
03-t
03-b
Level 3
,.I
03-t-vel
03-t-Art- 1
Levci2
,.
03-b-vel
Level 4 03-S1immen. slcmm3
,.
Level 5 03-Art- I .... slemm3
j
03-b-Art-1
03-t-Vel- 1
03-b-Vel- 1
Level 5 03-Vel- I .... slemma
Figure 42.8: Stemma construction for the second parcours.
42.4.1.2
Construction of Second Performance Parcours
In the second parcours, whose rather simple stemma (see figure 42.8) we are going to discuss hereafter, we changed to the experimental strategy, with the aim to investigate to which degree motivic weights can contribute to the elaboration of a determined performance aspect. Those motivic weights of a voice were used which cover a minimal number of motives (Cardinality: 2) within a dotted half note (Span: 0.625); all these weights were applied in inverted form. As with the subsequent third performance parcours, the core shaping work by means of weights (grey shaded levels in figure 42.8) must be preceded by further steps. To begin with, this concerns the shaping of the score's primavista components, such as dynamic, articulatory,
844
CHAPTER 42. PERFORMANCE EXPERIMENTS
and agogical prescriptions. In the case of contrapunctus III, which contains no performance prescription by the author, this task consisted only in the primavista shaping of a short ritardando before the junction to the final tonical chord (Levell). The next steps within the stemma (levels 2 to 4) relate to the horizontal separation into the four voices (by the Split Operator ), which were identified by specific (artificial!) different loudness values (Levels 2 and 3), and which in the fourth step are reset to a uniform loudness. Upon this basis, an attempt to shape articulation and dynamics was undertaken. In the course of six subsequent performance experiments it turned out that the global application of a single weight was capable of producing three slightly different variants of articulation shaping, however one had to pay attention to change the parameter for the weights' influence (in the WeightWatcher) in a minimal and systematic way. Figure 42.9 shows the final choice. Built upon these insights we tried to use the same motivic weights, the same systematic High Norm
Low Norm
Soprano
1.6
0.2
Influence
Deformation
Invert
-0.75
YES
Alto
1.5
0.3
-0.75
YES
Tenor
1.5
0.4
-0.5
YES
Bass
1.4
0.6
-1
YES
Figure 42.9: Choice of WeightWatcher parameters for motivic weights. and a similar handling of the change of intensities in order to produce comparable results for the shaping of dynamics. Without going into detail, it remains to be stated as a result that these attempts all failed. Typically, the dynamical relations between the voices turned out to be disequilibrated, and although the dynamics was perfectly modeled within single voices, the dynamical profile of thematic parts was sensibly worse. To obtain conclusive dynamics, one had to find another shaping procedure. 42.4.1.3
Construction of Third Performance Parcours
Because of these different dynamical profiles, the principle of former performance experimentsthe exclusive usage of a weight and its global extension-had to be given up. In a first step it was recommended to split the single voices at appropriate locations, and in a second step, a regress to the metrical weights already used in the first parcours and their renewed application under other viewpoints (a mixed usage together with motivic weights) seemed reasonable. The shaping of articulation from the second parcours would be conserved. In a preliminary step, a division of the single voices had to be executed. To this end, one had to find structurally legitimate points from the musical context, such as articulation by harmonic incisions or thematic groupings for developments and interludes. The first division of all four voices took place in bar 39, legitimated by a harmonic close to the major parallel of the minor dominant (C-major); at the same time this is viewed as a possible ending of the second (however incomplete) development and a beginning of a four-bar interlude.
845
42.4. STEMMA CONSTRUCTIONS
•
Prima Vista Score
Cpt-03
Prima Vista Agogics
Division into Soprano, Alto, Tenor, Bass
Shaping of agogics I Shaping of agocics 2 First division of single voices Second division of single voices Articulation shaping preparation Shaping of dynamics I Shaping of dynamics II Shaping of dynamics III
Levell PV-03-Agogik.stemma
PV-03-Agogik
Division into Soprano/Alto and TenorlBass
Uniform loudness for all voices
Mother
03-s-a
~
~
03-s
03-a
+ + 03-s-vel 03-a-vel + + 03-s-Agogik-l 03-a-Agogik-l + + 03-s-Agogik-2 03-a-Agogik-2 A A 03-s-I 03-s-II O3-a-I03-a-II A 1\ A 1\ s-I s-II s-III s-IV a-I a-II a-III a-IV + + s-I s-II s-III s-IV a-I a-II a-III a-IV + + s-I s-ll s-III s-IV a-I a-II a-III a-IV + + s-I s-II s-III s-IV a-I a-II a-III a-IV + + s-I s-II s-III s"IV a-I a-II a-III a-IV
03-t-b
Level 2
~
03-t
+ 03-t-vel + 03-t-Agogik-l + 03-t-Agogik-2 A 03-t-I03-t-II A 1\ t-I t-II t-III t-IV + t-I t-II t-III t-IV + t-I t-II t-III t-IV + t-I t-II t-III t-IV + t-I t-II t-III t-IV
03-b
Level 3
+ Level 4 03-b-vel 03-Stimmen.stemma + 03-b-Agogik-l Level 5 + 03-b-Agogik-2 Level 6 A 03-b-I03-b-II Level 7 1\ A b-I b-II b-III b-IV Level 8 + Level 9 b-I b-ll b-III b-IV + b-I b-II b-III b-IV Level 10 + b-I b-II b-III b-IV Level 11 + b-I b-II b-III b-IV Level 12
Figure 42.10: The stemma of the third parcours. In order to equilibrate the dynamical unbalances relating to the interludes from bars 19 and 46, a further division of the two halves of the fugue were necessary. A division of the first half was recommended in bar 19, having a close of the first development (exposition of fugue) and its half close on the dominant (A major). Because of the too strong dynamic sink of the three-voice interlude from bar 46/47, the division of the second half had to take place not later than at this point. This division was legitimized by the half close on the minor dominant (A-minor) beginning in bar 46 on the one hand, and the simultaneous ending ofthe second (then complete) development according to the three-part construction of the fugue. For the subsequent performance shaping, consider figure 42.10. Besides the already known preparatory steps-horizontal division into single voices (Level 3) and equalizing of loudness (LeveI4)-two performance steps for the later shaping of global agogics were inserted (Levels 5 and 6). This trick is applied, because agogics needs long calculation time on the global level of single voices and should be calculated after the stemmatically subsequent shaping articulation
846
CHAPTER 42. PERFORMANCE EXPERIMENTS
and dynamics. The vertical division of the single voices is applied in the previously described steps (Level 7 and 8). For the subsequent shaping of articulation and dynamics, each voice had to receive its separate and individual performance shaping for the four sections. This enabled us to apply different parameter values for the intensity effects, one per used weight. For the shaping of articulation, the three already elaborated performance steps were inherited. As is seen in the stemma (figure 42.10), the shaping of dynamics was realized in three consecutive steps. Here, besides the known motivic weights, two additional metrical weights were applied. For the first step (Level 10), we applied the metrical weight from the union of all voices wit'h Minimal Length of Local Meters equal to 2, in inverted form, and without deformation, see figure 42.11.
Metro: 03-Stoz-pZ
...
.. ,. ; .. .... .. "'.
.... ..
."'"\
.. -t.
..
•.:... . •• .. .+, ,..
... . . :
. .... ::'. :: ~:;~.::. ~:.:.:.~. .:. ~~:.;.' . . ::. ::·<:Ii?> :. . . ,
.:+ ...... ....... . . ... +.. .. ......:.. ••••• ..., ... :.. ~ .. +. •••••' #' fI"t .. ""'"
......
:
.
#+..
..
+:+ •
..
..
... . ..
l '
"* ,..
.. +.+
.. ... ...
.
.. ..
.
... +... ... ... ..; •. :..
.,. . . OJ
+ .......
.......... +."
. ..
.., . . .... ".. " .. ": + • ...., .. J.+ ...., . ....... ... #.. . . ... ... . . . .. . " " .:: " ..... .. '" '\.
..
..
.. :
'
••• ;
..
+... .. .:.: . ;;.:
Metro: 03-StoZ-pZ
Figure 42.11: Metrical weights in "contrapunctus III", union of all voices, Minimal Length of Local Meters equal to 2, in original form (top), and in inverted form (bottom). Upon this stemma, the second step (Level 11) applied the metrical weights with value 5 for Minimal Length of Local Meters for each individual voice in inverted form and also without deformation (the weight graphics were comparable to those from the second performance parcours described above). For the concluding shaping of dynamics, the already known motivic weights were applied
42.4. STEMMA CONSTRUCTIONS
847
in order to give the thematic onsets a plastic relief. From the interplay of the various intensity values, we got the constellation documented in figure 42.12. The result of this performance
Level 10 Weight HighNonn LowNonn Influence Defonnation Invert Level 11 Weight HighNonn LowNonn Influence Defonnation Invert Level 12 Weight HighNonn LowNonn Influence Defonnation Invert
Soprano
Alto
Tenor
Bass
Part 1 Part 2 Part 3 Part 4
Part 1 Part 2 Part 3 Part 4
Part 1 Part 2 Part 3 Part 4
Part 1 Part 2 Part 3 Part 4
Metro: St02-p2
Metro: St02-p2
Metro: St02-p2
Metro: St02-p2
1.1 1.1 1.1 1.1 0.6 0.6 0.6 0.8
1.1 1.1 1.1 1.1 0.6 0.6 0.6 0.8
1.1 1.1 1.1 1.1 0.6 0.6 0.6 0.8
1.1 1.1 1.1 1.1 0.6 0.6 0.6 0.8 1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Metro: S05-p2
Metro: A05-p2
1.0 1.0 1.0 1.0 0.7 0.7 0.7 0.8
1.0 1.0 0.7 0.7
Metro: T05-p2
1.0 1.0 0.7 0.8
Metro: B05-p2
1.0 1.0 1.0 1.0 0.7 0.7 0.7 0.8
1.0 0.7
1.0 1.0 1.0 0.7 0.7 0.8
1
1
1
1
1
1
1
1
1
1
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
1
Melo: s-625-2
Melo: a-625-2
Melo: t-625-2
Melo: b-625-2
1.0 1.2 1.4 1.2 0.5 0.4 0.4 0.5
1.0 1.2 1.05 1.2 0.7 0.7 0.7 0.7
1.0 1.4 1.2 1.2 0.75 0.7 0.7 0.75
0.25 0.25 0.25 0.25
0.25 0.25 0.25 0.25
1.3 1.7 1.3 0.7 0.8 0.8
1.3 0.7
1
0.5
0.5
0.5
0.5
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
Y
-0.25 -0.25 -0.25 -0.25
Y
Y
Y
Y
Figure 42.12: Intensity values for the concluding shaping of dynamics. communicates a relatively balanced dynamics, spread over the whole contrapunctus, the thematic onsets gain a profile, which can also be confirmed in the slight crescendo that leads to the beginning of the third development after the three-voiced interlude (from bar 46/47). Bringing together the dynamic and the already elaborated articulatory aspects, the result can be stated as a complementary shaping of both performance aspects, which on top of that reveals a musical sense in the elaboration of thematic onsets and the three-voiced passages of the interludes. For the shaping of agogics, the said levels 5 and 6 of our stemma were reserved. We did two different subsequent performance parcours with two different metrical weights: the sum of all voice weights (Minimal Length of Local Meters: 2) and the weight of the voice union (Minimal Length of Local Meters: 91 (!)), see figure 42.13. The intensity values are shown in figure 42.14. As a result we may state that according to the weight structure and a minimal intensity effect a slight increase of the basic tempo towards the middle of the piece happens, but it is
848
CHAPTER 42. PERFORMANCE EXPERIMENTS
Metro: 03-Sto2-p2
.. ,.
.
.. +...
.. +.. ..
.. ..
.,.
..' . . .... . . .:
+. :~ .:. ' ........ -~."', .... ••• +...
t....+...
• +.. ..
~
.. .....
'""
"
. .. ....
#.;.::.....:.:....... ":,..+...,:+ .... ..
+. .. ........
+ ... : ..
~
...
.. .+ ..
+# . . . . . .
.+" . .
+.+
+.
.. ':
.. ..
•••
+...
.... .-
•
....
~
..
..
..
,
... :.•. ~~•.. . . •....+:...+ •• ... .:'
~
+••••
..
..
.. ".
.1....
.~...
....
..
. .. :..
.."
:~.
.. + • • +
..
..
'
I .t.+.........
.. ..........
....
• ... "
"..
+........
'
.. "
...... +•
t, "t. ..
.
:
...... .~.............. I·. "" ":.+ .. :... ...... ... '\ ......~: ... ...
+ •
..
.:.
.....
.....
.....
.-!...
..
+........
~
Metro: 03-SC91-p2
.+.: ..+J : •••• : • ., ....... . . .......... :. ",·t.:•. •••·.1.......•••••••••+.:. .. ........ •• ........ .... ......... •• ........ •• ........ •• ......... •• ......... .+ ......... .+ .. ••+.. •• .. ' • • •+.. .+ .. '\. • • •+.. •• .. 1fI. • • •+.. ... ..~ •••+.. •. ..~ •.•+.. ... .. ......+.. •• ....'\..• .• .•+... . ••. . ... .'\. .... .. +; * • • .,.. ..... ............................ , .. #
• .'. • • • .'. • • • .'. • • • .'. r • • .'. • • • .'
• .'
~
• .'
.. .. . .. . ....:......""
~
Figure 42.13: "Contrapunctus III"; top: metrical weights, sum of all voice weights (Minimal Length of Local Meters: 2); bottom: weight of voice union (Minimal Length of Local Meters: 91). balanced by a slightly slower tempo for the initial and final bars. For the resulting performance as traced on the audio track on the CD-ROM attached to this book, see page xxx. The corresponding stemmata are also found on this location of the CD-ROM. 42.4.1.4
Local Discussion
Starting from the experimental strategy, the performance of "contrapunctus III" was first centered around the shaping with a single weight in order to sound the potential of a single weight. The shaping of articulation showed a reasonable local profile for every voice, i.e., a single analytical structure was capable of giving one performance aspect a reasonable expression. The global application of weights and the usage of a single weight showed its limits, as we have learned from the dynamical shaping of "contrapunctus III". For example, the global application of weights failed in the different grades between the contributions of the four voices.
849
42.4. STEMMA CONSTRUCTIONS High Nonn
Low Nonn
Total Contrapunctus
1.025
0.975
Total Contrapunctus
1.05
0.95
Influence
Defonnation
Invert
o o
NO NO
Figure 42.14: Intensity values for the metrical weights. Especially with the motivic weights of the tenor and bass voices, different weight profiles become visible which cannot be eliminated even by suitable deformations. These differing profiles of weights do result from the compositional structure. As this one splits into a number of partsdevelopments and interludes, groupings by harmonic closes and semi-closes-the division of the voices according to such compositional criteria is legitimized. Within these parts, the selected weights can be applied with different intensities and thusly equalize the disparate shapings. Dynamics received a special significance in the shaping process: the elaboration of thematic onsets by an inverted motivic weight-this was justified in the structure of the theme. More precisely, the long durations of the initial notes resulted in a weak weight profile, which could be used to stress these incipits by the weight's inversion. This principle which remains valid for almost all contrapunti, can however only be proved for the "Kunst der Fuge" .
42.4.2
Instrumental Setup
Summary. The conditions and influence of the instrumental setup are discussed. -~-
A discussion of adequate instrumentation must be in the forefront of virtual performance work, since the behavior of the respective sound generator is an essential basis for shaping of the single musical parameters. Since the performance results from RUBATO® are encoded in a MIDI file, we have the possibility to access a MIDI-driven acoustical piano or else to use a corresponding digital device. Here, we should observe some relevant differences which-besides the basic difference in sound-the repetition, the dynamic response, the resonance behavior, and the spatial environment which we cannot discuss here. These differences are not only present between the acoustical and the digital instruments, they also act within each category. For pianos and grand pianos, the MIDI-driven access is offered by Boesendorfer and Yamaha models. At the time of our experiments, the comparison between these two brands could a priori not be made in a serious way. Even if the shaping of dynamics admits a limited bandwidth of variation within the 128 MIDI velocity values, the piano and pianissimo ranges (velocities between roughly 30 and 1) for the Boesendorfer were much finer to tune than for the Yamaha piano and grand piano, where these dynamical values make the keys move silently. This defect as well as the extremal differences in the repetition mechanics show the futility of a comparison between such Boesendorfer and Yamaha instruments and imply that the Bosendorfer is the only reference for a reasonable performance. In this sense, the Boesendorfer was already chosen for the first experiment with Schumann's "Kuriose Geschichte" as briefly reported in section 42.1.
CHAPTER 42. PERFORMANCE EXPERiMENTS
850
In principle, an acoustical piano should be chosen as a reference for computer-assisted performance, but because of restricted availability of such instruments3 , a digital piano had to be selected in our case. Our experiences with digital pianos and their sounds showed significantly different behavior for identical dynamics and articulation when applying identical weights. And it must be remarked that at the present state of performance work, it is difficult to obtain exact information about the dynamics and articulation shaping in the sense of a possibly reliable performance grammar since the judgment of the dynamical and articulatory aspects as a function of the available instruments, see [505] for a detailed presentation of sound experiments. Any conclusion regarding the sources and rationales for the performance shaping of a score are vitally influenced by these instrumental conditions. In other words, within a strictly scientific framework, the MIDI-encoded performance data enforce an instrumental selection. If research can be realized by means of one and the same MIDI piano, comparative statements can be made exactly for this instrument. In a somewhat broader sense, this is also true for digital maps of the acoustical piano, i.e., one has to restrict the research to one and the same digital piano, such as Kurzweil's Micro Piano as it was used in [504]. Any conclusion from the digital to the analog piano or vice versa is impossible, even among different digital or analog pianos no comparison is possible, see also [505]. One solution out of this dilemma could be the application of the "physical modeling" or "virtual acoustics" technology (see appendix A.1.2.4), where a direct access of the physical technology of an instrument by the virtualization (i.e., the software modeling) of the physical system of an instrument is enabled. This methodology offers flexible instruments which can be deformed seamlessly and can respond without delay. Although only first experiments have been initiated, this perspective opens an encompassing approach of computer-assisted performance research. The sound and resonance environment with its consequences for the shaping of performance aspects would not only yield new insights in the functioning of musical instruments, but also insights in the practice of instrumental playing. In this context, an interdisciplinary research team consisting of instrumentalists/interpreters, musicologists, computer scientists, mathematicians, and physicists would be required.
42.4.3
Global Discussion
Summary. We summarize the insight drawn from this second experiment. -~-
In the course of the performance experiments, two different approaches and performance strategies crystallized. We tried to give the score's text an immanent shaping by means of two approaches: • what is the sound of the analytical structure? • can the sounding analytical structure yield a musically reasonable performance? 3In Germany, the Bosendorfer MIDI grand exists only in two locations: at the conservatories in Lubeck and Karlsruhe. In both cases, access to this instrument is virtually impossible since it is located in rooms that are used for ordinary school activities.
42.4. STEMMA CONSTRUCTIONS
851
and two contrary performance strategies: • the target-driven strategy, • the experimental strategy. In contrast to objective analytical approaches, for performance, subjective ingredients cannot be completely eliminated. They are present in their feedback with the performance result, while weights and intensity parameters in the WeightWatcher are determined, but they playa fairly reduced role. From the first performance experiments, which have not been discussed in detail here, until the complete performance as described above, we have known milestones which demonstrated several problematic issues: It was not easy to eliminate the impression of an existing performance-in our case by Glenn Gould, say-and to stick strictly to what is written in the score; the performed version of the piece automatically resonates as a comparison while doing the performance work. This was the situation where we started these experiments with the ambitious task of approaching an artistical and esthetical performance as far as possible. Therefore, the target-driven strategy was to a certain degree determined by the comparison with traditional human performances. Under these conditions, weights were applied and results were judged. This turned the tradition into an obstruction, it positioned the expected performance in the foreground and the shaping weight in the background. Only the consequent questioning of the analytical structure and the systematic liberation from traditional performance expectations led to a performance strategy which positioned the analytical weights in the center of the investigation. This experimental strategy was coined by an as unbiased as possible sounding realization of analytical structures, centered around the question of how a weight, when applied to a particular performance aspect, would sound. Within this procedure it was possible to insert 'unheard' results, to admit purposed over-subscriptions in the sense of the 'still more clear', whereas the question whether an interpreter would play in this way turned out to be completely irrelevant. From this point of departure, how a determined analytical structure would sound, the experimental approach to shaping a musically reasonable performance was sought. This qualitative determination of what is a "musically reasonable" performance is inevitably a subjective one which as such decided upon the subsequent steps towards the final performance. Similarly to the interpreter who puts up for discussion his provisionally final version while performing in concert-where in the last analysis it is more his personality than the musical performance which is judged-in the case of computer-assisted performance, the subject who works with the performance workstation RUBATO® presents his results as a provisionally final contribution to the ongoing discussion. When judging all these performances, one has to take into account that only metrical and motivic weights were applied and the effects of harmonic passages were not included in the shaping of performance (except of the motivations for the not machine-made subdivisions from global to more local applications of weights in the third parcours). Furthermore a certain economy in the choice of weights and their application was applied. In this sense, we first had to check out which weights would entail what type of shaping consequences, and how the change of intensity parameters would influence the musical expressivity. It was only after this preliminary work that a systematic application of the weights and a partially purposed work with their intensity parameters became possible.
852
CHAPTER 42. PERFORMANCE EXPERIMENTS
The portability of the presently described performance technique must be deduced from the compositional structure (a fugue in general and the thematic structure of the "Kunst der Fuge" in particular) as well as from the instrumental context. In nuce it can be said that such systematic statements are still premature. Many more analyses and performances would be necessary, but these can only be realized as soon as RUBATO® has become a common tool of musicology. Then the question of whether general recipes which are valid beyond the limits of single compositions can be stated, or whether performance is rather bound to each individual composition, could be restated. Whatever is true for the transformation of the analytical structure in a scientific work targeting an artistically valid esthetic performance, one should not forget about the elimination of (and nonetheless omnipresent) emotional and gestural aspects. The realization of a sonification of analytical structures during the interaction with the computer always bears a degree of emotionality, a phenomenon that should be taken into account as a kind of "uncertainty relation" . The judgment of the performance results took place in the same line as the judgment of a human performance, and the work with RUBATO® was also proposed as a provisionally final contribution to the work's discussion. While describing the performance results, the stress of a scientific analytical performance was central. The feedback to the analysis has a particular significance in that possibly, the conclusive character of a performance could yield an analytical criterium. This implies an absolutely serious attitude towards analysis, and no disclosure from emergent new aspects and innovative analytical ways of hearing. Therefore we refrain from a discussion of subjects such as "prejudices against results which are produced by a machine" , or "performance and the soul of music versus soulless performance machines". Instead, we favor representations of procedures and performance strategies, the exemplary demonstration of connections between analyzed structures, performed results, and the attempt at a generalization of these insights in the form of a performance grammar in its dependency on the instrumental conditions.
Part XI
Statistics of Analysis and Performance
853
Chapter 43
Analysis of Analysis o sancta simplicitas! Jan Hus (1370-1415) Summary. Not unexpectedly, weight analysis turns out to be complex information that cannot always be handled intuitively. This suggests techniques that help analyzing weight analysis. The problem may be tackled by use of statistical methods. We expose the subject and Jan Beran's approach based upon hierarchical decompositions of weights, together with an application to comparison of analyses of Bach, Schumann, and Webern. -~-
Although the MetroRUBETTE® seems to implement a very simple analysis, the metrical weights turn out to encode quite complex information. For performance applications such as RUBATO®, this may be acceptable, but for an analytical understanding per se, the weights are too complex to be used directly, except for direct visual inspection of evident surface properties. The prejudice that musical analysis should be a simple, intuitive affair, is therefore banned to the fairy tales of auto-incompetent humanities. In this chapter, we give an account of a statistical approach to understanding weights. More specifically, Jan Beran's method of hierarchical smoothing (see also [50]) is presented.
43.1
Hierarchical Decomposition
Summary. This section describes and motivates the hierarchical decomposition of weights by use of decreasing sequences of time bandwidths, the so-called hierarchical smoothing. -~-
43.1.1
General Motivation
Can additional structural insight into weights be gained by suitable analysis of the analytic weight curves? The idea of the following method is to find a "natural" decomposition of the 855 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 43. ANALYSIS OF ANALYSIS
856
weight functions in order to find hidden regularities. In time series terminology, the general problem can be stated as follows: Let {xs(t i ), ti E R, s = 1, ... ,k,i = 1, ... ,n} be a collection of k time series, measured at the time points t i . The aim is to find a decomposition M
Xs(ti) = LXj,s(ti) j=l
such that the components {Xj,s, s = 1, ... , k} reveal a maximal amount of 'regular structure'. One of the difficulties is to define what is meant by 'regular structures' and to define corresponding meaningful measures of the amount of 'regular structure'. Here, a pragmatic approach is taken, in that the amount of 'regular structure' is judged visually. Clearly, more formal definitions could be used. Before introducing the idea of hierarchical decomposition, a few general remarks should be made: Remark 18 Traditionally, one of the main structures of interest for time series is periodicity. In particular, spectral decomposition based on sines and cosines may be used for this purpose (see e.g., [423], [69]). In our context, this is not applicable, because many compositions are likely to have much more interesting structures than just periodicities. In fact, some scores may not contain any nontrivial periodicities at all. More generally, the problem is that using the same basis of functions, irrespective of the structure of the score, results in focusing on a very limited number of predetermined features that may in fact not be present. Remark 19 As a consequence, a nonparametric approach based on kernel smoothing will be proposed here. In a traditional setting, the bandwidth b is chosen by minimizing a criterion such as the mean squared error as n tends to infinity. In particular, b tends to zero with increasing sample size. This concept is not directly applicable in our context. The main reasons can be summarized as follows: 1. Based on the definitions given above, the metric, melodic and harmonic aspects of a score are characterized respectively by one weight function only. In contrast, a composer is likely to have a hierarchical view. For instance, a piece has on one hand a global harmonic shape that makes the piece coherent as a whole, and on the other hand more local structures. Some composers in fact consciously write a score using a hierarchical approach, first defining a global shape and then refining more and more local structures. Similarly, while rehearsing, a performer is likely to focus first on global features of the score and then successively refine more and more local features. This fact was also used in RUBATO to design the process from a primavista performance to the refined artistic result [357]. Here, a genealogical tree, the stemma of the performance process, is responsible for successive refinement and localization of the performance. In order to obtain a better picture of the structure of a score it is therefore necessary to "extract" the hierarchy that is hidden in the weight functions. For smoothing, this means that there is not just one optimal bandwidth that is of interest. Instead, there is a hierarchy ofrelevant bandwidths b1 > b2 > ... > bM . Moreover, the structure of the score, rather than an omnibus statistical criterion (such as the mean squared error), is likely to yield the key information about which sets of bandwidths could be interesting.
43.1. HIERARCHICAL DECOMPOSITION
857
2. The weight functions obtained from the analysis above are generally rather complex. In particular, the weights often jump abruptly up and down between very small and very large values (see figure 43.4). This can certainly not be carried over linearly to musical performance. For instance the tempo of a "musically acceptable" performance is unlikely to change up and down drastically and repeatedly within a few seconds. It is therefore reasonable to assume that a performance is not a linear function of the weights but rather a weighted sum of non-linearly deformed smoothed versions of these functions. Again, there may be a hierarchy of several bandwidths that need to be considered.
These general considerations motivate the idea of hierarchical smoothing and hierarchical decomposition described below.
43.1.2
Hierarchical Smoothing
Let {Xs(ti),ti E R,i = I, ... ,n,s = I, ... ,k} be a k-dimensional time series observed at time points tl, ... , tn and Kb a smoothing kernel with bandwidth b and support [-b, b]. Applying the smoothing operator n
KbXs(t) = LKb(t,ti)Xs(ti ) i=1
(t
E
R) for a hierarchy of bandwidths bl {Xj,s(t)
> ... bM, we obtain a hierarchy of k-dimensional curves
= KbjXs(t), s = 1, ... , k},
j
= 1, ... , M.
Here, the Naradaya-Watson kernel
K (t t·) b
, •
-
K(!.::.h ) b "'~ K(!....!L) uJ=1 b
with a triangular function K(s) = I{lsl :::; I}· (I-lsI) was used. For b = 0, we have KbXs(t) = xs(t). Figures 43.1 and 43.2 display hierarchies of smoothed curves for Schumann's "Triiumerei" and for Bach's "canon cancricans" (see also figure 8.7), resulting from the metric, melodic and harmonic weights. The figures illustrate that different bandwidths make different features more visible. In particular, for the metric weights, smoothing highlights places where high values occur more frequently. Also, some remarkable similarities between the metric, melodic and harmonic weights become apparent after smoothing. Remark 20 The statistical technique of using smoothing kernels deserves a comment from the point of view of stemma and operator theory (section 39.8), a comment which strongly relates to the inverse performance theory to be exposed later in part XII. To begin with, taking into account neighboring values of the analyses by kernel smoothing has a musical meaning: The interpreter is rightly supposed to be conscious of what happened and will happen within the time bandwidth b. In inverse performance theory, the idea of kernel smoothing is introduced in the locally linear performance grammars, see section 39.8. There, the mutual influence of different local parts Ci and Cj of the composition C for the performance shaping process is formalized by use of interaction matrices (e;"j). The coefficient Ci,j quantifies the influence of
CHAPTER 43. ANALYSIS OF ANALYSIS
858 Trauam9l"9i: Kxftr b=O.1,O.5,1.2,4,B
and metric weights
D
5
10
15
2D
25
30
on.etlime
TraumQl'9i: Kx for b=O.1,O.5,1,2,4.8 and melodic waights
o
6
10
15
Z)
26
30
onaettime
Tra9umomi: Kx fa b=O.1 ,0.5, 1,2,4,8
and hlrmonicwsights
o
5
10
IS
20
26
30
on.ettime
Figure 43.1: Smoothed version of metric, melodic, and harmonic weights for Schumann's "Ttaumerei" . C i on C j . The general theory of locally linear performance grammars deals with the description of the de facto algebraic variety of interaction matrices inducing a fixed performance. From this point of view the triangular kernel smoothing means a selection of the a priori shape of interaction matrices, showing a peak around the diagonal.
43.1.3
Hierarchical Decomposition
The approach of hierarchical smoothing suggests a decomposition of the weight function into components of varying smoothness. Thus, let {x s (ti), ti E R, s = 1, ... , k, i = 1, ... , n} be a collection of k time series. As discussed above, the aim is to find a decomposition Xs(ti) = L:~1 Xj,s(ti) such that the components {Xj,s, s = 1, ... , k} reveal a maximal amount of "regular structure". Structure can be, for instance: symmetry, repeated shapes/periodicities, relationship between different components etc. Note that with respect to cross-correlations, a number of methods are known in the literature for testing dependence between stationary time series
43.1. HIERARCHICAL DECOMPOSITION
859
canon cancrtcans: Kx for b=O.25,O.S,
CalD1 CBlCricans: Kxforb=O.25.0.5, 1,3,11 and melodic WCligl'ta
1,3,90nd moblcwoights
lil.--------'--'------=------,
I •
"
20
10
30
:;!
10
o,..atimt
Can ... clneoclns: Kx for b=O.25,O.5, 1,3,0 and harmonicwaights
:~ 20
10
30
an_Ime
Figure 43.2: Smoothed version of metric, melodic, and harmonic weights for Bach's "canon cancricans" . (see e.g., [181]' [211], [230], also see [423] and references therein). A direct adaptation of these methods is not possible for the following reasons: 1) The series considered here are not stationary in a nontrivial way and can, in particular, not be reduced to white noise by applying a linear filter. 2) The time points are not equidistant. 3) The aim is not only to obtain high crosscorrelations but also to highlight regular features of the individual series. 4) Not only crosscorrelations between "residual" but between all components are interesting. 5) The musical context suggests that the decomposition should be hierarchical in the sense that, with increasing index j, Xj,s should contain increasingly local features. We thus define the following decomposition: 1. Define a hierarchy of bandwidths bl from the score.
2. Define the smoothed function
Xl,s
> b2 > ... bM = 0, based on structural information
= KblXS and for 1 < j
::; M,
Xj,s
= Kbj (xs- Lt~i Xl,s).
It should be noted that this decomposition is only one of many possible decompositions
CHAPTER 43. ANALYSIS OF ANALYSIS
860
of Xs. The problem of choosing a meaningful decomposition of a time series is not new. In particular, in the context of regression analysis (see chapter 44), it is a special case of the general problem of defining meaningful explanatory variables in regression models. Here, subjectspecific considerations provide important guidelines. From a pragmatic point of view, a chosen decomposition can be considered reasonable if the subsequent regression analysis leads to meaningful interpretable results. In our context, the above decomposition appears meaningful, since it decomposes Xs in a simple additive way into components of decreasing smoothness. This translates, in a straightforward way, the generally accepted fact that a musical composition as well as a performance may be considered as a superposition of a hierarchy of local and global "shaping features" , obtained by different degrees of "zooming in or out". For a given sequence of bandwidths bl > b2 > ... bM = 0, the first component Xl,s represents the most global view of the score (or more specifically of the metric, harmonic or melodic structure, respectively), X2,s represents the next step of refinement by considering, in a more detailed fashion with a smaller bandwidth b2 < bl , the remaining information (obtained by subtracting the "global information" XI,2, and so on.
43.2
Comparing Analyses of Bach, Schumann, and Webern
Summary. The statistical method developed in the preceding section is applied to a comparative study of RUBATO-analyses of works by Johann Sebastian Bach, Robert Schumann, and Anton Webern. -I;-
Each of figures 43.3.a through 43.3.d displays the melodic (dotted, middle), metric (full, lower) and harmonic (dashed, upper) weights for Schumann's "Traumerei" op. 15/7 (Kinderszene No.7), Webern's "Variationen fur Klavier" op. 27/11, the "canon cancricans" from Bach's "Musikalisches Opfer" BWV 1079, and Schumann's "Kuriose Geschichte" op. 15/2 (Kinderszene No.2). For onset times with more than one value of the melodic and harmonic weight respectively, the average of the values was taken. It is also interesting to look at scatterplots of the three types of weights against each other. The example for Bach's "canon cancricans" is displayed in figure 43.4. For each of the compositions, some simple regular features of the weights are visible:
• Triiumerei: From the score it is clear that this composition may be divided into four parts Pj , j = 1,2, 3, 4, corresponding to the onset intervals h = [0, 8] and I j = ((j - 1) . 8, j . 8], j = 2,3,4, respectively. Also is it obvious that these four parts are similar to each other, and that P3 differs most from the other parts. In fact, P2 is, by definition, an exact replicate of PI (except for the slightly different up-beat). In figures 43.5.a through 43.5.c, the weights for the four parts are plotted on top of each other, i.e., onset time is taken modulo 8. The weights are indeed almost identical to each other. Interestingly, the fact that P3 differs most from the other parts shows only for the melodic weights. Also, the scatter plots do not indicate any strong relationship between the three weight functions. The sample correlations are all in the range [-0.01,0.09].
43.2. COMPARING ANALYSES OF BACH, SCHUMANN, AND WEBERN , merlll :mtllie. mBodic BId hIlrmoni=o wei'JhII.
1:1 .-------"-------,
OSlO
1S,
20
25
'30
flf! '".
~ca~: m."G.
melodic on:! h3fll'lOtR: 'IIIIIt9""*'
861
WI!'DMn'. op.'Z7l2:. mn;c. latIodica'ldh::Jm:)nic .....'1tb
10
Kumle m~lodi::::nd
IS
G.~hielM:
20
rT'Ii&bt..
fDTronc weijl II.
. ....... '~II :I f: II : iII I i : : I: r I :: ,
I
i
I I:
i
I
I
I"
== I;
I,
10
I
20
I
I
30
••
1011;'2(12530 onsel1ima
Figure 43.3: Metric, melodic, and harmonic weights for Schumann's "Triiumerei", Webern's "Variationen fur Klavier", Bach's "canon cancricans", and Schumann's "Kuriose Geschichte".
• Variation op. 27III: With respect to the melodic and harmonic weights and from the score, it is clear that the composition can again be divided into four parts Pj , j = 1,2,3,4, corresponding to a division of the onset time into four intervals of equal length. Clearly, the first two parts are almost identical with respect to the melodic and the harmonic weights. The same is true for the last two parts. For the metric weights, however, P2 is not a simple replicate of Pl. The same is true for the last two parts. Also, for PI and P2 , the maximal values of the metric weights are much higher than for P3 and P4 . Again, no apparent relationship seems to exist between the three weights (not shown). However, the largest correlation (in absolute value) is much higher than in the previous example, namely -0.31 between metric and harmonic weights. • Canon cancricans: As expected for a retrograde canon, there is an almost exact time symmetry with respect to the middle of the onset axis. The symmetry is not exact, because the retrograde is not just a reflection of onsets but rather a transvection in the
CHAPTER 43. ANALYSIS OF ANALYSIS
862
-3
-2
-1
0
1
.,
o Bach metric weight
Q
o
Bach <'I
•
melodic weight
..
., _m _ .It:
~
" '0
IDII
~
0
.,
OCD ~ .OO ~
~'========~
0 OQ _CI __D _ _ __ ~
Bach harmon ic weight
L•• - -'-. ..----.--.-~
-to
0
0
0 0
(D
0
~
at.
III
L,----,--,...-...,-...J
<'I "I
OJ)
Figure 43.4: Scatterplots of analytical weights for Bach's "canon cancricans". onset duration space parallel to the onset axis (see section 8.1.1, example 9). Also striking is the clustered nature of the weights and the apparently very regular high frequency oscillation of the metric curve. A high metric weight is almost always succeeded by a low weight and vice versa. Because of the clustered nature of the weights, scatter plots are not very useful in this case. The correlations between the weights are again very small, ranging between 0.03 and 0.04. • K uriose Geschichte: Here, the score is again divided into four parts corresponding to the onset intervals [0,6], (6,12], (12,21]' (21,30], with PI equal to P2 and P3 equal to P4 . Again, it is difficult to tell in how far the three different curves may be related to each other. Note however that the metric weights are much lower for onset times above 21. Thus, for the metric weights, the correspondence between P3 and P4 is much weaker. The reason is the breakdown of local meters at bar 21. Similarly to Webern, the strongest correlation between the weights is quite remarkable, namely -0.33 between melodic and harmonic weights.
43.2. COMPARING ANALYSES OF BACH, SCHUMANN, AND WEBERN
~
TrutnKBi: Me.,;c'llll851htafDfonWI III BJ(AI.(" 16](B)~I6.2'J
JJ
863
TrllElLlm....r. MIiIodIo wtlghu b' on.tot In 1'10](.'0),(8.'"11"),( 1',"'I(C).(2A,32)(O) A
C
c'j!
r : ••
Ct'H,
Tf ••untttt t-OttnOl'lb weig/'rtl. Jot In III 'l(AI.(" 16](B).(I6.24](C).(2A.J2I(D)
onsel tMOJIo to
Figure 43.5: Analytical weights for Schumann's "Traumerei" against onset time modulo 8. It is also interesting to compare the four compositions with each other. The weights of Bach's "canon cancricans" exhibit an extreme high frequency oscillation that is not observed for the other scores. Ignoring that onset times are not exactly equidistant, this can be seen for instance very clearly by comparing the sample autocorrelations of the metric weights (figure 43.6). Another property of interest is the marginal distribution of the weight functions. Eliminating global 'trends' by taking first differences x(tj ) - x(tj-d the histograms are given in figure 43.7 for the metric weights. For the compositions by Schumann and Bach, the first difference of the metric weights can essentially be classified into three clusters (low, medium, high). For Webern's score, the distribution is completely different and in fact rather close to a normal distribution. In contrast, the distributions of the differenced melodic weights are qualitatively similar for all four scores. For the harmonic weights, all distributions appear to be essentially symmetric. However, while for Schumann's "Traumerei" and the score by Webern there appear to be three clusters, the histograms for Bach and the "Kuriose Geschichte" are essentially unimodal.
864
CHAPTER 43. ANALYSIS OF ANALYSIS Series : Traum erei
Series : Webern
"'< ';0
10
""
Series : Bach
: 1 ci
IS,
20
Series: Kinderszene2
ci
II I
II
I 10
15
lag
20
I
II III ,II I I I
10 lag
I I
IS
2'0
Figure 43.6: Autocorrelograms of metric weights for Schumann's "Traumerei" , Webern's "Variationen fur Klavier" op.27/2, Bach's "canon cancricans", and Schumann's "Kuriose Geschichte". In summary, a first look at the weight functions reveals certain elementary features of the score. In the following it will be demonstrated that a more thorough analysis leads to further new insights about the structure of the scores. In particular, note that the three weight functions were defined in a completely different way. It may therefore be expected that there is no strong relationship between the curves. The scatter plots of the weights seem to support this conjecture. But the following analysis will show that certain components of the weight functions are indeed closely related. Specifically, application to the four examples was carried out using M = 4. This choice was based on musicological considerations (time signature and bar grouping) as explained in the following. In this sense, the analysis here is exploratory, since no statistical selection criterion was used for choosing M. The following notation will be used here: Xl = Xmetric =metric weight, X2 = Xmelod =melodic weight, X3 = Xhmean =harmonic (mean) weight, Xj,metric = Xj,l, Xj,melod = Xj,2, Xj,hmean = Xj,3. The choice of the bandwidths was based on the time signature and bar grouping information. Example Schumann/Traumerei is written in 4/4 signature, the
43.2. COMPARING ANALYSES OF BACH, SCHUMANN, AND WEBERN TraQUllKlroi: hlslogrnm of flf'Sl diflQrgl1Cle of melric WGighls
·3
·2
WQbQrn'$ op.2712: histo<.Jam of first diflorona> 01 motricwQ~hts
.,
.,
Canon canaicans:: histcoram ot fIrst diff9nmcg of mgtric weights
., .,
865
Kuioslii Ge5Cticht9 : tisJogmm of Irst diffeJ'Wl('.::l; of
m~ric
WQighls
:.IUJI ·3
·2
-1
Figure 43.7: Histogram of first difference of metric weights for Schumann's "Triiumerei", Webern's "Variationen fUr Klavier", Bach's "canon cancricans" and Schumann's "Kuriose Geschichte" . grouping is 8 + 8 + 8 + 8. The chosen bandwidths are therefore 4 (4 bars), 2 (2 bars) and 1 (1 bar). Example Web ern is written in 2/4 signature, its formal grouping is 1 + 11 + 11 + 11 + 11; however, Web ern insists on a grouping in 2-bar portions [562]' suggesting the bandwidths of 5.5 (11 bars) , 1 (2 bars) and 0.5 (1 bar). Example Bach is written in 4/4 signature, the grouping is 9 + 9 + 9 + 9. The chosen bandwidths are 9 (9 bars), 3 (3 bars) and 1 (1 bar) . For example Schumann/Kuriose Geschichte, the time signature is 3/4, the grouping is 8 + 8 + 12 + 12. The chosen bandwidths are 3 (4 bars), 1.5 (2 bars) and 0.75 (1 bar). Figures 43.9 ("Triiumerei") and 43.8 ("canon cancricans") show remarkable regularities that have not been observed for the original weights (same for Webern and Schumann/Kuriose Geschichte, which we omit here). In particular, for all four compositions, much stronger similarities between the metric, melodic, and harmonic components can be observed than for the original weights, especially for j = 2,3. Moreover, for the first two scores, the same kind of relationship can be observed for j = 2,3, namely: positive correlation between Xj,melad and
CHAPTER 43. ANALYSIS OF ANALYSIS
866 x_{melric.1}. x_{hmean. 1} and x_{metod.1}
5
10
15
20
25
x_{melric.2}. {hmean.2} and x_{melod.2}
1- .-..
.-
30
..•.
1l..(1'-.71-
A
&
A.
5
i"""",:n
10
15
20
25
3Q
Or'lse(~
x_{melric.3}. x_{hmean.3} and x_{metod.3}
x_{melric.4}. x_{hmean .4} and x_{metod.4}
0_
.5
10
15
20
25
30
1-"-'.
,.-.-c.
_ _ • """"-41
5
10
15
20
25
30
onset time
Figure 43.8: Hierarchical components of metric (solid lines), melodic (dotted lines), and harmonic (dashed lines) weights for Schumann's "'Itaumerei", as defined in section 43.1.2: (a) b = 4; (b) b = 2; (c) b = 1; (d) remaining (residual) series. negative correlation between Xj,melod and Xj,metric, and negative correlation between and Xj,metric' Particularly surprising is the fact that Webern's score shows the same type of association as Schumann's "'Itaumerei". This leads to new insights into different approaches to composition. The weight functions are in fact very complex data and deserve a refined "analysis of analysis". Hierarchical smoothing is a possible approach to this problem. Webern's piece is written in a completely dodecaphonic way, and thus breaks with harmonic and homophonic tradition. This deserves a special methodological comment. The fact that we have nevertheless applied harmonic analysis could be viewed as being in contradiction to Webern's rupture with harmony. Now, we do not claim that this analysis corresponds to Webern's poietic position when composing his "Variationen". Nonetheles's, an objective analysis according to the Riemann approach is reasonable for two reasons: (1) Riemann intended to attribute tonality to any possible chord. The fact that he did not succeed in his goal is no reason for refraining from completion of his sketch. This is what the HarmoRUBETTE® is about: It Xj,hmean, Xj ,hmean
43.2. COMPARING ANALYSES OF BACH, SCHUMANN, AND WEBERN
867
Traeumerei: x_{1,.}
5
5
10
16
20
25
30
6
10
15
XI
26
onBettime
orsattime
Traeumerei: x_{3,.}
Traeumerei: x34,.}
10
Hi
20
onBattime
25
30
5
10
15
20
25
30
30
ol'llettime
Figure 43.9: Hierarchical components of metric (solid lines), melodic (dotted lines) ,and harmonic (dashed lines) weights for Bach's "canon cancricans", as defined in section 43.1.2: (a) b = 9; (b) b = 3; (c) b = 1; (d) remaining (residual) series. is a proposal to discuss possible completions of Riemann's theory. (2) Therefore it is also very interesting to discuss its application to apparently atonal compositions. Such an experiment is likely to yield a testbed for the universality of Riemann's approach. These considerations suggest that the following fact is not completely surprising, although it has not been established explicitly elsewhere in the literature: The correspondence between metric, melodic, and harmonic structure in Webern's "Variationen" is very similar to Schumann's "Triiumerei". It should be emphasized that this conclusion and in particular its quantitative demonstration is new in the musicological literature. Schumann's "Kuriose Geschichte" also shows a strong correspondence between the three curves for j = 1,2 and 3. But this time, the relations are different: For onset times below 12, we have the following:
CHAPTER 43. ANALYSIS OF ANALYSIS
868 1. For j
= 1,
cor(Xl,metrie, Xl,melod)
= 0.83,
COr(Xl,metrie, Xl,hmean) = cor(xl,hmean, Xl,melod) =
2. For j
-0.71, -0.63.
= 2,
we have the following (rounded) correlation values: = 0.00, COr(X2,metrie, X2,hmean) = -0.31, cor(x2,hmean, X2,melod) = -0.82. COr(X2,metrie, X2,melod)
3. For j
= 3, we
have
COr(X3,metrie, X3,melod) COr(X3,metrie, X3,hmean)
= -0.67, = -0.20,
cor(x3,hmean, X3,melod) =
-0.61.
Observe in particular that, in contrast to the other scores, melodic and harmonic components are negatively correlated. After onset time 12, the correlations are: 1. For j
= 1:
= 0.10, = -0.38, cor(xl,hmean, Xl,melod) = -0.29.
COr(Xl,metrie, Xl,melod)
COr(Xl,metrie, Xl,hmean)
2. For j
= 2:
COr(X2,metrie, X2,melod) cor(X2,metrie, X2,hmean)
= -0.47, = -0.14,
cor(X2,hmean, X2,melod) =
3. For j
= 3:
COr(X3,metrie, X3,melod)
-0.11.
= -0.75,
COr(X3,metrie, X3,hmean) = cor(x3,hmean, X3,melod) =
0.58, -0.69.
Finally, for Bach's composition, the only noticeable correlations occur between metric and harmonic weights, namely: 1. For j
= 1 : COr(Xl,metrie, Xl,hmean) = 0.94,
2. For j
= 2 : COr(X2,metrie, X2,hmean) = 0.63,
3. For j = 3 : COr(X3,metrie, X3,hmean) = 0.61. With respect to the shapes of Xj,., for j = 2 and 3, the two scores by Schumann and the one by Webern are dearly more similar to each other as compared to Bach's shapes. From the point of view of music history, this is quite plausible, since Webern's organic composition principle is more related to Schumann's rankly growing romanticism than to Bach's self-disciplined architectural setup (see also the following remarks).
43.2. COMPARING ANALYSES OF BACH, SCHUMANN, AND WEBERN
869
Finally, note that the scatterplots in figure 43.4 show that Bach's harmonic weights are highly clustered and the smoothed curves in figures 43.8.a through 43.8.d are more 'edgy' than for the other compositions. In this sense, Bach's composition exhibits a high degree of organization. This confirms the general belief that the principle of architectural rather than processual construction plays a dominating role in Bach's music. Overall, we may conclude that hierarchical decomposition reveals interesting properties, in particular strong similarities between the metric, melodic and harmonic weights, that were not visible in the original series. The results are musically plausible in that the analysis of Bach's score turns out to be the most regular one and the analyses of Web ern and Schumann appear to be closer to each other than to Bach's. The results are surprising in that (the analysis of) Webern turns out to be closer to (the analysis of) Schumann than expected. Also, the strong relationship between the three analytic curves could not be expected a priori, since the three weights were calculated using completely different aspects of the score and the scatter plots of the original curves did not show almost any association. Based on the results, one may conjecture that appropriate matching of metric, melodic and harmonic structure plays an important role in music, independently of musical style. The tools introduced here provide the possibility of investigating which types of relationships may exist in which musical and historical contexts. An important task for future research will be to investigate such aspects for a larger variety of compositions.
Chapter 44
Differential Operators and Regression Rejection by common sense, for whatever reason, proves nothing. Other fields of science are built on propositions that seem absurd but in fact are true. Donald 0 Hebb [213] Summary. We give statistical evidence from 28 performances of Schumann's "Triiumerei", as measured by Bruno Repp [438] that the rhythmic, motivic, and harmonic analyses provided by RUBATO® are shaping structures for the agogical streams. The statistical model is based on regression analysis and realizes shaping of agogics by a second degree linear differential operator as a function of analytical weights which are averaged over a natural grouping hierarchy (as described in chapter 43) of the score. -~-
At present, the best investigated aspect of performance theory-including appropriate software---is timing microstructure, i.e., agogics on the level of tempo curves and their hierarchies, see [272, 346] for further reading. This chapter deals with this topic: agogics as an expression of harmonic, melodic and rhythmic structures. So observe that we do not consider emotional or gestural rationales for agogics. This does not mean that these factors are negated. We merely restrict our investigation to the question whether and how strongly agogics could be explained by exclusive causal reference to structural analysis. Even in this neat reduction is the question neither trivial nor even well defined, since musicology does not offer precise tools for rhythmical or melodic analysis, and even harmonic analysis is far from effective. Therefore, the question is only a scientific one if one specifies the analyses and their output data format. For the general setup for such an explicit and operationalized analysis framework, namely the RUBATO® analysis and performance workstation, we refer to part X. The main concern is an empirical study regarding the basic question whether agogics (the tempo curve of the "Triiumerei", also called "timing microstructure") may be expressed in mathematical terms by use of structural data obtained from a specific set of musicological 871 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
872
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION
analyses offered by RUBATO® 's modules. The experimental data are taken from Repp's timing measurements of 28 famous performances of Schumann's "Triiumerei" [438]. Summarizing our results, we can state that
Result 2 There is strong statistical evidence for the timing microstructure in 28 famous performances of Schumann's "Triiumerei" as being an expression solely of harmonic, melodic and rhythmic structures furnished by the RUBATO® analysis. In other words: These structural rationales are sufficient for explaining agogics, and emotional or gestural rationales can be disregarded for the present experimental material. In the framework of RUBATO®, the analytical output of any type of analysis is always a (smooth) weight, i.e., differentiable function of one or several note event parameters, such as onset E, pitch H, loudness L, duration D, etc. With this specification, the above result can be restated in more mathematical but less intuitive terms:
Result 3 The tempo curve (timing microstructure) can be generated by an agogical operator n which is essentially a linear differential operator of second order as a function of harmonic, melodic and rhythmic smooth weights. Statistically speaking, this means that in our empirical context, the fiber (in the sense of Todd's approach) of the chosen performance transformation is not empty. Observe that the operator n which plays the role of the transformation II in Todd's theory [530] (see also section 36.3) does not rely on general encoding tempo curve functions. Intuitively, this means that our approach generates agogics from smooth weights as score-specific functions, and not general, score-independent, curve types, as proposed in [518, 532]. This does not contradict usage of general encoding tempo functions, it simply suggests that agogics is a superposition of more "primavista"-like tempo functions and of a strong and differentiated timing microstructure stemming from analytical data.-Thirdly, the statistical results suggest
Result 4 Essential commonalities and diversities among tempo curves may be characterized by a relatively small number of analytical weight curves. There is in general no unique way of attributing features of the tempo to exactly one cause (harmonic, metric or melodic analysis). Results depend on which of the three analyses is given priority. However, there appear to be a certain number of canonical curves that are essentially independent of the priority. Overall, a large variety of musically meaningful results is obtained. This is in particular due to the fact that a score-specific basis of curves is used on which the tempo curves are projected. We thus may conclude
Result 5 The analytical curves obtained from (1) score-specific harmonic, melodic and rhythmic smooth weights and (2) a score-specific hierarchical decomposition of these weights, yield a natural score-specific linear basis in the space of tempo curves, for performances of the considered score. We should stress that all our results are intimately related to the concrete analyses which RUBATO® produces-together with the underlying theories. There is no unique analysis, and
873 therefore, specification and numerical representation of musical analysis is not secondary and will in any case (!) influence the results. We should also remind the reader trained in natural sciences that musical analysis is not a neutral tool but pertains to the unavoidable "artifacts" of analysis in the humanities. There are no objective laws in human creations which subsist beyond interpretative interaction. This is a caveat to those who believe that performance can be more than a relation between what we understand (rationally, emotionally, gesturally) and how we express this understanding.
44.0.1
Analytical Data
Summary. This section describes the analytical weights used in this analysis.
We shall omit the notification of the used parameter lists Param which are related to the specific RUBETTE®, and of the predicate PTriiumerei of Schumann's score and abbreviate Xmetric Xmelodic Xhmax Xhmean
XPTriiurnerei
M etroRubette,Param'ffletric
xPTriiumerei
M eloRubette,Faram'fflelodiC
xPTriiu'merei
H armoRubette,Paramharrnonic/-mux
xPTriiuTnerei
H armoRubette,Paramharmonic/m.ean
to denote the four following weights used in our context. We are going to give their description when evaluated on onsets E of note events occurring in PTriiumerei: •
xPTriiumerei
M etroRubette,Faram'metric
This is a metrical weight which measures the rhythmic relevance or "weight" of every onset of a note event in the composition in the lines of Riemann [453], Jackendoff-Lerdahl [243], and Mazzola [340]. A detailed description of the MetroRUBETTE® was given in section 41.1. •
xPTraurnerei
M eloRubette,Parammelodic
This is a "boiled-down" melodic weight! which measures the sum of the melodic weights w(Evt) of all note events Evt at a given onset E. The calculation is extremely complex and time-consuming and goes back to theories of Reti [444], see chapter 22. See section 41.2 for a detailed description of the MeloRUBETTE®. •
xFTriiu'fflerei
H armoRubette,Paramharmonic/ -max
This harmonic weight measures the harmonic relevance of a chord ch which occurs at an onset E in PTriiumerei. It is calculated by the same method as the fourth weight. The only difference is that this weight captures the harmonic relevance of the most important note in ch whereas the fourth weight represents the average harmonic relevance among all notes of ch. See section 41.3 for a detailed description of the HarmoRUBETTE®. lSee formula (39.26) in section 39.3.
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION
874 •
XPTriiurnerei
H armoRubette,Paramharrnonic/ rnean
As already mentioned, this weight is a variant of the third one, the only difference being an averaging instead of maximizing procedure. We refer to the previous discussion for the basics.
44.1
The Beran Operator
Summary. In this section, we define the conceptual setup for the following statistical analysis. -~-
The general idea is that agogics is to be shaped by use of smoothed versions of the boileddown weights, their first and second derivatives, and corresponding kernel-smoothed versions with respect to hierarchical (triangular) kernel functions.
44.1.1
The Concept
As we know, the general RUBATO® concept of shaping performance is built on smooth (actually C 2 in this context) weights x where dEx, d~x denote the first and second derivatives with respect to symbolic time E. The kernel smoothing process relates to kernel functions b( s) = 1/ b· X {I s I :s; b}· (I-lsi/b) with triangular, zero-symmetric support of extent ±b, and characteristic function X{ P} for a predicate P. The linear smoothing operator b 0 f on a function f is defined by the convolution bo f(E)
=
J
b(t - E) . f(t).
( 44.1)
It averages f around E with weighted center E and bandwidth b. If this function is a weight, this
means that the weight's analysis within the entire bandwidth neighborhood of a given onset is included instead of spiking the analysis to the singular onset. In the following process, this kernel smoothing process has been applied to a hierarchy of bandwidths, starting with b = 4 (= eight bars), then b = 2, then b = 1. The averaging process is taken to define successive remainder functions as follows:
h = 40 f, h =
20 (f -
h), h =
10 (f -
h - h),
f4
= f - h - 12 - h
(44.2)
This means that the decomposition (44.3)
for a smooth weight x defines a "spectrum" of that weight with respect to successively refined neighborhoods of its ambit. Remark 21 Musically speaking, as already observed before, this kernel smoothing process is completely natural. In fact, the kernel function alters the original time function f(E) by a weighted integration of f-values in the kernel neighborhood of a given time E. This means that we now include the information about f from the neighboring times to make an analytical
44.1. THE BERAN OPERATOR
875
judgment. This latter is a well known and common consideration in musical performance: The interpreter looks up a full neighborhood of a time point to derive what has to be played in that point. Moreover, the repeated application of the kernel smoothing process with increasingly narrowed neighborhoods is understood as a succession of a refinement in local analysis: First, the interpreter makes a coarse analysis over eight bars (b = 4), then he/she looks for the remainder f - h and goes on with refined actions, if necessary. This procedure is applied to the metric, melodic and harmonic weights and to their first and second derivatives. This gives the following list of a total of 48 spectral analytical functions: Xmetric,l
X me tric,2
X metric,3
X me tric,4
d E x metric,l
d E x metric,2
d E x metric,3
d E x metric,4
d 2E X metric,l
d 2E X metric,2
2 X dE metric,3
2 X dE metric,4
X melodic,l
X melodic,2
X melodic,3
X melodic,4
dExmelodic,l
d E x melodic,2
dEXmel odic,3
dEXmelodic,4
d~xmelodic,l
d~xmelodic,2
d 2E X melodic,3
2 X dE melodic,4
Xhmax,l
Xhmax,2
Xhmax,3
Xhmax,4
d E X hmax,l
d E X hmax,2
d E X hmax,3
d E X hmax,4
2 X dE hmax,l
2 X dE hmax,2
d 2E X hmax,3
d 2E X hmax,4
Xhmean,l
Xhmean,2
Xhmean,3
Xhmean,4
dEXhmean,l
d E X hmean,2
d E X hmean,3
d E X hmean,4
d 2E X hmean,l
d 2E X hmean,2
2 X dE hmean,3
d 2E X hmean,4
For which musical reasons are these derivatives added to the analytical input data? The first derivatives measure the local change rate of analytical weights. Musically speaking, this is an expression oftransitions from important to less important analytical weights (or vice versa), i.e., a transition from analytically meaningful points to less meaningful ones (or vice versa). This is crucial information to the interpreter: It means that he/she should change expressive shaping to communicate the ongoing structural drama. In the same vein, information about second derivatives is musically relevant since it lets the interpreter know that the ongoing structural drama is being inflected. Evidently, one could add higher derivatives but we argue that an interpreter is already highly skilled if he/she can take care of all these functions, also because different analytical aspects from metrics to harmonics must be observed simultaneously. Besides these analytical input functions, we add three types of 'sight-reading' functions. They regard the following three instances: ritardandi, suspensions 2 , and fermatas. It is clear that any text-sensitive performance should be aware of such information. 2Suspensions are notes which are tied by a slur while the harmony changes; we may attach to such events the time interval where the suspension does not start until the harmony change is terminated.
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION
876 1. Ritardandi
The score shows four onset intervals Rl, R 2 , R 4 , R4 for ritardandi, starting at onset times Eo(Rj) (j = 1,2,3,4) respectively. We define the four linear functions
(44.4) 2. Suspensions
The score shows four onset intervals 51, 52, 54, 54 for suspensions, starting at onset times Eo(5j ) (j = 1,2,3,4) respectively. We define the four linear functions
XSUSj (E) = X(5j
) •
(E - E o(5j )), j = 1,2,3,4.
(44.5)
3. Fermatas The score shows two onset intervals F1 , F2 for fermatas. We define the two support functions (44.6) x/ermj(E) = X(Fj),j = 1,2. Summarizing, we have a total of 58=48+4+4+2 onset functions of analytical and primavista types. Call X the analytical vector of these 58 functions listed in a fixed order. The present approach is to define the tempo function at onset E as being a linear function of these 58 variables. for the tempo curve is defined For a 'shaping' vector w E 1R58 , the shaping operator by the canonical scalar product of X with the shaping vector w,
n;
n; = (X,w).
(44.7)
n;
This means that for every onset E, we have (E) = (X (E), w). Recapitulating the meaning of the analytical vector X, we are dealing with a second order differential operator which we call "Beran operator" since it was introduced by Jan Beran in [52J. On this basis, the central question of the following is whether tempo curves T of the "'Ihiumerei" as they appear in the context measured by Repp in [438J may be approximated via by appropriate choice of the shaping vector w. The main result of this approach states that there is strong statistical evidence for the equation
n;
(44.8) for the given analytical vector X, a suitable shaping vector w, and a constant C. This means that the 58 coefficients of the shaping vector w are random variables and that we prove a significant statistical correlation-in the mathematical form described by the Beran operator-between a certain subset of the analytical vector X and tempo as it is measured for the 28 performances by Repp.
Thesis 7 One may therefore try to use the above formula (44.8) to define tempo as a function of analytical score data in the sense of a general performance grammar as described in chapter 37.
44.1. THE BERAN OPERATOR
44.1.2
The Formalism
44.1.2.1
Tempo Information
877
In the following, a more detailed description of the tempo data used for the analysis is given. • Onset times: The onset times are on a grid of 1/8th beats. Thus, for instance, grace notes are excluded. From this set of onset times, we consider only onset times where at least one note is actually played. This results in a set T of n = 212 not equidistant onset times ti (i = 1, ... ,n) which are multiples of 1/8. • Log-transformation: Instead of the original tempo y we consider its natural logarithm In y. Intuitively this can be justified by the expectation that a performer may control the tempo in a relative rather than an absolute way. Also, the statistical results were more satisfactory on the logarithmic scale. In the following we refer to the logarithmic tempo as 'the tempo curve'. • Standardization of individual curves: The data consist of tempo measurements (or tempo curves) for m = 28 performances. In the current analysis, the interest lies in investigating the shape of the tempo curves rather than the absolute tempo values. Therefore, each of 28 tempo curves is standardized. More specifically, let y*(ti,j) be the (natural) logarithm of the tempo of the lh performance at onset time ti (i = 1, ... , n; j = 1, ... , m). Then the standardized tempo data are defined by
(t . .) _ y*(ti,j) - y*(j) y .,J s*(j) where y*(j)
= n- 1 L~=l y*(ti,j)
and n
s*(j) = [(n _1)-1 ~)y*(ti,j) - y*(j))2]!. i=l
44.1.2.2
The Explanatory Variables
The following notation is used: Let A be a p x q1 matrix and B a p x q2 matrix, then C = (A, B) denotes the p x (q1 + q2) matrix obtained by 'attaching' B on the right-hand side of A. The following steps describe the definition of the matrix of explanatory variables in more detail. According to the concept in section 44.1.1, the score data (metric, harmonic and melodic weights, additional score information) are given in the form of a design matrix that is used subsequently in a regression analysis. The following definitions are used: 1. Derivatives. "Derivatives" are defined as finite differences divided by the difference of the onset times. Thus, for instance,
d x E
and
- -(to) - Xmetric,j(ti) - Xmetric,j(ti-1) metr,c,J ' (ti - ti-1)
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION
878
2. Hierarchical smoothing. Each of the weights and their first and second (discrete) derivatives are decomposed into four components of different smoothness as defined by equations (44.2) and (44.3). 3. Additional variables. Additional variables modeling ritardandi, suspensions and fermatas were defined by (44.4), (44.5), and (44.6). The aim is to model these musical events in a "minimal" way. For instance, the resulting linear model for a ritardando is only a crude approximation to a "true" ritardando. The reason for using only the simplest parametrization is that the main purpose here is to examine to what extent the metric, melodic, and harmonic weights alone, together with only absolutely necessary additional information from the score, contain enough information to "explain" the tempo of a performance. 4. Initial design matrix. Using the definitions above, we define for j = 1,2,3,4 the n x 4 matrices Xj (harmo)
=
(Xhmean,j, Xhmax,j, Xmetric,j, Xmelod,j)
Xj (metric) = (Xmetric,j, Xhmean,j, Xhmax,j, Xmelod,j) Xj(melod) = (Xmelod,j, Xhmean,j, Xhmax,j, Xmetric,j)
Clearly, there are more possibilities of permuting columns. Here, we consider only the representative permutations above. The first column of Xj(harmo) is Xhmean,j so that, due to the orthonormalization to be described in the following section, the main emphases is put on the harmonic mean weights. Similarly, the metric and melodic emphasis are chosen with Xj(metric) and Xj(melod). Furthermore, we define
d~Xj(harmo) = (d~Xhmean,j, d~Xhmax,j, d~Xmetric,j, d~Xmelod,j) d~Xj (metric) = (d~XmetriC,j, d~x hmean,j, d~x hmax ,j, d~Xmelod,j ) d~Xj (melod)
=
(d~Xmelod,j, d~Xhmean,j, d~Xhmax,j, d~Xmetric,j)
and the n x 10 matrix
where
Xrit
=
(Xritl,Xrit2,Xrit3,Xrit4)' Xsus
(Xjerml' Xj erm2)'
=
(XSUS1,XSUS2,XSUS3,XSUS4)
and
Xjerm
879
44.1. THE BERAN OPERATOR
Finally define the n x p matrices (with p
X(harmo)
= 58)
=
(Xl (harmo) , X 2(harmo) , X3(harmo), X 4(harmo), dEX I (harmo) , dEX2(harmo), d E X 3(harmo), d EX 4(harmo), d~XI(harmo), ~X2(harmo), d~X3(harmo), d~X4(harmo), Xadd) , X(metric)
=
(XI(metric),X2(metric),X3(metric),X4(metric), dEXI(metric), d EX 2(metric) , dE X 3(metric) , d EX 4(metric) , d~XI (metric), d~X2(metric), d~X3(metric), ~X4(metric), X add ) , X(melod)
=
(Xl (melod) , X 2(melod), X 3(melod), X 4(melod), dEX I (melod) , dE X 2(melod), dE X 3(melod), d EX 4(melod) , d~XI(melod), ~X2(melod), ~X3(melod), d~X4(melod), Xadd). 5. Orthonormalization. Each of the design matrices X (metric), X (harmo), and X (melod) turned out to be singular, since the last column can be expressed as a linear combination of the previous ones. Hence, we omit the last column. For simplicity of notation, the new n x 57 matrices will also be denoted by X(metric),X(harmo),X(melod). Figure 44.1 shows that the corresponding columns of the three matrices are closely related, at least for j = 1,2,3. Intuitively this means that it is not possible to distinguish exactly whether certain characteristics of the tempo curve stem from the metric, the harmonic or the melodic analysis. Thus, results may depend on the sequence of orthogonalization. This sequence reflects whether, in our view, the harmonic, the metric or the melodic has priority. Moreover, instead of focusing on names such as "metric weight", "first derivative of the metric weight", etc., we will also try to extract typical weight curves (canonical curves) that appear to be important for the tempo, independently of which of the three analytic approaches (metric, harmonic, melodic) have priority. More specifically, the three design matrices are defined in the following way: The columns of X (harmo) , X (metric) , and X (melod) respectively are orthogonalized and standardized successively. We thus obtain three n x 57 matrices which will be denoted by
Z(harmo), Z(metric), and Z(harmo). Each of these matrices has orthonormal columns. The reason for computing three different matrices is that orthonormalization depends on the initial sequence of the columns. An artificial preference ofthe variables that are accidentally in the first (or first few) column(s) is avoided by carrying out three separate regression analyses with the respective matrices Z(harmo), Z(metric) , and Z(melod), and by comparing the common features of the three results.
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION
880
I
x_{metnc,l}, x_{hmean,l} and x_{melod,l}
x_{metric,2}, x_{hmean,2} and x_{melod,2}
, '
/\
I'
o
~
\
~-,
'~~
I
5
......~'.
10
15
20
25
o
30
5
10
15
20
25
30
onset time
onset time
x_{metnc,3}, x_{hmean,3} and x_{melod,3}
x_{metnc,4}, x_{hmean,4} and x_{melod,4}
_1L{1TIIlWil,3) _.• _. __ x..{hrrfin.3) ___ IL{IIIIIIOd.3)
o
/\ ;
/~
~
,
5
10
S!
15
20
25
I==:~
. ___ xJmeIDd,4)
o
30
5
10
15
20
25
30
onset time
onset lime
Figure 44.1: The four hierarchical levels of metrical, harmonic, and melodic analyses.
44.2
The Method of Regression Analysis
Summary. Inspired by visual comparison of logarithmic tempo cures, where strong similarities between the different performances become visible, we applied the following regression model. -~-
44.2.1
The Full Model
Let Z be one of the three matrices Z(harmo), Z(metric), or Z(melod), respectively. The full (Le., biggest possible) model for the lhindividual tempo curve is
44.3. THE RESULTS OF REGRESSION ANALYSIS
881
where Zi is the ith row vector of Z, (3(j) = ({31(j), ... {357(j))t and E(ti,j) (ti E T) are (for each fixed j) identically distributed zero mean random variables. This means that we assume each performance to be essentially characterized by a 57-dimensional parameter vector (3(j). Under the present orthonormalized conditions, this vector corresponds to the shaping vector w introduced in section 44.1.1. Note that we do not assume the residuals Ei to be independent, since corrected p-values are used that take into account serial dependence. Also, due to standardization of y and of the columns of Z, there is no intercept in the model. The vector (3(j) is the parameter vector corresponding to the performance number j. Therefore, (3(j) is assumed to be a random vector, sampled from the space of all "possible" interpretations, with expected value E[{3(j)] = {3. We then may write
(3(j) where TJ(j) is a random vector with E[TJ(j)]
=
(3 + TJ(j)
= 0 and
Intuitively this means that, up to a small unexplained deviation E(ti,j), the (logarithmic) tempo of the lh performance at onset time ti can be expressed as a "mean performance" Zi{3 plus an individual deviation from the mean that is equal to ZiTJ(j). For the mean tempo curve m
yeti)
= m- 1
Ly(ti,j), j=l
we then have
yeti) = Zi{3 + Ei(ti)
where E(ti) (ti E T) are identically distributed zero mean random variables.
44.2.2
Step Forward Selection
In the following, the main focus is on the individual curves. Some comments on the mean curve are also given. In order to decide which components of {3 or (3(j) respectively are not zero (i.e., which explanatory variables contribute "significantly" to the tempo curve), stepwise forward selection [84] is carried out with F-to-enter level of significance 0.01. For the individual curves, a separate stepwise regression is carried out for each individual. The statistics software S-Plus [467] was used for the calculations.
44.3
The Results of Regression Analysis
Summary. In the following discussion the main questions are: 1) Is there a relevant association between the analytical weights computed from the score and the observed tempo curves? 2) How complex is the relationship? 3) Are there commonalities and diversities; how can they be characterized?
-L:-
882
44.3.1
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION
Relations between Tempo and Analysis
Summary. The statistical analysis of the relation of tempo data and analytical weights via the Beran operator is carried out. -~-
Mean log·tempo curve and LS-fit with Z(harmo), F-alpha= 0.01
~
L-~_ _~----~----~----_ _----~----~--~ 10 15 20 25 30 onset time
Mean log-tempo curve and LS-fit with Z(metric), F-alpha= 0.01
~
L -_ _ _ _~----~----~----_ _----~----~--~ 10 15 20 25 30 onset time
Mean log-tempo curve and LS-fit with Z(melod), F-alpha= 0.01
~
L -_ _ _ _~----~----~----_ _----~----~--~ 10 15 25 30 20
onset time
Figure 44.2: Mean tempo curve and LS-fit with orthonormalized Z matrices with harmonic, metric, and melodic emphasis. To begin with, it is interesting to learn how much can be 'explained' at most by the analytical weights. Recall that in regression, R2 denotes the proportion of the variability of y that is explained by the estimated regression function. Ideally, R2 would be equal to 1.00 which would mean that the (log- )tempo curve can be expressed exactly as a linear function of the analytic information encoded by the design matrix Z. Such an exact correspondence between the analytic curves and each tempo curve can hardly be expected. The maximal achievable values
44.3. THE RESULTS OF REGRESSION ANALYSIS
883
of R2, which are obtained by using the full matrix Z (Le., without eliminating nonsignificant variables), are quite high however for the mean curve as well as the individual curves. For the mean curve, R2 is equal to 0.84. For the individual curves, we have 0.65 ::; R2 ::; 0.85. It is well known that R2 can be increased by simply including a sufficiently large number of explanatory variables, even if these variables have nothing to do with y. It is therefore necessary to investigate which explanatory variables contribute significantly to the response y. To do this, we first applied stepwise forward selection with F-to-enter a = 0.01. In all cases, all coefficients in the resulting model turned out to be significantly different from zero at the 5% level of significance, even after taking into account the possibility of serial correlations in the residuals. The values of R2 for the mean curve and the individual curves respectively are still remarkably high: For the mean curve, R2 is equal to 0.79 for Z(harmo), 0.79 for Z(metric) and 0.77 for Z(melod). Figure 44.2 shows that the fit (dotted line) to the mean tempo curve (full line) is very good, even if only significant coefficients are used. For the individual curves, we have 0.46 ::; R2 ::; 0.79 for Z(harmo), 0.48 ::; R2 ::; 0.78 for Z(metric) and 0.36 ::; R2 ::; 0.77 for Z(melod). The low value of 0.36 is obtained for Kubalek. Excluding this performance, we have 0.51 as the lower bound Z(melod). Thus, the quality of the fit is also good in general, but varies individually. This is illustrated further by figure 44.3 to 44.3 for Z(melod). The results on commonalities and diversities given below in section ?? yield further evidence for the existence of a meaningful association between y and Z.
44.3.2
Complex Relationships
Summary. We discuss the complexity of the relationships between the individual results.
-~-
Even when using only significant coefficients, the estimated models are very complex. As an example, consider the performance by Brendel. With Z(melod), the R2 is in this case equal to 0.76. No. 4.4 in figure 44.3-44.6 confirms the good fit. The following table A summarizes the result: Table A: Coefficients of explanatory variables chosen by stepwise forward selection with F-toenter=O.Ol and Z(melod), for the logarithmic tempo curve of Brendel. {The P-values given here do not take into account serial correlations.}
884
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION z-variable Zmelod,l Zhmean,l Zhmean,2 Zhmax,2 Zmelod,3 Zhmean,3 Zhmax,3 Zmelod,4 dE Z melod,l dE Z melod,2 dE Z hmean,2 d E z m elod,3 dE Z hmean,3 d"-1:Zmetric,l Zrit3 Zrit4 zferml
est. coefficient
std. error
t-statistic
P-value
-0.3136 -0.2737
0.0353 0.0353
0.0000 0.0000
0.2781 0.2659 -0.1258 0.1303 -0.2800
0.2938 0.1308
0.0353 0.0353 0.0353 0.0353 0.0353 0.0353 0.0353 0.0353 0.0353 0.0353 0.0353
-8.8760 -7.7443 7.8713
-0.1361 -0.1952 0.1032 -0.1425
0.0353 0.0353 0.0353 0.0353
-0.1562 -0.2663 0.1567 0.1143
7.5248 -3.5597 3.6882 -7.9244 -4.4193 -7.5371 4.4337 3.2356 8.3143 3.7024 -3.8508 -5.5249 2.9208 -4.0331
0.0000 0.0000 0.0005 0.0003 0.0000 0.0000 0.0000 0.0000 0.0014 0.0000 0.0003 0.0002 0.0000 0.0039 0.0001
The number of significant coefficients is very large. The model contains all four weight functions, first and second derivatives of various degrees of smoothness and also two ritardandovariables and one suspension. Also note that all degrees of smoothness are used. Formally, even after adjustment for serial correlations, all p-values are below 0.05. (As a cautionary remark, it should be noted however that p-values obtained after model selection can be used as guidelines only.)
44.3.3
Commonalities and Diversities
Summary. In spite of the high complexity of the selected models for 28 individual tempo curves, there are interesting commonalities and diversities. They are characterized in this section. -~-
Before going into the details of commonalities and diversities among the 28 given performances, we should make a remark on the performance selection and the performers as made available by Repp [438]. Above all, we should emphasize that Repp succeeds in a choice of first quality pianists, among others the celebrated "romantic virtuoso" Vladimir Horowitz, the "analytical mannerist" Alfred Brendel, or the "perfect but utterly cool" omnipresent Vladimir Ashkenazy, to name just three of them3 • So from the point of view of performance culture, 3For a complete list of all performers, see [438]. We shall only name selected performers who are relevant to this analysis.
885
44.3. THE RESULTS OF REGRESSION ANALYSIS 11.1 ARGERICH
10
15
20
25
11 .2 ARRAU
30
,0
Hi
20
2S
30
25
30
2S
30
OI'I$tl lil'l'lO
11.4 BRENDEL
11.3 ASHKENAZY
~
r--------------------,
10
10
15
20
...... 11.5 BUNIN
1S
20
25
30
10
IS
20
onsetlinMt
11 .6 CAPOVA
25
30
11 .7 CORTOT1
10
1$
20
11.8 CORTOT2
~ f~~::-::::~~:~:-;·??~j.~~ ?
~ 10
150
20
ORS811ime
25
30
10
15
20
.............
25
30
Figure 44.3: Fit for Z(melod). the election is unprecedented and representative. But in this scientific context, we must refrain from further judgments. This discussion is not about journalistic criticism. However, we should encourage critics to review their understanding of performance by focusing on the question whether and to what degree the analytical structure of a score may be responsible for agogical expressivity. Our present answer to this question-incomplete as it must remain-may seem to position some of the artists in unexpected relative position to each other. But this is not surprising since we do not claim that the overall judgment from common criticism really does represent the strictly analytical perspective of our approach. As mentioned in chapter 36, one should absolutely add emotional and gestural components to reach a complete description, an objective which was out of reach of this discussion. In spite of the high complexity of the selected models for individual tempo curves, there turn out to be interesting commonalities and diversities that can be characterized by either of the matrices Z(harmo), Z(metric) and Z(melod) respectively. Recall that using Z(harmo)
886
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION 11.10 CURZON
11.9 CORTOT3
., L-___,.-__ 10
15
20
25
10
30
-15
20
20
~-'
25
30
2S
30
11 .12 DEMUS
11 .11 DAV IES
10
15
:2'5
10
30
15
20
onset_
11.14 GIANOLI
11 .13 ESC HENBACH
!l
~~
~ ~ 10
15
20
25
IS
20
25
15
20
2S
-
30
l' .16 HOROWITZ2
11 .15 HOROWITZ1
10
10
30
30
OOSOIlimo
10
15
20
-,.".
25
30
Figure 44.4: (Cont.)
corresponds to an understanding of the score that put a first priority on the harmonic structure. Using Z (metric) corresponds to putting first priority on the metric structure. Using Z (melod) corresponds to putting first priority on the melodic structure. Therefore, depending on which of the three matrices is used, somewhat different results should be expected. The fundamental problem is the ambiguity of a performance. In general, based on one performance, it cannot be decided with certainty whether certain features of the tempo are 'due to' the harmonic, the metric or the melodic content. Nevertheless, the results below show a strong similarity between the three regressive analyses. Thus, there appears to exist at least a core of tempo features that are unambiguously attributable to specific weight functions. A number of different aspects of commonality and diversity can be considered. Here, three possible aspects are described.
44.3. THE RESULTS OF REGRESSION ANALYSIS 4.17 HOROWITZ3
887
4.18 KATSARIS
10
15
20
25
onset time
onset time
4.19 KLiEN
4.20 KRUST
10
15
20
25
30
10
15
20
25
30
30
onset time
onset time
4.21 KUBALEK
4.22 MOISEIWITSCH
10
15
20
25
30
10
15
20
25
onset time
onset time
4.23 NEY
4.24 NOVAES
10
15
20
25
30
10
15
20
25
30
30
onset time
onset time
Figure 44.5: (Cont.) 44.3.3.1
Signs of Coefficients
We ask the following question: For which k 1, ... p do we have either fh(j) ~ 0 for all 1 :::; j :::; m, or /lkU) :::; 0 for all 1 :::; j :::; m? In other words, which coefficients have the same sign for all performances? The result is quite amazing: For Z(metric), Z(harmo), and Z(melod), all except 3, 2, and 1 coefficients (out of 57) respectively have the same sign for all performances. Thus, the sign of the coefficients is a very strong commonality. The analytic curves 'act' in the same direction. In particular, the following general tendency can be observed: • The tempo decreases as the original (not orthogonalized) harmonic weight increases. • The tempo increases as the original (not orthogonalized) metric weight increases. • The tempo decreases as the original (not orthogonalized) melodic weight increases.
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION
888
11 .26 SCHNABEL
11 .25 ORTIZ
o 10
15
20
25
30
5
10
15
20
onse1 time
onset time
11 .27 SHELLEY
11 .28 ZAK
25
30
25
30
o 5
10
15
20
25
30
5
10
15
20
onset lime
onset lime
Figure 44. 16: ( ont.) It should be noted however that these conclusions are valid under the assumption that all other variables are kept fixed. As we saw above, for the score considered here, the original weights are strongly correlated. This makes the actual relationship between weights and tempo much more complicated.
44.3.3.2
Frequency of Variable Inclusion
For k = 1, ... ,p, let nk = L,~ll{t3k(j)#O} be the number of performances for which the explanatory variable number k was included in the model. Figures 44.7-44.9 show, for Z(harmo), Z(metric) , and Z(melod) respectively, the curves of variables that were chosen at least 24 times (out of 28). The curves are multiplied by the sign of the coefficient. At least two types of curves are common to practically all performances, independently of the matrix that is used: 1) very smooth 'global' curves, such as Zmelod,l, that shape the overall tendency of the tempo; 2) almost periodic curves, with a period of about four measures, corresponding to the approximate periodicity of the harmonic curve Zhmean,2' Comment on Z(harmo). Note that Zhmean,l is identical with Xhmean,l, see figure 44.1a. Moreover, Zhmean,2 is almost the same as Xhmean,2 in figure 44.1b. Also, dEZhmean,3 exhibits features that are very similar to Xhmean,3, see figure 44.1c. Thus, analytical weights obtained by local averaging without orthogonalization have a direct impact on the performance. In fact, by the above, the orthonormal curves selected as most relevant by the regression turn out to be closely related to the original curves. Comment on Z(metric). Here, resemblance of curves is similar to Z(harmo). Namely:
44.3. THE RESULTS OF REGRESSION ANALYSIS Z(harmo): z_{hmean,1} chosen 28 times
10
15
20
25
30
889 Z(harmo): z_{metric,1} chosen 28 times
10
15
20
25
30
onset time
onset time
Z(harmo): z_{melod,1} chosen 28 times
Z(harmo): d_E z_{hmean,3} chosen 27 times
10
15
20
25
30
10
15
20
25
30
onset lime
onset time
Z(harmo): z_{hmean,2} chosen 25 times
Z(harmo): d_E z_{metric,1} chosen 24 times
10
15
onset time
20
25
30
10
15
20
25
30
onset time
Figure 44.7: Frequently selected curves for Z(harmo). Zhmean,l is almost the same as Xhmean,l, Zmetric,l and Xmetric,l are identical, and Zhmean,2 is very similar to Xhmean,2' Also Zmelod,l is almost the same as Zmelod,l in Z(harmo) , compare figures 44.7 and 44.8. Again, we conclude that the original averaged weight curves influence the performance directly. Comment on Z(melod). Similar comments as for Z(harmo) and Z(metric) apply for Z(melod). Here, Zmelod,l is the mirror image of Xmelod,l. Further, Zhmean,l and Zhmean,2 are very similar to Xhmean,l and Xhmean,2, respectively. Comment on non-linear deformations. The melodic weights seem to play the most prominent role. Independently of the emphasis, Zmelod,l is chosen for all 28 performances. For the melodic emphasis, Zmelod,l is obviously identical with Xmelod,l' For the harmonic and metrical emphasis, the corresponding Zmelod,l-curves turn out to be non-linear deformations of Xmelod,l' The method of non-linear deformations of analytical weights as arguments of refined shaping of performance is also implemented in the PerformanceRUBETTE® of RUBATO®.
890
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION Z(metric): z_{hmean, I} chosen 28 times
10
15
20
25
Z(metric): z_{melod,l} chosen 28 times
30
10
15
20
25
30
onsattima
Z(metric): d_E z_{metric,3} chosen 28 times
Z(metric): z_{melric,l} chosen 26 times
:!
i
:l ~ ~ ~
10
15
20
25
30
onset time
10
15
20
25
30
onset time
Z(metric): z_{hmean,2} chosen 24 times
onsetdme
Figure 44.8: Frequently selected curves for Z(metric). In summary, we conclude that for the score of "Traumerei", there is a small number of "canonical" analytical weight curves that are relevant for most performances and essentially do not depend on the analytical emphasis.
44.3.3.3
Largest Coefficients
Since the design matrix Z is orthonormal, the importance of the kth explanatory variable may be assessed by the absolute value of the corresponding kth estimated coefficient (ranked in comparison to the other coefficients). (Also note that, due to orthonormality, all estimated slope components are uncorrelated and their standard deviations are the same.) For fixed j, let rkU) be the rank of l'skU)1 among all coefficients 'ssU) (s = 1, ... ,p). Furthermore, for 1 ~ l ~ p, let fk(l) = 2:7=11{rkU) > p -l}. Thus, fk(l) is the number of performances for which I'sk(j) I is at least the lth largest. Consider first l = 1. Thus, fk(1) is the number of performances for which the kth variable
44.3. THE RESULTS OF REGRESSION ANALYSIS
Z(melod): z_{melod, I} chosen 281imes
10
15
20
25
30
891
Z(melod): d_E z_{melod.3} chosen 28 times
10
15
20
25
30
onset time
onset time
Z(melod): z_{hmean,2} chosen 27 times
Z(melod): z_{hmean.l} chosen 25 times
10
15 onset time
20
25
30
10
15
20
25
30
onset time
Figure 44.9: Frequently selected curves for Z(melod). is most important. It turns out that fk(l) is not zero for a very small set of variables. The results for the three matrices are:
Z(harmo) : figure 44.10 displays the four curves for which fk(I)~O, i.e., variables that are most important for at least one performance. Table B shows the clusters of columns HI to H4 of performances for which the corresponding variables Zhmean,2 ("" HI), Zhmean,l, Zmetric,l, and Zmelod,4 are the most important ones. Thus, using Z(harmo) and therefore an approach that gives priority to the harmonic structure, the first cluster of performances has the 4-measures periodicity of the harmonic structure as the dominating feature. In particular, all Cortot performances are included. For the second cluster that includes in particular Horowitzl and Horowitz2, the more global shaping curve Zhmean,l is most prominent. For Bunin and Gianoli, a global curve with a peak around the 15th measure is most important. Finally, for the first performance by Horowitzl, the very detailed local structure of the melodic curve Zmelod.4 seems to dominate.
892
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION For Z(harmo), z_{hmean,2} has Ck(1) = 15
10
15
20
25
30
For Z(harmo), z_{hmean, 1} has Ck(1) = 10
10
15
20
25
30
onsel time
onset time
For Z(harmo), z_{melric,1} has Ck(1) = 2
For Z(harmo), z_{melod,4} has Ck(1) = 1
10
15
onset time
20
25
30
10
15
20
onset lime
Figure 44.10: Important variables for Z(harmo).
Table B: Overview of clusters as derived by the above criterion with 1 = 1.
25
30
893
44.3. THE RESULTS OF REGRESSION ANALYSIS Artist ARG ARR ASH BRE BUN CAP COl CO2 C03 CUR DAV DEM ESC GIA HOI H02 H03 KAT KLI KRU KUB MOl NEY NOV ORT SCH SHE ZAK
HI
• • •
• • • • • • •
H2
•
H3
•
H4
Ml
M2
• • • • • • •
M3
• •
M4
• •
M5
M6
MLI
• • • • •
ML2
ML3
• • • •
• • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • •
Z(metric) : The clusters of columns Ml to M6 in Table B show the results for Z(metric) (see figure 44.11). These six columns correspond to the variables Zhmean,b Zmetric,2, Zmetric,b Zhmean,2, Zhmax,3, and dEZmetric,3. Hence, by use of an approach that gives priority to the metric structure, the cluster with Horowitz2 and 3 is exactly the same as for Z(harmo), with the exception of Ashkenazy who
894
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION For Z(metric), z_{hmean, 1) has Ck(1) = 11
10
15
20
25
30
For Z(metric), z_{metric,2} has Ck(1) = 7
10
15
20
25
30
onseltime
onseltime
For Z(metric), z_{metric,1} has Ck(1) = 4
For Z(metric), z_{hmean,2} has Ck(1) = 4
~
'i
~ ~
9
10
15
20
25
30
10
15
20
25
30
onset time
onset time
For Z(metric), z_{hmax,3} has Ck(1) = 1
For Z(metric), d_E z_{metric,3} has Ck(1) = 1
10
15 onset time
20
25
30
10
15
20
25
30
onseltime
Figure 44.11: Important variables for Z(metric). was not included before. Also, it corresponds essentially to the same curve. Similarly, the cluster with the three Cortot performances corresponds to a very similar curve. It is however smaller since it is a proper subset of the previous 'Cortot' cluster above. Bunin and Gianoli are again in the same separate cluster, this time together with Capova and Kubalek. The peak of the curve is now around measure 20. The cluster with Argerich contains several of those performances that were previously in the Cortot cluster. The dominating curve is still almost periodic with a period of about four measures. Finally note that Horowitz1 builds again a separate cluster with a locally very refined metric curve corresponding to the derivative of Zmetric,3' Z(melod) : For Z(melod) (figure 44.12), we obtain the clusters shown in columns ML1 to ML3 in Table B. The corresponding variables are: Zmelod,l, Zhmean,2, and Zmelod,2' The melodic approach yields very simple clusters. For almost all performances, including all Horowitz performances, the global shape of Zmelod,l is the most important feature. For Cortot1 through 3, and Krust, the 4-measures periodicity of Zhmean,2 is most important. For
44.3. THE RESULTS OF REGRESSION ANALYSIS For Z(melod), z-fmelod,1}has '-k(1) = 23
10
15
~
~
~
onset time
895 For Z(melod), z_(hmean,2} has'-k(1) = 4
10
15
~
~
~
onset time
For Z(melod), z_{melod,2} has '-k(1) = 1
10
15
ro
~
~
onset time
Figure 44.12: Important variables for Z(melod). Ashkenazy, the 4-measures (almost) periodic curve Zmelod,2 is dominating. Note in particular that the minima and maxima of this curve do not occur at the same place as for Zhmean,2' In comparison, Zmelod,2 appears to be shifted to the right. Also, Zmelod,2 has an extreme local minimum around the beginning of measure 30. 44.3.3.4
Argerich "Versus" Horowitz
The remarkable first performance Horowitz1 from 1947 evidences a preference of very detailed local information, be it from the melodic or metrical analysis-in typical contrast to the highly coherent Argerich performance. This observation is confirmed by an investigation of the correlation coefficients in the algebra-geometric analysis of the performance genealogy in the sense of RUBATO@'s stemma theory, see chapter 38. When translated into common language these quantitative results are in perfect coincidence with the judgments of experts on Argerich's and Horowitz' specific differences in performance [42]. Let us therefore make these findings more meaningful to the common understanding. How would an interpreter such as Horowitz experience his performance? He would look up a few notes ahead and remember just a few of the past note events when hitting a couple of keys in a given moment. He would then realize a couple of neighboring analytical facts, such as a harmonic step or a melodic contour, in his imagination, and then shape the present note event in its tempo, dynamics, and articulation within this minor context, and thusly express his analytical consciousness. Metaphorically speaking this resembles a near-sighted man who can only see and recognize nearby objects. It is as if he had no significant memory of what was happening several bars ago, or of what will happen in the
896
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION
larger time span ahead. In contrast, a performer of the Argerich type would be aware of lots of long-range facts in the overall analytical stream of the piece which is being played. She would then remember and plan everything and therefore hit the present notes in full consciousness of what was and what will be. This is what the semantics of the data tells. Table C: Overview of clusters as derived by the above criterion with 1 = 3. Artist ARG ARR ASH BRE BUN CAP COl CO2 C03 CUR DAV DEM ESC GIA HOI H02 H03 KAT KLI KRU KUB MOl NEY NOV ORT SCH SHE ZAK
MI
• • • • • • • • • • • • • • • • • • • • • • • • • • •
H2
HI
DM3
M2
• • • • • • • • • • • • • • • • • • • • • •
• • • • • • • • • • • • • • • • • • •
DM2
HM2
• • • • •
HM3
M4
DMI
•
DDMI
SS2
•
• • • •
• •
• •
•
Similar results can be obtained for 1 > 1. Here, A(l) is the number of performances for
44.3. THE RESULTS OF REGRESSION ANALYSIS
897
which the kth variable is among the k most important ones. Consider, for instance, l = 3. The following partially overlapping clusters corresponding to variables with h(3)=/:O are obtained (see Table C). The columns correspond to the following variables: Ml '" Zmelod,l, H2 '" Zhmean,2, HI '" Zhmean,l, DM3 '" dEZmelod,3, M2 '" Zmelod,2, DM2 '" dEZmelod,2, HM2 '" Zhmax,2, HM3 '" Zhmax,3, M4 '" Zmelod,4, DMI '" dEZmelod,l, DDMI '" d'i;zmelod,l, and SS2 '" Zsus2. The first cluster consists of all performances except Ashkenazy. Thus, using the 'melodic approach', apart from Ashkenazy, the global melodic curve Zmelod,l is one of the three most important factors for all tempo curves. Again, the 4-measure periodicity determines clusters with Cortot performances. It is remarkable that, in spite of the large number of overlapping clusters, there is no cluster-except the first one-that contains at the same time Cortot and Horowitz. Moreover, there is one cluster consisting solely of Horowitzl through 3 corresponding to the complex local melodic structure of Zmelod,4. Evidently, many more detailed comments about figures 44.10-44.12 could be added. We conclude the analysis by noting that the relative size of the coefficients suggests a natural way of obtaining simplified tempo curves that contain the most important features. For given j and 1 ::; q ::; p, let the p x 1 vector ,,/q(j) = ["tq,l (j), "/q,2(j)" ... , "/q,p(jW be defined by "/q,k(j) = .Bk(j)I{rk(j) > p - q}. Then Yq(ti,j) = Z,,/q(j) is a simplified tempo curve that corresponds to using the variables (analytic curves) that are among the q most important ones for tempo curve j, importance being measured by rk(j). Thus, the resulting tempo curve is a simplified curve obtained superposition of the q most important features only. Note that, for q = p, this yields the complete curve fitted by stepwise regression. Nos. 44.13.1-44.16.28 in figures 44.13-44.16 display Yq (q = 1, ... ,p) for Z(melod) for all performances.
44.3.4
Overview of Statistical Results
The main statistical conclusions from the analysis above can be summarized as follows: • There is a clear association between metric, melodic and harmonic weights and the tempo. • The exact relationship between the analytic weights Z and an individual tempo curve is very complex. However, a large part of the complexity can be covered by our model. • Commonalities and diversities among tempo curves may be characterized by a relatively small number of curves. There is in principle no unique way of attributing features of the tempo to exactly one cause (harmonic, metric or melodic analysis). Which curves need to be used depends partially on which of the three analyses (harmonic, metric, melodic) has 'priority'. However, there seems to be a small number of canonical curves that are essentially independent of the priorities and which determine a large part of the commonality and diversity among tempo curves. Natural clusters can be defined. • There is a natural way of reducing an individual tempo curve to a series of simplified tempo curves containing an increasing number of features. Overall, the proposed method yields a variety of results that are interpretable from the point of view of music and performance theory. In particular, the hierarchic approach of decomposing each of the weight functions into components of different degrees of smoothness seems to
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION
898
11.2ARRAU
11.1 ARGERICH
10
15
20
25
30
10
20
25
30
25
30
25
30
25
30
11.4 BRENDEL
11.3 ASHKENAZY ~
15
onset time
onset time
r-----------------~
10
15
20
25
30
10
15
20
onset time
onseltime
11.5 BUNIN
11.6 CAPOVA
~~ o~ 10
15
20
25
30
10
15
20
onset time
onset time
11.7 CORTOT1
11.8 CORTOT2
10
15
20
onsettirne
25
30
10
15
20
onset time
Figure 44.13: (44.13-44.16) Superposition of the q most important features. be appropriate. The different choices of the bandwidth h correspond to a hierarchic approach to musical performance, starting with the most global features of the score and refining the performance successively in greater detail. The results here are closely related to Repp's work [438]. Repp applied principal component analysis to the 28 tempo curves. One of his main results is that Cortot and Horowitz appear to represent two extreme types of performances. Thus, in a heuristic way, Repp suggested classifying the performances according to their factor loadings into a Cortot and a Horowitz cluster respectively. Our regression analysis confirms the basic findings. Due to the use of weights obtained from a musical analysis of the score, we obtain further information about the nature of the commonalities, diversities and clusters. For instance, as discussed above, for the Horowitz cluster, the most important feature appears to be the overall descending line of Zl,melod and very local variations of the tempo that correspond to the local variations of the analytic weights. On the other hand, for the Cortot cluster, the up and down movement of Z3,hmean with a period
44.3. THE RESULTS OF REGRESSION ANALYSIS 11.9CORTOT3
899 11.10CURZON
~
~~ ;";\~;'~;::~~~<~'C;>~<~;~~~
o~ 10
15
20
25
30
10
15
20
onset time
onset time
11.11 DAVIES
11.12 DEMUS
10
15
20
25
30
10
15
20
onset time
onseltime
11.13 ESCHENBACH
11.14 GIANOLI
10
15
20
25
30
10
15
20
25
30
25
30
25
30
onset time
onset time
11.15 HOROWITZ1
11.16 HOROWITZ2
~~
~~ ~ 10
15
20
25
30
10
onset lime
15
20
25
30
onset time
Figure 44.14: (Cant.) of about four measures is the most important characteristic.
Result 6 More generally, this approach reveals a set of canonical curves whose combination yields the most important features of a tempo curve. It should be emphasized that these curves are score-specific. Thus, for each score (in our case "Triiumerei"), a new set of essential curves is obtained. The original weights as well as the decomposition into parts of different smoothness are based on the specific score that is performed. This is a crucial feature that is in sharp contrast to traditional mathematical 'omnibus-decompositions' such as provided by Fourier or wavelet analysis. The score-specific choice of the Z - matrix enables us to relate statistical results directly to the musical/analytic content of the score. The main point here is this: We argue that to understand the character of tempo, it is above all important to refer it to a "basis" of score specific analytical curves and not to curves~such as sinoidal curves in Fourier representation~which have a generic type that tells nothing about
900
CHAPTER 44. DIFFERENTIAL OPERATORS AND REGRESSION 11.17 HOROWITZ3
10
15
20
25
11.18 KATSARIS
30
10
15
20
onset time
onset time
11.19 KLiEN
11.20KRUST
~ ~ 10
15
20
25
30
onset time
11.21 KUBALEK
10
15
20
25
-
25
30
~ 10
15
20
25
30
onset time
11.22 MOISEIWITSCH
30
10
15
20
onset ....
onset time
11.23 NEY
11.24 NOVAES
25
30
~
~
10
15
20
2S
30
10
15
20
onset time
onset time
Figure 44.15: (Cont.) the particular context of the genealogy for the given tempo curve.
25
30
11.25 ORTIZ
11.26 SCHNABEL
"' 10
5
15
20
25
30
5
10
15
20
onset time
onset time
11.27 SHELLEY
11.28ZAK
10
15
20
25
30
onset time
10
15
20
onset time
Figure 44.16: (Cont.)
25
30
25
30
Part XII
Inverse Performance Theory
903
Chapter 45
Principles of Music Critique Never trust the artist. Trust the tale. The proper function of a critic is to save the tale from the artist who created it. David Herbert Lawrence (1885-1930) Summary. Inverse performance theory deals with the critic's problem of how to extract in his critique what is hidden behind a performance output. To initiate this theory, we therefore should question the perspectives which a critic has to envisage. To begin with, we inquire the intriguing task of feuilletonistic critique: why is it a never ending story? Is it substantially necessarybeyond music business? We then position the critic within the sociological context: How do norms intervene in critique? This is exemplified by Glenn Gould's performative redefinition of classics. The chapter terminates with an ethnomusicological view on historicistic performance as it is typically undertaken by Nicolas Harnoncourt.
45.1
Boiling down Infinity-Is Feuilletonism Inevitable?
Summary. It is not accidental that music critique has stuck to feuilletonism. Its scope is an infinite one in several dimensions: It has to cope with the infinite interpretative work with respect to a given text and with the infinity of expressive nuances of each given interpretation. We analyze the necessity of boiling down this infinite challenge in view of the poor tools of traditional musicology. -~-
We have learned in previous chapters, in particular in section 13.4.1 and section 32.2, fact 18, that performance conveys an infinite message. And that this infinity is a double one l : that of interpretative perspectives as they are realized in music analysis, and that of performative 1 Leaving
aside the gestural and emotional rationales for the time being.
905 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 45. PRINCIPLES OF MUSIC CRITIQUE
906
shaping variants as they may be expressed on the infinitesimal vocabulary of performance fields. It is not clear whether critics are aware of such a variety of background that may produce concrete performances. In particular with respect to (analytical) interpretation, they preferredly stick to the traditional canon of how the structure of a composition should be viewed and interpreted or analyzed, respectively. Of course, it is not clear whether music critics should be cognizant of possibly new interpretations, but once they have gone into their business, a creative dealing with analytical problems should be mandatory. One may understand that this is not necessary ante rem, but after the event, a re-reading of the text should be considered, be it only for comparative handling of the present performance: Could it be that the artist discovered a new interpretation of the given text? In practice, the selection of an analytical interpretation (in the best case, autoincompetent critics excluded ... ) is just a matter of limitations of time, energy, and interest, besides ignorance of the infinite variety of interpretations. As to the infinity of performance nuances, this is beyond the vocabulary of music critics and it is also beyond the present measurement technology for such data: In a common concert, no performance field reconstruction is feasible. So critics are nolens volens limited to describing performance by use of common language expressions ("elegant diminuendo blended by a mysterious pedaling cloud ... ") which beyond its imprecision cannot relate expression to interpretation. So is feuilletonism inevitable? Or rather: Is such a bad feuilletonism inevitable? Is it necessary to play the game of a unique "best" interpretation whose expression has to move along unreflected paths of prejudices? The alternative would be to embed one's judgment in the potential infinity of analytical interpretation and expressive performance. And to keep this embedding omnipresent in the critical discourse. We argue that the most precious role of a music critic would be that of putting the infinity of perspectives on a musical work into evidence in every concert or CD review. These would be the crucial points: • Infinity of analytical interpretations, • infinity of expressive performances, • infinity of correlations between interpretative rationales and expressive performance shaping. And it is not the question about bad or good quality in these specifications, in the limit, the only quality is to teach us something about the work in question, and about the relativity of each perspective. Suggesting a boiled-down jinitistic or even unicorned view of art is a destructive way of reduction and hinders every deepening or progress in the arts.
45.2
"Political Correctness" in Performance-Reviewing Gould
Summary. Given the infinity of critical understandings of artistic performance, norms are easily infiltrated against unlimited variation of expressivity. We make the point concrete with the example of Glenn Gould's eccentric (ab-normaL.) performance of classics from Bach to Beethoven. -~-
45.2. "POLITICAL CORRECTNESS" IN PERFORMANCE-REVIEWING GOULD
907
A testing ground for a valid music critique is Glenn Gould's performance of classical works from Bach to Webern. His technically unprecedented performances have evoked strong reactions which unveil a number of limitations in common critique styles. Whereas Gould's Bach performances may be non-conformist, but still acceptable and adequate for Bach's compositions, his performances of Beethoven's sonatas is beyond the supportable deviation from common taste. The famous critic Joachim Kaiser has described in [257] the most famous "mis-performance" of a Beethoven sonata on the example of Gould's presentation of op. 57 "Appassionata":
Bei Goulds Wiederyabe des allegro assai durfte es sich um die verriickteste, eigensinnigste Darstellung handeln, die jemals ein Pianist einem Bethoven-Satz hat angedeihen lassen; und das will etwas heiflen. Gould hiilt es fUr richtig, demonstrativ langweilig und gelangweit den Kopfsatz so zu bieten, als ob ein Beethoven- Veriichter seinen Plattenspieler nur mit halber Geschwindigkeit ablaufen liefle. Tranig langsam, langweilig und gelangweilt, die Triller wiihrend des pp im Schneckentempo, wiihrend der Fortissimo-Stellen etwas rascher, quiilt sich die Musik vorbei. Man meint, der Pianist imitiere ein Kind, das mit erfrorenen Fingern die Appassionata vom Blatt spiele. Nur selten vergiflt er dabei, dafl er ja vergessen machen wollte, der genialische Glenn Gould zu sein. This critique is strongly based upon the commonly accepted reading of the Beethoven text as a passionate message which calls for temperament and stormy dynamics in performance, and not for analytical cool vivisection of such a vital piece of literature. In Kaiser's characterization, Gould's production is like a "child with frozen fingers in a sight-reading performance". Here, the different and aberrant performance is incorrect, even forbidden. It is a norm which the politically incorrect Gould has broken and thus made the sonata ridiculous; Kaiser even comments that the sonata "remains silent" when confronted with such a misreading. The basic hypothesis behind such an outrageous indignation is that Kaiser knows what and when and how the sonata (which is personalized here) would have communicated, and that crazy Gould just destroyed that known and accepted messaging. Kaiser in fact evokes an installed performance grammar which requires a passionate forte seventh degree cascade towards the piano on the dominant in bars 14-16 of allegro assai. Instead, Gould descends like a noble, bored lady and snobbishly sits down on the boring dominant fermata. No passion whatsoever. The same, even more dramatic viz. ridiculous deformation can be observed in Gould's performance of sonata op.106, "Hammerklavier". This case is even worse since one just thinks that Gould did not understand a single word of the text, he simply was too stupid for the performance task. What happened? And why was Gould's Bach so much more successful? Evidently, Gould's microscopic performance method works for Bach, and not for Beethoven. Why does this microscopic view fascinate and illuminate Bach's work whereas it virtually kills Beethoven's sonatas? The point is that in Beethoven's work, there is an inbuilt performance grammar which is not engraved in the score but stems from the performance tradition as such, an oral tradition so to speak, an element of rhetoric communication which transcends written code. Instead, Gould reads the same code from the Bach and from the Beethoven scores, and effectively demonstrates,
908
CHAPTER 45. PRINCIPLES OF MUSIC CRITIQUE
that there is a huge defect in Beethoven's written code; it is quite trivial, at least locally, the written script is simply boring. Gould has effectively given a quasi-mathematical demonstration that the same performance strategy cannot be applied to Bach and to Beethoven, that the same analytical insight and the same rhetoric shaping yield completely different results for these composers. To me, this is a sensational lesson to teach a characteristic difference between Bach and Beethoven. This is very clear in the descending seventh passage on bar 14th bar, which runs on semi quavers after a triggering triplet of quavers at the end of bar 13. Gould effectively takes the double temporal rate of the semiquavers with respect to the quaver triplet, without any tempo increase, without any dynamic profiling, just letting us see the anatomy of this triadic descent structure. The common reading [534] of this passage is that of an explosion:
Die Explosion (a tempo, Auftakt zu T.14) erfolgt im niedersausenden DominantArpeggio und f-Sextakkord (T.15), wird aber sogleich abgediimpft durch einen CDur-Sextakkord, p, T.16.
With Gould, there is no explosion, just the written text, cleanly played, but antagonistic to any such musical drama to which an explosion would testify. The common reading in fact classifies this sonata as a musical drama, and asks interpreters to integrate this semantic into their performance. Gould plays the "Appassionata" minus the commonly implied drama. The question here is whether this dramatic character is implicit in the score structure or whether it is an external determinant which has been added by historical standards-which Gould filters away to lay bare what he believes is a poor structural essence [191]. So the question arises whether the commonly accepted dramatic performance is an expression of Beethoven's work or of an added character. Let us therefore analyze the specific performative shape of the passage in question. To begin, its agogics is profiled against the temporal neighborhood, i.e., not only is the indication "a tempo" valid from the last three quavers of bar 13, but in bar 14, the resumed tempo is again increased. The dramatic performance contains an increase of tempo, and within that level, also an increase of tempo towards the middle of the descent. Further, the dynamics is not only the forte at the end of bar 13, but the target tones of each descending intervallic movement of the descent is played louder, maybe to a ff or sf. As a whole, this descent (with its added ascending tail in bar 15) is not only a musical structure, but more an explosive gesture whose very beginning goes to the top pitch, falls down and bounces back to the dominant fermata. This is not a written rationale, but it is a semantic unit which can easily be deduced if gestural semantics is to be included in the performance shaping. So Gould's experiment would demonstrate that Beethoven requires gestural rationales beyond analytical ones. Meaning that Beethoven's compositions have a performative added value of gestural nature which is not (yet?) virulent in Bach's architectural music. Observe however that this gestural character is not on the level of the interpreter's gestures, it is a rationale in the performance grammar, a semiotic layer which is added to the score system. Summarizing, Gould's politically incorrect performance withdraws from the common dramatizing approach and gives us an insight to Beethoven which would not have occurred otherwise.
45.3. TRANSVERSAL ETHNOMUSICOLOGY
45.3
909
Transversal Ethnomusicology
Summary. Ethnomusicology is essentially confronted with the synchronic normalization problem: Given two (simultaneously existing) cultural areas, understanding one of them from the other's perspective means normalizing fundamental categories, e.g., the "score" concept, such that comparison of different ethnics becomes feasible. When dealing with performance of works which belong to distant epochs, the ethnomusicological problem of understanding a performance is restated as diachronic normalization: what is the common ground for understanding the performance of a work which was written in the spirit of another epoch? We discuss the efforts of Nicolas Harnoncourt in this direction of "transversal" ethnomusicology. -~-
In its common understanding, ethnomusicology deals with a transformation problem between synchronic music cultures. Such a transformation may cause major problems to the contents and forms of music, for example because of incompatible notation or even incompatible modalities of communication, such as oral traditions, rebuilding instruments for each new musical event, embedding of music in more global forms of art, etc. But synchronic ethnomusicology has the undeniable advantage that a feedback process may help deal with such problems, and eventually solve them in the ideal case 2 . This advantage cannot be claimed for diachronic ethnomusicology. What is this type of ethnomusicology? It deals with transformation of music cultures which are at a temporal distance instead of a geographic or social distance of contemporaneous cultures. For example, if we play an opera of the late Renaissance composer Claudio Monteverdi in the 20 th century, this is a diachronic type of ethnomusicology. In fact, the cultures of Monteverdi's time and of the 20 th century are very different, and the communication between them is restricted to the historical proliferation channel. The historical distance influences strongly the communication of forms and contents. It is by no means automatic that everything is transmitted without loss of information. Above all, the socio-cultural background of a musical composition is easily blurred by historical filters. But also, the instrumental practice and technology impose dramatic deformations of what was reality in a historically distant context. Nicolas Harnoncourt has with great success restarted a dialog with historically distant traditions in the sense of understanding those conditions and not imposing ours to a diachronically distant culture. His approach is based upon the basic position that instrumental constraints are very important for shaping one's performance and expressivity. Restriction gives one a clear frame, a limited field of activity where a composition must unfold its semantics, and not an unlimited Wagner-tailored orchestra, where the quasiinfinity of instrumental power and colors competes with and actually substitutes the efforts for better expressivity. All this intelligent effort does not solve the communication problem in the historical dimension: in contrast to synchronic ethnomusicology, diachronic communication is unidirectional: 2However, the real case may be far from ideal. For example, if we want to initiate inverse ethnology, i.e., the review of our own music culture from the point of view of an 'exotic' culture, major obstacles will occur, from financial ones to the intolerance against another culture which tries to relativize the usual perspective of occidental supremacy.
910
CHAPTER 45. PRINCIPLES OF MUSIC CRITIQUE
No real feedback from ancient times is possible, we only have the sources and must try to understand without really having answers to our experiments. Harnoncourt's experiments may be an attempt to transform our performance practice back to Monteverdi's, but it is not a demonstration of anything. So performance is not only a hic et nunc affair, but also a process which is coupled to spatio-temporal distances of cultures. Within this transformational framework, there are different serious asymmetries of communication. In the synchronic direction, feedback can be dealt with-however in the limits of sociocultural asymmetries for inverse ethnology. In the diachronic direction, the communication is intrinsically unidirectional, and performance remains a challenge for the adequacy of cultural transformation.
Chapter 46
Critical Fibers To see a World in a Grain of Sand, And a Heaven in a Wild Flower, Hold Infinity in the palm of your hand, And Eternity in an hour. William Blake (1757-1827) Summary. Stemma theory offers a model for a critical understanding of performance as a complex process between interpretation and the rhetorics of expressive performance grammar. We make the model and its limitations explicit. -~-
46.1
The Stemma Model of Critique
Summary. Modeling performance through stemma theory allows us to define inverse images of performances within a well-defined variety of stemmatic situations leading to the given performance. A critical fiber is such a variety. A critique is a choice of a point within this fiber. We discuss criteria to make such a choice, i.e., to select one critique among all possible critiques within the stemma model. -~-
In section 32.4 we discussed the four global aspects of performance, in particular the fourth point: stemmatic deployment of performance. In that discussion, we stressed the fact that stemmata are not just a learning process but much more a logical unfolding of deformation strategies. If we take this point of view for granted!, music critique should deal with the stemmata which could possibly lead to a given performance, since they would unveil the anatomy and genealogy of a performance, and this must be the central issue of a critique which merits that name. 10ther models of performance genealogy are at hand, but we do not know of any such model that is technically as explicit as the stemma model.
911 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
912
CHAPTER 46. CRITICAL FIBERS
Probably the very construction of a stemma with its genealogical factors from cell hierarchies to performance operators and ramification architecture would help a critic to shape his/her style and criteria for judging performances. But beyond that, the general framework of stemma theory is far too generic for any concrete goal. What is the concrete goal?
Principle 27 Try to construct a stemma that produces a given performance from the existing score! In this form, the problem has trivial solutions if we admit that any performance field is admitted without further articulation and operator constraints. The (non-trivial!) construction of a performance field from experimental performance data (e.g., on a MIDI file record) has been implemented by Stefan Miiller, see section 46.3, and can be used to construct a depth-one stemma whose operator just produces this performance field on a monolithic LPS. Such a solution is however not what a critical understanding of performance would preconize since this is nothing more than the brute 'sampling' of a performance transformation, and would not lay bare any of the semantically valid rationales. The crucial point is that one should impose constraints on the admitted tools for stemma construction. This would lead to the more reasonable
Principle 28 Given a defined set of constraints regarding the construction tools for stemmata, try to construct a stemma that produces a given performance from the existing score! In this form, the principle becomes a challenge for critics since it thematizes the strategies a critic could imagine to be used by a specific artist. Only after such a strategic preset can the reconstruction of a possible generating stemma be seriously tackled. The lesson to be learned from such an 'exercise' is that in criticism, one should learn to reflect very cautiously the conditions under which critical judgments are made. The critical business now splits into two subtasks: comparative criticism of one and the same performance, and comparative criticism of a number of different performances of the given score. The first means that we are given a fixed performance and compare different reconstructions of backing stemmata: Which one is acceptable, which one is simpler but still adequate, in what respect could two criticisms be considered as being isomorphic, etc. The second one means that the phenomenological difference of performances is lifted to a genealogical difference of backing stemmata, and this one is the core activity of a music critic. However, it presupposes that usually, the constraints on the stemma reconstruction potential are the same, a condition for comparability which is not automatic.
46.2
Fibers for Locally Linear Grammars
Summary. Locally linear grammars are a special approach to stemma varieties, see section 39.8. We give a first description of critical fibers in this context. They turn out to be varieties in the sense of algebraic geometry and are called grammatical varieties. -~-
913
46.2. FIBERS FOR LOCALLY LINEAR GRAMMARS
The general inverse problem of reconstructing a stemma for a given performance of a defined score is not solvable for two reasons: we first do not have a general formalism and second, even a reasonable general formalism would imply wild 2 mathematical classification problems. We therefore want to consider a more tractable situation: the locally linear performance grammars which were introduced in section 39.8. We refer to that section and keep its notation, recalling in particular from definition 110 that a locally lR-linear grammar is a family of lR-linear representations of the stemma quiver T which are parametrized by lR-vector spaces Ex of finite dimension Sx, x E V(T), the vertex set of T, the parametrization being given by affine maps 'Px : Ex ---- End(Ax). Within this very precise setup, where the stemma and the locally linear grammar are fixed, we may ask for the structure of the fibers lying above the performances which are defined on the stemma's leaves by the given locally linear grammar. Technically speaking, we proceed as follows: For each final vertex z E V(T) let mzo = r, mzl, m z2, ... mznz = z, be the ordered sequence of elements of MAT). For the root r and for each final vertex z E V(T) let us fix arbitrary e r EAr, e z E A z, respectively. The main task of this inverse performance theory is to study the set of solutions of the system of equalities (46.1)
where for k = 1, ... n z the vertexes Yk lie in the set Dmzk _ 1 (T). For the explicit calculation, we select a basis Vxl, ... v xs", of the vector space Ex. This means that for every vertex x E V(T) we may consider the linear operators Oxi := 'P~(Vxi) E End(Ax), i = 1, ... Sx as a part of our data. Whence, if we identify Ex with CSx through the basis Vxl, ... v xsx ' we can define the homomorphism (39.50) as a triaffine map fx : Am
X
C#Drn(T)
X
II
C~Y ---- Ax
yED",(T) by associating to (am, (C~x)YEDrn(T)' (bYj)YED,.,,(T),j=l, ... sJ the sum Sy
L L c~xbyjiy,x(Oyj(rm,y(am))) + L
yEDrn(T) j=l
yEDrn(T)
c~xiy,x('P;(rm,y(am))).
The equations (46.1) become: (46.2)
21n algebra, a classification problem is said to be wild, if its solution would imply the classification of any module category.
CHAPTER 46. CRITICAL FIBERS
914
where the sum is over all Yl E Dr(T), Y2 E DmZl (T), ... Yn z E Dmznz-l (T) and all jk = 1, ... SYk' for k = 1, ... n z . The leading vector summand of this linear combination equals (46.3)
The general vector summand refers to a choice of endomorphisms X ... = 0Ynzjnz or X ... = 'PtYnz etc. and is equal to (46.4) In order to describe the solution variety we can interpret (46.2) as a system of linear equations with parameters,
(46.5)
Observe that ez,lyz are vectors in A z , so the solutions C;,;z-l of equations (46.5) result from simultaneous solutions in the vectors' coordinates if we are given a basis of each A z . The coefficients lyz equal linear combinations of the vector summands (46.3) and (46.4) with coefficients which are monomials in the remaining variables c:,. and the b... If we now assume that all variables c";ynz-l are independent, we see that for any two different z E D mnz - 1 (T), the solutions of the equations (46.5) do not interact with each other. This means that the solutions of the system (46.5) for z E D mnz - 1 (T) is either empty (if the value e z is not in the image of the linear map (46.5)) or equals
II
II
where Lz are linear subspaces of C#D=nz-l (T). Their co dimension can be read from the matrix of the coefficients. Since there is no other condition on the other variables C;';'k we obtain altogether the following result:
Theorem 35 The solution space of (46.2) is a linear fibration over some appropriate affine space. Corollary 22 The dimension of the non-empty fibers of this fibration is generically minimal, and it increases along some finite union of proper algebraic subschemes (the loci where the minors of the coefficients' matrices vanish). This means that the non-empty fibers Fib(e.), i.e., the solution spaces defined by (46.1) over a given output set e. = (ez)z are all generically isomorphic, i.e., isomorphic when restricted to appropriate open subschemes. However, the configuration of the specialization subschemes is not evident and depends on the particular vector summands. Also, the condition for non-empty fibers is not evident in general.
46.2. FIBERS FOR LOCALLY LINEAR GRAMMARS
915
How can we interpret this result in musicological terms? A quantitative measurement of a performance can be done by recording the values of the parameters that characterize a note, i.e., loudness, duration, and so on. In the last paragraph we have denoted this data by the vectors e z . Equation (46.2) explains that in order to produce given values of the parameters from the weight functions , one has to find suitable values for all C;;:~k. It would be nice if the choice of these coefficients could be small, because this would mean that in order to produce a given performance one does not have a lot of freedom in the choice of system paramet ers. However from this model this is not the case. In fact, corollary 22 tells us that for a fixed performance one has either no one or else infinitely many possibilities to choose system parameters which produce the given performance. Moreover, we learn that all performances with non-empty fibers Fib(e.) have open dense subsets U( e.) C Fib(e.) which are all isomorphic with each other, so these generic subsets U(e.) are qualitatively equivalent, see also figure 46.1. In other words, the non-empty fibers only differ on their special loci apart from generic open subschemes U(e.). Lie operator para meters: weights, di rections
t
_ __ _ • O utput fiel ds e.
/
Affi ne tra nsport paramete rs
Fi b(e.)
Figure 46.1: All performances with non-empty fibers Fib(e.) have open dense subsets U(e.) C Fib(e.) (elliptic regions) which are all isomorphic with each other In musical terms, a fiber Fib( e.) could be called a critical fiber because its points are the possible background parameters-in the present stemmatic model- which lead to the given performance output data e., such as local tempi, articulations, dynamics, detunings, etc. So the fiber really includes the possible ways of understanding why a performer is playing his actual performance. In fact, finding out which parameters the interpreter could have used is (or should be) the core activity of a music critic. Having generically isomorphic fibers means that in any
CHAPTER 46. CRITICAL FIBERS
916
two critical fibers of two given performances, there are "dominant" open sets of "criticisms" (i.e., points in the fiber!) which are isomorphic with each other. This does not mean that the criticisms are the same for the two given performances, but that their structural contexts can be identified. Again, this does not mean that the relevant criticisms may be identified-on the contrary: maybe, these isomorphic contexts just describe the criticisms which are what everybody could say if no supplementary information about the specific performance culture of the interpreter is known. So we should not discard this model as insignificant, instead we need to look at it in the right way. In fact, it is like pretending to explain the geometry of smooth plane curves, just using lines. The theory of lines can be useful for local questions, but not for global ones. The same thing happens here. The great flexibility of this model enables us to adapt it to the questions that one tries to answer, but they can not be too general. Indeed, one can select subschemes that are appropriate for the study of a particular problem. For example, if one is interested in comparing several performers under the point of view of the local/global way of playing, one could try to restrict the research to the level of the daughters of the root, and to use some more of the structures of the vector space Ax (if any). Another question that could be asked is that of the final/causal way of playing. This requires one to impose mutual dependence conditions upon the variables C~~k for varying Z E Dmzk (see 46.4).
46.3
Algorithmic Extraction of Performance Fields
Summary. The algorithmic extraction of performance fields is a first step for systematic calculations in inverse performance theory. The extracted fields can further be used for visualization. We describe an approach how performance fields can be calculated from given scores and performances and present the tool that implements the theory. -~-
In this section, we address the question of inverse performance theory which deals with the reconstruction of a performance field for a given performance on a determined score, including an implementation in the RUBATO® framework, named EspressoRUBETTE®. This question generalizes the well-known problem of constructing a tempo curve from a measured performance to form space S = EHLD ... of parameters such as onset E, pitch H, loudness L, etc. Such an attempt is first of all an interpolation task where a continuous performance field must be reconstructed from a discrete data set of performed notes. As such it is subjected to (a) the problem of matching symbolic and performance events, (b) ambiguities in the local definition of field vectors, (c) algorithmic constraints for real-time objectives, and (d) visualization options for performance fields.
46.3.1
The Infinitesimal View on Expression
The concept of expression is ambiguous as far as the content of the expression and its reality layer are not a priori clear, see chapter 36 for the details. If we aim at analyzing expression, this does not regard the psychological perception of a performance by humans. The psychological aspect is a legitimate one, but it touches a category which relates the performed music to human
46.3. ALGORITHMIC EXTRACTION OF PERFORMANCE FIELDS
917
categorizations in terms of emotional response. Such a perspective is, for example, dealt with in [231] or in [289]. In contrast, our point of view is expression as a rhetorically shaped transfer of structural score contents by means of the deformation mapping of symbolic data into a physical parameter space. The psychological implications are not the subject of this perspective, it is a purely mathematical description of this mapping, not of the emotional correlates. The theory of performance fields is derived from the general hypothesis that performance is a smooth (continuously differentiable) isomorphism !J = R ----> RP on a frame neighborhood R of the given local composition C c S. This is of course a strong hypothesis, but it is, at least locally on the given composition, a reasonable one. In our inverse problem, we are not given !J, but only its restriction !Jlc to the given local composition. Accordingly, we shall not really construct the performance field 'IS associated with the unknown map !J, but a discrete performance field, defined on the points of C, which is determined by the restriction !Jlc. We shall now construct such discrete fields, their interpolation on a neighborhood of C, as well as their visualization by means of color fields. It will also be possible to calculate difference fields in order to compare two performances of the same local composition.
46.3.2
Real-time Processing of Expressive Performance
An implementation of the performance field theory should be able to operate in real-time, especially for interactive applications, where immediate feedback, either visible or audible , is desired. As we shall see, the complete calculation of the performance fields can be split up in dedicated, communicating modules for specific tasks. Particularly important for performance is the extraction of tasks that can be processed in advance. Figure 46.2 gives an overview of input score (i.e. MID I)
j
inpllt pelforlllClnce
~
input filtering
basis calculation
I visualization preprocessillg (off·lille)
reat-lillle processillg
Figure 46.2: Overview of the modules and the flow of control (as shown by the vertical arrows) in our implementation of the EspressoRUBETTE@. the modules and the flow of control (as shown by the vertical arrows) in our implementation. Modules are notified by events when new data for processing is ready. The modules themselves are stateless, they share their information with other modules in the 'real-time context, a data
CHAPTER 46. CRITICAL FIBERS
918
structure which contains all relevant information for the whole process, thus minimizing the risk of inconsistency. Of course, asynchronous accesses to the context have to be synchronized using locks or a similar synchronization technique. For increased flexibility and efficiency all modules accept lists of events, therefore making off-line and real-time processing structurally identical. For example, the input filtering modules for the score and for the performance are the same. The former accepts the score as a whole and the latter processes individual events as they are received in real-time. Further, modules can be prioritized and be put to sleep if there is not enough processing power to support all present modules temporarily. The following items give short descriptions for the modules shown in figure 46.2. Details will be presented in the subsequent sections. It is important to see that the described architecture allows the definition of additional modules as needed. This mostly depends on application requirements. Also some application might not need certain already defined modules, i.e., field interpolation in a computer accompaniment system.
Input filtering. This module translates incoming note events to the representation defined in the real-time context. It also processes structural information, such as different voices, tempo changes, etc. The input filtering module must be implemented for any external representation (e.g., MIDI or RUBATO®s denotator format). Basis calculation. The calculation of the bases depends only on the input score, not on the performance and can thus be performed off-line. For each event an appropriate basis has to be calculated. Typically this is a time-consuming process. Matching. The incoming performance events have to be matched to the corresponding score events. As we shall see in the designated section, this is a non-trivial task and has developed to a research field on its own. Field calculation. The individual field vectors for each note are be calculated based on the precalculated basis and the given match. Field interpolation. The field vectors calculated by the former step are typically not aligned on a grid. However, for visualization a 2D or 3D grid-like field, with field vectors defined anywhere in this grid is desired. The interpolation step allows the definition of such a grid and performs the translation from the note field to the interpolated field. Visualization. Finally the calculated field is ready for visualization. Here, many user defined parameters such as scaling, color-specification, ranges, etc., have to be taken into account.
46.3.3
Score-Performance Matching
A lot of research effort has been put into score-performance matching techniques. Scorefollowing, the real-time matching and tracking of soloists performing a given score was first published by Dannenberg [108] and Vercoe [544]. Puckette [426] presented the methods used on the IReAM Signal Processing Workstation (ISPW). Heijink et al. [216] have given an evaluation of different approaches to score-performance matching.
46.3. ALGORITHMIC EXTRACTION OF PERFORMANCE FIELDS
919
Literature has typically differentiated between two types of algorithms: For real-time algorithms, mostly used in real-time accompaniment software, good performance had higher priority than matching quality. Off-line algorithms, where calculation time is less important, were mostly used for in-depth analysis applications requiring a high level of matching quality. We however experienced that with to days processing power high quality matching can be performed in realtime, particularly when the algorithms are well suited for extensive preprocessing of the given score. Mathematically, the matching problem is complex and depends upon the difference which one allows between score and performance. For example, if chords remain chords and all notes are played exactly once, the problem is trivial. But normal performance includes more or less strong arpeggiation of chords, omissions of notes or playing additional notes by error or by ambiguous definition of the notes, such as is common for trills and other ornaments. We have implemented an algorithm which will not be described in detail here since it is not our principal subject. The algorithm is a kind of matching along a 'wave front of notes which are defined by the temporal unfolding of the performance and thereby fits in the real-time constraints. We nevertheless should sketch the principle ideas behind our algorithm. Usually, matching is thought bottom-up in that the performance map of the whole piece is constructed from the performance map X 1--+ p(X) on the single elements X. We rather tried a top-down strategy, i.e., to rebuild the element images from maps on sets of specific coverings I, J, respectively of the local composition C and its performance D. Typically, one considers the covering of C by hyperplane sections in each parameter (for example onset slices). On D, a covering J is defined which is a more fuzzy version of I, for example neighborhoods of hyperplane sections (for example c-neighborhoods in the onset dimension). If p exists, different constraints can be imposed on the induced map on the coverings: First, p induces a map no(p) : 1---> J such that p(U) ~ no(p)(U) for all covering elements U in I. This yields a map n(p) : n(I) ---> n(J) of the simplicial nerves, and thus conditions on the map on the covering sets. Second, the sets of these coverings are linearly ordered3 by U -< V iff either U C V or both U - V and V - U are non-empty and min(U - V) < min(V - U). In this ordering, we require that U j V => no(p)(U) :::; no(p)(V). Third, if one defines a distance d(U, V) between the covering sets (for example the elastic shape distance from motif theory, see section 22.2.1.3), one requires that d(U, nop(U)) < c for a given positive distance limit c. With these constraints one may define the map, and then recover p if every point X in C may be seen as the intersection of all covering sets of I which contain the point. This is evidently the case for the hyperplane sections described above. Following these observations, an implementation typically makes use of structural properties of a musical score and a corresponding performance. Further, dynamic programming techniques help in coping with the real-time problem: Multiple possible solutions are created, maintained, and discarded as the matching process is running.
46.3.4
Performance Field Calculation
Let us consider the score space S, the performance space P, the performance transformation p: S ---> P and the constant vector field Ll(X) = Ll = (1, ... , 1) for all x E P. Recall from section 3This is the usual linear ordering of powerset denotators as defined in section 6.8.
920
CHAPTER 46. CRITICAL FIBERS
33.2.2, equation (33.11) that the performance field is the inverse image of ~ and evaluates to
where J(p) is the Jacobian matrix
J(p)(X)
= (:~J IXi=e,h,l,d,... Xj=E,H,L,D,...
8e
=
(
8e i~ g~
:::
)
(X)
.
at X. In order to calculate the field vectors in an element X of the given local composition C in S, we have to determine J(p). Now, assume that we are given a matrix Ux of not necessarily orthogonal basis vectors based in X. J(p) can be rewritten as:
where the basis matrix Vx is the image of the basis matrix Ux . Then:
The last term is used to identify three cases: 1. Vx is regular, thus ']S(X) is defined, 2. det(Vx) = 0, Adj(Vx)
3. det(Vx)
#- 0: only the direction of the vector is given,
= O,Adj(Vx) = 0:
not its length, and
no information at all is given.
While we are now able to calculate the field vectors, the question of how to find the appropriate bases is still open. 46.3.4.1
Obtaining the Bases
The only information available for basis calculations are difference vectors of the given score notes. Basically any difference vectors could be considered as basis vectors, but due to the following restrictions, the candidates have to be selected carefully: First, only notes in a small neighborhood of X should be considered. This principle of locality ensures that the basis consists of notes that are in the local musical context. The second restriction is of a mathematical nature: we have seen that the transformed basis has to be regular in order to be able to calculate the field vector. Because the performance is allowed to have arbitrary deviations from the score, there is no general solution to this problem. What can be done is to decrease the possibility that the transformation of the basis Ux yields to a non-regular basis matrix: This can be accomplished by making Ux as orthogonal as possible. Thus, the selection of the basis vectors is based on the following two criteria: 1. Locality: Idet(Ux ) I is minimal, and
2. Orthogonality: Idet(u:orm) I is maximal (where uyorm is the matrix of normalized basis vectors).
46.3. ALGORITHMIC EXTRACTION OF PERFORMANCE FIELDS
921
Note that the two criteria are to some extent in competition, so they have to be weighted and combined. Consequently, a basis-calculation algorithm has to select bases by searching for
min ( wloeldet(Ux) 1 + Worth
det(U~orm))
with Wloe and Worth being positive pre- or user-defined weight values. Unfortunately, there is still one case that has to be dealt with: the case where it is not possible to find a regular basis matrix Ux in a small neighborhood of X. This may occur if all notes have the same loudness, if the basis has to be calculated for an isolated chord, where all onsets are equal, or for repeated notes with the same pitch. The only option left here is to construct orthogonal basis vectors that ensure that the basis remains regular. Finally, the pseudo-code for a basis-calculation algorithm can be given: for (each Note X in Score S) { List neighbors = S.getNeighborList(X, maxDist); List basisVectors = emptyList; for (each Note N in neighbors) basisVectors.add(N - X); List bases = getCandidates(basisVectors); X.basisCost = infinity; for (each Basis B in bases) { float basisCost = wLoc * abs(B.det()) + wOrth / abs(B.norm().det()); if(basisCost < X.basisCost) { X.basisCost = basisCost; X.B = B; } }
The function getCandidates 0, whose pseudo-code was omitted here, generates a list of bases containing the permutations of the basis candidates, and also adding constructed basis vectors if necessary. The above algorithm can be optimized by generating the permutations on the fly, the best expected ones first. In that case, the candidate list can be sorted by increasing distance, and the distance is used to cancel the loop, as soon as it is known that a lower basisCost can not be reached anymore.
46.3.5
Visualization
One of the most straightforward applications of a calculated performance field is its visualization. Vector field visualization has been successfully used in many science and engineering domains, i.e., in gas and fluid dynamics. Thus, many different techniques and their corresponding implementations are available. Common to all those methods is that they should be accurate, fast, and display the field in an intuitive way. See [76] for an advanced method that is suited for 2D as well as for 3D visualization. This section shows how the calculated field vectors need to be processed in order to make them available to such standard visualization methods. So far, we have dealt with a score C, consisting of a set of notes, the corresponding performance D, and the associated set of calculated field vectors F. The points in those sets reside in
CHAPTER 46. CRITICAL FIBERS
922
an n-dimensional space, n being the number of symbolic sound parameters. For visualization, n will normally be too large, so as a first step it has to be decided which parameters are to be used for visualization. For instance, we may choose onset E as the horizontal axis and pitch H as the vertical axis in a 2D setup. The remaining sound parameters are omitted. Further, the desired field vector components have to be selected, for example E in horizontal direction and D in vertical direction for a tempo-articulation field.
46.3.5.1
Field Interpolation
Typically, when dealing with vector fields, the field vectors are arranged in a grid of a given resolution. In contrast, our setup implies that the score points reside at arbitrary locations, making it impossible to use standard vector field visualization methods. Thus, a conversion from the calculated field vectors to vectors located on a grid is necessary. This can be accomplished through interpolation. At first sight, a triangulation of the given set could be considered, making it easy to calculate the interpolated grid vectors in the resulting triangles. However, since the different symbolic sound parameters have different meaning, triangulation is not well suited here: interpolation should occur in a musically meaningful way. Therefore, it makes sense to perform interpolation in a defined recursive order. For instance, when interpolating an ED field, first the D axis of the grid is considered and then the E axis. More precisely, one draws hyperplanes HI, H 2 , ... , Hk perpendicular to the n-th axis in the symbolic parameter space S such that every point of the given composition C sits in one such hyperplane. By recursion, we suppose that the interpolation is available for the first n - 1 coordinates. To get the interpolation value on an arbitrary point X, one draws the straight line through X and parallel to the n-th axis. This line cuts two neighboring hyperplanes in points P, Q. The values in P and Q are then interpolated by a cubic spline with zero slope in P and Q. For a detailed description see section 32.3.2.l. What happens at the boundaries of the given set? Since no field vectors are available, boundary vectors have to be defined. When having a look at the theory of the former sections, it becomes clear that outside the boundaries a frame of diagonal vectors has to be placed. The distance between the boundaries and the frame is constant and has to be predefined.
46.3.6
The EspressoRUBETTE®: An Interactive Tool for Expression Extraction
The methods for algorithmic extraction of musical expressions have been implemented in a tool called EspressoRUBETTE®. The tool can run as a stand-alone Java application. The Swing and Java2D classes take care of the user interface, and the user can manipulate calculation and visualization parameters through a simple dialog panel. As an alternative and more flexible approach, the software also implements the RUBETTE® interface and can thus be integrated into the Distributed RUBATO® framework, see chapter 40.
46.3. ALGORITHMIC EXTRACTION OF PERFORMANCE FIELDS
/
• • •
•
•
•
I
L
•
/
/
•
923
•
•
• •
Figure 46.3: A chromatic scale and its performance: Above the field vectors, middle: the color encoded and interpolated field vector, below the performed scale.
46.3.6.1
Example 1: Tempo Field of a Chromatic Scale
Let us now give examples of calculated performance fields. Figure 46.3 shows a chromatic scale and its performance. In this case representation is close to the one of a piano roll: the horizontal axis represents onset, pitch is mapped to the vertical axis. The width of events corresponds to their duration. Note that the EspressoRUBETTE@ allows arbitrary redefinition of those mappings. The top section shows the score containing the chromatic scale, twelve note events in increasing pitch order, all with the same duration. A hypothetical performance of the twelve events is shown in the bottom section. The first three events are played at the same speed as the original MIDI score. Then the performance is getting slower, and towards the end it is getting faster again. The last two notes are played faster than the MIDI score. This situation is depicted by the calculated and visualized vectors in the top section: the first, the second and the eleventh vector are diagonal vectors, stating that the notes are played at the given tempo. The angle of the other vectors depends on the local tempo played at a
CHAPTER 46. CRITICAL FIBERS
924
given note. The middle section shows the corresponding interpolated field, at a resolution of 400 times 200 cells. Here, the slope of each vector has been mapped to a color, and its length is related to the brightness of a cell. 46.3.6.2
Example 2: Excerpt from Czerny's Piano School
Eole
yiew
• •
•
• •
• •
•
• • •
•
• •
• • •
•
•
•
•
•
•
• • • • •
•
•
• •
• • •
• • •
• • • •
•
Figure 46.4: A performance field for an exercise from Carl Czerny's piano school. The second performance is a real-world example, namely a performance field for an exercise from Carl Czerny [98], as recorded on an MIDI file. Figure 46.4 shows the first two bars of the exercise, axes and note representations are as in the previous example. The upper half shows the performance field on the score space S = EH, the lower half shows the physical space S = eh of the performed piece (physical parameters being written in small letters). The exercise shows the Chopin rubato, i.e., the right hand plays the melody slightly shifted in time against the firm left hand chords in such a way that synchronization is recovered at the end of bars. The field shows the E- and D-components of the four-dimensional EH LD-performance field, as encoded by colors. One recognizes that the left hand notes are quite near in color to the green color
46.4. LOCAL SECTIONS
925
which encodes the diagonal unit field for mechanical, unshaped performance. We see that there is a right hand rubato effect in the middle of each bar, and significantly more in the second bar. The cyclic coloring effects are due to a multiple covering of the color circle in order to make the fine slope differences of the performance field more visible. Remark 22 The results - calculated and interpolated performance fields - contain explicit expressive information and are available for visualization or for other performance analysis tools. The algorithms are not restricted to specific sound parameters, and the method can thus be used for extensive expressive analysis. Currently, basis calculation imposes the biggest limitation. In some cases, the calculated basis of a note does not correspond to its musical context, resulting in field vectors that are - while being mathematically correct - hard to understand. Here, ongoing research will definitely deliver better results. A promising field of further research is also the insight that performance fields are not restricted to musical data. In medical applications and in computer-aided anthropology, the growth information of human bones and organs can be extracted in a similar manner - in which case we may talk about Nature's Performance.
46.4
Local Sections
Summary. We study canonical sections, i.e., selectors of points in grammatical varieties. The subject deals with problems of how to choose a determined critique out of an entire variety of possible critiques.
The generic isomorphism of critical fibers as shown in the above corollary 22 suggests that we should look for musically motivated restrictions on the admitted system parameters in order to obtain more specific information. In this section, we propose a model which is based upon the causality and finality of locally linear grammars and gives a manageable approach to the structure of causality and finality in the transition parameters c;:y from a daughter x of a mother m to its sister y (notation inherited from section 46.2). The idea is to parametrize the transition parameters by a small set of shaping parameters which give the system of all cm = (C;:y)v",(T) a causal-final coherence. Suppose that the daughters in Dm(T) are linearly ordered with respect to time. For example, this may happen if these daughters are defined by onset intervals of a sequence of adjacent bars or periods. Then we may just enumerate these daughters Xl < X2 < ... Xd(m) in ascending order and temporarily abbreviate cm X " x" = Ci J". In this notation, if i < j, this means that Ci J" measures the causal "., 3 ' , influence of the prior LPS Xi on the later LPS Xj, while i > j yields a measure Ci,j for the final influence of the later LPS Xi on the prior LPS Xj. The diagonal values Ci,i measure the "autocorrelation" of LPS Xi. With this in mind, we want to give the value matrix cm a simplified shape as follows: Suppose that the index set 1,2,3, ... d(m) is evenly distributed on the interval [1, -1], i.e., A(i) = 1- (i -1)2/(d(m) -1). We now define the function deformLiGr on the square [1, -1] x [1, -1]
CHAPTER 46. CRITICAL FIBERS
926
on the real plane as follows:
spl(x) = e-(0.4X)2
+ 0.l.causal.x 2 )spl(x) if 0:::; x, (1 + 0.l.final.x 2 )spl(x) else, LiGr(causal, final, x, y) = LiGramm(causal, final, (x + y)/2) deform(start, end, y) = 0.5((start - end)y + (start + end)) LiGramm(causal, final, x)
=
{
(1
deformLiGr(causalStart, causalEnd, finalStart, finalEnd, x, y) = LiGr(deform(causalStart, causalEnd, y), deform(finalStart, final End, y), x, y).
(46.6)
Figure 46.5: Nine shapes of the function deformLiGr for the system parameter 4-tuples causalStart, causalEnd, finalStart, finalEnd (from left top to right bottom): (3,3,3,3), (3,3,1.5,1.5), (3,3,0,0), (1.5,1.5,3,3), (1.5,1.5,1.5,1.5), (1.5,1.5,0,0), (0,0,3,3), (0,0,1.5,1.5), (0,0,0,0). The causal extremum (1, d(m)) is to the left, the final extremum (d(m), 1) is to the right of each surface. The horizontal diagonal is the autocorrelation area which has the constant value 1. For each fixed pair x, y, the function deformLiGr is an affine function of the four system parameters causal Start, causalEnd, finalStart, finalEnd. The system parameters contain information on the strength of the causal and final correlations at start and end of the given time interval. Figure 46.5 gives an image of the function for nine different system parameter
46.4. LOCAL SECTIONS
927
combination. The function is evaluated on each pair x = A(i), y = A(j) and yields ci,'j as an affine function of the four system parameters. We see that low parameter causal and final parameter values, respectively, give low correlations in causal and final direction, respectively. For example, the top right "flying carpet" is strong in final, but weak in causal direction. This means that we are looking at an algebraic variety which is defined by the substitution of the generic transition coefficients ci,j by functions
ci,'j (causaIStart m , causalEndm , finalStart m , finalEnd m
)
of four system parameters which are a function of the given mother m. If m(T) is the total number of mothers of the stemma quiver T, this yields a total number of 4.m(T) causal-final variables which add to the analytical variables from the modules Ex. To be clear, the shape of a "flying carpet" is a transcendental function involving exponential and quadratic components, but the causal-final system variables which define the actual shape of a carpet are only involved in an affine way.
46.4.1
Comparing Argerich and Horowitz
Summary. This section applies the comparative theory to a particular case of agogics performance in Robert Schumann's "Traumerei" by Martha Argerich and Vladimir Horowitz. It turns out that Argerich's agogical coherence is global compared to a rather local coherence with Horowitz. -~-
In this section, we shall apply the preceding approach to the inverse problem regarding tempo curves and their shaping by use of motivic weights and associated operators. The piece is Robert Schumann's Kinderszene 7 "Traumerei" , the performances are those by Martha Argerich (ARG) and Vladimir Horowitz (HOI in Repp's list), both measured by Bruno Repp [438] among a total number of 28 performances that were already discussed in chapter 44. For our example, we take the stemma on the onset space which is defined by the primary mother RM frame that extends from the first onset through the end of the last event. This mother has four daughters RA, RA', RB, RAil. Frame RA is the time frame of part A from the very beginning to the end of bar 8. The second frame RA' is the time frame of the eight bars of the first period, the frame RB is that of the third eight bar period, the last frame RAil is that of the reprise eight bar period to the end. Each of these daughters is the mother of eight bars, except the first, which has a first daughter including the upbeat quarter in the first full bar. Let us denote the daughters of RA by RA,l ... RA,S, then those of RA' by RA',l ... RA',S, those of RB by RB,l ... RB,s, and those of RAil by RAII,l ... RAil,S. For the analytical parameters, we take a constant vector space Ex = E = (WI, W 2 , ... Wt) generated by a finite family (Wi)i of global weights on the onset frame RM. For a given onset frame R of our stemma, we have the linear map 'Px : E -+ Ax into the vector space of C 1 tempo curves on the stemma frame x. It maps the weight W to the scaling transformation by the quantity 'Px(W) = Ixl- l Ix W. This operation leaves constant fields constant, polynomial fields polynomial, etc., it conserves any reasonable special type of tempo curves. Observe that we really restrict global weights to local frames and do not recalculate local weights when averaging.
928
CHAPTER 46. CRITICAL FIBERS
Figure 46.6: The restriction of the generic fiber spaces to 'flying carpet' coherence domains yields characteristic differences between Argerich's and Horowitz' performances. This being the case, we start by a global constant default tempo dT, then take its restrictions to the period daughters, then apply all the scaling transformations to the restricted tempi, then add up all the shifted contributions that are multiplied by the crosscorrelation coefficients from a "flying carpet" with causal-final variables
causalStart M , causalEnd M , JinalStart M , JinalEnd M on the periods. Then we have the 32 restrictions to the single bars, the application of the scalings according to the averaged weight restrictions to the bars, and then ending by the cross correlations according to "flying carpets" for the four periods and eight bars each, with the total of 16 causal-final variables
causalStart A , causalEnd A , JinalStart A , JinalEnd A , causalStart A ' ,causalEndA ', JinalStart A ', JinalEnd A ' causalStart B , causalEnd B ,
JinalStart B ,
,
JinalEnd B ,
causalStart A" ,causalEndA" ,JinalStartA" , JinalEnd A" .
929
46.4. LOCAL SECTIONS We have applied this system to the one and only motivic boiled-down weight XPTriiumerei
M eloRubette,Parammelodic
which was also used in the statistical analysis of chapter 44. This yields a total of 21 variables and a total of 32 cubic polynomials (one for each bar) in these variables, whose solutions yield the variety lying above the average measured tempi on the 32 bars. Therefore, it cannot be expected that we really have zeros of all these equations. With Mathematica® routines, we have therefore calculated local minima of these polynomial equations and found the following results [348, III] in terms of causal-final variable values:
Result 7 Period level: • In the inter-period coherence, Argerich is more final than Horowitz, whereas the causal level is more pronounced by Horowitz. Result 8 Bar level: • Horowitz plays the first period with pronounced causal and final coherence, whereas the causal coherence decreases to a very low level towards the end of the piece.
• The repetition A' of the first period A shows a 'relaxation of coherence' which may be justified by the repetitive situation. • The development section B slightly increases the final character. • The recapitulation seems to be quite 'tired ': the causal character is very low, the final character is decreased. • For Argerich, the first period has a less coherent ambitus than with Horowitz. • In contrast to Horowitz, the final coherence of Argerich increases as the piece goes on. • The development and the recapitulation are pronouncedly final. The development and the recapitulation shows a consciousness of the end of the piece which is absent with Horowitz. • In other words, Argerich's recapitulation is 'prospective' and not 'retrospective '. These calculations are however only locally relevant, and a global solution space with more subtle estimations should also be calculated. Nonetheless, an identical algorithm is applied to two samples and therefore, comparing these inverse performances is admitted.
Part XIII
Operationalization of Poiesis
931
Chapter 47
Unfolding Geometry and Logic Time
•
In
Keeping time, time, time, In a sort of Runic rhyme, To the tintinnabulation that so musically wells Prom the bells, bells, bells, bells. Edgar Allan Poe: The Bells (1849) Summary. Musical poiesis in composition (and performance) is intimately related to a projection of abstract objects into time. We discuss the logical and geometric aspects of this mapping process. The subject is crucial to the entire art of music since music is involved in the creation of autonomous time beyond the physical "tyranny" of real time. This enforces a review of Michael Leyton's theory of time as a philosophical category which is derived from spatial symmetry transformations [303]. We also discuss the role of unfolding insight within the syntagmatic discourse of music.
After the development and analysis of the various aspects of musical structure and process in this book, it should no longer astonish that the creation of music involves an incredibly complex semiotic, communicative, and reality-critical construction. Far beyond a simple representation and performance of elementary sound events, music is a narration of strong logical and geometric categories, or, at least: without such an intense existentiality, music would never reach the status of a valid antiworld which takes us to an autonomous time and space. This is valid all the more than straightforward common signification processes fail, and meaning has to be built without external references. So the question is legitimate: What is the story that compositional narration is likely to convey to the listener? This question is critical in two regards: first, the narrative discourse without common, external content is not likely to be a remake of the narrative discourse in ordinary language. Second: The absence of an ordinary content (the story) makes it a questionable point whether narration in music is a reasonable category at all, or whether it is rather 933 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
934
CHAPTER 47. UNFOLDING GEOMETRY AND LOGIC IN TIME
a metaphor which one instantiates a defaut de mieux, and which one should avoid rather than abuse. Of course, the musical performance is embedded in the physical time-line, just like the ordinary story which is being told in a piece of literature. But in literature, this time surface transports a hierarchy of time strata which owe their existence to told and telling times of a time-sensitive reality, be it a description of events and movements in a fictitious world, be it the story teller's discursive telling time. In music, this is not possible. Except physical time, there are no timely contents that are referred to and represented, including a beginning, middle, and ending part, as Aristotle has described narration in his poetics. This means that if narration does exist in music, it must not only invent an autonomous content layer, but in addition, it has to create the time hierarchy that organizes the narrative stream. In this short chapter, we want to give some remarks on how such a creative process and its contents could be conceived.
47.1
Performance of Logic and Geometry
Summary. In contrast to linguistic discourse, musical syntax does not, a priori, tell a story which has its own time dimension. We contend that the intrinsic story in music is about performing logic. This includes-in particular-that musical time means above all syntagmatic time, and not the material time shared by the parametrized music events. We relate the subject to Algirdas Julien Greimas' theory of narrativity [195]. -~-
Although there is no common storyboard in music, it does nevertheless realize a narrative discourse. We shall see in chapter 52 that the discursive dialog among four humanist persons (as represented by four instruments from the family of violins) is a characteristic feature of the string quartet. This is all the more remarkable that string quartet music is quite the contrary of a program music, it has always been the art form of "absolute music" where an abstract "musical idea" is processed. So what are these musical personalities talking about? In common musicology, one would argue that they are organizing musical ideas. We suggest that more precisely, they are constructing a poetical work along the principles of Jakobson's poetical function: projecting the paradigmatic axis into the syntagmatic axis, see section 11.6.1 for this function. Such a paradigm could be a pitch or chord class, a tonality, a motif or a melody, a contrapuntal line such as cantus firmus or discant, a rhythm, or an instrumental color, for example. The projection of such paradigms into the syntagmatic axis means (1) selecting representative instances of such paradigms and (2) arranging these instances along the syntagmatic contiguity. So the paradigms in absentia are syntagmatically represented in praesentia. This may be a contiguity in time, such as is the case for a succession of tonalities, or else a contiguity in pitch, such as we have discussed for contrapuntal voices, or a contiguity in a sound color space for a multi-instrumental projection. The point here is that these syntagmatic relations are not linear, i.e., the contiguity can extend to different dimensions. And it can also take place on relatively abstract levels in the sense that, for example, a tonality need not be the concrete score level, but a preliminary organizational level which is a generic scheme for the score realization. We shall also see in
47.2. CONSTRUCTING TIME FROM GEOMETRY
935
chapter 51 on the OpenMusic software that the abstract syntagms are completely natural in intelligent implementations of compositional strategies. The syntagmatic arrangement of paradigmatic representatives creates the first instance of Greimas' fundamental categories of narration, i.e., succession. But we again have to stress that succession is not necessarily one in physical time, it is one in an abstract parameter space, or even in a space form of generic concepts. The concatenation of successive units is given a logical justification by the insertion of transformational process units, such as contrapuntal interval relations, contrapuntal or harmonic rules of progression, modulatory parts between adjacent tonalities, transformations of motives to their variations, for example. This enrichment is associated with Greimas' category of transformation which, together with the states of succession defines his "programme narratif" 1. In the third stage of the narrative organization, the category of "mediation" is recovered in order to embed the narrative program in a global reasoning, i.e, the declaration of a purpose behind the narrative program. This can be, for example, the overall strategy for a modulatory plan as we have known it in the analysis of the modulatory landscape in Beethoven's op. 106, see section 28.2. Summarizing, the narrative organization is a performance of the logical strategies which dispatch paradigmatic units in the syntagm of their logical concatenation, the latter being explicated on successive transformations, whereas the whole expresses what one really could call "the musical idea". But the syntagm of this organization is by no means a linear one, and even less one in the physical time of a typical telling instance. And accordingly, the story being told is much more than a temporal succession of events, it is a logical construction of local and global compositions, of morphisms between such objects, of universal constructions in the corresponding categories, short: of logically and geometrically motivated predicate instances. This is why constraint programming has become an interesting field in computer aided composition, but see chapter 51 for details. At this point we have to a certain degree reached what Eduard Hanslick [206] calls "tonend bewegte Formen". So what is missing? Essentially, there are two major gaps at this stage: • A narrative time concept and its organization is still missing . • The unfolding of the syntagmatic logical display into physical performance time is outstanding. We are now going to discuss these issues.
47.2
Constructing Time from Geometry
Summary. We sketch predicates of logical and geometric nature which are designed to set up syntagmatic time in music. In this context, Michael Leyton's theory of time is briefly reviewed. -~-
As the present organism is a purely logical and geometric one, the only hope to generate narrative time is by means of mechanisms which turn logic and geometry into time. On the 1 "On appelle programme narratif (PN) la suite d'etats et de transformations qui s'enchainent sur la base d'une relation 8-0 [8ujet-Objectj et de sa transformation."
936
CHAPTER 47. UNFOLDING GEOMETRY AND LOGIC IN TIME
logical level, for example, the implication form A ==? B induces a time category where the condition is antecedent to the implied statement. On the geometric level, there are several symmetry-related "time generators" . In fact, a symmetry S can be viewed as a transformation, where objects x are moved around, i.e., x ~ S(x), whence a time-stamped relation from x to S(x). A radical approach to time is proposed by Michael Leyton [303, 304]: In its cognitive and physical reality, the time concept is viewed as a derived one. Leyton's theory develops structures to reduce time ontology to a dynamical syntax of spatial symmetries. These symmetry groups are wreath products 2 Fie of a control group C and a fiber group F. Leyton's approach allows us to understand music as a natural reconstruction method of time via syntax of symmetries. This syntax aligns symmetries and their broken variants as a ordering relation: first symmetry, then its broken version. Time is generated via symmetry breaking. This implies that musical time is generated by such symmetry breaking relations. Together with the logical time construction, we obtain a variety of ordering relations which are induced by symmetry breaking processes. More generally speaking, time is generated by logical and geometrical succession relations. For example, if we recall the model for tonal modulation (chapter 27) the modulation quantum is a global composition whose inner symmetries relate the antecedent to the successor tonality, but it is not possible to qistinguish these tonalities in their logical role from the structure of this symmetric quantum. Leyton would say [304, symmetry and asymmetry principles p.4l] that symmetry is without memory, since it is symmetry breaking that constructs memory. The time ordering of the involved tonalities is only generated by marking the trace of the modulation quantum in the target tonality: which defines in fact the modulation steps. This is also the Greimas transformation part relating the two units in the succession chain. Such a breaking of a symmetry in modulatory degrees creates time and bans the symmetry from the syntagmatic surface. A good example of memory-less construction by unbroken symmetries is Schonberg's dodecaphonic method. It starts on an original dodecaphonic series and proposes a display of some of its (generically) 48 transformed versions under the known group D 12 ,12, (see the discussion following definition 22 in section 8.1.1). There is no specification of the method regarding the instantiation of the involved symmetries, they are external to the composed syntagm, and there is no symmetry breaking that would create a time-line among the 48 variants in that orbit. So the narrative structure of this method is not specified. Maybe this drawback, which lives in a pronounced contrast to the excellent narrative sonata form, is one of the reasons for the failure of pure dodecaphonism, i.e., dodecaphonism that is not enriched by syntagmatic concepts for the narrative construction. So let us assume that syntagmatic time is generated as an ordering relation between logical and/or geometric units. Such a time is not necessarily linear, there may be several competing time strings. For example, recall from counterpoint theory that "punctus contra punctum" relates to the horizontal interval succession as well as to vertical cant us firmus vs. discantus succession (as the melodic implication). So we have two time strings which are simultaneously present in the "contra" process. In a modulatory process, the time-line of tonalities can be superimposed by a vertical time-line of melodic deduction from the harmonic basement, and a time-line induced by an instrumental hierarchy. Such abstract time-lines are the narrative 2See Appendix 73.
937
47.3. DISCOURSE AND INSIGHT
bricks of the musical counter-world. Suppose that we now dispose of such a narrative organization. Then, its unfolding is the following project: In performance, we have to unwind the logical and geometric time-lines in a space of performance time. We have discussed at length the time hierarchies and stemmata of performance in parts VIII and IX. Evidently, these tools must be used to unfold the abstract narrative organization in physical reality. Presently, such a theory is lacking, but the prerequisites are at hand. The main problem with regard to human cognitive capacities is to find an equilibrium between the complexity of the narrative instances of logical and geometric time-lines and the limitation of comprehension of such structures and processes in the physical performance space-time. This means that we have to name principles that define the basic structures of a global composition with regard to its narrative communication. A global composition has a basic syntagm that is defined by its covering and, more specifically, its nerve. This is already present in Schonberg's harmonic strip between successive chords, which is in fact an elementary but prototypical syntagmatic junction (the transformational part between two adjacent succession units in Greimas' theory). So the composition's nerve is an excellent syntagmatic device in that it includes narrative paths (on the nerve's one-dimensional skeleton) of neighboring local charts in praesentia. For example, if we have the interpretation of a diatonic scale by the seven triadic chords, we obtain the harmonic (Mobius) strip as a global composition. The geometric relation of two neighboring degrees on the strip, Ie, I I Ie, say, can be given a narrative direction in the succession I~, I I Ie where the common notes e, 9 of the inversion I~ and the third degree may be held such that we only have the time event of a downward movement c f---+ b to indicate the time-line from I~ to IIIe. We contend that any unfolding of the syntagmatic logical and geometric time-lines should locally on the physical time axis produce a small linear storyboard, i.e, minimize the multi-dimensionality of the abstract syntagm, or at least try to stress its most important components. To put it the other way round: Understanding the abstract narrative structure of a composition's nerve amounts to realizing a sequence of paths through the nerve's one-dimensional skeleton such that eventually, all important parcours of the nerve's 'city map' have been exposed. The problem here is analogous to the problem of how the eye's path should look when it observes a painting in order to obtain a good understanding of what is seen, or else to the problem of how we should walk through a house in order to optimally understand it as an architectural organism. It is the problem of curvilinear reduction of high-dimensional objects, a problem which by the human language stream of words has been solved in a very special context and with the success of a very limited textual representation of the world.
47.3
Discourse and Insight
Summary. This final section locates the musical discourse as a means of gaining insight into complex multidimensional structures by syntagmatic sequentialization (linearization). This contrasts with the linguistic discourse, since the latter has rich semantic rootings which are absent in music, must therefore be balanced by explicit discoursivity. -~-
The advantage of common language above music is the presence of external semantics, the language stream runs around any kind of complex entities that are not part of the language
938
CHAPTER 47. UNFOLDING GEOMETRY AND LOGIC IN TIME
structure. This makes language an easy business since it does not include its objects, but only points to them. In contrast music cannot point outside (except for trivial special cases of onomatopoiesis), it has to build everything from an inner discourse. This is also its advantage and scope: to be able to invent worlds instead of pointing to them. In this sense it is an excellent exercise to learn what is really needed to create a 'world' in our imagination, i.e., a meaningful spiritual architecture without external roots. The investigation of the narrative discourse should teach us how we gain insight into things when telling their story, meaning: how to understand music while playing it.
Chapter 48
Local and Global Strategies Composition
•
In
Aus den Auswahlkriterien also entsteht die Dialektik der Abfolge oder Verknupfung von Lokalstrukturen, wobei diese Auswahlkriterien bestimmend sind fur die Eingliederung der Lokalstrukturen in die grojJe allgemeine Struktur, die Form. Pierre Boulez [60, II] Summary. This chapter sketches the compositional process between paradigmatic selection and syntagmatic combination within the musical sign system. Apart from these semiotic perspectives, the process is characterized by a "dialectic" interrelation of local and global criteria. These features-well known from the general structure theory of global compositions-reappear in the special light ofpoiesis: The construction of a composition resembles the step-by-step completion of a puzzle of logical units, distributed in syntagmatic time, and selected to optimize association to already placed units. This activity is remunerative and fed back by a successive accumulation of poetical semantics. -~-
We do not intend to describe the psychological path in the composer's mind, but the fundamental strategic steps, independently of the psychological or cognitive realization. We also do not impose this strategy on any composer, but want to sketch a possible system for composition which can also be implemented on the software level. We shall see that two implementations: presto® and OpenMusic realize quite a portion of these ideas. Essentially, the poiesis of a musical composition is the construction of a global composition, i.e., a patchwork of local charts, together with gluing transformations. Moreover, the charts are selected from a set of paradigms and combined according to a set of syntagmatic rules. But the concrete making of a composition is not a one-step procedure, it results from a successive completion of an ensemble of charts. Ideally, one is
939 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
940
CHAPTER 48. LOCAL AND GLOBAL STRATEGIES IN COMPOSITION
1. given a simplicial complex and would like to realize it as the nerve of a covering by local charts. 2. One also wants the charts to pertain to determined isomorphism classes or else being specializations of representatives of such classes. 3. One finally wants particular gluing transformations on determined parts of the charts. Point two reflects the analytical approach using the paradigmatic theme of Ruwet and Nattiez, see section 11.7. But it extends it in that the nerve is planned as a syntagmatic design pattern, further specialization and not only transformation of basic paradigms is allowed, and finally, the gluing transformations for the syntagmatic combination are thematized.
48.1
Local Paradigmatic Instances
Summary. Local paradigmatic strategies can be split into transformational and topological procedures. In each case, we describe the basic options. -~-
Recall from chapter 10 that paradigmatic relations can be either transformational or topological---or combinations thereof. Whereas the first is related to symmetries, the latter is related to the idea of variation. Usually, transformations will quite brutally change the auditory impression of a local composition and must be used with care, whereas topological variation is more adequate for instantaneous recognition of similarity.
48.1.1
Transformations
Summary. Starting from Schonberg'S dodecaphonic method, we describe varieties of local compositions produced by a determined set of transformation groups, the paradigmatic themes of poiesis, acting on a selected set of local "germs". -~-
This method starts from a sequence S = (G I , ... G c ) of local germs, for instance thematic units such as the singleton S = (G I ) containing a dodecaphonic series, or a two-element set S = (G I , G 2 ) containing a main motif and a side motif of a sonata. Then, for each local germ G i , we are given a transformation group Pi, the paradigmatic group of this germ, which describes the a priori allowed transformations. For instance, for dodecaphonic compositions, we have PI = D I2 ,12. It is however not necessary to realize the entire orbits Pi.G i of these group actions, a composition will usually select a small number of such transformed germs. In the construction of the germinal melody of the "Synthesis" composition in section 11.6.3, see also figure 11.16, we have 26 isomorphism classes of three-element motives and take one representative of each class, where the transformation group is the same for all classes, Le., the full affine group on Z12. The main point in this procedure is the instantiation of the transformations within the composition. In fact, it is very rarely sufficient to place just two objects Gi,P.G i in contiguous position in order to communicate the particular symmetry p which is responsible for the
48.2. GLOBAL POETICAL SYNTAX
941
succession. It is part of the failure of dodecaphonism that its exponents never did care about instantiating p. A nice example of such an instantiation in Beethoven's op. 106 was given in section 28.2.5.
48.1.2
Variations
Summary. Instead of transformation groups, a set of local "germs" can be deformed according to a set of similarity paradigms (e.g., gestalt topologies) -2.;-
In this approach, there is no group action and the original germ G i must be associated with one of its specializations by use of deformation procedures. For example, one might superimpose a force field on the given germ and then move its points around according to the local force action. Such a procedure has been implemented in the OrnaMagic module of the presto@ software, see section 49.3. This variational change of perspective is very precious for understanding a germ's potential, but in general will not give us back the original germ. The quantity of change is a function of the used gestalt topology whose choice depends on the composer's preferences.
48.2
Global Poetical Syntax
Summary. According to rules of contiguity and semantic "added value", a variety of transformed and/or deformed local germs is distributed along the syntagmatic space. We systematize the possible procedures according to horizontal and vertical poetical functions. -2.;-
The construction of a global assembly of local charts which are provided by the local techniques is very delicate, see also Pierre Boulez' reflections [60, II]. For instance, the germinal melody in "Synthesis" was built from the generic motif class representative, then gluing it with a representative of the first different motif class that fits with the first representative on a subset of two points. But the possibility to find such a solution until every class was represented exactly once is not obvious, it is a constraint to the compositional material that could have ended in an incomplete solution. The normal procedure is a puzzle reconstruction: One begins on any interesting germ and adds other germs at any syntagmatic position (not necessarily adjacent), and successively densifies the display, thereby observing constraints on the nerve that one wants to construct. For example, in the sonata "L'essence du bleu" [368], the plan as described in [328] was to construct a global composition on a motivic zig-zag {c, cij, d, dij, e, dij, d, cij, c} by use of nine three-element submotives in such a way that the nerve turns out to be a M bius strip. The construction of a nerve is not only complicated as a constraint problem, it is also a problem of narrative character: the intersection configurations must also be mediated on the concrete time axis, and in such a way that the transformations between associated units can be made explicit, e.g., the Mobius strip must be boiled down to the time axis such that its
942
CHAPTER 48. LOCAL AND GLOBAL STRATEGIES IN COMPOSITION
one-dimensional skeleton (we stick to the ideas from section 47.2) is transgressed on reasonable paths. Evidently, the main point of this global patchwork is the syntagmatic distribution of paradigmatic representatives in order to achieve an added poetical value in the sense of Jakobson's poetical function. This objective splits into two subtasks: horizontal and vertical poetical functions according to Jakobson and Posner, respectively.
48.2.1
Roman Jakobson's Horizontal Function
Summary. Distribution of a variety of transformed and/or deformed local compositions can follow Jakobson's correspondence [245] along the syntagmatic axis. -~-
Jakobson's poetical function was introduced in section 11.6.1. Counterpoint, harmonic syntax or dodecaphonic row distribution are examples of this mechanism. The most elementary realization of this function is the action of a translation group on rhythmical units in order to produce a cyclic character of our composition. This semantic enrichment creates a coherence of the compositional corpus which is basic to all other elements. It may, for example, connect motives, harmonies and similar units if they are distributed on the regular positions of the rhythmic basis. But it also creates a time circle that abolishes the straight physical time-line and rebuilds and "good" cyclic eternity (in Hegel's sense) which our finite life was to negate.
48.2.2
Roland Posner's Vertical Function
Summary. Jakobson's poetical function is orthogonal to Posner's vertical function which relates different signification levels in denotation and connotation. We discuss strategies which make use of this functionality in the production of poetical "added value" . -~-
Jakobson's poetical function relates signs of the same semiotic system, for example, two phonological signs such as "bad", "dad" which are positioned in syntagmatically equivalent places, in a rhyme, say. But this structure is enchained with the semantic sign system (by the double articulation of language, see [361]) where the phonological equivalence in the rhyming places produces a paradigmatic equivalence between the bad and dad, i.e., "dad is bad", a meaning which is neither inherent in "bad" nor in "dad". So the level of equivalence is shifted from the phonological to the semantic one. This shifting of equivalence in Jakobson's function illustrates the vertical function as proposed by Roland Posner [421]. In music, such a construction is not obvious since connotative semantic levels are not automatic, but must be constructed or simulated. This can be achieved by a small global composition which is attached to a local 'signal' composition whose equivalence to another local 'signal' composition on the horizontal Jakobson function induces an equivalence of the attached small global compositions. Such a small global composition can be a configuration consisting of a chord interpretation and a melodic unit, for example. So here, the connotative levels are constructed by successively enriched small global compositions within the total global composition.
48.3. STRUCTURE AND PROCESS
48.3
943
Structure and Process
Summary. The poietic process is confronted with the resulting 'score' structure in order to recall a more general approach to the score concept. -~-
We have stressed the global composition as a target of compositional strategies. However, the making of this composition can be more relevant than the result. In other words, the general score concept which we discussed in section 33.3.2 may be specialized to the interior score known from jazz: Instead of a more or less fixed score structure in form of a global composition, the processes that lead to the effectively played material are stressed, while the resulting structure is quite secondary (albeit not irrelevant, as some fundamental critics of free jazz have argued) in that it is one possible variant of those processes, and another variant-as an exemplification of the operating processes-would do as well. The situation is comparable to algorithms for complex shapes, such as L-systems or fractals. They are defined as processes, and the resulting variants are all an expression of the same basic process type, as with biological phenotypical expression of the genotype l . It is a very interesting research area to classify process score types and to implement corresponding tools in music composition software. An example of such an approach (the composition software OpenMusic) is given in chapter 51. The following chapter 49 describes a software (the composition software presto®) which is mainly based on paradigmatic strategies. As a matter of fact, such strategies are very difficult to handle because a paradigm is, by definition, in absentia to the text, whereas syntagmatic structures are in praesentia, which means that a software that is built on the LEGO-like juxtaposition of bricks will have an easier acceptance than the paradigmatic one. However, if the bricks are no longer concrete musical material, but abstract units of processual character, the ease of the LEGO approach is no longer valid.
1 Phenotype is the "outward, physical manifestation" of the organism, while genotype is the "internally coded, inheritable information" (written in the genetic code) carried by all living organisms.
Chapter 49
The Paradigmatic Discourse on presto® Damit konnen Sie mich eine ganze Nacht lang allein lassen! Herbert von Karajan to Guerino Mazzola on the occasion of the presentation of the presto® prototype in Salzburg 1984 Summary. The presto® composition software was developed as a commercial implementation and operationalization of mathematical music theory. Several large compositions have been successfully realized on this software [48,49,338]. One of them will be discussed in chapter 50. We describe the overall architecture and functionality of presto®. This is mainly driven by a local/global paradigmatic perspective. The paradigmata of transformation and deformation are realized on (1) the level of modular affine transformations in the four-dimensional space of onset, pitch, duration, and loudness, and (2) the level of variational deformations. Both paradigmata are discussed and exemplified. We conclude the chapter with a remark on the problem of abstraction in paradigmatic composition, since composers tend to have major difficulties to get familiar with abstract paradigmatic structures. -~-
Although the software was implemented in ANSI-C on now historic Atari®computers, we first describe its functional scheme since it represents a prototypical and still unique way of thinking paradigmatic composition strategies (to our knowledge, no similar strategy has been implemented on software to the present). The presto® software as well as the entire source code are GPL and can be downloaded-together with a manual, examples, and a concept presentation software prestino . prg-from the book's CD-ROM, see page xxx.
49.1
The presto® Functional Scheme
Summary. We describe the functional scheme of presto®. The scheme is centered around the 945 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 49. THE PARADIGMATIC DISCOURSE ON PRESTO®
946
back bone of global score, score, and local score. This threefold architecture reflects an early stage of the global-local paradigm from mathematical music theory. -~-
*~~~~~~=============*~~ Easy
Con1:rollers
S1:andard Transforn. General Transforn.*
'*
PIIas1:er
Figure 49.1: The functional scheme of the presto® software. It is centered around the back bone of global score, score, and local score. The graphics of this chapter are all from the original presto® manual [338]. The functional architecture of presto® consists of a series of so-called scores, see figure 49.1. The global score, the score, and the local score deal with geometric representation and editing of notes (mental note events) on global to local space levels. The recording score allows for mousedriven and instrumental real-time input, the transformation score deals with graphical input of affine transformations, and the grid score allows for graphical input of ornamental grids. The input goes via recording via mouse or MIDI instrument, loading from MIDI or presto® data files, and painting with the mouse. Editing features split into affine transformations on charts of notes, building of ornaments, all-parametric variations according to ornamental attraction and repulsion fields, instrumental and parametric coloring of local charts, Boolean combination operations on groups of notes, and performance editing, especially construction of complex tempo hierarchies. For the latter, please refer to our corresponding discussion in section 38.2. Output is split into audio and SMPTE, saving is on MIDI and presto formats. Figure 49.2 shows the main window of presto®. The rectangular score window in the middle shows onset (horizontal) and pitch (vertical, can also be set to loudness or duration),
49.1. THE PRESTO® FUNCTIONAL SCHEME
947
and Schumann's Kinderszene "Haschemann". The black subrectangle marks a block which may be copied, cleared, moved, etc. Below the score, the tempo curve is visible. Above the score, the total composition space of the global score (narrow rectangle) is seen, below the tempo curve, six registers for provisional local parts of the score are placed. On top, we see 16 icons for instrumental colors (MIDI program change data), checked means the instrument is active; while editing, one may work on any checked subset. The different score and register windows
Figure 49.2: The main window of presto®. The rectangular score window in the middle shows onset (horizontal) and pitch (vertical, can also be set to loudness or duration), and Schumann's Kinderszene "Haschemann". implement some chart types in the global composition strategy. One may perform any Boolean operations on such parts, e.g., on two registers. Evidently, modern object-oriented windowing techniques could vastly generalize this elementary implementation. For a detailed editing of small composition parts, the local score window is available, see figure 49.3. The local score can be opened by double-clicking on a selected position on the score; after editing it can be merged to the score. A miniature view of the local score's content is visible on the right lower corner of the main window in figure 49.2. The local global paradigm is realized on the local score via the feature of "coloring". On the local score, we view a four-dimensional cube, representing the four-dimensional discrete torus Zil' in any of the six relevant projections Onset+ Pitch, Onset + Loudness, etc. onto two of the four parameters of pitch, onset, duration, and loudness. All parameters are integers modulo 71 and suitably calibrated: Pitch is a MIDI
CHAPTER 49. THE PARADIGMATIC DISCOURSE ON P RESTO®
948
!luu {"'tt.' tl\etcntll'r r tn. f.,""
y
.'
:.' .~"
Figure 49.3: The local score is the interface for detailed editing of small portions of the actual composition. The darkened polygon marks a region where any operation, such as copy, erase, transform, change the instrumental icon, or simply play can be performed. New events can be defined by mouse drawing actions.
Figure 49.4: The transformation score is the interface for detailed editing of affine transformations of the torus Z?l' The user's editing acts on a number of standard transformations (buttons to the right, middle), matrix text fields, and a graphical editing option by direct drawing of affine images of the unit square.
key number in the interval [27,97]' loudness is a MIDI velocity value between velocity 0 and 127, equally distributed among 71 values, onset and duration are in integer units that can be set to be different fractions of a bar, depending on the time signature and the resolution preferences for each metrical unit. The local-global paradigm is realized by the coloring feature: The user may define any closed polygon in a 2D projection, and thereby select all events that lie in that region with regard to the given projection. Such a coloring domain can be used to do many different things, such as moving around on Z~l' copying, erasing, transforming, setting new instrumental icons, playing. By mouse-driven drawing actions, new events can be inserted. The parameter plane can be changed at any time. To the right, we see the orchestration (only piano in figure 49.3, see below for complex orchestras), where any instrumental icon can be unchecked for instrumentally specified editing options. Of course selection of a toroidal representation and manipulation (for example for shifting operations, where events exit to the right and reenter from the left) is questionable, but once you are dealing with affine operations, such finiteness decisions have to be taken. The advantage of this selection will become clear in the next section 49.2.
49.2
Modular Affine Transformations
Summary. This section describes the mathematical framework of modular affine transformations in Z?l, their four-dimensional extension and the combination options for such transformations, their action on selected local charts, and the graphical input features. -~ -
49.3. ORNAMENTS AND VARIATIONS
949
The selection of the torus Zi1 was motivated by small number of pixels in those early screens, together with the need for icons in order to represent instrumental 'colors'l. Now, the selection of a prime number was not by case. In fact, the choice offers us a finite field where non-zero determinants of matrices automatically yield invertible matrices. So a large number of transformations becomes invertible, and therefore a large number of compositional processes become non-destructive. The software offers a transformation interface where any affine endomorphism in Zi1 @Zi1 can be defined graphically as well as numerically see figure 49.4. We first use the fact (see section E.3.6, theorem 53) that Zi1 @Zi1 is generated by its transformations which only move around two of the four dimensions. So the user may transform his/her material on the selected plane, then switch to another plane for further transformations, etc. The user may directly define a product of any number of plane transformations (for different planes) by a feature which memorizes a list of plane transformations. We have a total of 71 16 .71 4
= 10'596'610'576'391'421'032'662'867'140'133'202'401 ~
1.05966 x 10 37
elements in Zi1 @Zi1' and the number of 10'445'260'466'832'483'579'436'191'905'936'640'000 ~ 1.04453 x 10 37 --+
elements in G L( 4, 71) according to the formulas in appendix C.3.5. The transformation score in figure 49.4 shows the basic input choices: matrix coefficients, standard transformations, and graphical editing by definition of the images of the three points: origin, head of horizontal unit vector, head of vertical unit vector. The parallelogram image of the unit square is visualized on the local score. The geometric advantages of the prime number 71 are abundant, among others, we have these facts: Since the multiplicative group 71 x is cyclic of order 70, i.e., isomorphic to Z70 ~ Z2 X Z5 X Z7, we have fifth and seventh roots of unity in GL( 4, 71). We also have 80 2 (71) ~ Z72 (see appendix C.3.5), and therefore, a generator D of the special orthogonal group. Such a generator is D = (30 38
33) 30
which means that we may view D as a rotation by 360° : 72 = 5°. In the software, we have implemented the rotation by the triple, i.e., D3 rv 15°-rotation. Of course, such rotations in the modular torus can transform harmless compositions into very wild looking variants, but we have always experienced that the transformed version maintains certain regularities that were present in the original form.
49.3
Ornaments and Variations
Summary. Modular affine transformations are used to build ornaments by translation grids of "cells" of notes. Such ornaments are used directly as periodic note sets, for instance in drum patterns. They may also be used as "background" ornaments whose points act as centers of attraction or repulsion for other notes. The latter method is a multidimensional generalization 1 Because
of low resolution, color was not a good commercial option in 1988
950
CHAPTER 49. THE PARADIGMATIC DISCOURSE ON P RESTO®
of the well-known variational technique, in particular of the alteration of pitch sets (scales, chords, motives, etc.). -I;-
I /
/
Cell of Ornament Grid
Figure 49.5: An ornament is defined by a motif, together with a translation grid and a range in each direction of the defining grid vectors. The grid cell is the parallelogram spanned by the generating vectors. The software's module for ornaments is termed OrnaMagic. The idea is this: The user first defines a motif M, either a small one on the local score, or an arbitrary large one on the score. M is just a local composition on the four-dimensional EH LD space of the software. Next, two (usually linearly independent) translation vectors 9h, 9v define the grid, i.e., the group (e 9h , e9v ). This group operates on the motif M and yields a translated motif Mi,j = e i .9h +j·9v .M for each integer pair (i,j), see figure 49.5. The user defines a two-dimensional ornament of a special ornament window, the grid score, '
951
49.3. ORNAMENTS AND VARIATIONS
etries OrnaMents
6) 24
'.
GRID
Draft aRID
Vertical Range [ B-, Select
MOTIF
I r-I--= 2-M-O--:-t-;-if'--'
PI ace
LOC" L MOTIF
A
Figure 49.6: The grid score is the interface for defining an ornament via its grid vectors (socalled "horizontal" gh and "vertical" gv) and the range a :::; i :::; b, c :::; j :::; d, i.e., the interval of horizontal/vertical translation ei.gh+j.9v to be effected on the motif M. have two local compositions, L, G, where G plays the role of the driving grid, whereas L is the composition which we want to deform. Usually, G is defined by an ornamental construction as described above, but this is not mandatory, i.e., one may also take a large motif M and just apply the grid group on the zero range (a = b = c = d = 0). Given these data, the alteration works as follows. Each event x of L is shifted towards or from G according to a procedure which starts by the calculation of a 'nearest point' G(x) to x in G. This point is found starting on x, and then moving along a spiral outward around x, until the first point of G is hit; this is G(x). Following this first algorithm, several alteration strategies are offered. First, one may choose a degree of deformation, say y%, a positive or negative real number. This means that the vector G(x) - x is stretched to y.(G(x) - x), and that the alteration of x is set to x+y.( G(x) -x). However, this is not the last word in alteration, one may also define its direction. Typically, this is vertical (conserving onset time), or horizontal (conserving pitch), etc. But the vector G(x) - x is neither one nor the other. So we have to introduce the projection Pt(G(x) - x) of the difference G(x) - x to the direction t. This yields the final alteration
At,y,G(x) = Pt(y(G(x) - x)) of x according to the system variables t, y, G.
+x
952
CHAPTER 49. THE PARADIGMATIC DISCOURSE ON P RESTO@
Figure 49.7: Drum patterns in the composition "Synthesis". This is all implemented in the OrnaMagic module of presto®. Moreover, given a block with onset limits u < v, the user can define a successively increased alteration by setting the percentage to y(u),y(v) and altering from At,y(u),G to At,y(v),G as onset moves from u to v. Such procedures have been used in the composition "Mystery Child" in [49]. This piece can also be heard from the book's CD-ROM, see page xxx. The effect resembles a morphing operator which lets the background ornament G act on L with successive variation of its "alteration force field". As a special case, we have the classical pitch alteration that is driven by a background tonality G, as well as the onset alteration, i.e., better known as groove effect on sequencers, defined by an onset grid that is derived from a rhythmical onset configuration. These special effects are implemented as "easy alteration variants" in presto@.
Example 60 A special application of pitch alteration is tonal inversion: If we want, for example, to apply tonal alteration of the C-major scale around e (leaving e fixed, exchanging d and f, c and g, b and a), we may first apply the real inversion Ue and then apply the tonal alteration downwards with respect to C-major.
49.4
Problems of Abstraction
Summary. Whereas syntagmatic contiguity is a relation in praesentia, paradigmatic associa-
49.4. PROBLEMS OF ABSTRACTION
953
tivity is a relation in absentia. This difference has heavy consequences on the ease of managing paradigmatic composition tools. As a consequence, composers rarely transcend straightforward paradigmatics and prefer sticking to syntactical composition software, such as MAX, or common sequencer software. -~-
The composition software pTesto@ was designed as a tool for everybody, even for the non-experts in musical notation. The publicity and the product concept tried to invoke the statement "Beethoven's creativity at everyone's reach". The argument was that drawing music (see figure 49.8) is an interface that could give everybody access to the otherwise abstract and heavily codified music. Also, the simultaneous presentation in geometric coordinates of EH LD space, together with the 2D projections, and the presentation in notes (see figure 49.9) was thought to be a big advantage against classical approaches to composition. Now, at the center of this concept are evidently the TransforMaster (symmetries), the OrnaMagic (ornaments), and the Ago Logic (tempo curve hierarchies) modules. The first two of them are paradigmatic tools par excellence. They are powerful, but require strong abstraction capabilities. The absentia of the paradigm enforces a selection process of representatives, and a memorization of their relation to the paradigm. This reality switch is a major obstacle to a breakthrough of such a software concept. It seems to be a major task of future composition software development to build concepts that enable seamless transitions from abstract to concrete composition layers. In the limit, one should have very abstract objects, such as self-addressed compositions or even more generally addressed objects (why not playa functorial global composition, how would one do that?) at one's fingertips and handle them completely naturally as if they were normal notes or chords, and in fact: they aTe completely normal, only we did not learn to sonify them in appropriate shape. In this regard, Tom Johnson's compositorial approach [253] is one of the most promising in trying to morph mathematical objects into musical events.
'" ", :.
. ,,\ .
.
\:L: ~_.; .. __ ·-
" ',
. -."= .
Figure 49.8: The pTesto@ concept tried to merge Beethoven's creativity and ease of a graphically interactive interface for musical composition. The local score here shows a drawing of Beethoven in a geometric space, and using instrumental icons.
Figure 49.9: The geometric representation on the local score can also be viewed in classical score signs. One can immediately understand the geometric shape, whereas the common notation yields a completely cryptic object.
Chapter 50
Case Study I: "Synthesis" by Guerino Mazzola Habe die CD nun grundlich mir angehort und frage mich, woher kommt bloss die Kraft, die so was Schones schajJt. Jazz Saxophonist Werner Liidi on "Synthesis" [309] Summary. "Synthesis" is a composition for piano, percussion and e-bass. Its global and local organization was driven by classification of local compositions and modulation theory on one hand, and by the presto® software tool on the other. We describe the overall organization and the four movements. -~-
"Synthesis" is documented on CD [339]. It is the result of a composition grant of the city of Zurich and was composed and recorded in 1990 using the presto® software, together with this hardware: one Atari®Mega ST4 computer, the synthesizers Roland R-8M (drums and percussion), Yamaha RX5 (drums, percussion, and special sounds), and Yamaha TX802 (bass), and a Steinway grand for the piano part played by the author. The drum, percussion, and bass parts of the composition were written on presto® in four months and then completed by the piano part in an additional two months. The music critics did not recognize on the CD (where no trace of the computerized music was given) that the whole composition's percussive and bass parts were synthetic. This is what Gerard Assayag rightly calls a Turing test for the viability of computer-aided composition, in particular since in the field of jazz this technology is thought to be an impossible tool. The original presto-files (extender . sto) as well as the audio-files of "SYNTHESIS" are available on the book's CD-ROM, see page xxx. 955 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 50. CASE STUDY I: "SYNTHESIS" BY GUERINO MAZZOLA
956
50.1
The Overall Organization
Summary. We discuss the overall organization: its material, composition principles in the four movements, and the instrumentation strategy. -2;-
The overall organization is a four-fold view on a generic material: the 26 classes of threeelement motives in OnPiM Od 12 ,12 (see appendix M.3 for the class list). The multiplicity of views is given by classical forms: • the sonata form, • the cycle of variations, • the scherzo, and • a finale.
50.1.1
The Material: 26 Classes of Three-Element Motives
Summary. The entire composition is based on the 26 isomorphism classes of three-element motives in OnPiMod 12 ,12 (see section 11.3.8) and its specialization tree. We make precise the different usage modes which have been realized in "Synthesis" . -2;-
After the inspiring analysis of the Schubert-Stolberg work in 11.6.2, the usage of the 26 classes was recommended. The classes were used in different contexts. We refer to section 11.3.8 for the theory and to appendix M.3 for the 26 classes, in particular figures M.1, M.2 for representatives and the specialization Hasse diagram, see section 12.2.2 for specialization. To begin with, representatives of all classes were patched together to build the germinal melody already described in section 11.6.3, see especially figure 11.16. This melody appears explicitly in the introduction to movement four, see figure 50.1. The 26 motives also appear as percussive "phonemes" in the third movement, as already discussed in section 11.6.3. In the second movement, the germinal melody is altered according to ornamental deformation techniques and yields a sequence of characteristic melodies which are played by the bass and harmonically ornamented by the piano. Already the very first percussive motif in movement one (just after the piano solo intro) is the germinal melody, however played with pitches being encoded by percussive sounds. And in movement four, these motives are used to define ornaments and fractal refinements thereof. More schematically, this strategy is shown in figure 50.2.
50.1.2
Principles of the Four Movements and Instrumentation
Summary. The four movements include a sonata form, a variational sequence, a scherzo and a finale. The instrumentation issue is intimately related to the structural aspects. Each movement bears its specific constraints on the role of instrumentation. -2;-
50.1. THE OVERALL ORGANIZATION
9S7
Figure SO.1: The beginning percussion rolls in movement four of "Synthesis", as extracted from presto®'s main window, together with the repeatedly accelerating tempo curve. The germinal melody comes twice just before the piano intro solo. The principle of a four movement concert such as are encountered in the classical concert form is realized as follows, each movement with a total of 122 instruments: First movement: Earthquake/Full Force, duration: 10:46. It is devoted to the Greek element of earth. It is a sonata form which exposes the ideas, followed by modulatory development, a reprise and ending by a coda. Instrumentation: Yamaha RXS, TXS02, Roland R-SM (PCM cards: Jazz, Jazzbrush, Ethnic Percussion). Second movement: Liquid Colours/Sea of Faces, duration: 14:05. It is devoted to the Greek element of water. It is a cycle of variations. These variations are taken from the germinal melody and are concatenated in a paradigmatic way. Instrumentation: Yamaha RXS, TXS02, Roland R-SM (PCM cards: Jazzbrush, Ethnic Percussion, Mallets).
Third movement: Poem of Wind/Fly! Fly! Fly!, duration: 09:27. It is devoted to the Greek element of wind. It is a scherzo. The motivic alphabet of the 26 classes is produced in poetical arrangement of rhythmic patterns.
CHAPTER 50. CASE STUDY I: "SYNTHESIS" BY GUERINO MAZZOLA
958
26 motif classes COR RESPO D
CE
poetical production via Baudelaire
germinal melody
3rd movemelll
SELECTIO REFLECTIONS . ORNAMENTS
rhythms and their modulation I Sf movement
bass I.icks £It! movements
DECOMPOSITION. ALTERATI O S, ORNAM ' NTS
variations according to Messiaen grids 2nd movement
fractal refinements I SflJlovemelll
Figure 50.2: The overall strategy in "Synthesis" is driven by exploiting the 26 classes of threeelement motives. Instrumentation: Yamaha RX5, TX802, Roland R·8M (PCM cards: Jazz, Contemporary Percussion, Mallets). Fourth movement: Burning Spears/Interstellar Space, duration: 10:28. It is devoted to the Greek element of fire. It is a finale. Self·similar repetitions and refinements of the germinal structure successively densify the musical material. Instrumentation: Yamaha RX5, TX802, Roland R-8M (PCM cards: Jazz, Jazzbrush, Con· temporary Percussion)
50.2
1st Movement: Sonata Form
Summary. The first movement "Earthquake" is a sonata form which uses the modulation theorem (see section 27.1.4) in time dimension.
-EThis movement is a sonata form for rhythmical structures. After the piano intro, the rhythmical germinal melody is played on toms (pitches are associated with different toms) from
50.3.
2ND
MOVEMENT: VARIATIONS
959
the RX5 synthesizer. During the exposition, the germinal subject is enriched by drum and percussion, multiplied and profiled by bass lines. The flight of the piano over the rhythmical carpet indicates an intense cadence of the exposition, ending on a fermata. The development consists of three rhythmical modulations (see sections 28.3.1 and 28.3.2). This delicate process includes four different rhythmical macro-scales, which each consists of seven rhythmical three-tone motives from the 26 classes (see also figure 28.13). After the fermata, the first scale is built and cadenced by regular "falling drops" from the ohkawa instrument. Subsequently, its character is neutralized and transformed into a second scale. You hear the cadence of this new scale as being again marked by the ohkawa's regular "falling drops", this time enriched by a rain of light piano pearls. After a further fermata, the second scale is altered into a third one, whose cadence coincides with an intermediate climax of the piano. When the piano finally recedes, the third scale is neutralized until a march-like turning point introduces the fourth scale, whose cadence terminates the development by another fermata. The reprise follows in a slightly altered instrumentation. The finale starts after a short fermata. It is recognized on a heavy rock rhythm which is played in reduced tempo on the kicks.
50.3
2nd Movement: Variations
Summary. The second movement "Sea of Faces" follows a syntax of melodic variations of a fundamental melody which entails a particular "harmolodic" color in the vein of Ornette Coleman's music/theory.
This adagio is a labyrinthic wandering of piano, bass, barafons, glockenspiel, sanzas, gender, tube bells etc. through seven melodic variations of the germinal melody. Drums and percussion also obey these deformation forces, each harmonic, rhythmical, and melodic change is a specific expression of these forces. Scales and rhythms are known to be periodic structures in pitch and onset. By the choice of two scales from the first three Messiaen scales, one for pitch, one for onset, we generate an ornament, a "harmonical-rhythmical" scale. After certain rotations of such ornaments, the harmonic and rhythmical Messiaen components in onset and pitch are completely mixed. Here, the deformation forces of our germinal theme become manifest: Each tone event is displaced according to the ornament alteration algorithm described in 49.3. The deformation uses the following grids (see figure 50.3): We take the Messiaen scales M 1 , M 2 , M3 as described in example 13 of section 8.1.1. For each grid, we take a pair (Mi' M j ), i, j = 1,2,3 and build the cartesian product Mi x M j of two such scales, one in pitch, the other in onset direction, and each of them with a large range such that it acts on the germinal melody as if it were infinitely extended. The action is taken to be 100% without any directional constraints. This yields nine variations of the germinal melody, as shown in figure 50.4.
960
CHAPTER 50. CASE STUDY I: "SYNTHESIS" BY GUERINO MAZZOLA
HESSIAEN-RASTER
. ............ ..... ... · .. .... .
Drehs1:reckung
M2x'U. ::I ••••••••
............ :,........ ... .. .. .. .. I'D
••••••••
I'D
••••••••
::I
Messiaen 2
~
-----,..-
'"" M3xM3 ::I ::: ::: :::
......... ... ... ... ..... ......... ::: ::: :::. ........ I'D III III
•••••••••
::I
Messiaen 3
. Cool
Figure 50.3: The original graphics for the grid construction. To the left two Messiaen grids, to the right their transformation via a rotation-dilatation which is used for rhythmical connections. The Messiaen grids are cartesian products of the Messiaen scales and rhythms. 7 .. 6 ..
s .. 4 .. 3 ..
",..
6
M2xM2
,,\f... "lyI..
M1xM2
...ty..~
M3xM2
.q..
M3xM1 M3xM3
a""'·
..~.:;
~
....
Hl Hl
M1xM1
a:,...'-..
4/4
H3
..~.:;
8/8
H2
.......
/
,/ .
M3xM1
M2xM1
~./"J( 7/8
. ..n· ...
M1xM1
M2xM1
~./,(
.",
H3
H2
M1xM3
M1xM2
-A~...
...ty..~
8/8
• ''''8/8
y-,-.
M1xM3
-A~...
,
6/8
/
M3xM3
..n ...
M2xM3
M3xM2
·
.q..
7/8
2. Satz endet hier
~----------------------------
y-,-. .......
M2xM3
M2xM2
~:.t..
Figure 50.4: The original graphics for the nine variations of the germinal melody G according to the Cartesian product grids Mi x M j (upper part). The variations are played in the order shown below, however ending on variation Mi x M j , for reasons of the length limits of the movement
50.3.
2ND
961
MOVEMENT: VARIATIONS
In the second movement, seven of the nine variations are used in the order shown in figure 50.4, lower part. The bass plays these variations in a very extended slow gesture, and the piano ornaments these melodic lines by a harmonization work. Each couple of such variations is connected by a rhythmically complex intermediate structure, see figure 50.5. The intermediate parts terminating in the Mi x Mj-variation are constructed as follows: We take the Messiaen grid Mi x Mj and operate a dilatation-rotation by 45° on this object , see figure 50.3, right part, for two such transformations. From these transformed local compositions (or from a very similar skew transformation) we build an interpretation by a partition into five sub compositions which are again transformed as visualized in figure 50.6. These charts are then taken as motives of different ornaments which are generated by the grids as shown in figure 50.7. The upper part of figure 50.7 shows the superposition of the five ornaments generated from Messiaen grid M3 x M 3 . This rhythmical construction is also seen in the right and lower part of figure 50.5.
r - - ---, 5. Uaria"tion
SYNTHESIS 2. Satz: Liquid Colours
113xl13
Ub.rganY __ . __ _ 4.
Uaria"tion r - - - - - , '
--
- , _.
112xl1J.
I
,..-------'"
.....
/ ~/:
A.
- ~- ~- ... .
-- _-. ......
7/8
7/8
,
A
\ /("""--"'''"','".""' _ _ _ _ _ _ _ _ _' L
Figure 50.5: The original graphics for the intermediate structure between two successive variations, here connecting the M2 x M1-variation to the M3 x M3-variation. The connecting structure is a rhythmical construction that is deduced from the generating Messiaen grid M3 x M 3 .
962
CHAPTER 50. CASE STUDY I: "SYNTHESIS" BY GUERINO MAZZOLA
3x ...........
3x
...........
.
+....
+
t .. ..
+ +
+
+ +
U... kehr
.
2x
•
...........
II ..
II . . . .
Figure 50.6: The original graphics for the interpretation of the rotated Messiaen grid and the symmetries applied to the partition's components.
20
'0L-~ ..I~.~·~~~~~~~~!!~~~2!~!&~~~~~~~~!!~~~~~!&~~J
fiinf 6itter
........ ....
............ ........
.... - - - - -.... [24. -~]
....
.... ----===-+~==----:--[-2-4-.---2-]--I.....
.... .... .... ........
=========;
[36. 0]
[64.0] .. [23. -4]
- - - - - - - - - - - -....... [42.0]
~U7.3]
... [24.0]
---------~----_I'
Figure 50.7: The original graphics for the ornaments that are generated by charts of a rotated Messiaen grid, following different grid translation vectors.
50.4. 3RD MOVEMENT: SCHERZO
50.4
963
3rd Movement: Scherzo
Summary. The scherzo "Poem of Wind" is based on a transcription of Charles Baudelaire's last poem "La mort des artistes" in "Les fleurs du mal" .
Figure 50.8: The beginning of the third movement with its breakneck changes in agogics. This movement is in complete contrast to the quiet flowing of that adagio. It resembles Ernst Jandl's concrete poesy. A poem of wind: without firm ground or romantic sky. Here, the piano builds an expressive dialog with a pointillistic percussion. The first two strophes of Charles Baudelaire's "La mort des artistes" is the last poem in his famous "Les fleurs du mal". The transformation of this poem was already discussed in section 11.6.3. This procedure transforms every word into a sequence of rhythmical motives, we hear it as a coherent sound sequence which is separated from its successor by a short fermata. At the end of each verse, longer interruptions are inserted. From this raw material the final composition is constructed by an extremely refined agogical architecture, changing from breakneck accelerandi to stagnating ritardandi. Here, the third movement makes extensive use of the presto®-AgoLogic for tempo curves. The single motives are also enriched by echo-like variations and repetitions, such that their color engraves itself on ones mind. The answer of the piano to this witty poesy is the albatross in the air, which flies, falls and plays with Cecil Taylor's florescence as if it were an old tale. Just one not too many salty swift-and goodbye.
CHAPTER 50. CASE STUDY I: "SYNTHESIS" BY GUERINO MAZZOLA
964
50.5
4th
Movement: Fractal Syntax
Summary. The fourth movement "Burning Spears" is constructed in a completely non-syntactic way from fractal principles. -~ -
This finale is a steadily evolving rhythmical organism whose form could be compared to Alexander Scriabin's ecstatic sonata "Vers la flamme" . The movement gets off by a drum intro, followed by a bass intro which announces fanfarelike that something is to happen. It receives a promising answer from the low piano keys until a samba-like rhythm makes the final run. This starting rhythm is again derived from the germinal melody. You can hear its essential elements from the drum intro. After two roll groups, the samba reappears, but now with augmented time values, whereas the now opened gaps are filled up by new micro rhythms (see figure 50.9 where the original motives are seen in the low and high regions, and figure 50.10, where the stretched motives are now also visible in these regions). This process of intensification is cadenced until a cutting bass line terminates the developmental part. The piano dances like an entranced sorcerer over this rhythmical lava. In the reprise, the samba figure reappears, but with a higher tempo. It ends in a strong pulse which leads to a further time augmentation after a bass cadence. This time, the black dancing extension figure is refincd by a cackling entwinment by wood blocks which is joined by a fast dialog with the piano. The following earthy rhythm is introduced by a bass drive as if we should be warned that we are now changing the civilizations. Here too, the construction is an augmentation of rhythm and an intensification. What we are hearing is in fact a fractal repetition of self-similar time structures. The last and extremely fast part expresses the force of a burning spear, which is thrown into the sound sky by a delirating dervish dancing on an exploding volcano... So the musical principle of this finale is not architectural, but a self-renewing process .
...... . . ..... ..
.......... .
.. . '
...... :. .
. ...
..
' .
~
, ~
Figure 50.9: The original motives are seen in the low and high regions of the local score.
Figure 50.10: The stretched motives are also visible in the low and high regions.
50.5. 4TH MOVEMENT: FRACTAL SYNTAX
PRESTO Fil e Block +
++
...
+
+
•
+ ...
• ++ +
... +
++
.• •
.
...
•
.• .. .
+
... +
.'" • • ... ....
. .
etr ies ...
+
+
f+
...
...
...
+
...
++
... +
+
• • •
.
..
.. .... .. .
+++ ...
...
...
++
+
•
965
+ ...
.. + ..+
..
......
.
•
...
.. .+
..
..
Figure 50.11: The end of the fourth movement shows a particularly dense and polyrhythmic percussive part.
Chapter 51
Object-Oriented Programming OpenMusic
•
In
Le probleme que l'ordinateur pose au compositeur n'est pas d'abord d'ordre specifiquement musical, mais avant tout culturel et philosophique. Il implique une refonte complete des rapports de l'abstrait et du concrete. Le musicien, le musicologue, l'auditeur doivent bien se rendre d l'evidence de ces changements sans aller pOUT autant jusqu'd la maladie de l 'adaptation. Hugues Dufourt [130] Summary. OpenMusic is a visual programming language for music composers. It was designed and implemented by the Musical Representation Team at !rcam-Centre Georges Pompidou. OpenMusic is based and implemented on CLOS (Common Lisp Object System) [510]. It shows several original features, such as reflexivity, meta-programming capacities, handling of the duality between musical and computational time, and provides a framework of predefined musical objects for handling sound, MIDI and musical notation. OpenMusic combines different technics of programming, e.g., functional programming, constraint programming and object-oriented programming. We will focus in this chapter on the last one. Object-oriented programming is crucially connected with the categorical approach. Category theory helps formalize in an original way concepts like inheritance, methods, classes, etc. More details on this relation can be found in [306] or in section 9.4.2 of this book.
-EAlthough one may consider music composition to be an important issue in any computer music research or development, the term Computer Assisted Composition (CAC) has taken a specialized meaning during the past years. As opposed to the generation and processing of audio signal, by means of DSP hardware or software technologies, CAC systems such as OpenMusic focus on the formal structure of music. The software technology is rather based on symbolic computation, where the typical data structures (trees, graphs, sets, collections, associative memory, etc.) and algorithms (often issued from discrete mathematics) are suited to handle the complex structures involved in a compositional process. The great diversity of 967 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
968
CHAPTER 51. OBJECT-ORIENTED PROGRAMMING IN OPENMUSIC
esthetic, technical, formal (or anti-formal) models, implies that one cannot conceive an environment of CAC as a fixed application which provides a fixed collection of generative procedures and musical transformations. On the contrary, we conceive an environment as a programming language, helping each composer to constitute his personal universe. Of course, there is no sense in providing a traditional programming language, the control of which requires a great technical expertise. For this reason, our purpose is to build a programming language conceived especially for composers. This leads us to think about the various existing models of programming, intuitive graphical interfaces (which enable control of this programming) and the internal as well as external representations of musical structures that will be built and transformed when using this programming. Therefore the main goal is to implement a language containing the concept of notation of the result (a musical score) as well as the concept of notation of the process leading to this result (visual program).
51.1
Object-Oriented Language
Summary. In this section we describe the main entities of our object language. Such language entities are usually called meta-objects. We describe each meta-object in a graphical and formal way. Basic calculus for object-oriented programming inherits the approach imposed by the precursory language Simula and its successors. The attempts to formalize this family of object models used the concept of parametric polymorphism [80]. More recently, languages based on multiple-dispatching (methods that dispatch on a product of types rather than a single type) such as CLOS could be formalized using concepts of overloading or ad-hoc polymorphism, found in )..&-calculus [81]. OpenMusic, which is based on CLOS, may be formally described in this way. Although we do not give here a real formalization of OpenMusic, we use the )..&-calculus to give a general idea of each meta-object. From a visual point of view, the meta-objects of our calculus are represented as graphical entities called frames. -~-
Object-oriented programming is based on simple concepts. A program can be seen as a set of entities called objects. An object is made of data (slots) and operations applied on it (methods). Objects communicate in a specific way, usually called message passing. Most of the object-oriented languages implement the notion of class in order to abstract similar objects. New classes can be created from existing classes using the mechanism of inheritance. Inheritance allows the extension or partial modification of a class. If a class A inherits from a class B, A is called the subclass of Band B is called the superclass of A. Object-oriented programming offers in a natural way a dynamic management of resources, which means that we can create new objects at any time. The mechanism of object creation from a class is called instantiation. The new created object is called an instance of the class. Meta-objects are represented either as composed frames or simple frames. Several frames (i.e., different point of views) may be produced for the same object. The simple frames which represent an object are called views. They generally appear as icons. The composed frames representing an object are called editors. For more details about formal and graphical description of OpenMusic see [11].
969
51.1. OBJECT-ORIENTED LANGUAGE
51.1.1
Patches
A patch is a meta-object specific to OpenMusic. It reifies the notion of program. A patch is the place where objects will be interconnected in order to specify musical algorithms. From a formal point of view, a patch can be seen as a A-function. Patches are composed by boxes (icons) and connections between them. Boxes represent functional calls while connections represent functional composition. Figure 51.1 shows the view and the editor of a patch implementing an algorithm for the expression x2 + 2. It can be formalized as the lambda function AX(X· x + 2).
•
0
013
patch
~
x
,.1:,
L.J .... +r
.
~
output
~
"~
I
pa t ch
Figure 51.1: View and editor of a patch.
51.1.2
Objects
In the A&-calculus, objects are represented as registers. A register can be seen as a set of labeled fields l = v where l is called the label and v is called the value. For instance, an object representing the note C3 can be written as: note = (pitch = C, octave = 3). Figure 51.2 shows the view and the editor of this note object. 0
o.\1NOTE
08
amnQle
'"' DD 00''''''' pi
•• A
t~h
Figure 51.2: View and editor of an object. The next two rules define the field selection and the field writing, respectively:
(h = (h =
VI, ... ,
In
VI, ... ,
li
= v n) 0 li f-+ Vi, = Vi, ... ,In = Vn)[li I v]
f-+
(lI
=
VI, ... ,
li
= v, ... ,In = Vn).
970
51.1.3
CHAPTER 51. OBJECT-ORIENTED PROGRAMMING IN OPENMUSIC
Classes
If objects are seen as registers, classes will then be seen as generators of registers. The editor for a class is an ordered collection of views representing slots. Slots contain information about their name, their type (a class icon), a default value and a flag that indicates if the slot is public or private. View and editor for the class of the note object defined in the previous section are shown in figure 51.3. 0
08
NOTE Slots
;ti p i ten
Z oc t ave
Show
Default value
~
"A"
~
4
41·
[All
0-
Figure 51.3: View and editor for a class.
51.1.4
Methods
Methods are simple functions (A-abstractions) where arguments are typed by classes. The editor in figure 51.4 shows a method with two inputs self and num of type Note and Integer respectively.
o
OM. ;
w t pout
JI
...
Figure 51.4: View and editor for a method. The body of this method increments the value of the slot octave of self by num and returns the note object. This method is formalized by the expression:
Aself:note Anum:integer self[octave I self 0 octave + num].
51.1. OBJECT-ORIENTED LANGUAGE
51.1.5
971
Generic Functions
A generic function is a collection of methods (E&MI& ... &Mm ) where E is the empty method. The type of a generic function containing methods Mi of type Ui ----+ Vi, 1 ~ i ~ m, is {UI ----+ VI"'" Um ----+ Vm }. However, not any set of methods can be seen as a generic function. A set of methods with types Ui ----+ Vi is a generic function iff the two following conditions are satisfied for all i,j:
Vi
Vj, U is maximal in LB(Ui , Uj Ui ~ Uj
=}
~
) =}
3Uk, Uk
=
U
where LB(U, V) indicates the set of common lower bounds l V of types U. Figure 51.5 shows a generic function composed of two methods: the first one is the method described in the previous section; the second one is specialized for inputs of type Number.
m
D _ _ OM+ ~
+
00°
1:!l8
Figure 51.5: View and editor for a generic function.
51.1.6
Message Passing
Message passing is achieved by generic function invocation. We distinguish the application of a simple function from the application of a generic function, which will be indicated by the operator •. The application of a generic function G to arguments N of type U consists of two steps: selection of a method M j among the methods of G and normal application of M j to N .
Note that U may not be contained in the set Ui of input types of the generic function. In this case we select the method M j satisfying: Uj = mini=1...m{Ui I U ~ U;}. The CPL of a class is a linearization of its superclasses. The details of the linearization are not crucial for our purposes, see [510].
51.1.7
Inheritance
The inheritance mechanism is defined by the subtyping and the mechanism of method selection. Subtyping is defined by the following rules: 1 The
ordering <::: is defined by the Class Precedence List (CP L) of the class U.
972
CHAPTER 51. OBJECT-ORIENTED PROGRAMMING IN OPENMUSIC Vi E I, :3] E J, Sj {Sj
-+ Tj}j EJ :::::
-+
T j ::::: Ui
{Ui
-+
-+
Vi
VihEI
The rule on the left is the usual contravariant-covariant rule for arrow types. The other one states that an overloaded type is smaller than or equal to another overloaded type if for every branch in the latter, there is a branch in the former smaller than or equal to this one [81]. Graphically, a class inherits from another one when there exists an arrow from the superclass to the subclass, as is shown in figure 51.6. The class mote extends the class note by adding a new slot called rhythm of type string with default value quarter.
J..
0
no t e:
iTi rhythm
~
J..
218
RNOTE Slots
Show
Default value ~
0
"quc:u·· tet"" "
I~ I~
['iilJ
rnote
~
r;
~
Figure 51.6: Graphic inheritance.
51.1.8
Boxes and Evaluation
Boxes are placed in patches and allow other meta-object to be involved in the calculus. Boxes are composed of an icon and an ordered set of inlets and outlets. There are different types of boxes depending on the referenced meta-object. The user may create boxes in a patch by dragging meta-objects into it. Figure 51.7 shows different types of boxes and their references. o
J
noJ.
+
patch
Fact ory
---#l~ 1J
t
..
Generic Tunllo n cn ll
Functi o n COlli
Figure 51. 7: Boxes in a patch. Inlets in functional call and generic function call boxes correspond to the function arguments. For factory boxes, inlets and outlets refer to the slots of the class. Boxes and connections in a patch can be seen as a graph of functional compositions. By clicking on the output of any box, an evaluation from the corresponding point into the graph is induced. A box evaluation can give rise to other box evaluations creating a chain corresponding to the execution of a program.
51.2. MUSICAL OBJECT FRAMEWORK
51.1.9
973
Instantiation
Users can create instances of a class with the aid of factory boxes (boxes built from a class). A factory contains a number of inputs corresponding to the public slots of the class. There are as many outputs as inputs. When evaluating an output, a new instance is created. The values returned by outlets are, from left to right, the instance itself, then the current value of its public slots. An instance can be visualized graphically as a box which can eventually be connected to other boxes (see figure 51.8). e:J8
0
~)
+. ~ J
no~t:
••
Figure 51.8: Graphical instantiation.
51.2
Musical Object Framework
Summary. A framework is a set of reusable classes that one can use as building blocks for a specific software [179]. This section describes the musical OpenMusic framework. -~-
OpenMusic offers a set of predefined classes and generic functions for musical representation and manipulation. This framework can be extended by using inheritance or by defining new methods in generic functions or by writing new generic functions. There are graphical editors for these definitions. In this section we will focus on the structure of musical objects (classes) rather than on their behavior (generic functions). The object-oriented concept of encapsulation can be defined as the separation between the internal representation and the interface of an object. These two aspects will be described in the next two sections.
51.2.1
Internal Representation
In OpenMusic any musical structure is a container which embeds other musical structures. All containers have a temporal extension (e) and an offset (0). e and 0 are expressed as a multiple of a rational unit v = l/u, u E N+, the set of positive integers, v is considered a fraction of the quarter-note. The value u can be redefined at each level of the embedded structure. From the temporal point of view, we have entities coming from different rational time scales. In order to express them in a homogeneous way, we have implemented a hierarchical unit system, which will be discussed briefly by showing an example. Figure 51.9 shows a music fragment and an integer hierarchy describing it.
974
CHAPTER 51. OBJECT-ORIENTED PROGRAMMING IN OPENMUSIC
Exercise 82 Define the corresponding form whose denotators are these rhythmical hierarchies.
m
.--3--.
4
JJJJJJn
2 1 1 1 1 2 1 1 1 ---- -1- 111
Figure 51.9: Hierarchical representation of a rhythm. We can calculate the extension of each element ei by the recursive formula d(ei) = where S is the sum of ei's brothers including itself, and F(ei) is e/s father. The deepest triplet in our example has an extension given by: ~ ~ ~ = 1~' For this reason, we set u to 15 (the quarter note is equivalent to 15 units). In this scale the new extension of the triplet is 4. Following the idea of a variable scale adapted to each embedded level in a musical structure the rhythm of figure 51.9 can be represented as shown in figure 51.10. ~d(F(ei))'
f
Figure 51.10: Temporal organization of musical instances. Some basic operations for container manipulation are described below: • NewContainer. Create an empty container (u equal 1 by default). • AddTo(c1, c2, at). Set the container c2 into the container c1 at the position at and calculate new values for u, e and attributes.
°
• RemoveFrom(c1, c2). Remove the container c2 from the container c1. • QReduce(c, [n]). Computes u' = ppcm(Ui), where Ui are units for all sub containers of c. The attributes U of the container c is set to u'. New values for parameters Ui, 0i and ei are calculated too. This operation allows comparison between all parts of a container. If a value n is given, then u' = PPcm(Ui' n). • QN ormalize(c). Sets u value for c and each of its sub containers to optimal, i.e., the smallest possible integer number, considering the offset and extent values that have to be expressed as integer multiples of l/u.
51.2. MUSICAL OBJECT FRAMEWORK
975
The QNormalize operation sets all extensions to integer values. If we want to compare all values we need to choose for the whole measure one unit u equal to ppcm(2, 3,15) = 30. In this case the new values for u are 30 for the last eighth note and 30 for the triplet, see figure 51.11. The choice of using only integers for our representation allows us an easier translation to music notation and a coexistence of objects described in hierarchical time and in continuous time. Moreover, in this way we avoid reversibility problems coming from the translation between reals and integers (continuous and discrete time).
I
u
"
30
120
I
Figure 51.11: A normalized container. This model of containers and their operations represents in a natural way traditional musical structures like polyphony, measures, chords, etc. We will see in the next section how these containers are passed to the composer in an object context.
51. 2.2
Interface
A summary of the predefined musical classes available in OM is given in figure 51.12. Ci
0,
score •
~
......
nUl
s i . p I e-sc~e-e I emen t
~equ$nc::e'"
~ ...,. super"'~s
I
1 1 1 -1 1 I 1 m J "1 ~ .l
"\.te
re:~t
s'V'd
lIidlJi I e
vO.J,.c:a
~
1II'1Iii1:0~ 1""(t
~I;p
rl ;
c::~ d
r
It i on
~
PO.Jy
~
:>
!¢ P-i
Figure 51.12: Musical framework. There are three main classes of musical objects:
• superposition, chords and polyphonies are made of other objects placed in parallel.
CHAPTER 51. OBJECT-ORIENTED PROGRAMMING IN OPENMUSIC
976
• sequence, objects like voices, measures, etc are composed by other objects one after the other. • simple-scare-element, these objects are terminals (empty containers). There is a predefined set of generic functions with methods for all musical classes. They are used to apply transformations to musical instances (i.e., transposition, inversion, etc.) or simply to play or visualize them. Editors of musical instances have been replaced by musical notation editors (figure 51.2 is replaced by figure 51.13). We use the paradigm Model- View-Controller, where the model is the musical instance, the view is its representation on the screen as a score and the controller is a user interface allowing us to change slot values. 0
NOTE
(!I
8
~
. ....
--"1.'1
J
[g
no"
I
1,,1
I!B:J
••
..
~.
Figure 51.13: View and editor of a note instance. As we have seen in the previous section, musical entities can be represented as containers. However it is not suitable for composers to have to build musical objects from containers. In order to create a musical instance we must provide to the composer a symbolic abstraction of the real musical object. For instance, MIDIcents are a good way to code pitch values. On the contrary, problems arise when the rhythmic content is taken into account. We propose in the next section a representation of rhythmic information adapted to the composer.
51.2.2.1
Rhythmic Trees
Rhythmic Trees (RTs) are the base of the rhythmic representation in OpenMusic. They must be understood as an alternative description of symbolic rhythmic structures using traditional music notation, and an external specification for container objects.
• Syntax An RT is defined as a pair (D S) where D is an integer ratio (> 0) and S is a list of n elements. Each element in S can be either an integer or a RT. Here is an example corresponding to this syntax: (2 ((1 (1111)) (1 (1 1 1 1))). Remark 23 Solving the above exercise 82, we have here a nice example of a doubly circular form. To begin with, given a form F, a list form over F is the circular form
List(F)
--+ Id
with the terminal form Terminal
Colimit(Item(F), Terminal)
--+ Id
Simple(Z) in order to formalize the end of a finite
list and to indicate its length with the integer of the coordinator module Z. The form Item(F) is a Cartesian product
Item(F)
--+ Id
Limit(F, List(F)).
51.2. MUSICAL OBJECT FRAMEWORK
977
Building on this form, we define the form RT by
RT
---+
Id
Limit(DurationValue, List(RT))
and a simple duration form
DurationValue
---+
Id
Simple(Q).
It is useful to visualize this doubly circular form in its construction graph as drawn in figure 6.5 .
• Semantics For a given RT = (D, S), D expresses a duration and S defines a group of proportions of D. For instance, by taking as unity the quarter note we have for RT = (1, (1, 1, 1, 1)) the rhythm shown in figure 51.14.
1111 Figure 51.14: Rhythm for RT = (1, (1, 1, 1, 1)).
RTs allow us to represent, in a homogenous way, different types of musical objects. Polyphonies, voices, measures groups, etc. are expressed as RTs. When the value D is at the measure level, we express it in whole note units. For example, the RT for the next rhythm will be (3/4, (2, 1)).
Figure 51.15: RT
= (3/4, (2, 1)).
As it was defined, S represents a sequence of proportions of D. The example in figure
Figure 51.16: RT = (4/4, (1,2, 1)). 51.16 shows the case for RT = (4/4, (1, 2, 1)). Until now, we have presented RTs where S was a list of integer elements, representing notes or beats. For an RT = (4/4, (1, (2, (1, 1, 1)), (1, (1, 1, 1)))), more complex elements appear as shown in figure 51.17. The RTs contained in S represent what we call (rhythmical) groups. In general, groups are graphically represented as beamed notes. As well as measures, groups can contain either notes or other groups. Here is an example for the RT = (4/4, (1, (1, (1, 1, (1, (1, 1, 1)), 1, 1)),2)):
CHAPTER 51. OBJECT-ORIENTED PROGRAMMING IN OPENMUSIC
978
Figure 51.17: RT = (4/4, (1, (2, (1, 1, 1)), (1, (1, 1, 1)))).
Figure 51.18: RT
=
(4/4, (1, (1, (1, 1, (1, (1, 1, 1)), 1, 1))2)).
We extend the RT syntax in order to integrate rests and ties which will be represented respectively by negative numbers and floats 2 . For
RT = (4/4, ((1, (1, 1)), (2, (1.0, 1, -2, 1)), (1, (1.0, -1, 1)))) we obtain the following rhythm:
Figure 51.19: RT
= (4/4, ((1, (1, 1)), (2, (1.0, 1, -2, 1)), (1, (1.0, -1, 1)))).
RTs may encode the metric intention in a rhythm. The two rhythms in the next figure are encoded by different RTs RTI = (2/4, (2,2,2,2)) and RT2 = (2/4, ((1, (1, 1)), (1, (1, 1))).
illl1 Figure 51.20: RTs RTI
inn
= (2/4, (2,2,2,2)) and RT2 = (2/4, ((1, (1, 1)), (1, (1, 1))).
The symbolic representation is essentially based on the hierarchical musical content. This allows an overall readability of the hierarchical structure and a symbolic format, which can be controlled by algorithms and other transformations such as inversion, recursion, etc. This is made possible by the fact that the format is the abstraction of the object itself. RTs are automatically encoded into optimal container structures.
51.3
Maquettes: Objects in Time
Summary. The maquette is an OpenMusic meta-object aiming at representing, in a same object, patches (musical process) and containers (musical material). It is a new concept of score 2Perhaps, a better distinction of domains for ties would be to take imaginary rational numbers i.x/y instead of floats, since rational numbers are not really different from floating point numbers. A more musical solution would be to take the form TimeSig defined in formula (6.92) instead of DurationValue for normal duration values (see also appendix A.2.1), to reserve DurationValue for ties, and to replace DurationValue by the colimit of DurationValue and TimeSig in the above definition.
51.3. MAQUETTES: OBJECTS IN TIME
979
where the static description of musical structures and the definition of dynamic computational processes seamlessly coexist. The user may go back and forth between these two metaphors by considering the maquette as a score (in traditional or graphical notation) or as a set of interconnected processes. As external objects like MIDIFiles or SoundFiles may also be imported, maquettes offer an original environment for music creation.
-:EThe maquette is an original concept in OpenMusic which allows us to solve the problem of combining the design of high level hierarchical musical structures, the arrangement of musical material in time, and the specification of musical algorithms. Just like other meta-objects, maquettes may appear as a view or may be opened in a maquette editor, which is basically a 2dimensional surface with time flowing along the x-axis. This surface contains several blocs that we call temporal boxes. In the maquettes the hierarchical imbrication of musical structures and their temporal order can be represented in an explicit visual way. In the musical sketch presented in figure 51.21, made by the composer Mikhail Malt, temporal boxes have been disposed on the maquette surface.
~
D! D D
Figure 51.21: Temporal boxes in a maquette. Horizontal temporal box positions correspond to onset values in 'absolute' (physical) time. Durations and intensities are given by their horizontal and vertical extensions. Pictures have been associated to the boxes in order to give an elementary musical semiotics. Thus, triangles correspond to chords whose resonance decrease quickly. Multiple triangles are associated with chords ostinato. Other figures are triangles whose intensity follows the geometrical contour of the picture. In figure 51.22, connections between temporal boxes are shown. They represent a different kind of musical information. We can see that temporal boxes are inferred one from another by functional relations. For instance, the ostinatos are linked to the first chord. This level of information bears paradigmatic content, because analogies between any part of the structure can be derived.
980
CHAPTER 51. OBJECT-ORIENTED PROGRAMMING IN OPENMUSIC
Figure 51.22: Functional relation of temporal boxes. 0
--
[jJ!I '-'
.. ~
"omp obJ2
a
~<
•••
'8"" f'1'''''
•r h. I.'. , ~ :":"i i ~?
218
<--· 6 repetil iOIiSo
..; ..
r . peot-ri
~ I
•
~
'"
.
'- '
I ..
.. ,. 1/
Figure 51.23: Patch calculating a temporal box.
If we open the editor of the third ostinato (figure 51.23), we can see that the chord coming from the first box (which is represented by the box called input) is transposed by an augmentedfourth (18 half-tons or 1800 MIDI-cents) , then repeated six times and finally sent to a factory that builds an instance of the class chord-sequence. Elements of the musical material take place syntagmatically in the final result. Let us take now the first chord that is provided as input for the ostinato. If we open its editor (figure 51.24), we can observe that the generating algorithm is reduced to a given data. This is the fourth level of information which concerns the basic music material representation.
981
51.3. MAQUETTES: OBJECTS IN TIME
~~
...
...
telllpout
output
Figure 51.24: Patch with musical material for a temporal box. Finally, we can toggle the visual representation of boxes by showing the traditional representation of the musical result, see figure 51.25.
o
08
Amaquette .. mikllail
...
:,.~ "
,.
3.
Figure 51.25: Musical notation of a maquette. In the previous example, we tried to show how an OpenMusic object scheme enables us to structure the musical information at different levels: • the static level of the form, allowing us to create visual semiotic markers; • the dynamic and paradigmatic level of the form (i.e., its functional relations between the temporal boxes); • the syntactical level, it is the calculus building the musical discourse inside the temporal boxes; • the material level.
CHAPTER 51. OBJECT-ORlENTED PROGRAMMING IN OPENMUSIC
982
These four levels of information are obviously interconnected. The most important advantage in the maquette concept is to offer a visualization of this interaction and at the same time, interactive control of it. This produces a source of experimentation: • Recombination at the form level: Temporal boxes are moved and stretched in time without changing the other three levels. • Modification of functional relations: We do not change the position of the blocks but their causal relation. • Syntax modification: Algorithms which build the material can be changed according to the compositional goals. • Change of the material: The color of the piece may change while keeping all the formal organization. When combining these procedures, sophisticated musical experiments can be carried out.
51.4
Meta-object Protocol
Summary. Meta-object protocols (MOP) provide an alternative framework that opens the language implementation to user's intervention. OpenMusic was implemented using metaprogramming technics. We have extended CLOS meta-objects (methods, classes, generic functions, etc.) by adding visual counterparts. In the same way we extended CLOS as such by using the technique of meta-programming; the user can make extensions of the OpenMusic language thanks to the visual MOP. In this section we describe OpenMusic's graphical MOP, as well as musical applications. -~-
Originally the MOP was conceived for solving problems in the design and implementation of CLOS [262J. Usually the internal architecture of a programming language is not interesting for the programmer but only for language designers. For object-oriented languages, a MOP of the language is an interface to the language, presented as a framework. Classes, generic functions, patches, maquettes and other entities discussed in the previous sections are simply instances of special classes. These instances are called meta-objects and the classes meta-object classes. The set of these classes, as a framework, constitute the MOP. In addition, a language may support a MOP only if it has two characteristics: reflection and reification. The reflection is the ability of a program to inspect and modify its state at run time. In order to achieve this purpose it is necessary to have a mechanism to represent the program's data and state, this mechanism is called reification. A MOP is composed by a static and a dynamic part. The static part is given by the hierarchy of meta-object classes. The dynamic part is made of a set of generic functions that can be applied to meta-objects in order to control their behavior. Examples of such functions are:
• Get-Elements. We will consider that a meta-object is composed by a set of meta-objects together with a relation between them. For example, a class is composed by an ordered
983
51.4. META-OBJECT PROTOCOL
list of slots, a generic function consists of a set of methods, a patch contains a list of boxes, etc. This function returns the list of elements of a meta-object.
• Get- View and Get-Editor. It returns the two possible graphical representations of a metaobject. • Open-Editor. It shows the editor for a meta-object. • Add-Element and Remove-Element. They allow editing of any basic object. • Box- Value. It enables us to trigger the evaluation of a visual expression in a patch. Figure 51.26 shows the graphical representation of the dynamic and static parts of the OpenMusic MOP. One may be surprised by the simplicity of the protocol. In general, a protocol which is too much specified is not very modular. It means that changes must be made in several places, and as consequence, modifications are not very reliable. On the other hand, a poor specification of the protocol makes it very difficult to find the place where modification should be included. meta
0
'"
.
'" full
. . . . 'oh
•
box..
•
CJbsoul.
•
potch~u.--
-
8
Meta og.t_l.-nls
1
•
J'Atl.ct
Meta
~t C:h
W\t,* I ~ t
. ~l.-o.(wl
t
M~lII
. '"P"l .~
".
... """.,""
--Meta
c:ode--'wn
Meta
.,. 151 -
. i!;ll'''''''''''•
VOilA
6
I
~
OIM~hod
I
I
fun
~i~ 1 00(:tion
1
OJ .....
I
~
.
t..por~lbox
add-iIIl • .enl
. I".'."enc:.
.,.
.
boX-uol,-
~ _U>Od
•
0
oatxtIJlcobJK'
Meta
~ief\.II'ICt l on
8
= = meta
Meta
~Itor
. ~ ts
•
0
!.
Meta
,.illJIOvtI-iII •• ..nt
-
• •
-
Figure 51.26: Dynamic and static part of the OpenMusic MOP. The main tools for the meta-programmer are: sub classing inside the static class part and redefining functions in the dynamic protocol part. It is clear that the user can change some features of the language, but the most interesting point is that he can extend the language without loss of compatibility with old programs. Programmers can profit from customizing the language semantics. The following example in figure 51.27 shows the redefinition of the generic protocol function Box- Value which is called at each evaluation of a box. Graphical redefinition of these two methods changes the behavior of the language by introducing a visual trace of programs. The first method, which is defined with qualifier before, graphically selects any currently evaluated box before its execution. The second method, with qualifier after, deselects the box after its evaluation. These two simple modifications allow us to extend the language by tracing graphically and in an orthogonal way the functional composition.
984
CHAPTER 51. OBJECT-ORIENTED PROGRAMMING IN OPENMUSIC 0
08
OM-SOX-VALU E
o p,;mary ® befon, 0
[§J I,..,."
after
"
o ~ OM-SOX -VALU E o primary 0 befo,e ® afte'
08
0
[QJ
Ir"IiO\It. l
iflP,lt
~
Input I
~
Meta
Meta
SQlec l
IXIselec t
r
~ outpvt.
01,,1 tpu t
~
0
~
0-
Figure 51.27: Changing a generic function of the MOP.
51.4.1
Reification of Temporal Boxes
Finally, this section shows how we have extended the temporal boxes in maquettes by applying meta-programming concepts. Indeed, thanks to the graphical reification of temporal boxes, users have access to the temporal boxes at the same level as other objects like notes, chords, etc. Figure 51.28 shows a first example of the new possibilities. By opening the editor of the temporal box, we can see a new box self (pointed by one arrow) that represents the temporal box itself. The three public slots-available as outlets-are from left to right: the meta-object itself, its offset and its extension. The process associated to this temporal box builds a major chord whose pitches are transposed taking into account the box's time offset in the maquette (which means that the start time of the box is directly proportional to the transposition interval). o
"temp • • )
[±]!I
~
!!IS
• ~ •
~
..
~..::r
~
1234~6?e
Figure 51.28: Mixing calculations and temporal relations. As has been shown above, temporal boxes may send and receive data by using connections. By reifying temporal boxes, we allow them to have access to other temporal boxes belonging to the maquette. In figure 51.29 the upper temporal box is sent as a whole to the other one,
51.4. META-OBJECT PROTOCOL
985
which sets its own offset to the same value as the previous. One evaluation of the maquette will produce two chords and will align them in time.
Figure 51.29: Mixing calculus and temporal relations again. We stress two principal temporal box relations within maquettes: a temporal relation, given by the horizontal position, and a causal relation, established by functional connection between boxes. Examples in figures 51.28 and 51.29 expose how we have combined these relations by using the MOP. In the first example, the result of the calculation is affected by the position of the box. Informally, we can say that time changes calculation. In the second example, the evaluation of the functional composition of the boxes changes their position in time. In this case, the calculation changes the time organization. We finish this chapter with a musical example of composition using OpenMusic.
986
51.5
CHAPTER 51. OBJECT-ORIENTED PROGRAMMING IN OPENMUSIC
A Musical Example
Summary. Encore, By Jean-Luc Herve for two ensembles, live electronics, and two MIDIcontrolled pianos (commissioned by IRCAM-EIC). Created April 9, 2000 by Ensemble Intercontemporain, Conductor: Patrick Davin. Frederic Voisin was musical assistant and designed in OpenMusic the maquette described here. -~-
This piece is concluded by a cadence for two mechanical pianos (actually two MIDIcontrolled pianos) followed by a short orchestral finale. The cadence is actually an OpenMusic maquette that is played through MIDI. The piece is based on the concept of instrumental gesture. A gesture is a small musical unit with a typical energy profile, such as a glissando, a strong note preceded by a group of grace notes, a repeated note, etc. The gestures performed by the instruments may be continuous, such as a glissandi. The piano cadence is prepared by a densification of orchestral gestures, soon imitated by the piano. At the end of the cadence, the instruments enter back again and the cadence progressively sinks into the orchestral mass. The idea behind the cadence is to accumulate and superpose an enormous quantity of discrete gestures (played by the piano) into a well-defined architecture so that the total construction sounds as a quasi continuous sculpted sound shape. This is why it has been realized as a maquette: it was in fact impossible to write it manually in detail. Rather, the architecture is specified by the hierarchical inclusion of maquettes within maquettes (three levels). At the deeper level, one finds the elementary gestures. Elementary gestures are temporal boxes that contain all the algorithmics necessary to generate them. Groups of gestures inside a submaquette are linked by two kinds of relations. Firstly, temporal logic relations force them to begin and end simultaneously, even if they are moved or stretched in time by the composer. Secondly, they have inputs that are fed by links coming from a special block. This block is not a true temporal block in the sense that it is here only for computing harmonic material from which the gestures are colored. But this block does not generate any in-time music by itself. We will not detail the algorithmics behind gesture generation, we only want to make precise the relation between the computation of music material and the time information. Whenever the composer stretches a temporal block in order to experiment with duration, the note durations inside the block are not extended proportionally, as the default maquette behavior would enforce it. Rather, the composer's intention is to keep the same density of notes within a different duration. This is achieved by using the 'extent' outlet of the 'self' reflexive box inside a temporal block. By connecting this outlet to the right place in the visual algorithm, a gesture possessing the same overall profile is generated by inserting more notes picked out from the harmonic reservoirs. The whole process is illustrated in figures 51.30 to 51.36.
987
51.5. A MUSICAL EXAMPLE
0
08
coIIdl!nCll!! enU@re
,"...
~
~
.
,50
'
~~
""
".
".'..
... ,.
. ".
•
:~
~
~
.
~ ,..
~
..
.
,.r'
Figure 51.30: The piano cadence maquette of "Encore".
988
CHAPTER 51. OBJECT-ORIENTED PROGRAMMING IN OPENMUSIC
,,.
~l
,eo
..
'so
,
".
0
plm~se
!!IS
..
,. Figure 51.32: The fourth submaquette (starting left) opened.
HO '00
...
'so
,,.",.
08
pllrase '"
". ''''
'.
..
Figure 51.33: The third maquette level revealed.
51.5. A MUSICAL EXAMPLE
989
o
trait multi ascyn d89
199
I • I 1.2 1.3 1.4 I.
Figure 51.34: The fifth block (starting left) opened. The block on the top generates the harmonic material. The seventh block underneath are gestures. The vertical lines are temporal logic constraints.
r-
,,. .. ,.. ,,. '" ". . ". ....
-; '''' 'eo po
0
•
101
u"~lf"''''",,
o.
~
,.
ioo ~
ro
r-
,.
P. ~
I
i
OC~
"" urL-t
dfo/..... ,. ... \llh("'OI.r)
1
UV
~t
.. '" usr
==
c>
s~
H
LIS!"
1
.. . h . . . .
20
---- t
LIS!'
list
. \ 1"0' t
,.
~
n ..
.,:............
~
,-,
-
U5r
1· I;T
"''''' un
LUI"
r-
--
LUI'
.. ~JL.....:J!' ~t ~t
•
"7"1--+-
"",,""h
f .,..t
1..
'00
fo
[1]8
Af.elll 11 a lIj
~
..
.. ..
.
~.r l
-t
",I'
Figure 51.35: A gesture opened. The algorithmics generate in-time music material. The link coming from the 'self' box informs the algorithm about time conditions.
990
CHAPTER 51. OBJECT-ORIENTED PROGRAMMING IN OPENMUSIC
Figure 51.36: A gesture opened in music notation mode. Manual modifications can be achieved at that time.
Part XIV
String Quartet Theory
991
Chapter 52
Historical and Theoretical Prerequisites Bei der niimlichen Gelegenheit fragte ich Haydn, warum er nie ein Violinquintett geschrieben habe, und erhielt die lakonische Antwort, er habe mit vier Stimmen genug gehabt. Ferdinand Ries [458, p.287] Summary. This chapter introduces the best evolved theoretical part of instrumentation: string quartet theory. It starts with a short historic synopsis and then reviews Ludwig Finscher's work [151] on string quartet theory. We then focus on the technical core subject: the violin family as an instrumentation paradigm for the string quartet. The chapter concludes with a general discussion of semantics of sound colors. -~-
The scope of this part is the elaboration of a systematic foundation of the distinguished role of the string quartet at the end of the eighteenth century. This part is a synthesis of methods and results from modulation theory (chapter 27), classification (chapter 15), and counterpoint theory (part VII), combined with knowledge about the nature of sound parameters (see also appendix A). It is not astonishing that precisely the classical string quartet-one of the absolute highlights of instrumental music-needs a complex theoretical background for its comprehension. But since we deal with a systematically as well as historically founded phenomenon, it is adequate to prep end some remarks on the theory and history of the string quartet, remarks which we shall orient towards Ludwig Finscher's pioneering work [151]. This art form, whose instruments include two violins, one viola and one violoncello, can be characterized in the following way [374, p.409]: "In the string quartet, individuality and character of the single players are combined to a harmonic whole, where each one finds himself with an added profile. This reaches from a single complex chord which-in contrast to the piano-is not played and nuanced by the hand of one player, but by four players, to the entire performance. In the quality as a whole lies the specificity of the string quartet. In its harmonically contrasting 993 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 52. HISTORICAL AND THEORETICAL PREREQUISITES
994
togetherness, this chamber music corresponds to the ideal world view and to the high humanistic vision of the classical epoch."
52.1
History
Summary. The history of the string quartet is exceptionally short since this instrumental species almost instantly appeared around 1760 with the works of Luigi Boccherini and Joseph Haydn. It almost instantly installed a leading stream of sophisticated instrumental expression. We discuss the historical background of this phenomenon.
-EThe prehistory of the string quartet is more complicated than that of any other instrumental art form of the eighteenth century. It cannot be causally deduced from any single one of the threads of tradition from where it comes. To a certain degree it is the creative act, the invention out of a moment of the delicate historic equilibrium, the kairos in the sense of ancient Greek thinking. The prehistory dates only from about 1720 to 1760 when Luigi Boccherini and Joseph Haydn independently invented the string quartet. In 1761, Boccherini wrote his first quartets in northern Italy, they were published 1767-68 in Paris under the name of "quatuor concertant". Probably Haydn had written quartet "divertimenti" already in the 1750s in Vienna, they were however only well-known in 1760. The sparse regional, instrumental, and stylistic rootedness in the string quartet's prehistory, from which this new art form has quite spontaneously emerged, provokes the question whether beyond historical rationales a more systematic understanding could better enlighten the 'string quartet phenomenon'. The problem is to question this precise date (1760) of the rise of this precise instrumental art form (the string quartet) in the context of the European music from the systematic point of view. In this question Dahlhaus [104, p.105,p.119] is fairly right in stating that
(. . .) erst die systematische Konstruktion den Blick dafur ojJnet, welche Tatsachen einer Geschichte angehOren, die zu erziihlen lohnend scheint. (... ) DajJ etwa das AusmajJ in dem die Besetzung von Instrumentalmusik im 18. Jahrhundert gattungspriigend wurde, mit dem Grad iisthetischer Autonomie, mit der Herausbildung musikalischen 'Formdenkens' und mit der Festigung der Institution des offentlichen Konzerts eng zusammenhing, ist keineswegs nur eine geschichtliche Tatsache, die sich empirisch feststellen liijJt, sondern erscheint auch als Sinnzusammenhang, der sich einer phiinomenologischen - also 'systematischen' - Analyse erschliejJt.
52.2
Theory of the String Quartet Following Ludwig Finscher
Summary. In his habilitation thesis [151J Ludwig Finscher investigated the theory of the string quartet and exhibited three perspectives: the texture of four parts, the topos of conversation among cultivated humanists, and the family of violins. We discuss this threefold theory.
-E-
52.2. THEORY OF THE STRING QUARTET FOLLOWING LUDWIG FINSCHER
995
If we have decided to turn the string quartet into the subject of a formally valid investigation, this is because the theoretical reflections on the string quartet have reached a scientifically valid status. Finscher remarks [151, p.279]:
Das Streichquartett ist die einzige Gattung der neueren Instrumentalmusik, die eine solche an einem einzigen kunstlerischen Modell entwickelte, vergleichsweise genau und detailliert ausformulierte und als allgemeinverbindlich akzeptierte Theorie ausgebildet hat. This theory is based on two fundamentals: • the four part texture; • the topos of a conversation of four humanistically educated persons.
52.2.1
Four Part Texture
Summary. The texture of four parts is a basic structural prerequisite for the string quartet theory. We review its implications. -~-
The four part texture was the ideal type of structured polyphony which was oriented on the counterpoint with its long tradition. This is the formal, or better: formalized element of string quartet theory. We have to take it in the full conceptual ambiguity, i.e., on the one hand the texture "note against note" in its linear temporal progression in the sense of classical counterpoint. On the other, it is a texture of vertical units~charts in the terminology of global compositions~as an expression of harmonic relations. In the radically harmonic thinking, which is realized in the work of August Kollmann [268] from 1796, it is even possible to reverse the tradition in that the following thesis is proposed: Counterpoint should not start from the intervallic two-part texture, but from the four-part texture, since "a complete harmony" is fourpart and not two- or three-part. It seems that the formation of theories was already influenced by Haydn's success with his famous "Russian" string quartets from 1782. It is interesting to observe that this ideal type of an instrumental art form was fixed to exactly four voices, not one more. Especially with Haydn one could imagine that he could have added a fifth voice to "enrich the texture". But it is reported that he 'failed' on several occasions with this 'experiment'. Ries reports 1838 [151, p.287]:
Bei der niimlichen Gelegenheit fragte ich Haydn, warum er nie ein Violinquintett geschrieben habe, und erhielt die lakonische Antwort, er habe immer mit vier Stimmen genug gehabt. Man hatte mir niimlich gesagt, es seien drei Quintette von Haydn begehrt worden, die er aber nie hiitte komponieren konnen, weil er sich in den Quartettstil so hineingeschrieben habe, daft er die funfte Stimme nicht finden konne; er habe angefangen, es sei aber aus einem Versuche am Ende ein Quartett, aus dem anderen eine Sonate geworden. Presently, Haydn's argumentation that he has "enough" with the four voices, cannot be understood. We come back to this point at the end of our discussion in section 54.3.
996
CHAPTER 52. HISTORICAL AND THEORETICAL PREREQUISITES
52.2.2
The Topos of Conversation Among Four Humanists
Summary. String quartet tradition is intimately related to non-verbal humanistic conversation. This gives the species a rhetoric characteristic which has important consequences for the instrumentation problem. -Ij-
On the one hand, this is a topos which stems from the analogy of contrapuntal texture to the conversation and argumentation of humans. For example, Mattheson [151, p.285] says on imitation:
... dajJ eine Stimme die andere gleichsam gesprachsweise unterhalte. This topos must be cautiously distinguished from the well-known topos of a "Klangrede"l, i.e., from the similarity of musical expression or semantics to the common language. In the case of the string quartet, the more important thing than speaking is the dialog, a fact that becomes more evident in the French expression "quatuor dialogue" for "string quartet" (in fact the invention of a publisher). The association of a discourse to the string quartet was initiated by the musician Johann Friedrich Reichhardt in 1777 [151, p.287]:
Bei dem Quartett habe ich die Idee eines Gesprachs unter vier Personen gehabt. Like Haydn, Reichhardt also views the number of four as being the upper limit for a good dialog. He tries to add a fifth person to the quartet. But he fails:
Die funfte Person ist hier ebensowenig zur Mannigfaltigkeit des Gesprachs nothwendig, als zur Vollstimmigkeit der Harmonie,- und in jenem verwirrt sie nur und bringt Undeutlickeiten in's Stuck. The same happens to Schumann [151, p.289] in a discussion about a viola that was added in a quintet:
Man sollte kaum glauben, wie die einzige hinzugekommene Bratsche die Wirkung der Saiteninstrumente, wie sie sich im Quartett aujJert, auf einmal verandert, wie der Charakter des Quintetts ein ganz anderer ist, als der des Quartetts. Die Mitteltinten haben mehr Kraft und Leben,- die einzelnen Instrumente wirken mehr als Massen zusammen,- hat man im Quartett vier einzelne Menschen gehort, so glaubt man jetzt eine Versammlung vor sich zu haben. The quartet discourse as a dialog is very well suited to communicate understanding within music. This is the sense of the dialog. Goethe stresses this aspect in an enlightening comment on a concert by Niccolo Paganini [151, p.288], [185]: 1 German
for "sound speech", however, difficult to translate.
52.2. THEORY OF THE STRING QUARTET FOLLOWING LUDWIG FINSCHER
997
Mir fehlte zu Dem, was man GenufJ nennt und was bei mir immer zwischen Sinnlichkeit und Verstand schwebt, eine Basis zu dieser Flammen- und Wolkensaule. Ware ich in Berlin, so wurde ich die Maserschen Quartettabende selten versaumen. Dieser Art Exhibitionen waren mir von je her von der Instrumentalmusik das Verstandlichste: man hart vier vernunftige Leute sich untereinander unterhalten, glaubt ihren Diskursen etwas abzugewinnen und die Eigentumlichkeiten der Instrumente kennen zu lernen. Fur diesmal fehlte mir in Geist und Ohr ein solches Fundament; ich harte nur etwas Meteorisches und wufJte mir weiter davon keine Rechenschaft zu geben. The connection between the dialogical discourse and the communication of understanding which Goethe indicates means a valuation of the dialog, its qualification in function of communication of understanding. This valuation in turn is related to the instance of varying competition by Karl Popper [420]: "The value of a dialog depends above all from the manifold of competing opinions." But this manifold of opinions, always related to a given subject, is nothing else than a variation of points of view, of the perspectives of the participants. Therefore the dialogical principle of the string quartet turns out to be an instance of the Yoneda philosophy discussed in section 9.3: Understanding or classification, respectively, by a variation of the point of view. It is not astonishing that this phrase is thoroughly indebted to humanism with which the string quartet is deeply associated. Result 9 Summarizing, one root of the string quartet, the four part texture, appears as a form which, in the sense of Hanslick, is a carrier of musical spirit. This carrier in turn is made accessible to our understanding by the second root: the dialogical discourse of four violin personalities, by means of a variation of perspectives. At this point of our discussion the question arises, why the string quartet and the end of the eighteenth century are related to each other, more precisely: What is the connection between the four-ness within the family of violins (and its above all violins, not violinists who speak!) and the paradigm of four part texture at Haydn's time?-This question leads to a mathematically tractable apparatus in the manifold of violin sounds in defined parameter spaces.
52.2.3
The Family of Violins
Summary. We discuss the exceptional role of the family of violins in the building process of the string quartet species. In Finscher's work, other instrumental families are compared to the violins, we comment on the results of this study. -~-
The formation of the string quartet would not have been thinkable without the collaboration of the homogeneous sound of the instruments of the violin family, which in the eighteenth century was discovered as an ideal type of four-part music and also perfected on the artisanal level after the violin's creation from local variants of string instruments in the fifteenth century. The high quality of violins is guaranteed by a technical standard which through its fineness, sometimes also imponderableness such as the choice of woods and the varnish covering, contradicts any normalization. The individual sound color of every good violin is characteristic
998
CHAPTER 52. HISTORICAL AND THEORETICAL PREREQUISITES
to the family. But the family of violins is also strongly differentiated from other families of string instruments. Finscher describes [151, p.124/125] the characteristic of violins as compared to the gambas, which were the preferred solo instruments in the seventeenth century, as follows:
Die Violinen hatten gegenuber den Gamben jedoch noch eine weitere zukunftstriichtige Eigenart: Sie gliederten den Tonraum, der ehemals in Analogie zu den menschlichen Stimmengattungen gebildet, nun aber in der Tiefe wie in der Hohe langst kriiftig erweitert worden war, klarer und sinnfiilliger, mit deutlicherer Individualisierung ihrer jeweiligen Tonbereiche. (. .. ) Fur das klassische Streichquartett, das die Beweglichkeit, den Lagenwechsel, den Kontrast- und Farbreichtum des symphonischen Streichersatzes mit der grojJtmoglichen Anniiherung an eine streng auskomponierte Vierstimigkeiit zu verbinden suchte, bot sich das vierstimmige Ensemble aus Gliedern der Violinfamilie als das ideale Instrument an. It will be a task of the following chapter to access the specificities of the sound of the violin family via a mathematical description in the frame of parameter spaces.
Chapter 53
Estimation of Resolution Parameters Si tous les instrumets jouent l'accord staccato, je supprime ce moyen naturel d'analyse, et la perception ne peut plus discerner, Ii l'interieur du bloc sonore, de quelle combinaison il s 'agit. L'identification depend, das ce cas, de la presence ou de l'absence des Elements essentiels pour la perception. Pierre Boulez [61, p.547] Summary. This chapter is a technical account of the variety of sound parameters which intervene for the family of violins. -~-
Based on the preceding historical, systematic, and philosophical reflection we argue that the classification theory of global compositions (chapters 15 through 17) is an excellent candidate to be applied to the theory of the string quartet. From classification theory it follows that the resolution of a global composition is a formalization and optimization of the process of understanding this composition. We shall equally apply this approach to string quartets, which means that we have to think of the composition as being a global composition C I , whose resolution b..cI one wants to represent. In chapter 54, we shall specify the global composition associated with a string quartet composition. Our approach to the resolution is essentially realized via a variation of the perspective of the resolution parameters, i.e., by a differentiated change of their dichotomy into essential and accessory parameters. Within the string quartet, this dynamics of weighting of performance parameters is a subtle tool for communication of understanding and articulation as a vehicle of dialog. It is as if you walk around an object-which is the resolution in our case-and observe it from one, and then from another perspective.
999 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
CHAPTER 53. ESTIMATION OF RESOLUTION PARAMETERS
1000
If we now envisage the resolution of a string quartet composition, we first have to investigate the possibility of parametrization by concrete physical performance parameters as they intervene on the arsenal of the violin family.
53.1
Parameter Spaces for Violins
Summary. We describe the denotator types for violins, including generic geometric parameters, sound color parameters, and technical parameters, in particular those related to vibrato.
-EAs we already did in classification theory, we want to work over Q, which is a reasonable postulate for string quartet theory since we are dealing with physical parameters. Here, we envisage a fundamental problem of mathematics since we have to deal with a huge number of independent numerical parameters. If, for example, a sound color has to be represented by means of amplitudes and phases of the Fourier representation, and by the envelope, we easily add up hundreds of numerical parameters. Even though locally, the structure of music thinking may concern a small number of parameters, the techniques of classification theory, by their requirement of points in general position, enforce a number of geometric degrees of freedom that might possibly be far larger. We shall have to work with charts in modules Qn with large dimension n. This should however not prevent us from visualizing essential aspects of our reflections in three space Q3. The only point here is to build mathematically representative analogies in three space. What is the shape of the physical parameter space of a violin sound? To begin with, we dispose of parameters which we call geometric 1 , i.e., • Onset • Duration • Pitch The amplitude (loudness) is omitted in this presentation since the poietic violin parameters are coupled to the way of generation of the sound color aspect. The other attributes of the violin sound span the color space. To begin with, this aspect includes the following ingredients: • Envelope • Amplitude • Fourier spectrum What is the range of variability of these color aspects of a violin sound? Regarding this question, we refer to [371]. For example, we consider the amplitude spectrum for pitch g and g~ of a "Guarneri del Gesu" as compared to an F horn, see figure 53.1. One recognizes that the spectra lSee appendix A.1.2.1.
53.1. PARAMETER SPACES FOR VIOLINS
1001
Guarneri
F-horn
o
1
2
3
4
5
kHz
Figure 53.1: We consider the amplitude spectrum for pitch 9 and g~ of a "Guarneri del Gesu" (top) as compared to a F horn (bottom). One recognizes that the spectra from 9 to g~ are significantly more different for the "Guarneri" than for the F horn. between 9 and g~ are significantly more different for the "Guarneri" than for the F horn. In the representation of the amplitude spectrum as a vector in the color space, one may say that the change from 9 to g~ for the F horn is essentially a dilatation, whereas the corresponding change for the "Guarneri" violin also includes a change of direction (figure 53.2). In other words: The 9 and g~ spectral vectors of the "Guarneri" span a plane in the color space, whereas the corresponding vectors for the F horn lie on a line. This statement evidently is not meant in a strictly physical sense, but in the sense of valence theory (see appendix B.2): For the F horn, the g- and g~-spectral vectors are indistinguishable in the auditory perception from a pair of linearly dependent vectors2. This instrumental difference is justified by the fact that for winds, the sound color is 2This discussion is somewhat speculative since precise measurements should be made and relations to valence theory should be investigated in a more quantitative way. However, the means for such an investigation depend on the insight in the basic problem setup.
1002
CHAPTER 53. ESTIMATION OF RESOLUTION PARAMETERS
g
g
f-horn
Guarneri
Figure 53.2: In the representation of the amplitude spectra as vectors (represented in 3D space here), the "Guarneri" vector differs beyond the valence limit when comparing the 9 sound to the g~ sound. essentially built from an air column, but not from the material. In contrast, for the violins, sound color is an essential function of the material, i.e., of the corpus' resonance properties. Moreover, the spectrum depends on the string on which a fixed pitch is intonated (to this end, the spectral envelopes for different strings are compared [371]). All this turns the sound color of violins substantially into a function of the individual construction: from the material through the artisanal manufacturing to the individual history of the particular instrument. This variability is out of the question with wind instruments. For the same reasons, string pianos are inferior to violins. Figure 53.3 shows the principal configuration of the spectral vector for three violins as opposed to three such vectors for string pianos. To these instrumental parameter properties, central techniques of instrumental practice of sound shaping are added, techniques which for several other instruments do not even exist. The string player has the following possibilities to vary parameters: • Bow pressure • Bow velocity • Contact point of bow and string • Bow angle Further, the string player may shape his/her vibrato according to the following points of view: • Delay time with respect to the tone's onset
53.2. ESTIMATION
1003 1!JI IIJ1n li 1111111111
111111111111
Figure 53.3: The instrumental sound parameters of violins are more variable individually than these parameters are for pianos. Here the amplitude spectra of three violins against three pianos are visualized schematically in 3D space. In contrast to the piano spectra, those of violins lie in general position. • Modulation frequency (frequency of the finger's movement) • Pitch modulation (Range of finger movement on the string) • Amplitude modulation (Contact point of the finger-tip) Compared to the color attributes envelope, amplitude, and spectrum, these vibrato parameters are new. They enable the violinist to realize sound in still larger spaces. Together with the four bow parameters, the four vibrato parameters define an additional eight-dimensional space. These eight parameters which emerge in contrast to the instrumental parameters are called technical parameters. Figure 53.4 shows the effect of bow pressure and contact point variation while bow velocity and sound color remain constant.
53.2
Estimation
Summary. This section is devoted to a theorem giving an estimation of the maximal possible chart dimension ch(n) within a global composition, which can be produced by an orchestra consisting of n individual strings (in the violin family). We make use of the resolution theory for classification of global compositions (see chapter 15.2).
In the last step of our parameter analysis, we deal with differentiation within the technical parameters. Let us recall that we are searching for parameter spaces, i.e., coordinate functions which are suited for charts of global compositions. But this signifies that one has to distinguish
1004
CHAPTER 53. ESTIMATION OF RESOLUTION PARAMETERS
sui ponlicello
Ililstable
max}
\\ ililnul base lone
.
bo\\' prcssure
Illin
~--------------~------~~-----------
conlaCI po inl
25 bridgc
10
5 end ol· lingl.!rboard
Figure 53.4: The effect of a variation of bow pressure and contact point while bow velocity and sound color remain constant. parameters which can be varied independently of each other in short parts of the composition, in fact on charts, from those parameters which may very well vary from player to player or from situation to situation within a larger composition, but which are relatively constant on local regions. The latter, which may strongly depend on the player's personality, include: • the vibrato parameters. Even for a professional violinist they can scarcely be separated from the personality and are difficult to control; • The bow angle and contact point are relatively inert parameters, therefore not suited for extremely local purposes. The local variability is therefore distributed on two dimensions: • Bow pressure which above all acts on the amplitude. (This is the reason why we did not add amplitude to the geometric parameters here: it is only an aspect of the action of bow pressure!) • Bow velocity which above all acts on the dynamics of partials. Evidently the bow parameters are coupled with each other in their action; we only indicated the main actions. Summarizing, we have found three types of color parameters: 1. Instrumental parameters (Violin type, choice of strings, performance conditions),
53.2. ESTIMATION
1005
2. global technical parameters (vibrato, bow angle, contact point), which are a strong function of the individual player, 3. local technical parameters (bow pressure, bow velocity), which can be steered quite independently of each other. Therefore, the local technical parameters define a plane in the color space. Since these are under general control and can be steered objectively, we may assume that these planes HI, H 2 , ••• for player 1, player 2, ... are one and the same player-independent plane H up to an individual translation (figure 53.5). Observe that our hypothesis is made in the context of the widespread
Figure 53.5: In the color space of violins a plane H is appended to the heads of individual vectors of instrumental and global (inert) color parameters and yields Hs = Vs + H for player s.
Vs
string quartet at the end of the eighteenth century, and that in fact the signification of the string quartet is a pronounced reality of music sociology with regard to normal music practice in bourgeois saloons. It would be unrealistic to model our theory upon elite ensembles in this context. Based on the above observations about the individuality of colors and the violinists' personalities, we may set forth the following hypothesis in the spirit of string quartet theory:
Assumption 3 The n "instrumental vectors" VI, V2,'" which are spanned by the instrumental and global technical parameters can be chosen to be linearly independent of each other and of the common plane H.
1006
CHAPTER 53. ESTIMATION OF RESOLUTION PARAMETERS
This means that the submodule is spanned by VI, ... Vn and H has dimension n + 2. In other words: If H is spanned by two vectors h, k, the zero vector and the heads of the vectors VI, ... V n , h, k are in general position 3. If we add the three-dimensional module G ~ Q3 of geometric parameters to the color space (direct sum), we obtain a total parameter space, where the n instrumental vectors are positioned, of which to each is attached a 2 + 3 = 5-dimensional space H + G of local parameters, see figure 53.6.
Figure 53.6: Viewed as points for the n instruments of the violin family, the sounds are distributed on n affine subspaces H + G + VS, s = 1, ... n, in the total attribute space. Here G is the three-dimensional space of sound geometry (pitch, onset, duration), H is the plane of the local technical parameters bow pressure and bow velocity, and Vs is the sth instrument vector. One may assume that the Q-module spanned by G, H, Vs, s = 1, ... n has dimension n + 5. In view of classification theory our question is how many points in the total parameter space can maximally be distributed in general position on the n affine subspaces H +G+vs , s = 1, ... n. So every such point has the form x = Vs + h + g, hE H, g E G, see figure 53.6. It can be shown (appendix E.2.1, theorem 47) that there are maximally n + 5 points, where 5 means the dimension of G + H. The musicological meaning of this result for classification theory is this:
Theorem 36 With n string players from the violin family, charts of global compositions in local parameters can be defined with maximally n + 5 points in general position.
3See appendix E.3.4.
Chapter 54
The Case of Counterpoint and Harmony Au fur et Ii mesure que l'orchestre s'agmndit, que le role de l'instrument devient, je ne dimis pas fiou, mais ductile, multiple, les formes, elles aussi, s'amplifient. Pierre Boulez [61, p.544] Summary. This final chapter on string quartet theory deals with the analytical conditions on global compositions which are powerful enough to comprehend the structural richness of central European music in the epoch of Boccherini and Haydn. These structures are--essentiallycounterpoint and harmony. As a germ for a systematic theory of instrumentation we propose an estimation of maximal necessary chart dimension for Fuxian counterpoint and traditional harmony (including cadence and modulation). -~-
As the last member of our model for the string quartet, we need the initially announced information to solve the question of how a string quartet should be defined as a global composition. To this end we imagine that a score, written by Haydn, say, is given as a local composition. From this one would like to construct an adequate interpretation (in the technical sense of interpretable global compositions). And it is here where the historical moment comes into the game. The interpretation should be such that the European structural music thinking in the four part texture at the end of the eighteenth century is expressed in its essential features. In our presentation of the roots of the string quartet theory, the structural basis was first presented: the four part texture with its polysemic meaning as a contrapuntal as well as harmonic setup. These two main components are therefore to be investigated for the construction of an atlas.
54.1
Counterpoint
Summary. In this section, we calculate the upper limit of chart dimensions for counterpoint.
1007 G. Mazzola, The Topos of Music © Birkhäuser Verlag 2002
1008
CHAPTER 54. THE CASE OF COUNTERPOINT AND HARMONY
We make use of the counterpoint model exposed in part VII. -~-
In this section we shall refer to the discussion of the core theory of counterpoint, "note against note" , as it was presented in chapters 29 through 31. For intervals which are represented by arrows in counterpoint theory, we need two-element charts (admitting that what is physically played are not arrows, but their heads and tails). For the cantus firmus and the discant steps we also need two-element charts, one for each. For the consideration of a progression from interval to interval, one needs four-element charts. It is advantageous to include also the succession of two interval steps, for example regarding hidden/composed tritones, we need six-element charts. This is however not the statement of the rules of counterpoint! As we know, this would be much more complex. But we have defined the "cartographical" setup, and that is what we need. Therefore:
Result 10 The classical contrapuntal texture as it was codified by Johann Joseph Fux in 1725, the early days of the prehistory of the string quartet, requires in its core structure maximally 6-element charts. All more complex configurations can be reduced to this core structure: Thereby charts are glued together, but not enlarged.
54.2
Harmony
Summary. In this section, we calculate the upper limit of chart dimensions for harmony. We make use of the cadence (chapter 26) and modulation (chapter 27) models. -~-
The degree theory of chords as vertical structures interprets a chord as being a subset of a triadic covering (see chapter 25), i.e., by three-element charts. A cadence (see chapter 26) can be thought within the scheme consisting of three degrees (typically: IV-V-I), where the minimal cadential sets (here: IV,V) are completed by the first degree or else by the tonic note of the given tonality. This produces an interpretation of maximal nine tones per chart, consisting of one, three or nine points. The most complex situation in the harmony at the end of the eighteenth century is crystallized in the modulation process which we have formalized in the modulation model that is based on triadic degrees, see chapter 27. In a modulation which is presented as a sequence of neutral degree, modulation degree, cadence degree, we may recognize these three charts, with three elements each. Moreover, like with a cadence, the entire process is collected in a nineelement chart that contains the three triadic degrees. The cadence as such, which we described above as a nine-element interpretation, intersects our modulation in the cadence degree and concludes it as a process. Therefore:
Result 11 Harmony has a maximum of nine points for the relevant local charts.
54.3. EFFECTIVE SELECTION
54.3
1009
Effective Selection
Summary. As a result of the instrumental parameter estimation made in chapter 53 and the structural parameter estimation from this chapter, we obtain a global theorem estimating the minimal number four of string instruments (in the violin family) which is needed to express analytical music structures in the compositions of the classical epoch of Boccherini and Haydn. This theorem makes essential use of the resolution theory for global compositions, as exposed in chapter 15.2. -~-
Finally, we are in a state of indicating the minimal number of instruments from the violin family in order to provide the resolution of an interpretation of a score under the structure as preconized by harmony and counterpoint at the end of the eighteenth century by a sufficient number of free parameters. In sections 54.1 and 54.2 we have seen that a maximal number of nine points per chart are present in such an interpretation. Since in a resolution, all charts have their points in general position, the number n of instruments must, according to theorem 36 at the end of section 53.2, suffice the inequality (54.1) n+ 5 ~ 9. We therefore need at least four instruments for these scopes. As announced, our model of the string quartet has been deduced from the information about parameters, modulation theory, counterpoint, and the classification technique. From this point of view, it is not astonishing that classical string quartet composers such as Haydn did not see any sense in the accumulation of instruments: For a purely economical point of view, they were superfluous. Four string players were perfectly sufficient in order to provide the textural structure with a profiled representation in its resolution.
Part XV
Appendix: Sound
1011
Appendix A
Common Parameter Spaces This appendix chapter is an overview, not an exhaustive treatise of spaces which parametrize sound objects. These spaces where sounds are positioned always define an aspect, never the totality of music thinking, and every attempt to define a preferred space will narrow the music thinking, not the music. The best that can occur is that we offer an encompassing or at least a representative ensemble of parameter spaces which are interrelated by a precise relation. To this end, it is recommended to distinguish topographic positions, above all in their realities and communicative perspectives. This will also entail the corresponding mathematics. We start by the physical descriptions, turn over to more mathematical abstractions and the describe more symbolic viewpoints which we call interpretative since they are not just a new mathematical rephrasing of a priori equivalent physical description, but express abstraction with some mental background constructions.
A.I
Physical Spaces
As a physical object, a sound! is a more or less regular variation of normal air pressure 2 as a function of time. Starting at a determined onset time e sec, it starts from a source Q at position q = qQ m in the ordinary physical space and expands as a wave. At a location x and time t, the pressure variation (the difference from the normal pressure) p~(t) Nm- 2 is perceived as a longitudinal air wave, i.e., with a pressure front perpendicular to the waves expanding direction, see figure A.I. For a punctual sound source, however, the wave front at x =I- q is a spheric surface; we can write
(A.l) where v is the expansion velocity of the wave 3 . It is calculated by the formula v = C. fl, where T K is the absolute temperature in Kelvin degrees, and C is a constant with value C = 20.1 1 German:
"Klang". the zero height above sea and zero degrees Celsius, this is ;::j 1.1013.10 5 Nm- 2 . 3It is known [462] that the square of the pression variation is proportional to the intensity, i.e., energy flow per surface and time unit, and the latter, by energy conservation, decreases proportionally to the square of the distance Ix - ql, whence the formula. 2 At
1013
APPENDIX A. COMMON PARAMETER SPACES
1014
x
Figure A.l: The prototypical punctual sound source and the spherical sound wave. for the normal pressure. For normal conditions, we have v ~ 343 msec- 1 . But with complex sound sources and room-specific reflection and refractions, the pressure variation of a sound may be an overlapping of different spheric wave components. If several sound sources Ql, ... Q s are given, the resulting pressure variation sums up to p~(t) = Ll
A.I.1
Neutral Data
In order to describe the neutral sound data, let us first concentrate on the information at the source location q = qQ of a point source Q, in an idealized model of a single instrument. We shall come back to room acoustics in the next section A.1.1.1. The source sound (variation) 4See chapter 2 for the concepts of music topography, such as "esthesic", "neutral", or "poietic".
A.1. PHYSICAL SPACES
1015
event Pq(t) is usually a finite event, starting at time e, and ending after the duration d. The variation between these time limits is also limited by the maximal amplitude Am of the total pressure variation. So the function Pq (t) is the affine image of a normalized function P~ (t) which starts at time e = 0, has duration d = 1, and amplitude A = 1. More precisely:
Pq(t)
= Support(A, e, d)(p~)(t) = A.p~((t - e)/d).
(A.2)
The operator Support(A, e, d) reduces the unknown sound event to a normalized event. What happens between the normalized unit supports is however completely arbitrary. It may be a percussive sound or the sound of a Stradivari violin. The normalization by the support operator is completely harmless, but not much more than this data can be traced on the neutral level. The theory of all the rest is far from neutral; we are going to deal with this in section A.1.2.
A.1.1.l
Room Acoustics
The room is an important part in the information chain from the information source (instrument, speaker, public address system) to the receiver (listener, director, artist). Room sizes vary from small living rooms to huge cathedrals or concert halls. In this section, we want to tackle only basic features, reverberation time and acoustical power. For a large, irregular room, we can visualize the acoustical conditions by imagining a wave traveling inside the room. This wave travels in a straight line until it strikes a surface. It is reflected off the surface at an angle equal to the angle of incidence and travels in this direction until it strikes another surface. Because sound travels about 343 msec- 1 , many reflections will occur within a small time span.
Absorption. After a wave has undergone a reflection from a wall that is absorbing, its intensity will be less during its next traverse of the room. In a large, irregular room, the number of waves traveling are so numerous that at each surface all directions of incident flow are equally probable. The sound absorption coefficient a is therefore taken to be averaged for all angles of incidence. All materials have absorption coefficients that are different at different frequencies 5 [54]. In the frequency range of interest between 250 Hz and 4 kHz, plain walls and floors, as well as closed windows, have absorption indices below 0.2. Higher absorption can be achieved with acoustic tiles and, at least in the upper frequency range, with thick carpets and draperies. The total absorption A of a room is the sum of the product of surfaces Ai m 2 and absorption coefficients ai: A = ~iaiAi' If there is an open window in the room, all the energy incident on its area will pass outdoors and none will be reflected. 6 The absorption of an area of acoustical material in a room can therefore be expressed in terms of the equivalent area of an open window. For this reason, the total absorption A of a room can be characterized by its equivalent "open window surface" .
Critical Distance. The sound field which builds up in a room is fundamentally different from the free field situation. Let us first assume a sound source in a free field, i.e., a loudspeaker on a high post emitting sound in all directions. If the sound radiation is unidirectional 5See section A.1.2 for the discussion of the frequency concept. 6This statement is strictly true only if the window is several wavelengths wide and high, otherwise diffraction will occur.
1016
APPENDIX A. COMMON PARAMETER SPACES
and no reflections occur, the source will emit a spherical wave and sound pressure will decrease inversely proportional with the distance from the source. For loss-less reflecting walls on all sides around the source, the sound waves will be reflected over and over. In the case of no absorption in the air, the sound pressure will now rise and rise, as no energy loss occurs. Small absorption will cause an equilibrium. However, the sound field in the room will no longer be a directional spherical field because the reflected waves will by far dominate over the direct sound wave. The sound field will be diffuse, reflections will arrive from all directions with equal probability. Only in the close vicinity of the small sound source is the sound field directional, because there the sound pressure of the unreflected direct sound wave will dominate over the diffuse sound field. In summary, the sound field in a room will be directional only close to the sound source Q. There, as we know from the above, at point x, the sound pressure falls with 1/lx - ql. The sound field at a large distance from the source is diffuse. Sound pressure is almost constant and much higher compared to a free sound field. The distance from the source, where the transition between these distinct regions, occurs is called critical distance or diffuse field distance rHo If the average absorption a :S 0.4, the critical distance for the absorption A can be calculated with a precision of about 10% [54]: (A.3)
By the inverse distance law, the sound pressure decrease factor PD in the diffuse field of a room can be estimated by the formula: (A.4) Reverberation Time. Temporal effects during the onset and the decay of sound are essential features in rooms and are not present in a free sound field. In large rooms, such as a cathedral, long decay times are apparent. The human auditory system can distinguish whether a sound source is located in a large room, a small room or even whether it is not in a room at all, but outside. The human ear is obviously able to extract information about the room size from the temporal structure of sound. We will therefore discuss the onset and decay of sound.
When the sound leaves its source, the direct sound reaches the receiver first, its delay is determined by the distance sound has to travel divided by the velocity of sound. With further delay, reflected sound waves with only a slightly longer travel distance arrive, and later, also reflections with longer routes and multiple reflections arrive. The sound pressure is increasing until it reaches its steady state value. Only the direct sound gives information about the location of the source, which is exploited by our hearing system for sound localization. The build-up of the reverberation is only perceived in highly reverberant rooms. The reverberation is much more perceptible after the source is muted, and the echo may be still noticeable after seconds. The direct sound ceases after the propagation time from the source to the receiver, however, all the reflected sound waves still arrive. Their intensity will be reduced by factor 1 - a after each reflection on a wall. Therefore, the sound pressure will decrease exponentially.
1017
A.1. PHYSICAL SPACES
The reverberation time T is defined as the time required for the sound to decay by a sound pressure leve1 7 of 60 dB. Let aL m- 1 be the absorption in air (a highly frequencyand humidity-dependent variable), and let V m 3 be the volume of the room. Then the reverberation time T can be estimated with Sabine's formula [595]: "'" __ T ,---, 0_.1_6_1V__ sec.
A
+ 0.46a L V
(A.5)
In the frequency range below 4 kHz, sound absorption in air is usually neglected, whereas for frequencies above 4 kHz, the reverberation time T is mainly determined by the absorption in air a L . The reverberation time is the most important variable to describe the acoustics of rooms. In rooms with a long reverberation time, sources with a relatively low level yield a high sound intensity, however, speech intelligibility is decreased due to increased temporal masking. As a compromise, the reverberation time must be "appropriate" to the room size. For speech, reverberation time should be between 0.5 sec and 1 sec (increasing with room size), for music presentations 1 sec to 2 sec are acceptable. Beyond the global description of a room by the reverberation time, the temporal finestructure of the reverberation, the temporal incidence of the reflections, is of interest. From the first reflections, the human hearing system is able to extract information about the size of the room. If the first reflections occur very early (1 msec to 10 msec after the direct sound wave), the sound color-especially for music recordings-is altered. Reflections in the time span from 10 msec to 50 msec increase the perceived loudness. Single echoes, arriving with a delay of more than about 100 msec, are perceived as echoes. Very disturbing are periodic echoes, which are generated between parallel walls, for example. For good room acoustics, the reflections should be homogeneous and the intensity should decrease with time. Single echoes should not be larger than 5 dB compared with their temporal vicinity. In concert halls, the reverberation time is measured as a function of frequency and additional sound absorbers and reflectors are placed to improve the acoustics.
Remark 24 This sketchy discussion shows that room acoustics is a complex topic, therefore acoustic experts should be consulted already in the planning of music rooms. This need is documented by plenty of examples, where the acoustics of rooms built for the purpose of audio presentations is so bad that speech intelligibility is severely hampered. Thoroughly planned room acoustics is essential not only for concert and lecture halls, but also for most other rooms such as offices and even hallways and production areas, to keep noise levels down and achieve an environment which is pleasing to the ear. Remark 25 We have also included this discussion since it makes plausible that there is no chance to integrate a poor theory of room acoustics in a valid music-theoretic framework, and that great efforts should be made to lift this status in order to give the compositions with room acoustical specifications a firm background. 7See section A.2.2 for the definition of loudness.
APPENDIX A. COMMON PARAMETER SPACES
1018
A.1.2
Sound Analysis and Synthesis
This central subject of acoustics relates to the poietic (synthesis) and esthesic (analysis) aspects of neutral sound data, more specifically, of the sound pressure variation function p( t) = p~ (t) at a source location q and cast in a determined standard support (see section A.1.1). More precisely, we are considering a time function p which is defined for all times, vanishes outside a finite time interval, e.g., [0,1], and has a finite absolute amplitude supremum sup(!pl), e.g., sup(!pl) = 1. Sound synthesis means that we have exhibited an operator a which for each sequence of its finite or infinite number of numeric (mostly real-valued) parameters XI, X2, . .. yields a function p = a( Xl, X2, •.. ) of the described type. Analysis then means that we are given p and the operator a and would like to determine a sequence Xl, X2, ••• of arguments such that p = a(XI,X2, .. .). The general map a: (XI,X2, ... ) t-t a(xl,x2, ... ) will be neither surjective nor injective. So synthesis is neither a synthesis of any imaginable p, nor is analysis unambiguous, i.e., the fiber of a could be a large set of parameters. Moreover, one may have a synthesis operator al and an analysis operator a2 such that the analysis of a tame synthesis may become pathological. In the following sections, we shall discuss four such operators: Fourier, frequency modulation, wavelets, and physical modeling. We shall however not deal with mixed synthesis/analysis problems which, mathematically speaking, are wild ones-let alone the associated technological problems. A.1.2.1
Fourier
The Fourier approach deals with periodic functions, in our case periodic pressure variations pet) as functions of time t. This however is not the direct approach to produce a pressure function which is inserted into the support operator, since such a p has an infinite support. In order to turn a periodic function f into one with a standard support Support(l, 0,1), say, it is usually multiplied by an envelope function H, see figure A.2. This is a continuous, piecewise differentiable8 non-negative function on JR which fits in the standard support Support(l, 0,1), i.e., H(t) = outside [0,1] and !!H!!oo = Max(H) = 1. In most technological applications, H is even a spline function (often even a linear, i.e., polygonal spline) modeling the attack and decay of a sound event. Then, the pressure function is given by p = w.H. Observe that this is already a source of poietic ambiguities: neither w (not even its frequency), nor H are uniquely determined by p. We then say that the standardized pressure function p is defined by the envelope H and the wave w. Fourier's theorem deals with periodic wave functions w which are piecewise smooth9 waves. For any piecewise smooth function w : JR - t JR, the additive group JR acts by translations: (eP.w)(t) = wet + P). The group of periods Periods w of w is the isotropy group of wunder this action. For any non-zero period P, the inverse fp = 1/P is called a frequency of w, its unit is Hertz, Hz. If Pw = inf({P E Periodsw,O < P}) = 0, w is evidently constant, otherwise,
°
8Differentiable except for a finite set of points 9Continuous, except for a finite set of discontinuities (it need not be defined in these points, but the left and right limits of the functions exist in these points), with a continuous derivative, except for a finite set of points (where the derivative is not continuous or even not defined, but the left and right limits exist in all these singularities). Many examples are plain C 1 functions, but the saw-tooth function is not.
1019
A.I. PHYSICAL SPACES
envelopeH
wavew H.w
A ------+r-TTTT1:TTT;-;-----,
Figure A.2: The envelope H (top left), the wave w (top right), and its combination H.w (bottom left), as well as the affine deformation by the support operator Support(A, e, d) (bottom right).
Periods w = (Pw ) is the discrete group generated by the smallest positive period Pw ' To avoid ambiguities in the periods or frequencies of a wave, one addresses this smallest period Pw or frequency fw = 1/ Pw if one speaks about the "fundamental period" or the "fundamental frequency" of w (otherwise, not even the period or frequency of a wave would be uniquely determined). Fourier's theorem is this (for a proof, see [276]):
Theorem 37 If w is a piecewise smooth wave function and P E Periods w is a positive period, with f = 1/ P the corresponding frequency, then there are two sequences (An' Ph n+dn=O,1,2, ... of real numbers such that
w(t) = Ao +
L
An sin(2'7mft + Phn ),
(A.6)
n=1,2,3, ...
i.e., the infinite series converges and represents the wave for every time t for all points where the function is continuous. For the given period, the coefficients An and Ph n are uniquely
1020
APPENDIX A. COMMON PARAMETER SPACES
determined and can be calculated as follows: Ao = f jP/2 w(t)dt, -P/2 an
= 2f jP/2
w(t) cos(2·mft)dt, 0 < n,
bn
= 2f
j
w(t) sin(27rnft)dt, 0 < n,
-P/2 P/2
-P/2
va;
An = +b;,O < n, Phn = arcsin(an/An) if An =I- 0 and Ph n
=0
else.
The An is called the nth amplitude, whereas Phn is called the nth phase of the wave with respect to the selected period. The sequence (An)n is called the amplitude spectrum, (A;)n is called the energy spectrum since the energy of a wave is proportional to the square of the amplitude, and the sequence (Phn)n is called the phase spectrum. If the period/frequency is the fundamental period/frequency, one omits these specifications.
An equivalent representation (with coefficients an, bn , unique for a given period) is obtained for the explication of the sinoidal components via the goniometric formula sin(a + b) and yields w(t) = Ao +
= sin(a) cos(b) + cos(a) sin(b)
L
an cos(27rnft)
+ bn sin(27rnft).
(A.7)
n=1,2,3, ...
Remark 26 It is well known [307, Thm. 6.7.2], that the function sequences (sin(27rnft)n=1,2,3, ... , (cos(27rnft)n=O,1,2, ... form an orthogonal basis of the pre-Hilbert space lO cD [-P/2, P/2] of the continuous functions on [-P/2,P/2] for the 2-norm (see appendix 1.1.2), where f = liP. This follows in particular from the trigonometric orthogonality relations of the defining scalar product (f,g) = J~~~2f(t)g(t)dt offunctions f,g E cD[-p/2,P/2], i.e.,
(sin(27rnft) , cos(27rmft)) = (sin(27rnft) , sin(27rnft)) = (cos(27rnft), cos(27rmft)) = 0 (A.8)
for n =I- m. There is an infinity of such orthogonal bases for cD [-P /2, P /2]' and mathematically, nothing distinguishes the sinoidal basis chosen by Fourier from the other orthogonal bases. Moreover, sinoidal functions are all but elementary. Mathematically, they are very complex, as is evident from Euler's identity cos(x) + i.sin(x) = eix . A justification for using sinoidal waves lies in the fact that simple mechanical differential equations, such as the spring equation m.x = -k.x, have sinoidal functions as their solutions. But this is a physical argument which must be coupled with a dynamical system of this equational type in order to give these functions any preference. 10 A
normed real vector space whose norm is defined by a positive definite symmetric bilinear form.
A.I. PHYSICAL SPACES
1021
In order to meet the requirement for a unit amplitude, the coefficients of the Fourier representation can be dilated by a common factor, and we are done with the periodic wave. A common generalization of the Fourier representation (A.7) is defined if the frequency and coefficients are also functions of time: f = f(t), An = An(t), Phn = Phn(t), a situation which is also needed to represent sounds of physical instruments with glissandi, crescendi, and their natural damping effects. This construction is the poietic perspective. The esthesic one deals with the problem of constructing an envelope H, a periodic wave wand its Fourier representation (A.7) for a given sound function p. As was already mentioned above, the wave and the envelope cannot be reconstructed unambiguously in general. Even if the wave is known, the envelope is not reconstructible, although a number of obvious candidates can be calculated, e.g., a polygonal envelope defined by the local maxima and minima of the enveloped wave. As to the wave, one candidate for such can be guessed by the analysis of a time window [tl' t2J of p within the supporting duration, such that the local maxima of p are relatively constant (neither at the initial, nor at the decay phase of the sound). One can then take a multiple of the period as a time window, and calculate the Fourier representation of this time window which is interpreted as a finite interval of a really periodic function, i.e., prolongation of this window to infinity. Although this period will not be the fundamental period of the wave, the Fourier representation will yield the right coefficients modulo a multiple of the fundamental frequency. If the fundamental period is small relatively to the total duration of the sound, there is a chance to calculate the underlying wave. In general, this is a highly ambiguous situation. Once the wave is reconstructed, the Fourier coefficients are uniquely determined by the Fourier theorem, and we are done. But this is only the last phase of a highly ambiguous situation. It is of course always possible to find an underlying wave, it suffices to take the total duration of the support and to set it to the wave's period, i.e., prolongation to infinity of the sound by adding copies of itself to the left and to the right of the sound support. For the relation of these reconstructions to what is heard, see below appendix B. Remark 27 A final remark on the terminology of sound frequencies in music. Usually, when we deal with "the frequency of a sound" , we do not mean that this frequency is a neutral property of the sound (although it could happen that the sound really has a fundamental frequency), but the fundamental frequency of a wave that is used in the standard representation of the sound as a product of its (periodic) wave and a deformed envelope. This is a poietic definition, and this is what we will use because a neutral definition does not exist for general sounds. In this setup, we have the sound function p = Support(A,e,d)(wA.,Ph.(f).H). Here, we may assume that WA.,Ph. is the formal representation of the trigonometric sum by the amplitude and phase spectra, and such that the total amplitude maximum is 1; the frequency is given as an additional argument, and the envelope H is given in its standard support. We call the 4-vector (e, f, d, A) the geometric coordinates of the sound, and the pair (WA.,Ph., H) the color coordinates since they are responsible for the sound color (timbre) in this representation. For other poieses of sound which may also use the frequency coordinate (for example those to be discussed in the following sections), this one would also be referred to, but there is no neutral access to this concept. A word of caution: The way humans "detect" sound frequencies is not neutral, this is an esthesic psycho-physiological system whose function is far from understood, so do never mix up neutral facts with poietic or esthesic facts when dealing with sound attributes!
APPENDIX A. COMMON PARAMETER SPACES
1022 A.1.2.2
Frequency Modulation
In this section, we have a similar decomposition as discussed before: the sound p is written as a
product p = H.w, where H is the envelope, and w is a "wave" function. However, this time we do not use the additive combination of sinoidal functions of Fourier synthesis to build w. The combination is rather a functional concatenation of such functions, i.e., sinoidal functions have in their arguments other sinoidal functions, and so on. This synthesis operator was introduced by John Chowning [86] and implemented first in the legendary Yamaha's DX7 synthesizers. The formal definition of frequency modulation (FM) functions in terms of circular denotators is given in section 6.7, example 3, and yields this type of expressions: n
FMsound(myFMObjed)(t) = LAi sin(27rFit + Phi
+ FMsound(myModulatori) (t))
i
where myModulatori is the FM-Object factor of the limit type denotator Knoti. In the terminology of FM synthesis, the (respective) interior functions F M sound(myM odulatori)(t) are called the modulators with respect to the (respective) exterior sinoidal functions Ai sin(27r Fit + Phi) which are named the carriers. We symbolize this relation by an arrow myM odulatori =} myF MObjed. So the FM functions start as a sum of carriers where modulators are inserted, and these modulators are again of this nature, etc., until no modulator appears and the recursion terminates. In FM synthesis it is also allowed to have circular denotators in the sense that a modulator can be a carrier in one and the same function! Observe that the existence of a denotator myF MObjed describing such a "self-modulating" function does not imply the existence of the corresponding function FMsound(myFMObjed). However in digital sound synthesis, one often resolves this problem by taking the time argument of the modulator one digital step before the time argument of the carrier, i.e., n
FMsound(myFMObjed)(t n )
=
LAi sin(27rFi t n
+ Phi + FMsound(myModulatori)(t n _1)).
i
In the DX7 implementation, the FM recursion scheme is presented in the graphical block diagram format of a so-called algorithm (not an adequate wording, though); figure A.3 shows Yamaha's 32 algorithms. Each sinoidal carrier component is written as a block, whereas the modulators of a given carrier are those blocks which are above the carrier and are connected by a line with this carrier. Each of the building blocks can be specified in the respective parameters. The point of the FM synthesis is that it needs a small number of sinoidal functionsin fact only six sinoidal oscillators are needed for the DX7 algorithms-to simulate complex instrumental sounds, which is a theoretical and technological advantage. But the elegant FM synthesis has also drawbacks concerning the uniqueness question: How many different denotators myFMObjed do yield the same function FMsound(myFMObjed)? A general answer seems difficult, we have two partial solutions. The first regards a tower of modulators:
Lemma 54 Let G : IR --+ IR be a bounded C1 function. Then G is determined by its value G (0) and by any function F : IR --+ IR of the shape F(t) = a + bsin(ct + G(t)), with b =1= o.
A.l. PHYSICAL SPACES
1023
Figure A.3: Yamaha's 32 algorithms for FM synthesis. Only six sinoidal oscillators are required to generate a variety of more or less natural sounds. Proof. Evidently, by the boundedness of G, a+b = max(F), a-b = min(F) , so a = (max(F) + min(F))j2, b = (max(F) -min(F))j2, are determined by F. Moreover, the continuous function G' = V 2 bF' - c is a function of F , and the value G(O), together with the integral of G' b -(F - a)2
which is a function of F, completely determine G, QED.
'*
'*
'* ...
Proposition 61 If M = (Mn M n- 1 M n- 2 Mo) is an FM denotator defined by a sequence of relative modulators for the functions Mi = Ai sin(27l' fit+ 7), then the resulting sound function FMsound(M) determines all the modulators. Proof. This follows from lemma 54 by recursion on the modulators, starting with total function FMsound(M)(t) = Aosin(27l'fot + FMsound(M1 )) , where Ml is the denotator starting from Ml instead of Mo. In the lemma, take F = FMsound(M),G = FMsound(Mt) , and observe that G(O) = 0, QED. The second solution regards a flat sequence of unrelated carriers: Proposition 62 Let F(t) = Ei=l ,... k ai sin(M) with 0 < b1 < ... bk , 0 < ai be the function associated with a fiat FM denotator. Then the coefficients, and therefore the denotator, are all uniquely determined by F.
1024
APPENDIX A. COMMON PARAMETER SPACES
Proof. We have (_I)nF2n+1(0) = 2:i=l, ... kaib~n+1 for all n ~ 1. Consider the function H(a., b.)(x) = 2:i=l, ... k aibf of the real variable x. Suppose we have two sequences (ai)i=l, ... k, (bi)i=l, ... k and (aDi=l, ... k, (bDi=l, ... k such that they yield the same function F. Then we have H(a.,b.)(x) = H(a'.,b'.)(x) for all x = 2n + l,n ~ 1. Write bi = e,B;,b~ = e,B:, and suppose bk < b~. Then
H(a. , b.)(x) H(a'.,b'.)(x)
=
"'._ aie,B;x 6.-1, ... k 2:.a~e,B;x .-l, ... k •
2:.aie(,B;-,Bk)X .-l, ... k 2:.a'e(,B;-,Bk)X .-l, ... k •
=
+
=
2:.= _ aie(,B;-,Bk)X ak _.--::::l..:...,._ .. k_l_ _....,.-",;--;;-;-__ 2:.a'e(,B;-,Bk)X .-l, ... k •
But for x --- 00, the denominator goes to 0 whereas the numerator goes to ak, contradicting the fact that this quotient is 1 for all x = 2n + 1, n ~ 1. Therefore bk = b~, so in the above quotient, the limit for x --- 00 is ak/a~ which is also 1, and we have equal coefficients for the index k. Therefore, we may proceed by induction to k - 1 and we are done, QED. These very special results show that given the tower or the flat FM schemes, the functions determine their coefficients (the background denotators) uniquely, but for general FM schemes, there is no such a result. Moreover, if the FM scheme is not known, we have no idea of how the scheme should be determined from the function. This is a drawback compared to the Fourier operator, where the coefficients are always uniquely determined once the fundamental frequency is fixed. In other words, FM synthesis is much more efficient than Fourier synthesis, but one has to pay for this when turning to the respective analyses. The switch between Fourier and FM operators is essentially managed by Bessel functions. These are defined directly from a core situation from FM synthesis, i.e., Definition 111 Let z be a real number. Then the Fourier expansion of the 21f-periodic function sin(zsin(t)) oft is
sin(zsin(t))
=
2
L
hn+1(z)sin((2n + l)t),
(A.9)
n=O,1,2, ...
whereas the Fourier expansion of the 21f -periodic function cos( z sin( t)) of t is cos(zsin(t))
=
Jo(z)
L
+2
J 2n(z)cos(2nt).
(A. 10) I
n=1,2,3, ...
The functions J m , m
= 0,1,2, ... are called the mth Bessel functions.
The above definition relies on the Fourier representation of the respective functions and on their properties as odd or even functions. An alternative definition of Bessel functions is Jm(z) = ~ fo'" cos(mt - zsin(t))dt. From definition 111, one obtains the following fundamental equations for Bessel functions: sin(r + z sin(s))
00
= L In(z) sin(r + ns), -00
cos(r + zsin(s))
00
= LJn(z)cos(r+ns). -00
A.1. PHYSICAL SPACES
In particular, with s
=
1025
27rft,
sin(27r ft
Z
= I, t = 27rgt, we have
+I
L
00
sin(27rgt)) =
In(I) sin(27r(f
+ n.27rg )t),
-00
which is a Fourier type linearization, however, it is not a proper Fourier representation since the so-called "nth side band" frequencies f +n.27rg are not a multiple of a fundamental frequency in general. This is rather a reduction of a FM concatenation to a fiat FM configuration. Conversely, every (finite) Fourier decomposition is evidently a fiat FM configuration. A.1.2.3
Wavelets
Although the FM operator is much more efficient than the Fourier operator, it is still an operator which produces functions with an infinite support, a property which no real sound shares, and we have in fact added an envelope to cope with this requirement for Fourier and FM operators. From this point of view, wavelets are fundamentally better suited for handling finite sound objects without any envelope casting. Refer to [308] and [279] for the wavelet theory and its applications to sound and music. Let f be a square integrable function (element of L2(JR), i.e.,
(1 - tAZ)Exp[ -t"Z/Z]
Sin[Z Pi t]Exp[_(t)AZ/Z]
Figure A.4: Two wavelets: Murenzi's Mexican hat [387] (left) and Morlet's wavelet (imaginary part) [192] deduced from the sinoidal function (right).
fIR
If(x)j2dx
<
00). Then its Fourier transform is defined and is the function
(A.ll) Definition 112 A square integrable function 'IjJ is called a wavelet if 0 < C'I/J = fIR I~~ d'IjJ < 00. For a wavelet 'IjJ, the wavelet-transformed of a square integrable function f is the function (of two variables a, b)
(A.12) with a E JR - {O}, b E lR.
APPENDIX A. COMMON PARAMETER SPACES
1026
Figure AA shows two typical examples of wavelets. The wavelet transform is a function of two variables defined on every couple (a, b) E (JR - {O}) x lR. The point of this representation is that it is a kind of system of coefficients (L",f(a, b))(a,b)EIR-{O}XIR which is parametrized by two real numbers, corresponding to a scalar product (A.13) of the function f with an affinely deformed version l l Support(lal- 1 / 2,b,a)('l,b) of the "mother" wavelet 'l,b. This deformation Support(lal- 1 / 2, b, a) is an isometry on the square integrable functions, see also figure A.5 for some deformed wavelets. By the following formula, the wavelet
'Ij1« ?-b)la)
a
f\
b
Figure A.5: Various affine deformations of the mother wavelet 'l,b. transformations L",f(a, b) redetermine the original function f:
c'"
da db f(t) = -1/21 L",f(a, b)Support(lal- 1/2 , b, a)('l,b)(t)-2-' IR2
a
(A.14)
For selected wavelets, it is possible to generate an orthonormal basis of L2(JR) which is defined as a so-called frame. For ao > 1, bo > 0, and for the Meyer wavelet 12 'l,b we consider the frame (Support (a m / 2, nboaO', aO')( 'l,b) )m ,nEZ of deformed versions of 'l,b. Then this is an
o
110bserve that the operator Support(ial - 1/2 , b, a) defines a linear action Support: GL(IR) -> GL(L2(IR)) : eba>-> Support(lal- 1/ 2 , b, a) of the affine group GL(IR) on the space L2(IR) of square integrable functions. 12See [308, 2.1.25].
A.I. PHYSICAL SPACES
1027
orthonormal basis of L2(JR) for ao = 2, bo = 1. This is an analogous situation as encountered for Fourier series and their sinoidal bases with the known trigonometric orthogonality relations in formulas (A.8). For a comparison of Fourier and wavelet analysis, see [560J.
A.1.2.4
Some Remarks on Physical Modeling
The previous operators were directly acting on the production of a sound function. Their poietic nature was a mathematical one, just to construct a time function p(t) by a mathematical procedure from a certain type of "atomic", i.e., basis functions. In contrast, physical modeling is one step more poietic in that it does not directly deal with sound, but with a physical system that produces sound. On the one hand, this is a strong restriction since sound does not care for the physical system that evokes that sound. It seems however that this drawback is compensated by the fact that musical expressivity is largely determined by the physical device which the artist manipulates when interpreting or improvising music. It is also an important argument that the simulation of the physical instrument and then a possible canonical extension could yield more interesting sounds than just "abstract nonsense" procedures. For a general survey on physical modeling, see [460, chapter 7], we restrict our discussion of this extensive topic to some systematic remarks. The idea is that one considers a physical model of a sound production device and then implements this model as a software which-if sufficient calculation power is available-calculates the physical output on the level of the sound wave that is emitted by the modeled instrument. At present, there are three methodologies for such a modeling: mass-spring, modal synthesis, and waveguide. The mass-spring paradigm just models a physical instrument (a string, a drum) by a finite space configuration of point masses that are related by springs and damping effects [225J. The modeling is built upon the classical mechanics of Newton's law and the corresponding dynamic behavior that eventually terminates in the air's vibration. The modal synthesis paradigm [77J reduces the vibrating physical system to a system of vibrating substructures, usually very small in number compared with the mass-spring components. These substructures are characterized by their frequencies, damping coefficients and parameters for the vibrating mode's shape. This adds up to a sum of modal vibrations. Whereas in simple configurations these data can be obtained from classical literature in equations for vibrating systems, the complex data must be extracted from experimental results. A prototypical implementation of this paradigm is MOSAIC, developed by Jean-Marie Adrien and Joseph Morrison, see [460, p. 276 ff.J. The waveguide paradigm has been implemented in commercial physical modeling synthesizers by YAMAHA and KORG , see [460, p. 282 ff.J. It has mainly been developed by Julius 0 Smith III and collaborators [494, 495J. The waveguide model implements the traveling wave along a medium, such as a tube or a string. See [496J for an update of physical modeling strategies. Although physical modeling is a successful approach in the simulation of musical instrumental sounds, it is a step back from the neutral sound objects to their generators. This has not only been a technological requirement for performance theory (where physical modeling is a core approach), it is also a consequence of the failure in the understanding of the topological semantics of sound objects as exposed in section 12.3. In particular, it is an open problem to
APPENDIX A. COMMON PARAMETER SPACES
1028
relate the physical modeling theory and technology to the other operators, such as Fourier, FM, and wavelets. The deeper question here is whether the neutral sound objects are really the most relevant ingredients of musical performance, i.e., how strongly the gestural components influence and characterize the sounding reality. It is not clear how deeply a sound conveys the generating gestures in its autonomous structure. To our knowledge, sound classification has not been directed towards a gestural coordinate in the neutral sound description, except, perhaps, in the straightforward envelope component.
A.2
Mathematical and Symbolic Spaces
The physical description of sounds is not what can be used for music theories. This is based on (1) the way humans perceive sounds, (2) the shape of music thinking, and (3) available instrumental technologies. Therefore physical parameter spaces must be transformed into spaces which essentially encode the same information, but do so in a way which is more adapted to music. Basically, we shall present mathematical structures where physical parameters are represented. Based upon these spaces, we shall explain a derived set of representations which encode different music-topographic aspects. Our discussion regards the geometric coordinate pairs (basis coordinate plus associated pianola coordinate) onset and duration (A.2.1), amplitude and crescendo (A.2.2), and frequency and glissando (A.2.3).
A.2.1
Onset and Duration
In music, one speaks about tempo, metronome, quavers, semiquavers, triplets, 3/4 meter, etc. The relation to the physical time parameters is as follows. To begin with, the physical onset time e sec and duration d sec are opposed to a musical onset time E note and duration D note, usually also in real values, and in units such as "note", meaning a whole note. However, in many practical contexts, the rational number field Ql will do. In this latter context, we have the ratios of integer numbers E = win, D = z/n, n > 0, and the denominator is of the form n = 2T 3s 5t 7u with natural exponents r, s, t, u, whereas the numerators w, z are integers, i.e., we are working in the ring localization 13 1£[1/2,1/3,1/5,1/7] at the primes 2,3,5,7. Musically, this means that one is allowed to add, subtract, multiply such numbers at will without leaving the domain, and also division of such numbers by 2,3,5,7 leaves the domain invariant, i.e., the construction of duplets, triplets, quintuplets, septuplets is possible without any restriction. The musical time shares a mental reality and should not be confused with physical time. Genetically, musical time is an abstraction from physical time, but they are by no means equivalent. This abstraction is also a creation of an autonomous time quality where mental constructions such as a score can be positioned. The relation between these two time qualities is defined by the tempo, usually encoded as a metronomic indication of x quarters per minute according to Malzel, meaning a difference quotient (velocity) f:1E / f:1e of musical time per physical time. This presupposes that we are given a one-to-one performance mapping E 1-+ e(E) from musical time to physical time. It is common in mathematical music theory to write physical parameters in lower case letters, whereas musical parameters are written in upper case letters. 13Make the integers 2,3,5,7 invertible, i.e., admitting fractions 1/2,1/3,1/5,1/7. But see appendix E.4.1 for a formal definition.
A.2. MATHEMATICAL AND SYMBOLIC SPACES
1029
Let x, y be positive integers. Then every onset E can be written uniquely as E = J+T.X/y, where 0 ::; J < x/V is in Q, and T E Z. If J = q/(ny), we say that in x/V time14 , E is on the (q + l)st ny-tuplet in bar T + 1. The additive group Z.l/y is called the meter of the x/V time. Evidently, this initializes a score at time E = 0, but this is pure convention. Pay attention not to view the symbol x/V as a plain fraction, but as a pair x, y giving rise to a fraction. In other words, in music notation, the symbol x/V is the mathematical fraction x/V plus the meter.
Example 61 Let x = 3, y fourth quaver of bar 21.
= 4, E = 15.375. Then we have E = 3/8 + 20.3/4, i.e., E is on the
If we work in Z[1/2, 1/3, 1/5, 1/7]' then with E and l/y, the remainder J and the meter are automatically in this domain. If we are given a duration D, then durations of form D / n are
also called n-tuplets (with respect to D). The reason why the concept of onset time is not common in music lies in the fact that a note's onset can be deduced from its position in the score. The time signature, bar-lines, and simultaneous or preceding notes help establish the time context of each note. This helps calculate onsets algorithmically by recursion from the first score onset. This algorithm is derived from the linear syntagm of written language, but it sometimes leads to ambiguities.
A.2.2
Amplitude and Crescendo
The amplitude A of a sound relates to the loudness sensation, i.e., to musical dynamics. However, the chain of transformations to the physiologically relevant measures is quite complex [462]. The first member of this chain is the transformation which associates A with the sound pressure level (SPL) (A.15) l(A) = 20.1og 1o (A/ Athreshold) dB. Here, Athreshold = 2.10 5 Nm- 2 is close to the amplitude of the threshold pressure variation for a 1 kHz sound in a young, normal hearing human subject. The unit dB for l is Dezibel. Of course, these constants are pure convention. From the auditory physiology, the shape l(A) = a In(A) + b is essential, i.e., l(A) is a conventionally normed linear function of the logarithm of the amplitude. The physio-psychological motivation for this approach lies in the Weber-Fechner law according to which the sensation of a difference of sensory stimuli is proportional to the stimuli [242], yielding the logarithmic representation as an adequate encoding of this sensorial modality. Figure A.6 shows some environmental SPL values (the musical units ppp, pp, etc. will be discussed below). For example, the SPL ~ 120 dB of an air jet is one million times the threshold SPL. Just as duration is the time interval from the onset time to the "offset" time of a sound event, the crescendo parameter c is the difference between the onset SPL and the offset SPL. In normal piano sounds, this vanishes, but for violins, trombones, etc., this is a relevant quantity. In this context, one supposes that the sound representation p = Support(A, e, d)( WA.,Ph. (f).H) has a time-dependent amplitude A = A(t). At the offset time e + d, the amplitude has changed by the amount A(e + d)/ A(e), or in terms of loudness, c = l(A(e + d)) -l(A(e)). As with onset and duration, it is not specified what happens in the sound process between the onset and the offset, it's just the difference that matters. This difference c is termed the (physical) crescendo. 14We stick to the continental terminology since "meter" will be reserved for a different concept.
APPENDIX A. COMMON PARAMETER SPACES
1030
threshold of pain
limitof damage ri sk
t
t
h","ring threshold
"t 3 kH~
pllilllllff C(lnvcrS:ltiOllft l speech
traffic
s lee p
air j et
--------l I -12
I
0
I
I
I
I
10
20
30
4()
I
I
50
60
I
70
I
80
I
I
90
100
I
no
I
12lJ
I~
130
L(dB)
Figure A.6: Some environmental sound pressure level values.
In the musical abstraction L for loudness, one also works in the field ~ of real numbers, but the common score notation is not well defined for all real values. More precisely, the first still physically motivated codification is a conventional calibration such as, for example, 1 : Z ----+ ~ with l(L) = 10 dB.L + 60 dB such that the values
o
f-+
mJ
are arranged symmetrically around the mezzoJorte sign. This looks like an identification of the score symbols with precise physical values. But this is wrong, it is a transformation from mental to physical reality. In fact, the affine transformation l(L) is only a "default" assignment of a mental quantity symbolized by a dynamic symbol (which is codified by an integer). This fact is more evident if one recalls the velocity parameter in the MIDI code. This quantity is an integer in the interval [0, 127], but the physical meaning of a velocity value depends on the assignment by specific technological calibrations, in particular by the output chain where the loudspeakers can take values that have nothing to do with the velocities. Units in the mental level of loudness are not standard, but we could, for example, take vel for MIDI's velocity parameter. The extension of integer values to real or rational values (depending on the specific usage) is a reasonable procedure, for instance preconized by MIDI velocity codification. It is also useful for finer loudness data management with relative loudness signs such as "crescendo", "diminuendo" which make sense in a mental crescendo parameter which we shall denote by C. See also our discussion of performance transformations and primavista weights in section 39.2.
A.2. MATHEMATICAL AND SYMBOLIC SPACES
A.2.3
1031
Frequency and Glissando
Sounds which share a frequency parameter f are musically important, at least in the European tradition. However, the corresponding mathematics is a bit more involved than for time and amplitude. This is an expression of an intense discussion of traditional harmonies and interval theories with an evolving instrumental technology. To begin with, f behaves like amplitude: For reasons of auditory physiology (see also Appendix B), f is transformed via the logarithmic formula h(f) = u In(f) + v, yielding the physical pitch h of frequency f. The common unit of pitch is the Cent Ct, it corresponds to the logarithm of a relative frequency increase by the factor 21/1200, i.e., one percent of a welltempered semitone (see below for tuning types); this entails u = 1200/ In(2), but the constants u, v are purely conventional. Presently, and in the Western framework, the relevant frequencies have the shape f = 132 H z.2p .3 8 .5 t , where p, s, r E Ql. It is based on the chamber pitch 440 Hz of the one-line a, as fixed in London 1939. Instead, we have chosen the (unlined) c with the frequency 132 Hz = 440 H z.2- 1 .3.5- 1 as a starting frequency in order to relate the examples with ease to c. Mathematically, the restriction to the first three primes 2,3, 5 is not essential. One could as well take any sequence P1,P2, ... of mutually prime natural numbers (larger than 1), the rational powers of which are multiplied by 132 Hz. The natural logarithm of f is In(f) = p.ln(2)
+ s.ln(3) + r.ln(5) + In(132).
But music theory is rather interested in the relative pitch, i.e., In(f) -In(132) = p.ln(2)
+ s.ln(3) + r.ln(5).
Moreover, the passage to another logarithmic basis b could be desirable, i.e.,
so that we are given the linear function
for X = In(f). For music theory, the restriction to rational exponents is essential, since this hypothesis enables an unambiguous representation, see Appendix E.2.l. Every frequency f can be replaced by a point x = (p, s, r) E Ql3. Such a point represents the frequency f (x) = 132 Hz.2P.3 8 .5 r , and from f(x) = f(y), we conclude x = y. With this interpretation, Ql3 is called the Euler space and a point x = (p, s, r) E Ql3 is called an Euler point. This means that a real number p.logb(2) + s.logb(3) + r.logb(5) is viewed as a vector which is a rational linear combination of linearly independent vectors logb(2), logb(3), logb(5). The choice of the three first primes stems from the tradition of just tuning, where for two frequencies f, g, we have:
(O)f!g =2/1 :f is the octave (frequency) for g.
(Q)f /g =3/2 :f is the (just) fifth (frequency) for g. (T)f /g =5/4 :f is the (just) major third (frequency) for g.
APPENDIX A. COMMON PARAMETER SPACES
1032
This is why 2 is associated with the octave, 3 with the fifth, and 5 with the major third. We therefore call p the octave coordinate, r the fifth coordinate, and s the third coordinate (of the sound with frequency f or of the corresponding Euler point. The Euler point 0 = (1,0,0) is called the octave point, the point q = (0,1,0) is called the fifth point, and t = (0,0,1) is called the third point. Observe that fifth and third points are not the fifth and octave. The structural meaning of these points is explained in section 6.4.1. The minor just third frequency 6/5 does not add a new prime number. For a fixed chamber pitch, the 2-3-5 just tuning (short: just tuning) is the set of frequencies which are represented by integer coordinates, i.e., the subgroup Z3 C Ql3 of the Euler space. This is the three-dimensional grid which was introduced by Leonhard Euler [143]. The group of those grid points whose third coordinate vanishes is the Pythagorean tuning. The Euler space is derived from the logarithmic representation of pitch, but the coefficients are beyond physical reality, this is why we view this space as a mental space. The same is valid if we consider the codomain space of pitch h, but interpreted with rational coefficients (written as lR[lQ] instead of the usual real line lR, the one-dimensional real vector space I5 ). More precisely, one should name H the pitch on the mental pitch space Pitch, and h the pitch on the physical pitch space PhysPitch. Whereas the unit for physical pitch is Cent, the unit for mental pitch could be Semi tone or MIDI's key(number) , but no standard exists here. In performance, we have a transformation 8:J : Pitch ---4 PhysPitch. Both spaces have the real numbers as underlying sets, but the meaning of the spaces is different. The mental pitch space encodes pitch as it is symbolized on the score or as a key number in MIDI code. Ideally, performance transforms this abstract data into mathematical pitch Math(H) = (p,s,r) in the Euler space, and this is transformed to the pitch In(f) = p.ln(2) + s.ln(3) + r.ln(5) + In(132) in lR[IQ]. Only after forgetting about the coefficients, we are in PhysPitch. And this is by no means a formal play: It is a dramatic change of reality if one views the reals as an infinite-dimensional space with linearly independent octave, fifth, and third logarithm vectors, or as a line, where everything shares the same direction! In analogy to onset and loudness, one introduces physical glissando 9 and its symbolic counterpart G. In the common visualization of the Euler space, all grid cells are shown as cubes. This could suggest that angles and distances are relevant to this space. So far, this has however no musicological reason, and it is nonsense to argue with angles and distances as Arthur von Oettingen [406], Carl Eitz[137], and Martin Vogel [547] have done. Evidently, the mathematical structure of just tuning is independent of the historical choice of the sequence 2,3,5. One could as well take any pairwise prime positive numbers 1 < PI < P2 < P3 and would get a PI-P2-P3 just tuning. If the pitch range in Euler space is described by non-integer coefficients, one speaks of tempered tunings. The most well known are defined by a uniform construction mode. For a natural number w > lone considers all pitches whose octave coordinate is a fraction of shape P = x/w, whereas the other coordinates vanish. This is called the w-tempered tuning. This defines a grid Z.l/w.o ~ Z with step width l/w of the octave point. By the same recipe, tempered tunings in fifth and third direction can be defined. The 12-tempered tuning is the famous "well-tempered" tuning. The 1200-tempered tuning is less interesting for conventional composers than for measurement techniques, where the unit step is the Ct step defined above. 15See
Appendix E.2.1.
A.2. MATHEMATICAL AND SYMBOLIC SPACES
1033
More generally, one may define tunings which consist of tempered and just components. The procedure runs as follows. Take three positive integers WI, W2, W3 and consider the grid 111 Z.-.o + Z.-.q + Z.-.t WI W2 W3 which specializes to the 2-3-5 just tuning as well as the tempered tunings as defined above. Call this construction the WI -w2-w3 just-tempered tuning. Historically relevant is the mediante tuning, which is the 1-1-2 just-tempered tuning, and which includes the tempered whole-tone step in the major third. With respect to the auditory psycho-physiology (see Appendix B), we should consider the distribution of the just tuning grid vectors x = (p, s, r) in Z3 with respect to the mathematical pitch H(x) = p.logb(2) + s.10gb(3) + r.logb(5). More generally, take any vector x E IR3 and denote by Hprime = (10gb(2), 10gb(3), 10gb(5)) the prime vector. This means that pitch is the usual scalar product (A.16) H(x) = (Hprime, x) of the prime vector with the generalized Euler point x. Therefore, Proposition 63 With the definition (A.16) two vectors x,x' E]R3 have the same pitch H(x) = H(x') iff their difference is orthogonal to the prime vector Hprime' Call E = H:};;.ime the plane orthogonal to the prime vector. Then the proposition means that for any generalized Euler point x, x + E is the set of points with same pitch as x. Now, according to what we know, E lies so skew in IR3 that for a point x E Ql3,
(x+E)nQl3
=
{x}.
Nonetheless, every real number ¢ can be approached within any given error by points of the just tuning grid: For any positive bound 15, there is x E Z3 such that IH(x) -¢I < 15, see Appendix 73 for a proof. In particular, we have the following proposition which has dramatic consequences for theories of hearing of just tuned pitch (see section B.2): Proposition 64 If ¢ = H(xo) is a mathematical pitch of a grid point xo, then for any positive bound 15, there is an infinity of grid points x such that IH(x) - H(xo)1 < 15.
Appendix B
Auditory Physiology and Psychology "Music listening" is a metonymy of understanding music: For all participants, the ear functions as an interface for perceiving music between physical, psychological, and mathematical reality. But a metonymy is not the matter as such. This is what deaf Beethoven teaches us impressively: his innermost ear was an organ of imagination that was uncoupled from the material ear. And the physiology of the hearing process teaches us that the neural coupling of the ear to the respective cortical regions is extremely complex and still hardly understood. Section B.1 is written to give an overview on auditory physiology. Beyond receptive processes hearing means also an active shaping according to templates of esthesis and poiesis. One of the difficult basic problems concerning the activity of hearing deals with the compatibility of these templates and the physical input, and in particular the notorious "straightened out hearing" (German: "Zurechthoren") which keeps alive a lot of wishful thinking in music theory. Basically, the problem is that we do not perceive physical sounds, but classes of indistinguishable sounds, what Werner Meyer-Eppler coined "valences" in [372]. Valence theory, which is sketched in section B.2, has dramatic consequences for the relation between mathematical theories and their semantic potential for music. Historically and materially, and in view of the genealogy of a mathematical theory, the subject pairing of "consonance-dissonance" is an excellent illustration of auditory physiology and psychology. We deal with the formal and some physiological aspects of this subject in part VII. In the following section B.3, we want to expose the stratification of the phenomenon of consonance and dissonance. Hereby, the problem setup as well as the approaches to its solution demonstrate a strong dependency of the addressed reality layer. Methodologically, section B.3 is important since it makes evident that known approaches of the mathematical argumentation in musicology turn out to be too narrow with respect to the existing music, and too dogmatic and scientifically unbased with respect to music thinking. 1035
APPENDIX B. AUDITORY PHYSIOLOGY AND PSYCHOLOGY
1036
B.l
Physiology: From the Auricle to Heschl's Gyri
Two fundamentally different regions of sound processing in the auditory system can be distinguished. In the peripheral region, mechanical preprocessing takes place, especially in the fluid-filled inner ear. The sensory cells encode the preprocessed mechanical oscillations into electrical nerve action potentials. In the second region of the hearing system, neural processing of the sound information is conducted in ascending nuclei, which finally leads to auditory sensation. The hearing system, especially in mammals, has pushed its bandwidth up in a frequency range, where the limits of neural processing in the time-domain are exceeded by far. Instead of processing high frequencies in the time domain, evolution has developed the so-called frequency-place principle. In the inner ear, the frequency contents of sound signals are separated in the spatial domain. The sensitivity of our hearing system is thereby remarkable, in its most sensitive region, the threshold of the hearing system is limited only by thermal noise.
B.1.l
Outer Ear
Sound energy is collected by the outer ear and transmitted through the outer ear canal to the ear drum (figure B.l).
Figure B.l: Schematic view of the outer, middle and inner ear, modified from [594]. The delicate structures of our hearing system are well protected inside the skull. For the sound transmission, the outer ear canal acts like an open pipe with a length of about 20 to
B.1. PHYSIOLOGY: FROM THE AURICLE TO HESCHL'S GYRI
1037
30 mm. Its quarter-wave resonance is responsible for the high sensitivity of our hearing organ in this frequency range, indicated by the dip of the threshold in quiet around 4 kHz. This high sensitivity is however also the reason for high susceptibility to noise-induced damage in the region around 4 kHz.
B.l.2
Middle Ear
The fluids of the inner ear must be excited by the sound-induced vibrations of the air particles in front of the ear drum. The light but sturdy funnel-shaped ear drum (tympanic membrane) operates over a wide frequency range as a pressure receiver. It is firmly attached to the long arm of the hammer (malleus) (figure B.1). The motions of the eardrum are so transmitted via the anvil (incus) to the stirrup (stapes). The stapes foot plate, together with a ring-shaped membrane called the oval window, forms the entrance to the inner ear. The middle ear optimizes the energy flow from air-borne sound in front of the ear drum to fluid motion in the inner ear by a mechanism called impedance transformation. One part of the impedance transformation is based on the lever ratio of about 1.5:1 produced by the different lengths of the arms of malleus and incus [72]. The lever ratio transforms oscillations of the ear drum with small forces into motions of the fluid with large forces. The law of energy conservation implies also that the tiny displacements of the ear drum are transformed into still smaller oscillations in fluid. An even larger transformation of the pressure is due to the ratio of the large ear drum to that of the small oval window. This ratio is about 17 [47]. Through the lever and area ratios, an almost perfect impedance match is reached in man in the middle frequency range between 1 and 4 kHz. It allows optimization of the energy flow into the inner ear, which otherwise would be reflected. The middle ear operates normally when it is filled with air at atmospheric pressure. The Eustachian tube, which connects the middle ear cavity to the upper throat, normally opens and closes periodically, thereby insuring that the static pressure in the middle ear will remain the same as atmospheric pressure. We experience a pressure difference when the Eustachian tube fails to open during ascent or descent in an elevator. For an elevation of 8 m, the change in atmospheric pressure is 100 N m -2, corresponding to a sound pressure of 130 dB relative to the 20 p,Nm- 2 reference! This pressure causes a static deflection of the ear drum and increases the stiffness of the middle ear transmission system and sound transmission is attenuated.
B.1.3
Inner Ear (Cochlea)
The shape of the cochlea resembles that of a snail shell with two and one-half turns (in humans) and hence its name (figure B.2). The central conical bony core of the cochlea is called the modiolus. The auditory nerve fibers run in this bone and exit the cochlea at its base. The outer wall of the modiolus forms the inner wall of a 30 mm long canal which spirals the full two and one-half turns around the central core. This canal is separated into three partitions called scales: Scala tympani is separated by the so-called cochlear partition, which is formed by a thin shelf of bone projecting from the modiolus (the ossesus spiral lamina), which is connected by the basilar membrane and the spiral ligament to the outer wall of the cochlea. The sensory organ of hearing, the organ of Corti, is located on top of the basilar membrane. As can be seen in the cross section of the cochlear spiral, the cochlear scalae become smaller and smaller in cross-sectional area as the apex is
1038
APPENDIX B. AUDITORY PHYSIOLOGY AND PSYCHOLOGY
\'3scularis
So
ira l lamina
0. 1 rmn
Figure B.2: Section through the human cochlea (left) and magnified view of the organ of Corti (right). IHC: inner hair cell, OHC: outer hair cell, BM: basilar membrane.
approached. Directly opposed, the basilar membrane becomes progressively wider towards the apex (figure B.2). This is because the osseous spiral lamina is broadest at the cochlear base where the basilar membrane is only about 0.16 mm wide (in humans); at the apex the basilar membrane has broadened to about 0.52 mm. Scala media and scala vestibuli are separated by a thin membrane, called Reissner's membrane (figure B.2). At the apical end of the cochlea, scala vestibuli and scala tympani are connected by an opening in the cochlear partition, called helicotrema. Scala tympani and scala vestibuli are filled with perilymph, which resembles in its chemical composition other extracellular fluids. Perilymph is characterized by high sodium (Na+) concentration of about 140 - 150 mM and its low potassium (K+) content of only around 5 mM. Scala media is filled with endolymph, which is unlike any other extracellular fluid found in the body. From its chemical composition, it resembles intracellular fluids . Its predominant cation is potassium with a concentration of about 157 mM; sodium is very low (1.3 mM). In addition to its special chemical composition, the endolymphatic space exhibits a considerable positive electrical potential within scala media of about +80 m V relative to scala tympani and scala vestibuli, called the endocochlear potential. The chemical composition and the electrical potential of the endolymphatic space is sustained by active ion transport provided by the cellular layers of stria vascularis. The organ of Corti lies just between the endolymphatic and perilymphatic spaces (figure B.2). Its surface is sealed by tight junctions to keep the fluids separate. The basilar membrane, on which the organ of Corti rests, is composed mainly of extracellular matrix material with embedded fibers. In contrast to the tight surface of the organ of Corti, the basilar membrane is thought to be permeable to perilymph. In the organ of Corti, two types of sensory cells, one row of inner hair cells and three to four rows of outer hair cells are embedded. In humans, there are approximately 3,500 inner- and 12,000 outer hair cells. The membrane potential of inner hair cells is about -40 m V, of outer hair cells even as low as - 70 m V. Both types display "hair bundles" or stereocilia, which project into the endolymphatic space. The stereocilia are arranged in several rows, which are graded in size. Stereocilia from different rows are connected by a fine filament, called tip-link. The current theory of transduction assumes
B.1. PHYSIOLOGY: FROM THE AURICLE TO HESCHL'S GYRI
1039
that hair bundle deflection pulls on the tip-links, opening transducer channels which are close to the attachment points. This concept also explains that the hair bundle is only sensitive to mechanical stimulation in the direction of the tallest stereocilia. The transducer channel is a nonselective cation channel which is very impermeable to anions. Therefore, the transduction current is mainly carried by potassium (K+) and calcium (Ca2+) cations, driven by the large electrical potential between the endolymphatic space and the receptor cells. The driving potential sums up to 120 m V for inner hair cells and to 150 m V for outer hair cells. Potassium leaves the sensory cells via potassium channels present in the basolateral cell membrane and diffuses into scala tympani. It is interesting to notice that the intracellular concentration of potassium is as high as that of endolymph. Potassium is driven into the cells by the electrical potential and because of the concentration gradient, it can diffuse into scala tympani without requiring energy from the sensory cells. The inner and outer pillar cells, the phalangeal processes of the Deiters' cells and the cylindrical bodies of the outer hair cells build a complex, three-dimensional truss (see figure B.3). One peculiarity is that outer hair cells are not in contact with other cells along their lateral
OHC
Figure B.3: Scanning electron micrograph of the three-dimensional arrangement of the organ of Corti. IRC: inner hair cells, ORC: outer hair cells, DC: Deiters cells [141]. surface but immersed in extracellular fluid. From figure B.3 also the different morphology of the hair bundles becomes apparent: Whereas the bundles of the inner hair cells are arranged in a straight line, these of the three rows of outer hair cells are W-shaped. The hair bundles of outer hair cells are excited by a shearing motion between the surface of the organ of Corti and the so-called tectorial membrane. The tectorial membrane is a gel-like structure composed of extracellular matrix material and it is in direct contact with the longest row of outer hair cell stereocilia. In contrast, the hair bundles of the inner hair cells seem not to be in direct contact with the tectorial membrane. Their bundles are probably driven by fluid
1040
APPENDIX B. AUDITORY PHYSIOLOGY AND PSYCHOLOGY
forces. The transduction current flowing through the stereocilia is converted into a receptor potential in the cell body of the hair cell. At low frequencies, the receptor potential follows the stimulus cycle-by-cycle. Upon mechanical stimulation of the hair bundle in excitatory direction, tip-links are stretched and the transduction channels open. As positive K+ ions are driven into the cell, the potential inside the cell becomes more positive. If the bundle is stimulated in the other direction, tip links relax and transduction channels close. However, as for inner hair cells at rest only 20% of transduction channels are open, the receptor potential is highly asymmetric when stimulated with a tone. On the basal pole of the inner hair cells about 10 to 30 afferent synapses are located, and upon depolarization of the cell membrane, voltage-sensitive Ca2+channels located in the basal pole of the cell membrane open, the increased Ca2+ level causes vesicles filled with transmitter to fuse with the lateral cell membrane and release transmitter into the synaptic cleft. This transmitter release triggers an action potential in the afferent nerve, and these electrical spikes transmitted to the brain finally lead to the hearing sensations. These mechanisms work well at low frequencies, where the events of neural processing can easily follow the sound stimulus. Because of its refractory period, a single auditory nerve fiber can not respond to each successive cycle of a high-frequency sound. This problem can be partially overcome by the fact that more than one nerve fiber contacts a single inner hair cell. Each fiber is incapable of responding to every cycle of the stimulus, but collectively, they can do so. The temporal structure of the sound is conserved, as each fiber responds in the depolarized state of the inner hair cell. We call this behavior "phase locking". Even if the firing rate of a nerve fiber is too slow to follow the stimulus, basic features of its temporal structure-like the phase--are still coded in the neural pulse train. Still, the effect of phase locking is limited by several factors. The lateral membrane has a time constant of about 1 kHz, and above this frequency, the AC-amplitude of the receptor potential decreases. Also the synaptic processes like vesicle release and the generation of the postsynaptic potential are limited in speed and accuracy. This leads to a gradual loss of phase locking starting above 1 kHz, and above 3 kHz, phase locking is completely lost in humans. In the frequency region, where the receptor potential can no longer follow the stimulus on a cycle-by-cycle basis, we see a depolarization of the inner hair cells membrane potential while stimulated. This so-called DC-component of the receptor potential follows the stimulus envelope. This is because of the asymmetry of the hair-bundle transduction. The depolarizing currents dominate upon sinusoidal stimulation and the receptor potential, low-pass filtered by the cell membrane, cause a depolarization of the cell membrane. All the information about the sound's frequency is lost in this signal, therefore, a different way to code high frequencies had to be developed by evolution. This task was achieved by "sorting" sounds by frequency. Sound signals are mechanically preprocessed in a way that they are separated spatially. This concept is well-known as the frequency-place principle where high frequencies are located in the basal part of the cochle.a only and low frequencies in the apical part. Different sets of auditory nerve fibers elicit different auditory sensations by virtue of their central connections. We will examine the mechanical frequency separation process in the sec section B.1.4 and to do so, we will have to focus on the mechanical properties of the inner ear. It is however not clear how specific excitations converge to yield well-defined pitch in general. There is increasing evidence that pitch is extracted in the time domain (periodicity analysis [286]). Moreover, pitch was found to be independent of the frequency-place mapping
B.l. PHYSIOLOGY: FROM THE AURICLE TO HESCHL'S GYRI
1041
of the components of complex tones [287].
B.1.4
Cochlear Hydrodynamics: The Travelling Wave
The cochlea consists of three fluid-filled scalae, but from a mechanical point of view, the elastic properties of the thin Reissner membrane can be neglected compared to the stiffness of the basilar membrane. We can therefore simplify our mechanical investigations to a fluid model with two chambers separated by the cochlear partition (compare figure B.4). The basilar membrane contributes a large part of the elastic properties of the partition. Its width is increasing from base to apex from about 0.16 f-lm to 0.52 f-lm (in humans). Its thickness, on the other hand, decreases along the cochlea. Thus, its stiffness decreases greatly along the length of the cochlea. At the basal end of the cochlea there are two openings to the cochlear ducts, one on each side of the cochlear partition, that are covered by membranes. One is called the oval window and, as we already mentioned, it is in contact with the stapes foot plate. The other, the round window, is just below the oval window. Because we can assume the cochlear fluids as incompressible, the round window has to move out of phase if the oval window is driven by stapes motion. If we consider very slow stapes motions, the fluid is pushed along the entire length of scala vestibuli, through the helicotrema and back along scala tympani. Thus, the helicotrema provides a low-frequency shunt for extremely low frequencies. When the stapes moves into the cochlea, pressure builds up in the fluid, which deflects the cochlear partition. Pushing fluid requires overcoming inertial forces generated by the fluid mass. The elastic properties of the basilar membrane in combination with the mass of the surrounding fluid constitutes secondorder resonators. As the stiffness of the basilar membrane is very high at the basal end of the cochlea and much lower at its apical extreme, the resonant frequency of the cochlear partition monotonically decreases from base to apex. Inversely, the time constant of each resonator increases from base to apex. However, the basilar membrane is not under tension, and it does not respond like a series of independent resonators, like the strings of a harp. Instead, each part of the cochlear partition is coupled to the next by the cochlear fluids, and due to the large inherent friction, they are highly damped. For a periodical motion of the stapes, the cochlear partition is first set into motion at the basal extreme, where the mechanical time-constant is smallest. Because the stiffness is very high, the deflection of the basilar membrane is fairly small. The deflection propagates in the form of a wave in apical direction. As the time-constants of the partition increase and the stiffness decreases, the response will be more and more delayed but its amplitude will grow. At the location of resonance, the wave will reach its maximum and lose its energy, its amplitude will drop very rapidly. The location of "cochlear resonance", the place where the displacement-and therefore the excitation-of the cochlear partition reaches its maximum, depends on the stimulus frequency. Low frequencies will travel along the basilar membrane and reach a maximum close to the cochlear apex, high frequency sounds will exhibit their maximum response close to the cochlea base and fade out. The exact form of the vibration response of the cochlear partition was investigated by Georg von BekE~sy in human cadaver ears [47], a feat which earned him the Nobel Prize. He found that the deformation of the basilar membrane is a traveling wave. The wave starts at the cochlear base, where the basilar membrane is stiffest. It propagates toward the apex with a time delay that depends upon its own mechanical properties and the properties of the surrounding
APPENDIX B. AUDITORY PHYSIOLOGY AND PSYCHOLOGY
1042
fluid. Its vibration amplitude is increasing until it reaches a maximum, close to the location of cochlear resonance, and from then on, the wave diminishes rapidly. Because the stiffness gradient of the basilar membrane is approximately logarithmic, the peaks of the excitation patterns of sinoidal tones are located on the basilar membrane with a logarithmic frequency spacing. Figure B.4 schematically illustrates traveling waves elicited by a stimulus composed of three frequencies, together with the envelope of the peak displacement. The peaks of the
helicotrema
/ L-.3-~-'--:'=-round window
Figure B.4: Schematic illustration of the traveling waves elicited by three pure tones with frequencies of 500 Hz, 2 kHz and 5 kHz. The displacement of the basilar membrane is shown at the instant To (solid line) and a quarter of a cycle later (dashed line). The dotted line indicates the envelope of the wave. three waves are clearly separated along the cochlea, however, there is also considerable overlap between the waves and recordings from the auditory nerve have been found to be much more frequency selective, especially close to threshold, than the mechanical responses observed in cadaver ears. Therefore, a second mechanism is required to boost the frequency selectivity of the vibration responses.
B.lo5
Active Amplification of the Traveling Wave Motion
We have so far neglected the function of the outer hair cells, the second group of receptor cells within the organ of Corti. Despite the at about three to four times higher number of outer hair cells, only 5% of the afferent fibers innervate outer hair cells. The fibers are so-called type II fibers, they are highly branched and each fiber innervates dozens of outer hair cells. Little is known about these fibers because they are small and unmyelinated, making it difficult to record their activity. They are expected to be less sharply tuned, since they innervate a broad region of the cochlea, and, because of their lack of myelination, their conduction velocity is likely to be very slow. Most of the frequency selective and time-critical auditory information must therefore be carried by the afferent fibers originating from the inner hair cells. Whereas inner hair cells are not directly innervated by the efferent system, myelinated fibers from the medial to the medial
B.1. PHYSIOLOGY: FROM THE AURICLE TO HESCHL'S GYRI
1043
superior olivary nuclei make direct synaptic contact with the outer hair cells. Outer hair cells seem to be under neural control, much more like muscles than sensory cells. A final indication of their "active" role was established when their ability to change their length upon electrical stimulation was detected. Modern research assumes that the outer hair cells indeed provide an active, mechanical amplification of the cochlear traveling wave. As neural processing is by far too slow to keep up with high frequency hearing, current concepts assume that amplification relies on a local mechanical feed-back process [107]: Outer hair cells sense motions of the cochlear partition by converting shearing motion between the surface of the organ of Corti and the tectorial membrane into an electrical receptor potential. Upon depolarization, the outer hair cell reacts with a contractile force, which is fed back into the motion of the basilar membrane. From theoretical calculations derived from measurements in the inner ear, it is required that energy is pumped into the vibration of the cochlear partition in a region starting before the traveling wave reaches its maximum up to the place of cochlear resonance. The function of this amplification process is still unclear in its details, but from the observations of active cochlear mechanics it has wide reaching consequences. Measurements in the basal part of the cochlea indicate, that the amplification boosts cochlear sensitivity up to thousand-fold. The amplification is limited to a narrow frequency range, covering only about half an octave. The active traveling-wave response becomes very sharp in the region of cochlear resonance. The amplification of the vibration response is required to achieve the extraordinary sensitivity of the hearing system. The second hallmark of amplification is non-linearity. The amplification process boosts only weak sounds, it saturates at increasing levels. This non-linearity greatly compresses the dynamic range of the mechanical responses. This is important, because the dynamic range of the inner hair cell receptor potential is limited to a range certainly not exceeding 60 dB. This nonlinearity also has unwanted side-effects: If two sinoidal tones are presented, the non-linearities of the hearing-organ generate so-called distortion products, additional tones which we perceive under certain circumstances. In general, however, by virtue of its construction, artifacts due to the non-linearities of the inner ear are surprisingly small. Figure B.5 shows the excitation pattern expected on the human basilar membrane when stimulated with a 3 kHz sinoidal tone. For high sound levels (i.e., 100 dB), the feed-back amplifier is saturated and the traveling-wave is almost purely passive. It is highly damped and its envelope shows the characteristic shallow increase from base to apex and a sharp decay after the maximum is reached. 1 The threshold of the auditory nerve is somewhere between a minimal basilar-membrane velocity of 50 J-Lmsec- 1 and a displacement of 1 nm, exact values are still unknown. The broad traveling wave at high levels indicates that a large number of nerve fibers are stimulated, especially in the basal part of the cochlea. For faint sounds, the traveling-wave response becomes sharper and sharper and its envelope is almost symmetrical for levels below 40 dB. The location of maximum amplitude for a sinoidal tone at low levels is called its characteristic place. Only nerve fibers originating from a very narrow region around the characteristic place of the cochlea are stimulated. Note that for increasing levels, the maximum of the traveling IThe data in figure B.5 (dashed lines) indicates that further towards the apex, the wave does not die out completely in this experiment. It is still under debate, whether this remaining response is also present in the intact human cochlea.
APPENDIX B. AUDITORY PHYSIOLOGY AND PSYCHOLOGY
1044
v (!-lmts)
100 d B......,_ _-<>-,,-
10000
L
o
5
, 10
15
x( mm )
Figure B.5: Reconstruction of the excitation pattern in the human cochlea for a 3 kHz sinoidal tone. Original measurements were recorded in a chinchilla cochlea at a distance of 3.5 mm from its most basal extreme [392]. The characteristic frequency of this location was 9.5 kHz. Data has been converted assuming a frequency-place map of 8 mm/octave to illustrate the excitation pattern in a human cochlea. The shaded area indicates excitation below neural threshold, which is expected between 50 fLmsec-l and 1 nm. wave shifts considerably in the basal direction. If we analyze the level-dependence of the responses at various locations of the cochlea, we clearly see the effects of the non-linearities of the amplification. At the characteristic place (about 16 mm in figure B.5), the velocity amplitude increases from a value of 42 fLmsec-l at 0 dB to 5 msec- 1 at 100 dB. Without amplification, the amplitude of the traveling-wave response would be expected to drop by a factor of 105 , or from 5 mmsec 1 to 50 nmsec- 1 from 100 dB to 0 dB! The amplification therefore is almost a factor of 1000 or 60 dB. In addition to enabling the detection of weak signals, the amplification therefore also compresses the dynamic range, again by almost 60 dB. The inner hair cells, at the location of the characteristic place, have to cope only with a stimulus ratio of a little bit more than 40 dB to cover a 100 dB-level change of the sound stimulus.
B .1. 6
Neural Processing
The difficulty to understand neural processing of sound strongly stems from the extremely complex innervation from the auditory nerve to primary auditory cortex (Heschl's gyri) in the temporal lobe (see figure B.6). We have to stress that the image in this figure is considerably simplified, and that in particular, there are also connections from the cochlea to the ipsilateral auditory cortex, as well as efferent nerves from the auditory cortex down to the cochlea. The auditory system includes at least five "relays stations", whence it is clear that any particular functional decomposition (like Fourier's) will not be transferred unchanged to the auditory cortex. For example, the tonotopy of the latter is a multiply distributed one, see figure B.7.
B.l. PHYSIOLOGY: FROM THE AURICLE TO HESCHL'S GYRI
1045
Figure B.6: (With kind permission of the Hallwag-Verlag) This simplified image shows the six relevant relais stations of the auditory path from the cochlea to the Heschl gyri: (1) Nervus cochlearis, (2) nucleus cochlearis, (3) nuclei superiores olivae, (4) colliculus inferior, (5) corpus geniculatum mediale, (6) gyri Heschl.
By tonotopy, one understands the spatial distribution of excitation patterns according to specific pitch. More recent research also demonstrates that outside the auditory cortex, i.e., in the limbic system (the hippocampal formation, to be precise, which plays an important role for emotional and memory tasks), one finds a refined processing of pitch information [336, 337, 570, 571]. The "template fitting model" of Julius Goldstein [187] shows how far we are from understanding the neural pitch processing. In this model, the mathematical principle of a neural "central pitch detector" is proposed, from which the fundamental frequency of a periodic wave can be extracted if its Fourier components are known. The central pitch detector is however charged with the solution of local minima problems for functions in two variables~a rather heroic task for a small neural population. It seems hopeless to identify within the neural network the physical realization of a dynamic system that solves these differential conditions. We presently have no chance to verify the model physiologically, since the human ethics excludes adequate experiments in humans. In view of these facts, it is not only logically erroneous and experimentally very delicate to infer the higher sound processing from the superficial auditory physiology of the ears. And ethically, such an attempt is problematic since one runs the risk to "justify" discrimination of "degenerate music" against the acceptance of so-called "commonly accepted" music. It is also not clear how much such investigations reveal grown and trained configurations instead of
1046
APPENDIX B. AUDITORY PHYSIOLOGY AND PSYCHOLOGY
Figure B.7: (With kind permission of the Thieme-Verlag) Left: tonotopy in the spiral of the cochlea: high pitches are thick points, low pitches are thin points. Right: Corresponding multiple tonotopy in the auditory cortex of the cat. biological inheritance.
B.2
Discriminating Tones: Werner Meyer-Eppler's Valence Theory
From our everyday experience and from specific experiments it follows that we do not really hear the single tone events or chords, motives or rhythms as they have been parametrized on the level of physical or mathematical description. In fact, despite the very sensitive physiology of Corti's organ and its hair cells, we cannot distinguish all physically or mathematically distinct sound objects. For instance, sounds with frequency above 20,000 Hz or below 0 dB are indistinguishable since you cannot hear them. Or two sounds p, q with a phase shift, i.e., p(t + ~) = q(t) are indistinguishable. More important is that even under ideal condition we cannot distinguish sounds with arbitrary precision. Every singer or violinist who has to adapt his/her pitch to a prescribed context knows this. Investigations on variations of instrumental intonation show a remarkable bandwidth [57]. The same is valid for listening to time values, loudness degrees and instrumental colors. It is mandatory but not easy to take into account these phenomena. Werner Meyer-Eppler [372, 373] has attempted to solve the problem by use of the concept of a "valence". According to this approach,
Definition 113 In a specific context, given two sound objects Sl, S2, Sl is metamere to S2 with regard to a given predicate P (short: P-metamere, in symbols: Sl "-'p S2) iff the Sl cannot be distinguished from S2 with respect to P by human listeners. The set "-' p s of sound objects which are P -metamere to a given sound s are called the valence of s. Since usually the relation "-' p is symmetric, one calls two sound objects Sl, S2 metamere if Sl "-'p S2. If a sound object s is defined by a sequence PI, ... Pk of predicates, one defines its valence as being the sequence of valences "-'P" ... "-' Pk' and we may then reduce the total valence to those components which do not include all sound objects since these predicates are
B.2. DISCRIMINATING TONES: WERNER MEYER-EPPLER'S VALENCE THEORYl047 not relevant to the distinction of sound objects. The union Usupporting valences rv P i j of supporting valences is in fact the valence of the conjunction predicate Ai Pi. Predicates which are relevant in this sense are called valence supporting by Meyer-Eppler. In practice, if we are given a sound whose sound color is described by partials with frequencies up to the limit frequency fa Hz, and which last d sec, Meyer-Eppler deduces a maximal number of numerical predicates (dimensions) that are relevant to the valence, i.e., of valence supporting numerical predicates. This limit is called the maximal structure content, and its value is K = 2.d.fa. It is however an open fundamental question of sound color theory, which and how many valence supporting predicates must be chosen in order to yield a differentiated perception of sound colors. Probably, these valences define a multiply connected topological space in the physical parameter space, but see also section 12.3. Although Meyer-Eppler's conceptualization is plausible, it hides two delicate problems. The first concerns the context where valences take place, as related to the predicate in question. If the context's specification is neglected, the valence concept loses its meaning. Let us make two representative examples concerning pitch and onset. To begin with, we have to agree on who is accepted as a listener. In the sense of a statistical approach, Meyer-Eppler proposes that two pitches should be called metamere if at least 90% of the test subjects cannot distinguish them [373]. Moreover, one has to agree on the parameter(relations) of the test sounds. The result will depend essentially on the choice of instruments, flute, brass etc., and conditions upon the duration and the onset distances of test sounds. For example, the simultaneous presentation of sounds of several seconds duration will yield smaller valences because of beat effects, as compared to a comparison of non-overlapping sound events. Besides the parametric conditions, the context can also depend on the chosen music. Let us discuss this on the onset parameter, for example. In [373], the duration valence density is indicated by 50 - 60 per second. This means that sounds which are less than 1/50 - 1/60 sec apart are perceived as being simultaneous. However, the musical context of such a claim is relevant. Within a very slow piece with a small number of instruments, the temporal variation of 1/8 sec will scarcely be noticed, whereas in a rhythmically very dense and fast piece, 1/60 sec is known to define a quite coarse grid. The second problem is important for the theoretical significance of the valence concept. It relates to the fact that met amery is not an equivalence relation 2 in general. More precisely, any of the wanted properties: reflexivity, symmetry, and transitivity, can be violated. Reflexivity can be violated if the comparison of two sounds is temporally so separated, by several hours, say, that the human memory fails to recognize one and the same sound. Symmetry is violated if the order of appearance of sound objects is relevant, for example, a very loud sound, immediately followed by a very soft one, can mask the latter's properties. The most dramatic failure is the absence of transitivity: s rv P t, t rv P U does not always imply s rv P u. For example, if three pitches are such that the first is perceived as being equal to the second, and the second being equal to the third pitch, this is not entailed by equality of first and third pitch! Therefore, pitch valences are not equivalence classes, they may overlap. This means that the attempt to define pitch by an esthesic position in music psychology must fail. The perceptional concept of pitch is a non-transitive relation among tones, and therefore is not an attribute of tones. You hear that two tones have the same pitch, but you do not hear the pitch. Therefore corresponding 2See appendix C.2.
1048
APPENDIX B. AUDITORY PHYSIOLOGY AND PSYCHOLOGY
attempts such as [394] must fail. This has important consequences for musical practice and for theoretical aspects. In practice it is desirable to select grids of sounds such that their valences would not overlap. This needn't be a grid which is fixed once for ever, it might be a time-dependent construction, but it must yield a locally disjoint valence set. With the common notation, such a grid is realized as an orientation device as well as an acoustic and performative scheme. The continuum of onsets, durations, pitches, and sound pressure values is quantized in the well-known way such that Boulez' "notched tone space" can be taken as a grid behind the realiter played or heard. The reality layer of a grid is a mental or psychological one. The blurredness of hearing, as it is expressed in valences, has to be subjected to a cognitive interpretation. Semiologically speaking, the valence is the significant, the expression for a meaning which relates to our understanding of music. By use of the grid which is superposed to the valence perception, it is possible to associate the valences of perceived sound to objects of our imagination. The psychological quality of pitch then results from this mapping as a grid object in our imagination. The semiotic power of this signification depends upon the definition of the actual grid. A music-theoretically fundamental grid is the selection of the pitch arsenal, the tuning, wherein tones may be played. Let us first look at the chromatic w-tempered scales (see section 7.2.1.1). If w is not too large, valences of neighboring tones can be separated. From our experience with microtonal music, w = 36, i.e., tempered sixth-tone intervals, is not too large. So the postulate of an adjustment of pitch by inner grids is acceptable for w-tempered scales with w s:: 36. The situation of just intonation (see section 7.2.1.2 and appendix A.2.3) is much more delicate. For each pitch H(xo) of a point Xo in the Euler grid :J? of just intonation, and for any positive real number E, there are infinitely many points x such that IH(x - xo)1 < E, see proposition 64 in appendix A.2.3. In particular, infinitely many pitches fall into the valence of Xo. In the valence semiology, just intonation is infinitely homonymic. This problem can be solved by a restriction of the context, where just intonation music is played. As soon as the local context is a small region of the Euler space, for example, a small neighborhood of a determined finite portion of the chromatic scale, valences can be used to distinguish just grid points. A second difficulty for just intonation relates to the perception of pitch differences. A classical argument of just music theory is Euler's substitution theory [144], according to which intervals are heard in a way such that the frequency relations of their tones form fractions alb (in reduced representation) with minimal numerator a and denominator b. This is the basis of the classical consonance-dissonance theory which we address in the following section. In the grid of just intonation this means that the pitch difference of the interval is corrected/adapted to an interval which is realized by two points of the just intonation grid under the constraint that their (Euclidean or I-norm) distance is minimal. Of course, this correction has to happen within a valence in order to be a reasonable process of human hearing. However, it is easy to see that in general, there are several solutions alb, a' /b', a" /b", ... with minimal distance. So Euler's substitution theory would have to impose a contextual restriction for single tones as well as for intervals. These considerations should be taken for nothing more than they are: esthesic aspects of hearing. Nothing prevents us from doing music theory on neutral and poietic layers without bothering about valences. But then, one has to be conscious of the fact that highly differentiated
B.3. ASPECTS OF CONSONANCE AND DISSONANCE
1049
mathematical structures may be blurred by the semiology of valences from auditory psychology and physiology.
B.3
Symbolic, Physiological, and Psychological Aspects of Consonance and Dissonance
For a long time, mathematical reflections in music were centered around the problematic concept couple of consonance and dissonance. This is based on the ancient Greek Pythagorean tradition where consonance and dissonance of intervals was laid in the involved frequency relation. Perfect consonant intervals corresponded to ratios like 2/1 for the octave, 3/2 for the fifth, and 4/3 for the fourth. This simple arithmetic corresponded to the philosophy of the metaphysical tetractys. See [330, 394] for a historical discussion of these roots. Here, we simply want to recall that a unified mathematical foundation of musical thinking in the paradigm of simple consonant frequency ratios could not survive the differentiated development of theories in the contrapuntal setting [468], the psychological foundation of musical relations as introduced by Rene Descartes [126], and the discovery of physical partials by Marin Mersenne [310]. According to these more recent positions, the problem of consonances and dissonances changes as a function of the layer of reality where it is investigated-and on each layer it is not a minor one. In the present shorthand presentation, this result seems to be a provocative one since the conceptual unity seems violated. We want to make clear that we really are dealing with three different meanings of the sonance concept-Euler's gradus suavitatis on the mental layer, Helmholtz' beat model on the physical layer, and Plomp-Levelt's psychometrics on the psychological layer. In itself, each of these approaches is consistent, the problem only arises if one attempts to reduce one reality to another one. Following the knowledge about the neural processing of sounds (see the previous discussion in this chapter), it is hardly astonishing that psychological and physiological layers are not congruent: what the ear (in Helmholtz' model) does not "like" can very well be "agreeable" for the limbic system or the auditory cortex.
B.3.1
Euler's Gradus Function
Being a number theorist, Euler was interested in prime numbers. A priori, his gradus function r [143] is a purely number-theoretic function, it is defined as follows: According to the prime factorization of integers (see appendix D.2), a positive integer a is the unique product a = p~l .p~2 .... p~n of positive powers of primes PI < P2 < ... Pn (with the singular case of zero factors and a product = 1 for a = 1). Euler's formula, the gradus suavitatis, is r(a) = 1 + 2: I
APPENDIX B. AUDITORY PHYSIOLOGY AND PSYCHOLOGY
lO50 IO l r
10
5
o I I
16 15
9
8
6
5
5
4
4
3
45
3
8
32
2
5
5
3
16
15
9
8
interval
Figure B.8: The agreeableness of intervals within the octave in just intonation according to Vogel's chromatic [547], see section 7.2.1.2, is represented. It is reasonable to represent the reciprocal values 10/r instead of r since the latter is rather a 'gradus dissuavitatis': small r values are taken for the octave and the fifth, large ones for the second and tritone. The factor 10 is only a scaling constant. The order of Euler's valuation is this: Prime, fifth, fourth, major third/major sixth, minor third/minor sixth/major second, minor seventh, major seventh, minor second, tritone. e, /,g. We then have r(~)
= r(2e.3 f .59 ) = 1 = 1 + (2 -
or as a scalar product r(~) =
l)e
+ (3 -
1)/ + (5 - l)g
1 + (,~) with = (1,2,4).
(B.1)
(B.2)
If we compare this formula with the pitch formula
where Hprime is the prime vector from section A.2.3, then we observe a similar construction. The gradus function is something like a pitch function, but the 'direction' is instead of Hprime, see figure B.9. The ranking of intervals by the gradus function was already criticized by Euler's contemporaries Mattheson, Mitzler, and Rameau [71]. But it is the merit of Euler to have defined a ranking by a linear expression which, together with pitch and octave coordinate
B.3. ASPECTS OF CONSONANCE AND DISSONANCE
1051
third coord. gradus coord.
jifth coord.
octave coord.
Figure B.9: The gradus function is something like a pitch function, but the 'direction' is instead of Hprime, Both values are obtained from a scalar product of an Euler point x (of a difference point ~, respectively) with Hprime (with <1>, respectively). However, the point is not uniquely determined by the 'gradus coordinate', in contrast to the 'pitch coordinate'. defines a coordinate system for the Euler space on the one hand, and considers the consonances and dissonance, on the other. The disadvantage of Euler's approach is that it is based on the valence-theoretically invalid substitution hypothesis (see section B.2).
B.3.2
von Helmholtz' Beat Model
Hermann von Helmholtz proceeds from the hypothesis that beats between partials of two tones is responsible for sonance phenomena. The fact that he uses partials is bound to Ohm's postulate that we have a cochlear Fourier analysis [217]. Hence Helmholtz' approach only regards the cochlear basis of music perception and not the higher limbic and cortical auditory processing. A beat is the periodic amplitude variation which results from the superposition of two sinoidal waves which have a frequency difference ~ = f - g, the beat frequenc"; which is small with respect to their frequencies f, g. Helmholtz calculates the roughness, i.e., the degree of dissonance of an interval which consists of two tones p, q as the sum of the beat intensities In,m, which are associated with the nth partial of p and the mth partial of q, where In,m is supposed to have a strong maximum for beat frequency ~n,m = 33 Hz. 30ne uses the trigonometric equation sine x)
+ sin(y) =
2 sine ~ ) cos( ~), see figure B.lD.
1052
APPENDIX B. AUDITORY PHYSIOLOGY AND PSYCHOLOGY
Figure B.1O: A beat between two superposed sinoidal waves is characterized by a periodic amplitude variation whose frequency is twice (!) the difference of the given frequencies. Therefore Helmholtz' dissonance concept depends on the pitches and the involved sound colors. On the example of the violin, Helmholtz obtained good coincidences with Euler's gradus function. This model is impressive since it explains the experience according to which consonance is a function of the instrument and the absolute pitch of the interval tones. Its experimental verification is somewhat problematic. It essentially depends on the measurability of the beat intensities in the cochlea. Since non-linear distortions on the sound's way to the cochlea change spectra, one would have to perform ethically problematic invasive cochlear measurements. Moreover, individual statistical variations of the non-linear distortions would make the experiments even less robust. A fundamental doubt on the model's validity results from binaural experiments in hearing by Heinrich Husmann [242], where the interval tones are presented on separate left and right headphone inputs. In this case, no beats can intervene in the cochlea. Nonetheless, the experiments also revealed consonance as "happy moments of within the general (interval) disaster". The hypothesis that in the binaural experiments, Helmholtz beats must occur in the relays station of the corpus geniculatum mediale (see figure B.6) is speculative and demonstrates the limit of physiological models.
B.3.3
Psychometric Investigations by Plomp and Levelt
In the psychological reality of interval perception, the judgment of interval qualities looks quite different. In their investigation of "pleasantness" of intervals, Reiner Plomp and Wilhelm Levelt [418] have presented pairs of sinoidal tones and asked for pleasantness as a function of the given interval. The experiment was intendedly performed with musically untrained individuals in order to avoid judgments as a function of musical knowledge. Figure B.ll shows the resulting valuation curve. It is quite different from Euler's function as shown in figure B.9. Using this curve, Plomp and Levelt have tried to infer a description of Helmholtz' beat intensities, a procedure which was already recognized as being problematic in appendix B.3.2.
B.3.4
Counterpoint
There is however another-quite remarkable-point of view of the consonance-dissonance phenomenon, which has been poorly recognized within the psychoacoustic discussion, namely the prominent meaning of the concept pairing in the contrapuntal tradition, which was elaborated
B.3. ASPECTS OF CONSONANCE AND DISSONANCE minor major th ird third
400
450
freq u ency of lower tone
500
1053
fourth
fifth
550
600
frequency of higher tone
Figure B.11: (From [462]' with permission of Springer-Verlag) The psychometric investigation [418] of Plomp and Levelt yields a valuation which differs significantly from Euler's r function and is based on sinoidal tones shown in figure B.9. in the High and Late and Middle Ages, and which was encoded in an exemplary way by Johann Joseph Fux' Gradus ad Parnassum [174]. Carl Dahlhaus [99] has rightly pointed out that the textural function of the contrapuntal consonance concept is not yet fully understood. Interestingly, in the framework of the core theory of counterpoint, the interval of the fourth is dissonant, in contradiction to the other theories. It is inconceivable that the mathematical and physiological theories of counterpoint never included this perspective. A mathematical model of counterpoint is discussed in chapters 29 through 31.
B.3.5
Consonance and Dissonance: A Conceptual Field
Even on the mental level, the concept of consonance and dissonance is multiply explained, without necessarily leading to contradictions. In fact, Euler's approach was a neutral mental (number-theoretic) one, whereas the contrapuntal approach is poietic. If we look at the status quo of the consonance-dissonance discussion and the fight for a valid final semantics, we are confronted with a disaster. The mathematically arbitrary ornaments which composers such as Klarenz Barlow [39] or neo-Pythagoreans such as Martin Vogel [549] add to Euler's formula does not interest psychoacousticians such as Ernst Terhardt [525], who would extrapolate cochlear findings into the auditory cortex, and the latter approach cannot shed light onto the esthetics of music. What is common to all these positions adds to a concept of consonance and dissonance which is a conceptual field within the topography of music, a field with a quite ubiquitous presence. As a musical thought it results from a fundamental linear dynamic between polar extremals. It should be a main task of mathematical music theory to elaborate reliable and semantically reasonable, but esthetically undogmatic, models for such a way of thinking music.
Part XVI
Appendix: Mathematical Basics
1055
Appendix C
Sets, Relations, Monoids, Groups C.l
Sets
The language of sets describes mathematical facts in a classical way. An alternative foundation to sets is the language of categories, see appendix G. A set M is an object which is defined as a collection of uniquely determined objects which are also sets. These objects are called the elements or points of M. Two sets are equal iff they have the same elements. Whenever we say that M is a set, we mean that it is defined in a consistent way, i.e., without causing any contradiction. Existence of a mathematical object means that the object's definition causes no contradiction in classical logic (A is identical to A, (exclusive) either A or non A, and exclusion of a third). Then, for any set m it is either an element of M or it is not. One writes m E M for "m is an element of M" , or also "m is a point of M". In order to define a set M by its elements m, m' ... , one also writes M = {m, m', ... }. Observe that multiple enumeration does not change the set, for example {x,x,x,y} = {x,y}. A set N whose elements are all elements of M is called a subset of M, in signs: N ~ M, also sometimes N c Miff N ~ M and N i= M. Two sets are equal iff they are mutually subsets of each other. The empty set 0 is defined as having no elements. It is a subset of any set. A set is called finite if is empty or its elements can be indexed l by a sequence 2 0,1,2,3, ... n of natural numbers. Otherwise it is called infinite. 1 A mathematically correct definition of finiteness is this: A set if finite iff it is not in bijection with any proper subset. 2Recall that in this book, we make the logical, though not very common usage of the ellipsis symbol " ... ": it means that one has started with a sequence of symbol combinations which follows an evident law, such as 1,2, ... n, or al +a2 + ... an. The evidence is built upon the starting unit, such as "1," or "al +" in our examples, and then the following unit, such as "2," or "a2+", and then inducing the following units to be denoted, such as "3," or "a3+", "4," or "a4+", etc., until the sequence is terminated by the last symbol, such as "n" or "an" in our examples. The ellipsis means that the building law is repeated, and as such, it is a meta-sign referring to the inductive offset. Therefore the more common notation 1,2, ... , n, or al + a2 + ... + an is not correct. In the limit, for n = 3, it would imply a notation such as 1,2,,3 or al + a2 + +a3, which is nonsense. Moreover, in complicated indexing situation, the common notation would be overloaded.
1057
1058
C.l.I
APPENDIX C. SETS, RELATIONS, MONOIDS, GROUPS
Examples of Sets
Example 62 Z = {O, ±1, ±2, ... ± n, .. .}, the set of integers; N = {n E Z, n 2: O}, the set of natural numbers; the set Ql = {plq, p, q E Z, q =I- O} of rational numbers; the set lR of decimal or real numbers, e.g., x = -741.76, 1f = 3.1415926 .... We have NeZ c Ql c lR, where integers p are identified with rational numbers of form pl1, whereas rational numbers are identified with periodic real numbers. Example 63 If M, N are sets, their difference M - N or the complement of N in M is the set of points of M which are not in N. If V = (Mi)i is a family of sets, we denote by U V the union of V, whose elements are precisely the elements collected from any ofthe M i , for finite families V = M I , ... M n , one also writes U V = MI U ... Mn. In particular, U 0 = 0. A covering of a set X is a family V such that UV = X. The intersection of a family V is the set V consisting exactly of those points which are points in any of the family's member Mi' For finite families V = M I , ... M n , one also writes V = MI n ... Mn· In particular, 0 = AllBet, the set whose elements are all existing sets. In less universal contexts, one only takes the intersection with regard to a large superset of the family'S members. Observe that there is no reason why AllBet should not exist, however, not any of its subcollections defined by predicates equally exists. For example, the subset of all sets not containing themselves as elements does not. A partition of a non-empty set X is a covering such that any two of its members are non-empty and disjoint, i.e., intersect in the empty set.
n
n
C.2
n
Relations
Definition 114 If x, yare two sets, the ordered pair (x, y) is defined to be the set (x y ) - { {{x}} if x = y, , {{x},{x,y}} else. Lemma 55 For any four sets a, b, c, d, we have (a, b)
(C.1)
= (c, d) iff a = c, b = d.
A triple (a,b,c) is a pair of form ((a,b),c). Clearly, (a,b,c) b',c = c'.
= (a',b',c') iff a = a',b
=
Definition 115 Given two sets A, B their Cartesian product is the set A x B
= {(a,b)la
E A,b E
B}
consisting of all ordered pairs (a, b), with the first coordinate a an element in the first factor A, and the second coordinate b an element in the second factor B. A relation from A to B is a subset R <:;; A x B. One writes aRb for (a, b) E R, and, if the relation is clear, more simply a rv b. The inverse of a relation is the set R- I = {(b,a)laRb}. If A = B, one also speaks of a relation on A.
Definition 116 A graph f : A A to B; it is called
----+
B from A to B is a triple (A, B, J) with f a relation from
C.2. RELATIONS
1059
(i) total iff every a E A is the first factor of a pair in f; (ii) functional iff (a,b), (a,b') E f implies b determined b, or also f : a f-+ b.
= b'; one writes f(a) instead of the uniquely
(iii) a function or map iff it is a total functional graph. The set A is the domain of f, whereas B is the function's codomain. If f : A ---+ Band 9 : B ---+ C are two functions, their composition or concatenation function go f : A ---+ C is defined by go f(a) = g(J(a)). The function f : A ---+ A with f(a) = a for all a E A is called the identity on A and is denoted by IdA.
Exercise 83 Verify that 9 0 f is indeed a function. If h : C ---+ D is a third function, we have (h 0 g) 0 f = h 0 (g 0 1) and therefore we write hog 0 f. Show that Id B 0 f = f 0 IdA = f. Definition 11 7 A function f is called (i) injective if f(a)
= f(a') always implies a = a';
(ii) surjective, iff for every b E B, there is a E A such that f(a)
= b;
(iii) bijective, iff f is injective and surjective. Lemma 56 For a function f : A
(i) There is a function 9 : B
---+
---+
B, the following conditions are equivalent:
A such that go f
= IdA
and fog
= Id B .
(ii) The function f is bijective. The 9 in this lemma is uniquely determined by f and is called the inverse function of f, it is denoted by f-l. Bijections f : A ---+ A are also called permutations of A. If there is a bijection f : A ---+ B between two sets A, B, we say that they have the same cardinality, and write card(A) = card(B). On AllSet, the cardinality relation is an equivalence relation, and one may define the cardinality card(A) of a set A as the equivalence class [AJ under this relation. A finite set is one whose cardinality is that of a set of form {I, 2, 3, ... n}, with a natural number o : : ; n, where we take the empty set for n = o. Definition 118 A binary relation ::::; on a set S is said to be (i) reflexive iff x ::::; x for all XES; (ii) transitive iff x::::; y and y ::::; z implies x ::::; z for all x, y, z E S; (iii) symmetric iff x::::; y implies y ::::; x for all x, yES; (iv) antisymmetric iff x::::; y and x
-I- y
excludes y ::::; x for all x, yES;
(v) total iff x::::; y or y ::::; x for all x, yES.
APPENDIX C. SETS, RELATIONS, MONOIDS, GROUPS
1060
Definition 119 A binary relation ~ on a set S is called an equivalence relation iff it is reflexive, transitive, and symmetric. In this case, the relation is usually denoted by ""," instead of "~".
Lemma 57 Let", be an equivalence relation on S. Then the subsets [s] = {tis'" t} are called equivalence classes of"'. The set of equivalence classes is denoted by S / "'. It defines a partition of S, i.e., it covers S, and for any two elements s, t E S, either [s] = [t] or [s] n [t] = 0. Definition 120 A binary relation ~ on a set S is called a partial ordering iff it is reflexive, transitive, and antisymmetric. A partial ordering is called linear iff it is total. A linear ordering is called well-ordered iff every non-empty subset T C S contains a minimal element. Lemma 58 Let ~ be a binary relation on a set S. Denoting x < y iff x following two statements are equivalent:
~
y and x
i= y, the
(i) The relation
~
is a partial ordering.
(ii) The relation y < x.
~
is reflexive, the relation < is transitive, and for all x, yES, x < y excludes
If these equivalent properties hold, we have x ~ y iff x = y or else x < y. In particular, if we are given < with the properties (ii), and if we define x ~ y by the preceding condition, then the latter relation is a partial ordering. Proof. (i) ::::} (ii) Clearly ~ is reflexive. If x < y and y < z, then x ~ z. If we had x = z, then we were in the second statement of (ii), and it suffices to prove this one. But x < y and y < x implies x = y by the asymmetry of :::;, a contradiction. (ii) ::::} (i) 'fransitivity: If x :::; y and y ~ z, and either x = y or y = z, we are done. Otherwise x < y and y < z, whence x < z, therefore x ~ z. Asymmetry: If x ~ y and y ~ x, then x i= y is excluded by (ii) whence the claim. The last statement is clear, QED. Example 64 Suppose that (1, <) is a linearly ordered "index set", and that we are given a family ((Ti'
1061
C.2. RELATIONS
This is one of the many equivalent versions of the axiom of choice [281]. A particularly important variant is this: Theorem 38 (Zermelo) If the axiom of choice holds there is a well-ordered relation for every set. Conversely, if every set can be well ordered, the axiom of choice holds.
For a proof, see [281, p.261]. Proposition 65 Let::; be a partial ordering on a set S. Let :::S be the following binary relation on the set Fin( S) of finite subsets of S. For A, B c S, let
A~ B{A = Bor A
=f. B and for all x
E
A - B, there is y E B - A with x < y.
(C.2)
Then :::S is a partial ordering. If::; is linear, so is :::S. Proof. We set A ~ B as in lemma 58. If ::; is total, then for any A =f. B in Fin(S), if both disjoint difference sets A - Band B - A are non-empty, they contain different maximal elements, and we are done. Next, we verify statement (ii) about ~. Suppose that we have A ~ B and B ~ A for A =f. B. Hence, we may suppose A - B =f. 0. Take a maximal x E A-B. There is y E B - A with x < y. Then there is z E A - B with y < z, but by transitivity of <, x < z, a contradiction to maximality of x. To show transitivity, take A =f. B =f. C in Fin(S). Since A - C = 0 is trivial, suppose that x is a maximal element of A-C. Suppose first that x E B. Then x E B - C, and there is x < z with z E C - B, take a maximal element z of this type. If z E A, then z E A - B, hence there is z < w, wEB - A. If wE C, w E C - A, and x < z < w which was required. Else, w C whence wEB - C, and there is u E C - B with w < u, but z < w < u contradicts maximality of z in C - B. Now if z A, we have z E C - A, and we are also done. Suppose now that x B, and therefore x E A-B. There is z E B - A with x < z, take a maximal such element in B - A. If z E C, then z E C - A and there is wE C - B with z < w. If w A, then w E C - A and x < z < w which was required. If w E A we have w E A - B, and there is u E B - A with u < w. But then z < w < u contradicts maximality of z in B - A, QED.
rt
rt
rt
rt
Definition 121 For a finite partial ordering Rel on a set X, the Hasse diagram Rel(X) is the relation whose pairs are the uniquely determined minimal set of generating relations of Rel.
Proof of uniqueness. Suppose that two minimal generating sets A, B are different. Then any relation of A - B points at a decomposition with a factor in B - A, and so forth, vice versa, such that we obtain an infinite chain of relations, contradicting the finiteness of X. For a finite partial ordering Rel on X, the level function lev : X -7 N is defined as follows. For minimal elements x in the Hasse diagram Rel(X), put lev (x) = 0, for any element x E X, set pre (x) = {yl y < x in Rel(X)}. Then we put lev (x) = 1 + Max(lev(y),y E pre(x)).
1062
C.2.1
APPENDIX C. SETS, RELATIONS, MONOIDS, GROUPS
Universal Constructions
If V = (Mi)iEI is a family of sets, the product set ITI Mi is the set whose elements are the functions f : I -+ UV such that f(i) E Mi for all i E I. Evidently, pairs, triples are special cases of such functions, but they are basic to the definition of functions, and therefore are treated separately. For each j E I, we have the projection function Pj : ITI Mi -+ M j . Here is the universaf3 property of the product 4 set:
Lemma 59 With the above notation, if (Ii : X -+ Mi)i is a family of functions, then there is exactly one function f : X -+ ITI Mi such that Ii = Pi 0 f for all i E I, i.e., f(x)(i) = fi(x).
For the given family V, the coproduct set ill Mi is the union of the family V' = ({ i} x Mik For each index j, we have a function Lj : M j -+ ill Mi defined by Lj(m) = (j, m). The universal property of the coproduct 5 is this: Lemma 60 For every family (fi : Mi -+ X)i of functions, there is exactly one function f : ill Mi -+ X such that f 0 Li = Ii, i.e., f((i, m)) = fi(m). Lemma 61 If f : A -+ B is a function, then the image Im(f) = {bib E B, there exists a E A such that b = f(a)}, together with the inclusion function i : Im(f) -+ B and the surjective function f' : A -+ Im(f) : a 1-+ f(a) has the following universal property: We have f = i 0 f', and for every factorization f = u 0 v, there is a unique factorization u = i 0 h, and f' = h 0 v. Lemma 62 Let AB be a powerset, i.e., the set of functions f : B -+ A. Then there is a natural bijection of adjunction ad: C AxB -+ (CB)A, defined by ad(g)(a)(b) = g(a,b).
For a function f : A -+ B and a subset C ~ B, we define by f- 1 (C) = {xix E A, f(x) E C} the inverse image of C under f; if C = {c} is a singleton, we also write f- 1 (c) instead and call the set the fiber of c. Lemma 63 Let 2 = {O, I}, 0 = 0,1 = {O}, and let Sub(X) be the set of subsets of X (also called the powerset of X). Then there is a natural bijection x: Sub(X) -+ 2x defined by X(Y)(z) = 0 iff z E Y, and X(Y)(z) = 1 iff z tf- Y. The function X(Y) is called the characteristic function of Y. The inverse function maps c: X -+ 2 to Yc = c- 1 (0).
C.2.2
Graphs and Quivers
Definition 122 For a set X, denote by P2 (X) the set of subsets of cardinality one or two in X, i. e., the set of singletons and unordered pairs in X. Then a multigraph is a triple (L, V, G : L -+ P2 (V)). The elements of L are called the lines, the elements of V are called the vertexes of the multigraph. Often, we identify the multigraph with its map G. If G is injective, the multigraph is called a graph. A multigraph is finite, iff all involved sets are finite. 3This category 4This 5This
is a terminology from category theory, stating (in its most generic form) that a certain object in a is final, but see appendix G. is a special case of a limit set, see appendix G.2.1. is a special case of a colimit set, see appendix G.2.1.
C.2. RELATIONS
1063
Definition 123 A quiver is a pair G = (head, tail: A =+ V) of set maps. The elements of V are called vertexes, the elements of A are called arrows. If every pair of vertexes is head and tail of at most one arrow, the quiver is called a directed graph. A path pin G is either a vertex p = v' (a 'lazy' path) or else a sequence p = (ao, al, .. . an) of arrows with head(ai) = tail(ai+l) for each index i = 0, 1, ... n - 1. The length l(P) of a path is 0 for a lazy path, and n for a geneml path. A closed path is a path with head(a n ) = tail(ao). A lazy path is also a closed path of length O. A closed path of length 1 is also called a loop. A cycle is an equivalence class of closed paths which differ from each other by their start/end point. We also use the symbols head(p), tail(p) for the head, or tail, respectively, of the last, or first arrow ofp, respectively (or just the unique vertex v for the lazy path). Example 65 For a set V, a complete quiver is a quiver G = (head, tail: A map f : A ---+ V2 : a ~ (tail(a), head(a))
=+ V)
such that the
is a bijection. For two paths p, q with head(q) = tail(p), we have a composed path p.q which is the evident juxtaposition of arrow sequences or the respectively other path if one path is lazy. As with functions, the composition of paths is associative if it is defined.
C.2.3
Monoids
Definition 124 A semigroup is a couple (M, Jl) consisting of a set M and an associative binary opemtion Jl : M x M ---+ M : (m, n) ~ Jl(m, n) = m.n. The semigroup M is a monoid if there exists a neutral element e, i.e., e.m = m.e = m for all mE M. Since e is uniquely determined by this property, it is called the neutml element. For a given subset 8 C M of a semigroup, the semigroup (8) genemted by 8 is the smallest sub-semigroup of M containing 8. If M is a monoid, the submonoid with neutml element e genemted by 8 is denoted by (8)e. Ifm.n = n.m for all m, n E M, the semigroup is called commutative. An element x E M such that there is y with x.y = e is called invertible, its set M* is a submonoid. A monoid (M, Jl) is finite, iff the underlying set M is so. Given two monoids M, N a monoid homomorphism is a set map f : M ---+ N such that f(eM) = eN and f(m.m') = f(m).f(m') for all m, m' E M. A monoid isomorphism is a bijective monoid homomorphism, its inverse set map is then automatically a monoid isomorphism.
Exercise 84 Show that the map N ~ Ne = N U {e} defines a projection from the set of sub-semigroups of M onto the set of sub-monoids of M with Ne = N iff N is a submonoid and Ne = N~ iff N = N' for sub-semigroups N, N' which are not sub-monoids. Example 66 For a set6 Alphabet, there is a monoid F M(Alphabet) and an injection i : Alphabet 6E.g. Alphabet = ASCII, UNICODE.
---+
FM(Alphabet)
APPENDIX C. SETS, RELATIONS, MONOIDS, GROUPS
1064
such that for any set map f : Alphabet ~ M into a monoid M, there is exactly one monoid homomorphism FM(f) : FM(Alphabet) ~ M such that FM(f) oi = f. One calls the monoid F M(Alphabet) the free monoid over Alphabet and the above property is the universal property that characterizes the free monoid up to isomorphism. The free monoid consists of all "word" expressions h ... bk for bi E Alphabet, k > 0, and the empty word O. The product is defined by juxtaposition of words, 0 is neutral. The free monoid is also called "word monoid over the alphabet Alphabet". Consider the equivalence relation on F M(Alphabet) with b1 ... bk rv b7r (l)'" b7r(k) for any permutation 7r, then the quotient space FCM(Alphabet) = FM(Alphabet)j rv with the induced multiplication is a commutative monoid and is called the free commutative monoid over Alphabet.
Definition 125 In a semigroup M, an idempotent is an element x such that x = X.x = x 2 . We denote Idempot(M) = {xl x is idempotent in M}. For a subset X c M, we define its radical by v'X = {yI3n, 0 < n, such that yn EX}. In particular, given an idempotent x, the radical
vx = {yI3n,0 < n,
Since for every y
E
VX,
(y)
C
VX,
we also have
vx of x is defined by
such that x
= yn}.
VX = UyEy'X(y)·
Example 67 For example, the neutral element in a monoid is idempotent. Example 68 If Q is a left module 7 over a ring R, the set of idempotents Idempot(End(Q)) of the linear endomorphism ring of Q is in bijection with the set Dir( Q) of direct decompositions Q = U EB V of Q. In fact, the bijection is set up by Idempot(End(Q)) ~ Dir(Q) : x
f---7
(Im(x), Im(IdQ - x))
whose inverse defines the projection p: Q ~ U onto the factor U of Q = U EB V. The image of an idempotent x is a direct decomposition by the equation I dQ = x + (I dQ - x) which yields Q = Im(x) EB Im(IdQ - x) with x = Id1m(x) EB OIm(IdQ-x)' Conversely, the projection x = pru yields U = Im(x), V = Im(IdQ - x). So the above map is a bijection.
Lemma 64 The idempotents of the monoid Q@Q of affine endomorphisms8 are the elements eq.y such that y E Idempot(End(Q)) and q E Im(IdQ - y) = Ker(y). Example 69 For example, the idempotents of 1:12 are 0,1,4,9, and Ker(O) = 1: 12 , Ker(l) 0,Ker(4) = 3.1:12, Ker(9) = 4.1:12' so we have a total of 20 affine idempotents here.
=
Lemma 65 With the previous notation, for two idempotent elements x, y of a semigroup M, n v'Y =I- 0 iff x = y. Hence we have a partition of radicals in M, viz., vIdempot(M) = ilxEldempot(M) VX, the idempotent components 9 of M.
vx
7See appendix E. 8See appendix E. 9Terminology of Noll, [400j.
1065
0.2. RELATIONS
Clearly, if f : M ~ M is an automorphism of the semigroup M, we have f( yX) = J f(x). In particular, if M = Q@Q for a module Q, and if y E End(Q) is an idempotent linear endomorphism of Q, the conjugation by a translation e q , q E Q gives the formula
(C.3) which means that the translation exponent (1- y)q is a general element of Ker(y). Therefore:
Proposition 66 For a module Q, the orbits of idempotent components under the conjugation action of the translation group e Q are the sets of idempotent components associated with the linear idempotent endomorphisms of Q. Lemma 66 The canonical surjective linear factor projectionp: Q@Q ---7 End(Q) is compatible with the idempotent partitions, and the fiber of every idempotent component in End(Q) is the orbit of the translation group action under conjugation (see proposition 66). Proposition 67 If in the above example the module Q is finite, the respective idempotent components define partitions
y'Idempot(End(Q)) = End(Q) and y'Idempot(Q@Q) = Q@Q. Proof. Let y E End(Q). By Fitting's lemma (appendix 77), there is a positive power z = yn and a direct decomposition Q = U EB V such that zlU E GL(U), Ker(z) = V. So by finiteness of U, there is a positive power (zlu)m = Id u . Therefore, zm = ynm is an idempotent. As to the affine case, if y = et.yo E Q@Q, we know by the preceding that a power YO' = t is idempotent. So WLOG., we may suppose that Yo is idempotent. Set etk .yo = (e t .YO)k = yk; the positive powers of y must have recurrent values, so take yU = yu+k, for positive k. Then etu.yo = etu.yo.etk.yo = etu+yo(tkl.yO, i.e., tk E Ker(yo), whence yk is an idempotent by lemma 64, QED.
Lemma 67 For the subgrouplO M* of invertible elements of the monoid M, we have the intersection formula M* n y'Idempot(M) = VIM. Proof. Clearly, VIM c M*. On the other hand, if x E v't c M*, we have xk = t whence 1M = xk = t. In the special case of a total partition M = J I dempot( M), we get
M*
= VIM.
This is the case for an affine endomorphism monoid Q@Q of a finite module Q. lOSee section C.3.
=
t 2 = X2k,
1066
C.3
APPENDIX C. SETS, RELATIONS, MONOIDS, GROUPS
Groups
Definition 126 A monoid (M, f..L) with the neutral element e such that every element m has a left inverse n, i.e., n.m = e, is called a group. A left inverse of m is also a right inverse, m.n = e, and the inverse is uniquely determined by m, it is denoted by m-l. A subgroup of a group M is a submonoid N which is a group. A commutative group is also called abelian. If n E N, then we write mn for the n-fold product m.m .... m; if n is a negative integer, then we write mn = (m-l)lnl, we also set m O= e. Evidently mn.ml = mn+l. If a group is abelian, the product is usually written additively, i.e., m + n instead of m.n, further m - n instead of m.n-l, and the neutral element is noted by 0 instead of e.
Exercise 85 Observe that the neutral element e of a group is the only idempotent element x.x = x. Therefore, a subgroup has necessarily the same neutral element as the supergroup.
C.3.1
Homomorphisms of Groups
If (G,,), (H,,,,) are two groups, a set map f : G ---- H is called a group homomorphism iff f(g.g') = f(g).f(g') (products in the respective groups). We have f(ee) = eH and f(x-I) = f(X)-I. Clearly, the set-theoretic composition of two group homomorphisms is also a group homomorphism. The set of group homomorphisms f : G ---- H is denoted by Hom(G, H). If f is a bijection, its inverse is also a group homomorphism; f is then called a group isomorphism. G, H are said to be isomorphic iff there is an isomorphism between them. Clearly, group isomorphisms are an equivalence relation, whence the term "isomorphism classes". The explicit description of isomorphism classes is the main task of group theory. For finite groups, this is essentially solved l l . For finite commutative groups, the classification is described in appendix C.3.4. The set Aut(G) of isomorphisms of a group G onto itself (the automorphisms of G) is a group under the composition of group homomorphisms, the identity I de being the neutral element of Aut(G). The group 6M of permutations of a set M is called the symmetric group ofM.
Example 70 A homomorphism f..L : G ---- 6M is called a left action ofG on M. Let GOpp denote the opposite group to G where products are interchanged: g.oPPh = h.g. Then a right action is a homomorphism f..L : GOpp ---- 6M. A left action is equivalent to a map f..L : G x M ---- M (same notation for f..L) with f..L((e,m) = m,f..L(g(f..L(h,m)) = f..L(g.h,m), all g,h E G,m,n E M. If the action is clear, we write g.m instead of f..L(g, m). We usually mean left actions when we speak of actions. A group action f..L is called effective iff it is an injective homomorphism. For an element mE M, the group G m = {g E G, g.m = m} is called the stabilizer or isotropy or fixpoint group of m. If all stabilizers are trivial, the action is called (fixpoint) free or faithful. For a group action f..L, an orbit is a set of form G.m = {g.mlg E G}. The orbit space G \ M defines a partition of M, with the canonical map 7r : M ---- G \ M : m f-+ G.m. The action is llThe classification of all finite simple groups is one of the main results in group theory of the 20 th century, see [188].
1067
0.3. GROUPS
transitive iff the orbit space is a singleton. Clearly card(G.m).card(Gm
)
= card(G).
If /-L1 : G x M -+ M, /-L1 : G x N -+ N are two group actions of group G, and if h : M -+ N is a set map, we say that h is equivariant iff it commutes with these group actions, i.e., if for all mE M, g E G, we have h(/-L1(g, m)) = /-L2(g, h(m)).
Example 71 For a set Alphabet, there is a group FG(Alphabet) and an injection i : Alphabet -+ FG(Alphabet) such that for any set map f : Alphabet -+ G into a group G, there is exactly one group homomorphism FG(f) : FG(Alphabet) -+ G such that FG(f) 0 i = f. The group FG(Alphabet) is called the free group over Alphabet and the above property is the universal property that characterizes the free group up to isomorphism. The free group consists of all "word" expressions b~l ... b~k, where bi E Alphabet, ni E Z which are reduced, i.e., bi ¥- bi+l' ni ¥- 0, and the empty word O. The product is defined by juxtaposition and cancelling of powers of adjacent letters. Lemma 68 For a group homomorphism f : G -+ H, Im(f) is a subgroup of H. For a subgroup I ~ H, the inverse image f-1(1) is a subgroup of G. The inverse image of the trivial group (the singleton consisting of the neutral element eH) in H is called the kernel of f and is denoted by Ker(f) Example 72 For every element g E G of a group, we have a special group automorphism Intg : G
-+
G:h
f--+
g.h.g- 1
the conjugation with g. This yields a group homomorphism Int : G
-+
Aut(G).
Proposition 68 For a group G, a subgroup H and an element g E G, we write gH = {g.hlh E H}, Hg = {h.glh E H} for the left and right cosets of H. The set of left, right cosets of G is denoted by GjH, H \ G, respectively. Denoting (G : H) = card(GjH), we have the Lagrange equation12 : card(G) = card(H).(G : H) and in particular, if G is finite, any subgroup cardinality divides the order card( G) of G. Proposition 69 If H is a subgroup of G, the following statements are equivalent:
(i) For all g
E
G, gH
= Hg.
(ii) There is a group homomorphism f : G -+ K with Ker(f)
= H.
If H has the properties of proposition 69, it is called a normal subgroup of G, in symbols: H
APPENDIX C. SETS, RELATIONS, MONOIDS, GROUPS
1068
Proposition 70 Let H
Hom(GI H, K) ~ {f is a canonical bijection. If H isomorphism onto I m(f).
Exercise 86 If f
:G
~
=
E
G
7r :
~
G I H the canonical homomorphism,
Hom(G, K)IH <;;: Ker(f)} : t
f---7
to
7r
K er(f), then the corresponding morphism G I H
~
K is an
H is an isomorphism of groups, then the map
Inti: Aut(G) ~ Aut(H) : t
f---7
f.t.r i
is an isomorphism of groups. Here are the basic isomorphism theorems:
Theorem 39 (First isomorphism theorem) Let G, H, N be groups with N
G, H <;;: G. Then:
(i) The product set H.N is a subgroup. (ii) We have N
H.N and N n H
H.
(iii) We have an isomorphism HIN n H ~ H.NIN : h(N n H)
Theorem 40 (Second isomorphism theorem) Let N
f---7
hN.
G be a normal subgroup.
(i) The normal subgroups M
M
G,
and (ii) we have an isomorphism GIM ~ (GIN)/(MIN) : gM
C.3.2
f---7
(gN).MIN.
Direct, Semi-direct, and Wreath Products
Given a family (Gi)iEl of groups, the set-theoretic product TIl G i becomes a (direct) product group via the coordinate-wise product, i.e., (gi).(h i ) = (gi.hi). Given two groups H, N and a group homomorphism ¢ : H ~ Aut(N), the semidirect product N y,
(n,g).(m,h)
=
(n.gm,g.h)
where gm = ¢(g)(m). We have two group injections i : N ~ N y,
i
j N y,
1069
G.3. GROUPS
which is split, i.e.,
7r 0
j
= Id H . Conversely, any such split sequence
e - - - - + . N - - -..... G
j ~.=====:::;.
H - - - -..... e
7r
identifies the middle group G to N
)
H.
Example 73 For two groups G, H, we have the direct product group G H = ITH G and an action ¢ : H ----+ G H via ¢(h)(fh = fk.h, k E H. Then the wreath product is G I H = G H ) H.
C.3.3
Sylow Theorems on p-groups
Proposition 71 (Sylow's proposition) If G is a finite group of order n, p is prime, and pkln, then there is a subgroup H <;;;; G with order pk .
A finite group G has the order card( G) = pk for a prime p iff all its elements have a power of p as their orders. In this case, G is called a p-group. A maximal p-subgroup in a group G is called a p-Sylow group in G. Theorem 41 (Sylow's Theorem) Let G be a finite group, p a prime. Then:
(i) The p-Sylow groups in G are the p-subgroups S with p t (G : S). (ii) Any two p-Sylow groups are conjugate to each other. (iii) Let CJp be the cardinality of the set of p-Sylow groups in G. Then we have CJpicard(G) and CJp == 1 (mod p). See [21] for a proof.
C.3.4
Classification of Groups
C.3.4.1
Classification of Cyclic Groups
If S eGis a subset of a group G, the smallest subgroup in G containing S is denoted by (S) and is called the group generated by S. A finitely generated group is one that admits a finite set
of generators. A cyclic group is one that is generated by one element G = (s). For such a group, the group homomorphism s? : Z ----+ G : n f---+ sn is a surjection with kernel Ker(s?) = O(s).Z, where sO(s) = e is the smallest positive power of s which yields e, the order of s, or O(s) = 0 if no positive power of s vanishes, in which case the order of s is said to be infinite. This means that (s) ~ Z/O(s).Z, and we have classified all cyclic groups: They are isomorphic to the quotient groups Zn = Z/n.Z with card(Zn) = n for positive n, and 00 for Zo = Z. Among these groups, the groups Zp of prime order p are simple. For abelian groups, simplicity means being cyclic of prime order. The number of generators of a finite cyclic group Zn is the number of numbers 0 < t < n prime to n, i.e., the Euler function ¢( n) = card(Z~ ), where Z~ denotes these numbers modulo n, this is in fact the group of invertible elements of the ring Zn, see appendix D.l.
APPENDIX C. SETS, RELATIONS, MONOIDS, GROUPS
1070
Proposition 72 If h = r.s is a factorization by natural numbers 1 < r, s, and (r, s) = 1, then ¢(h) = ¢(r).¢(s). If h = pk is a positive power of a prime number q, then ¢(pk) = qk-I(q -1). Therefore, if h = II qt, is the prime decomposition of h, then
¢(h)
=
h.
II(1 -
l/qi).
i
C.3.4.2
Classification of Finite Abelian Groups
Let p be an prime integer (p = 2,3,5,7,11, ... ). Take a weakly increasing sequence u. of positive integers. We set
= UI
::;
U2 ::; .•• U w
S(p, u.)
= ZpUl
X
Z pU2
X ...
Zpuw .
< P2 < .. . Pt) of primes and for each such Pi a Then we set
If we are given an increasing sequence p. = (PI
sequence u. i , i.e., a sequence of sequences
U.·.
(C.4) Theorem 42 For every finitely generated abelian group G, there is a natural number f and a system of primes p. and positive, weakly increasing sequences U.· such that G ~ Zf x T(p.,u:).
All the numbers f (the torsion-free rank of G), and p., U.· are uniquely determined. The length of the sequence u. i is called the Pi-rank of G. The image T(G) of the factor T(p.,u.·) in Gis the torsion group of G. The subgroups in G corresponding to S(Pi, u. i ) are called the Pi-Sylow groups. They are the subgroups of all elements with an order equal to a power of Pi. For a proof, see [540, Vol.II]. Corollary 23 If G is a finite abelian group, and if mlcard( G), then there is a subgroup H of G with card(H) = m.
C.3.5
General Affine Groups
For a commutative ring R, we denote by GLn(R) the group Mn,n(R)X of invertible n x nmatrices over R, see appendix D.1; we also denote GL(n,p) = GLn(Zp) and GL(n,p) = eZp . GL(n,p), the affine automorphism group of ~
Z;.
Theorem 43 (Minkowski) Suppose that G is a finite subgroup of GLn(Z), and let q E N. Consider the canonical projection homomorphism
--t
GLn(Zq).
If 3 ::; q, then K er(
Ei
= Diag(EI, ... En) diagonal matrix,
= ±1 for all N E Ker(
0.3. GROUPS
1071
For a proof, see [264, Satz 5.1]. For a prime number p and a power n, we have the following cardinalities: card(GL(n,p))
and
=
(pn _l)(pn-l -1) ... (p _1)pn(n-l)/2
---+
card(GL(n,p))
C.3.6
= pncard(GL(n,p)).
Permutation Groups
Proposition 73 (Cayley) Every group G is isomorphic to a subgroup of permutations. Proof. In fact, the left regular representation map l? : G -+ <5a with h(g) = 19 : h t-+ g. h is such an embedding, QED. A permutation group G ~ <5p by definition acts on the underlying set P. Suppose that P is finite. For 9 E G, the orbits of (g) are called the cycles of g. Then these finite sets are arranged as sequences C = (x, g.x, g2. x , ... gk(x) .x) of pairwise different elements, i.e., gk(x)+1 is the generator of the stabilizer of x. The permutation 9 can be represented as sequence (C1, C 2 , •.. Cr) of orbits in cycle representation.
Definition 127 If G ~ <5 p is a permutation group on a finite set P of cardinality p, the cycle index of an element 9 EGis the polynomial X .cyc(g) with the cycle type of 9 cyc(g)
=
(Cl'
=
XC! . X C2 1
•
2···
C2, ... Cp), and
Ci
XCp p
= card( {C =
cycle of g, card(g)
= i} ).
Here is Fripertinger's cycle index formula [170]
Z(G1(Z~2)(1 + x, 1 + x 2, ... 1 + x 144 ) for orbits of zero-addressed local compositions in
Z~2:
+ x 143 + 5x 142 + 26x 141 + 216x 140 + 2 024x 139 + 27 806x 138 + 417 209x137 + 6345 735x 136 + 90590 713x 135 + 1190322 956x 134 + 14303835 837x 133 + 157430569 051x 132 + 1592645620 686x 131 + 14873235105 552x 130 + 128762751824 308x 129 + 1037532923086353x 128 + 7809413514 931644x 127 + 55089365597956 206x 126 + 365290003947963446x 125 + 2282919558918081 919x 124 + 13479601808 118798 229x 123 + 75361590622423713249x 122 + 399738890367674230 448x 121 + 2015334387723540077262x 120 + 9673558570858327142 094x 119 + 44275002111552677715575x 118 + 193497799414541699555 587x 117 + 808543433959017353 438195x 1l6 + 3234171338137153259094 292x 1l5 + X144
12397650890304440505 241198x 1l4 + 45591347244850943472027 532x 1l3 + 160994412344908368725 437163x 1l2 + 546405205018625434948 486100x 111 + 1 783852127215514388216575 524x 110 + 5606392061138587678507139 578x 109 + 16974908597922176404758662419x 108 + 49548380452249950392015617 673x 107 + 139517805378058810 895 892 716 876x 106 + 379202235047824659955968634 895x 105 +
1072
APPENDIX C. SETS, RELATIONS, MONOIDS, GROUPS
995405857334028240446249995969x 104 + 2524931913311378421460541875 013x 103 + 6 192094899403308142319324646 830x 102 + 14688225057065816000841247153 422x 101 + 33716152882551682431054950635 828x lOO + 74924784036765597482 162224697 378x 99 + 161251165409134463248992354275 261x 98 + 336 225 833 888 858 733 322 982 932 904 265x 97 + 679456372 086 288 422 448 712 466 252 503x 96 + 1331179830182151403666404596530852x 95 + 2 529 241 676 111 626447928668 220456 264x 94 + 4661 739558127027290220867616 981880x 93 + 8337341899567786249391103289453916x 92 + 14472367067576451 752984797361008 304x91 + 24388618572337747341932969998362288x9o + 39908648567034355259311114115744392x89 + 63426245036529210 051949169850308 102x88 + 97921220397909924969018620386852352x 87 + 146881830585458073270850321 720445 928x 86 + 214098939483879341610433150629060 274x 85 + 303306830919 747863651 620555026 700 930X 84 + 417668422888061171460770548484103836x83 + 559 136 759 653 084 522 330 064 385 877 590 780X 82 + 727765306 194069 123565702210 626 823 392x81 + 921077965629957077012552741 715036 692x 80 + 1133634419214796834928853170296724314x79 + 1356926047220511677349073 201120481570x 78 + 1 579704950475555411 914967237903930 342x 77 + 1 788 783 546 844 376 088 722 000 995 922 467 990x 76 + 1970254341437213013502048964983877 090x 75 + 2 110986794386 177596749436553816924 660x 74 + 2200183419494435885449671402432366 956x 73 + 2230741522540743033415296821609 381912x 72 + 2 200 183419494435885449671402432366 956x 71 + 2 110 986 794 386 177 596 749 436 553 816 924 660x 70 + 1970254341437213013502048964983877090x69 + 1 788 783 546 844 376 088 722 000 995 922 467 990x68 + 1 579704950475555411 914967237903930 342x67 + 1356926047220511677349073201120 481570x 66 + 1133634419214796834928853170296724314x65 + 921077965629957077012552741 715036 692x 64 + 727765306194069123565702210 626 823 392x63 + 559 136 759 653 084 522 330 064 385 877 590 780x 62 + 417668422888061171460770548484103836x61 + 303306830919747863651620555026700930x 6o + 214098939483879341610433150629060274x 59 + 146881830585458073270850321720445928x 58 + 97921220397909924969018620386852352x 57 + 63426245036529210 051949169850308 102x 56 +
G.3. GROUPS
1073
39908648567034355259311114115744392x 55 + 24388 618 572 337 747341 932 969 998 362 288x 54 + 14472 367 067 576 451 752 984 797 361 008 304x 53 + 8337341899567786249391103289453916x 52 + 4661 739558127027290220867616 981880X 51 + 2 529 241 676 111 626447928 668 220 456 264x 50 + 1331179830182151403666404596530 852x 49 + 679456372086288422448712466252 503x 48 + 336 225 833 888 858 733 322 982 932 904 265x 47 + 161251165409134463248992354275 261x 46 + 74924784036765597482162224697 378x 45 + 33716152882551682431054950635 828x 44 + 14688225057065816000841247153 422x 43 + 6192094899403308142319324646830x 42 + 2524931913311378421460541875 013x 41 + 995 405 857334028 240 446 249 995 969x 40 + 379 202 235 047824659 955 968 634 895x 39 + 139517805378058810 895 892 716 876x 38 + 49548380452249950392015617 673x 37 + 16974908597922176404758662 419x 36 + 5606392061138587678507139 578x 35 + 1 783852127215514388216575 524x 34 + 546405205018625434948 486100x 33 + 160994412344908368725 437163x 32 + 45591347244850943472027 532x 31 + 12397650890304440505 241198x 3o + 3234171338137153259094 292x 29 + 808543433959017353 438195x 28 + 193497799414541699555 587x 27 + 44275002111552677715 575x 26 + 9673 558 570 858 327142 094x 25 + 2015334387723540077262x 24 + 399738890367674230 448x 23 + 75361590622423713249x 22 + 13479601808118798 229x 21 + 2282919558918 081919x 2o + 365290003947963446x 19 + 55089365597956 206x 18 + 7809413514 931644x 17 + 1037532923086353x 16 + 128762751824 308x 15 + 14873235105 552x 14 + 1592645620 686x 13 + 157430569 051x12 + 14303835 837xll + 1190322 956x 10 + 90590 713x 9 + 6345 735x 8 + 417 209x 7 + 27 806x 6 + 2024x 5 + 216x 4 + 26x 3 + 5x 2 + X + 1.
Appendix D
Rings and Algebras D.I
Basic Definitions and Constructions
Definition 128 A (unitary) ring is a triple (R,a,JL) where (R,a) is an abelian group whose operation a is written additively (a(r, s) = r+s) with neutral element OR, and (R, JL) is monoid, written multiplicatively (JL(r, s) = r . s) with multiplicative neutral element lR such that these operations are coupled by distributivity, i.e., (r + s) . t = r· t + s· t, t· (r + s) = t· r + t· s for all r, s, t E R. A ring is commutative iff its multiplicative monoid is commutative. A set map f : R ----+ 8 of rings R, 8 is a ring homomorphism iff it is a homomorphism of the underlying additive groups and a homomorphism of the underlying multiplicative monoids. The set of ring homomorphisms from R to 8 is again denoted by H om( R, 8) if no confusion is likely. An element x of a non-zero ring R is called invertible, iff there is a multiplicative (left) inverse y, i.e., y . x = lR. The subset R X of invertible elements is a multiplicative group. A skew field is a ring such that R X = R - {OR}. A (commutative) field is a commutative ring which is a skew field. The subring Z(R) of all elements in a ring R which commute with all of R is called the center of R. For a commutative R, a ring homomorphism 'P : R ----+ 8 whose image is in Z(8) is called an R-algebra. If'P is clear, one says that "8 is an R-algebra", and 'P is called the "structural homomorphism". If'P : R ----+ 8, 'lj; : R ----+ T are two R-algebras, a ring homomorphism f : 8 ----+ T is an R-algebra homomorphism, iff'lj; = f0'P, i.e., iff the 'R-elements are conserved under f '. One als often writes r instead of 'P( r) if the algebra structure is clear.
The set-theoretic composition go f of two ring homomorphisms f : R ----+ 8, 9 : 8 ----+ T is again a ring homomorphism. A bijective ring homomorphism automatically has an inverse ring homomorphism, i.e., this is a ring isomorphism, and a ring endomorphism is a homomorphism with domain equal to its codomain, whereas an automorphism is an endomorphism which is an isomorphism. The corresponding concepts for R-algebras are evident: The homomorphisms have to be algebra homomorphisms, so, for example, an R-algebra automorphism is an automorphism which conserves the structural homomorphism.
1075
APPENDIX D. RINGS AND ALGEBRAS
1076
Example 74 Classical examples of rings are the rings Z, Q, ~, C, lHl of integers, rational numbers, real numbers, complex numbers and Hamilton quaternions. Except the integers, these rings are also skew fields, and lHl is not commutative. The conjugation z = a + i.b f-+ Z = a - i.b is an automorphism of the ~-algebra C. To every ring R, one has the opposite ring ROPP which is the same additive group, but multiplication is defined by reversing to given multiplication, i.e., r .OPP s = S • r. An antihomomorphism of rings is a homomorphism into the opposite codomain ring. Every ring R is a Z-algebra in a unique way by cp(z) = Z.1Rj the latter means that Z.1R = 1R + 1R + .. . 1R if z > 0, it is OR if z = 0, and it is -( -Z).1R if z < O. By the natural inclusions, ~ is a Q-algebra, C is an ~-algebra, and lHl is an ~-algebra, but not a C-algebra. The 2 x 2 matrices over a commutative ring R with coefficient-wise addition and usual matrix multiplication form a non-commutative ring. They are an R-algebra by the diagonal maprf-+(o~)·
Proposition 74 If I is a subgroup of a ring R, the following statements are equivalent: (i) For all r E R, r· I R.
~
I, i.e., I is a left ideal in R, and I· r
(ii) There is a ring homomorphism f : R
-+
S with Ker(f)
~
I,i.e., I is a right ideal in
= I.
If I has the equivalent properties of proposition 74, we call it an ideal or a two-sided ideal in R. So ideals correspond to normal subgroups. The quotient ring R/ I (and the associated projection 1f : R -+ R/ I) is just the quotient group (with respect to the additive structure), together with the well-defined multiplication (r + I) (s + I) = r s + I, and we have K er( 1f) = I. If G is any subset of a ring R, the smallest ideal containing G is denoted by (G) and is called the ideal generated by G.
Proposition 75 Let I be an ideal in R, any ring. Then
1f :
R
-+
R/ I the canonical homomorphism, and S
Hom(R/I, S) ~ {f E Hom(R, S)II ~ Ker(f)} : t f-+ to 1f is a canonical bijection. If I isomorphism onto I m(f).
= K er(f), then the corresponding morphism R/ I
-+
I is an
Example 75 By the Euclidean algorithm (D.3), every subgroup of Z is of the form {n} = n.Z for a uniquely determined non-negative n. Such a subgroup is also an ideal. The quotient ring Zn = Z/n.Z is the ring of integers modulo n. It is the so-called "prime field of characteristic p" iff the Euler function ¢(n) = n - 1, i.e., n = p is a prime (see also proposition 72 in appendix C.3.4.1). For any skew field F, we have the unique Z-algebra structure p : Z -+ F whose kernel must be either (0) or a prime ideal p.Z. In the latter case, the image Im(p) is the smallest subfield in Fj in the former case, the smallest subfield of F is evidently isomorphic to Q. The prime field of a skew field is called its prime field, whereas the generator (zero or a positive prime) of Ker(p) is called the characteristic char(F) of the skew field.
1077
D.l. BASIC DEFINITIONS AND CONSTRUCTIONS
A skew field F has only two ideals: (0), F, and conversely, a ring which has only these ideals is called simple. The ring Mn,n(F) of n x n-matrices (see E.1.1) over a skew field F is simple (but not a skew field for 1 < n).
D .1.1
Universal Constructions
If M is a multiplicative monoid and R is a commutative ring, we have a ring R(M), the monoid algebra by the following construction. The underlying set is the set R(M) of functions f : M -+ R with f(m) = 0 except of a finite number of arguments. This is an additive group under the addition (f + g)(m) = f(m) + g(m). The product is defined by (f. g)(m) = Ln.n'=m f(n) . g(n'), which is reasonable since the summands which do not vanish are finite in number. The multiplicative neutral element IR(M) is the function IR(M)(eM) = IR and zero else. The elements f : M -+ R of the monoid algebra are also written as a formal sum Lf(m)#O f(m)m. The algebra structure is given by r f--+ reM. Here is the universal property of the monoid algebra:
Proposition 76 Every monoid homomorphism cp : M -+ (S, f.L) into the multiplicative monoid of an R-algebra S, can be extended in a unique way to a homomorphism : S(M) -+ S of R-algebras. If L fm.m E S(M), its image is (L fm.m) = L fm . cp(m). A special case of a monoid algebra is for the free monoid F M (Alphabet) over Alphabet, see example 66 in appendix C.2.3. One usually writes S(Alphabet) instead of S(F M(Alphabet)). For example, if Alphabet = {X I ,X2 , .•. X n , ... }, we have the algebra of non-commutative polynomials in the indeterminates Xl, X 2, ... X n , .... If instead we take the free commutative monoid FCM(Alphabet) , we get the monoid algebra S[Alphabet] = S(FCM(Alphabet)) of commutative polynomials in the indeterminates {Xl, X 2, ... X n , ... } of Alphabet.
Example 76 For the R-algebra of polynomials in one variable R[X], the ideal (X2) defines the R-algebra of dual numbers R[c] = R[X]/(X2) with c = X + (X2) the class of X. An element of R[c] is uniquely described as a linear polynomial a + c.b in c, and the multiplication is (a + c.b)(c + c.d) = ac + c.(ad + bc). The group R[c] x consists of those elements a + c.b with a E RX. Definition 129 Let (Ri)iEI be a family of rings, then the product DiE I Ri of this family is the following ring: As a set, it is the product of the underlying sets, addition and multiplication are defined coordinate-wise l , i. e., (Xi)
+ (Yi)
(Xi) . (Yi)
=
(Xi
+ Yi)
and
= (Xi· Yi),
and the unity is the family (Ii) of unities Ii in the respective rings R i . For each index j E I, we have a canonical projection ring homomorphism Pj :
II Ri
-+
Rj
iEI
1 Addition
and multiplication are taken in the respective rings.
APPENDIX D. RlNGS AND ALGEBRAS
1078
The product and its canonical projections shares the universal property of a product in the category of rings (see appendix G.2.l for the definition of a product in a category): Lemma 69 Let S be a ring, and let (Ri)iE! be a family of rings. For each family (Ii : S --+ Ri)iE! of ring homomorphisms, there is a unique ring homomorphism f : S --+ ITiE! ~ such that fi = Pi . f for all i E f.
Clearly, the map f(s) = (Ii (X)) solves the problem. More generally, consider a diagram 2 D of rings with vertex set f. Then we have a subring lim (D) of the product ITiE! ~ consisting of all families (Xi) such that
f(xj)
= Xk
for any homomorphism f : Rj
--+
Rk
corresponding to a D-arrow from vertex j to vertex k. Proposition 77 With the above notation, the ring of lim(D), together with the induced canonical projections Pi : lim(D) --+ R i , is the limit of the diagram D of rings, i.e., for any family (Ii : S --+ ~)iE! of ring homomorphisms such that f . fj = fk whenever f : R j --+ Rk is a homomorphism corresponding to a D-arrow from vertex j to vertex k, there is a unique ring homomorphism g : S --+ lim (D) such that fi = Pi· g, all i E f.
In particular, if we have a fiber product3 diagram A
--+
f
C
f-
9
B of rings, there is a limit,
the fiber product of this diagram, usually denoted by A Xc B and inserted in the commutative "pullback" square AXcB ~B
(D.l) of ring homomorphisms. For non-commutative rings, fiber sums do not exist in general. However, if the diagram's homomorphisms are algebras over a commutative ring, we have the following well-known result [63, ch.III, No.2]: Theorem 44 Let A
f-
f
C
--+ 9
B be a fiber sum diagram of algebras over the commutative ring
C. Then the tensor product A ®c B defines a fiber sum
C~
A
(D.2) B ~A®cB
of C-algebras with il(a)
= a ® lB and i2(b) =
lA ® b.
2See definition 151, appendix G.1.2, for a formal definition of a diagram in a category. 3See appendix G.2.1 for the concept of a fiber product and the dual one of a fiber sum.
1079
D.I. BASIC DEFINITIONS AND CONSTRUCTIONS
D.1.1.1
Quiver Algebras
Definition 130 Suppose that we are given a quiver Q = (head, tail : A :::::t V) (see appendix C. 2. 2, definition 123). The path category of Q is the set P(Q) of paths of Q (the morphisms of the category, the lazy paths being the objects of the category), together with the composition p.q of two paths p, q, defined if head(p) = tail( q). In that case, p.q is just the evident composed path of length l(p.q) = l(p) + l(q). Definition 131 Suppose that we are given a finite quiver Q = (head, tail : A :::::t V) and a commutative ring R. The quiver algebm R(Q) of Q with coefficients in R is the free left Rmodule whose basis is the path set P(Q), together with R-bilinear extension of the composition in the path category, i. e., the product of two paths is their composition, if possible, and zero else. This means:
The unity l R (Q)
=
Lv
vEV
is the sum of all vertexes.
Definition 132 Let Q be a quiver, and R(Q) its quiver algebm over the coefficient ring R. A sub-path of a path p in R(Q), is a triple (u, w, v) of paths such that p = U.w.V. If the external factors u, v are clear, the sub-path is identified with the middle member w, and we write w c: p.
°
For example, if l(p) = (a vertex), the only sub-path of pis p, in fact, p = p.p.p is the only factorization of this kind. If p =0 x is a loop, the sub-paths of pare x.x.p, p.x.x, and x.p.x. So the middle member x appears two times since it is in different positions in the factorization. Ifp = Ul.Wl.Vl = U2.W2.V2 are two subpaths ofp with l(Wl) = l(W2), we write (Ul, Wl, Vl) < (U2,W2,V2) iff l(Ul) < l(U2)' Clearly this relation is transitive and antisymmetric. Among all subpaths (u,w,v) of p with fixed middle length l(w) = canst., the relation corresponding to relation S is an linear ordering. In fact, if (Ul' Wl, Vl), (U2' W2, V2) are two subpaths of p such that l(Ul) = l(U2) and l(wd = l(W2)' then they are identical, so the ordering relation is total. We shall use this total ordering to define linear endomorphisms, so-called sub-path opemtors of the quiver algebras as follows. . We shall define an endomorphism c:{ for each natural index i = 0,1,2, ... and for each system ).,i of coefficients in the following sense. We set ).,i
with
).,tj
=
().,tj)i~l,l~j~l-i+l
E R. The endomorphisms run as follows. Let
W
c:f(w)=
.
. {o
LVjLw,l(vj)=i
be a path of length l(w) = l. Then
).,jjVj
if 1 < i, else,
with the indexes of the subpaths of W referring to their total order defined above. In the musical applications, we shall encounter linear combinations «I> = Li Mi c:f of such weighted sub-path operators.
APPENDIX D. RINGS AND ALGEBRAS
1080
D.2
Prime Factorization
An integer 1 < P is called prime iff P = u . v with positive factors implies u
= 1 or v = 1.
Theorem 45 Every non-zero integer x is a product
x
= ±1 . p~l . p~2 .... p~k
with an increasing sequence of primes PI < P2 < ... Pk and positive exponents ni which are all uniquely determined. Corollary 24 Let PI < P2 < ... Pk be an increasing sequence of primes. For two sequences of rational numbers ql < q2 < ... qk , rl < r2 < ... rk, the equation
implies the equality of the sequences of rational numbers. This implies that the logarithms In(p) of primes are linearly independent (see E.2.1) in the rational vector space lR.[QJ (see E.1.1), a central fact for the construction of the Euler module of pitch systems (see section A.2.3).
D.3
Euclidean Algorithm
Proposition 78 Euclidean algorithm: Given a non-zero integer d, every integer x has a unique representation x = a.d + b, 0 b < d.
:s
Lemma 70 Let n > 1 be an integer. Then every positive integer x has a representation
x = LXi.ni i=O
with 0 :s Xi < n, and Xt written as
#-
O. The Xi are uniquely determined. The representation is usually
and known as the n-adic representation.
D.4
Approximation of Real Numbers by Fractions
Lemma 71 Let L
such that
= log2 (3). Then for every real number 0 > 0, there is a pair n, m of integers O
D.5. SOME SPECIAL ISSUES
1081
Proof. We construct a sequence (ni' md, (n2' m2), . .. (ns, ms), ... of pairs such that 0 < ns + ms . L < 1/2 s • We may start by (nl, mi) = (2, -1) since L ~ 1.58. Suppose that we have found (ns,ms) such that 0 < ns + ms· L < 1/2 s . There is a maximal positive integer k such that k· (n s + ms . L) < 1. Then (k + 1) . (ns + ms . L) > 1 since L is not rational by corollary 24. For the same reason, either (k
+ 1) . (ns + ms . L) -
1 < 1/2· 1/2 s = 1/2 s+1
or Then either or solves the problem, QED.
D.5
Some Special Issues
D.5.1
Integers, Rationals, and Real Numbers
Definition 133 The index function index : IR
----+
Z is defined by if 0
< x,
if 0
> x,
if 0
=x
(D.3)
for x E IR. x+
A real number x has unique representation x = bottom(x) = floor(x) + 1.
< 1; we set top(x)
+ x+
with bottom(x) E Z,O
~
Definition 134 The rounding function round: IR ----+ Z is defined by round(x)
for x E R
=
{flOOr(X) top(x)
ifx+ else
~
0.5,
(D.4)
Appendix E
Modules, Linear, and Affine Transformations E.l
Modules and Linear Transformations
Definition 135 Let R be a ring, then a (left)1 R-module is a triple (R, M, JL : R x M --+ M) where M is an additively written abelian group and JL is the scalar multiplication, usually written as JL(r,m) = r.m if JL is clear{R is also called the ring of scalars, and M the group of vectors}, with the properties: 1. We have lr.m
= m for all mE M.
2. For all r, s E Rand m, n E M, we have (r
and r.(s.m)
= (r· s).m.
+ s).m = r.m + s.m,
r.(m + n)
= r.m + r.n,
If (R, M, JL : R x M --+ M), (R, N, II : R x N --+ N) are two R-modules, a group homomorphism f : M --+ N is called R-linear (or a module homomorphism if the rest is clear) iff it is "homogeneous", i.e., f(r.m) = r.f(m), for all r E R,m E M, with the respective scalar multiplications. The set of R-linear homomorphisms from M to N is denoted by LinR(M, N). It is an additive group under the pointwise addition (f + g)(m) = f(m) + g(m). The settheoretic composition 9 0 f of two module homomorphisms f : M --+ N, 9 : N --+ L is also a module homomorphism, and we have distributivity, i.e., (g1 + g2) 0 f = g1 0 f + g2 0 f for gi : N --+ L, f : M --+ Nand 9 0 (h + h) = 9 0 h + 9 0 12 for fi : M --+ N, 9 : N --+ L. By the distributivity of composition of module homomorphisms, the group EndR(M) = LinR(M, M) is a ring, the endomorphism ring of M, which contains the multiplicative automorphism group AutR(M) = EndR(M) x of M. An R-linear homomorphism f : M --+ N has a group-theoretic kernel K er(f) and an image I m(f) which are also submodules. For a submodule N <;;; M, the quotient group M / N is also an R-module by the scalar multiplication r.(m + N) = r.m + N. The group-theoretic 1 Right
modules are defined in complete analogy, the scalar multiplication being written m.r instead of r.m.
1083
1084
APPENDIX E. MODULES, LINEAR, AND AFFINE TRANSFORMATIONS
results proposition 70 and the isomorphism theorems 39, 40 in appendix C, are valid literally if we replace the respective groups by modules (the normality of subgroups is automatic here).
E.1.1
Examples
Abelian groups G are canonically identified to Z-modules by z.g = g + g + ... g, z times for z > 0, (-z).g = -(z.g), and O.g = Oc. If M is an R-module, a submodule N ~ M is a subgroup that is stable under scalar multiplication, i.e., R.N = N. If ScM is a subset, the smallest submodule containing S is denoted by (S) and consists of all linear combinations I:i ri.Si, ri E R, Si E S. If there is a finite set S such that (S) = M, M is called finitely generated. Finitely generated Z-modules are completely classified, see C.3.4.2. If (Mi)iEI is a family of submodules of M, we denote by I:I Mi the module (U I Mi). It consists of all finite sums Xi, + ... Xik' Xij E M ij . Every ring R is a left R-module RR and a right R-module RR by the given multiplication. For commutative rings these structures coincide. A left, right ideal in R identifies to a submodule of RR, RR, respectively. If tp : S -+ R is a ring homomorphism and M is an R-module, we have an S-module structure M[cp] on M via s.m = tp(s).m, the module defined by restriction of scalars. If tp is clear, one also writes M[s]. For the Z-algebra structure of every ring R, M[z] gives the underlying structure of an abelian group M back. In particular, an R-algebra S is an R module via (SS)[R]' A dilinear homomorphism from an S-module M to an R-module N is a pair (tp : S -+ R, f : M -+ N[cp]) consisting of a scalar restriction tp and an S-linear homomorphism f. If (tp : S -+ R,f : M -+ N[cp]),('1f1 : T -+ S,g : L -+ M[,p]) are two dilinear homomorphisms, their composition is defined by (tp 0 '1f1 : T -+ R, fog: L -+ N[tp 0 '1f1]). The set of dilinear homomorphisms from M to N is denoted by Dil (M, N). If the scalar restriction is fixed by tp, we denote the corresponding set by Dilcp(M, N), and the special case Dil ldR (M, N) is just LinR(M, N) as above. For any family (Mi)iEI of R-modules, we have the product module TIl Mi. This is the product of the underlying groups, together with coordinatewise scalar multiplication r.(mi) = (r.mi) The submodule of those (mi) with only finitely many mi -I- 0 is the direct sum module EBI Mi. For every index j, one has the canonical (linear) projections 7fj : TIl Mi -+ Mj and 7fj : EBI Mi -+ M j , via 7fj((m)i)) 1--+ mj, as well as the canonical (linear) injections tj : Mj -+ EB I Mi with tj (m) having zero coordinates except for coordinate index j where the value is m.
Lemma 72 (Universal limit property of direct products of modules) For every family (Ji : X -+ Mi)i of linear homomorphisms there is exactly one linear homomorphism f : X -+ TIl Mi such that fj = 7fj 0 f for all j E I. (Universal colimit property of direct sums of modules) For every family (Ji : Mi -+ X) of linear homomorphisms there is exactly one linear homomorphism f : EB I Mi -+ X such that iJ = of 0 tj for all j E I. A sum I:IMi of submodules Mi of a module M is called (inner) direct, iff the linear homomorphism EB I Mi -+ M which is induced by the inclusions Mi ~ M is an isomorphism. For two positive integers m, n denote by m x n = [1, m] x [1, n] the set of all pairs (i,j), 1 :::; i :::; m, 1 :::; j :::; n. For a ring R, we have the direct sum Mm,n(R) = EBmxn R whose elements
1085
E.2. MODULE CLASSIFICATION are written in the matrix notation rl,1
(r·· ',J) --
(
...
r·',J.
rm,1
whose rows or columns are the submatrices with constant first or second index, respectively. If (ri,j) E Mm,n(R) and (Sj,k) E Mn,I(R), we have the matrix product (ri,j) . (Sj,k) = (ti,k) E Mm,I(R) with ti,k = Lj ri,j' Sj,k' Whenever defined, the product is associative. It is also distributive, i.e., ((ri,j)+(r~))'(Sj,k) = (ri,j)'(Sj,k)+(rL)'(Sj,k), and (ri,j)'((Sj,k) + (sj,k)) = (ri,j)·(Sj,k)+(ri,j)·(sj,k)' For m = n, one has the identity matrix Em = (8ij ) with the Kronecker delta 8ii = 1, 8ij = 0 for i i- j. With this identity and the matrix addition and multiplication, Mm,m(R) is a ring. With the matrix multiplication as scalar multiplication, Mm,n(R) becomes a left Mm,m(R)-module and a right Mn,n(R)-module. If R is commutative, Mm,m(R) is an R-algebra via r 1-+ r.Em' i.e., the scalar multiplication of the R-module Mm,m(R) coincides with the multiplication with R-elements from the algebra embedding.
E.2 E.2.1
Module Classification Dimension
For any set C and ring R, we have the free R-module EBc = R C of rank card( C) which is the direct sum of C copies of RR (for C = 0, we take the zero module OR)' A free module M is one that is isomorphic to a free module R C . It is well known that the rank card( C) is then uniquely determined and called the dimension dim(M) of the free module M. If R = F is a skew field, an F-module is called a vector space (over F), and we have the main fact of linear algebra: Theorem 46 Every vector space M over the skew field F is free, and the dimensions are a complete system of invariants of isomorphism classes of vector spaces.
The proof ofthis theorem is based on the concept of linear (in)dependence in a module. A family (mi)i of elements mi E M is called linearly independent iff any (finite) linear combination 0= Lj=I, ... k rj.mij implies rj = 0, for all j. Otherwise the family is called linearly dependent. A base of a module is a family of linearly independent elements which generates the module. The main theorem 46 is proved by the exchange theorem which states that any family (mi)i of linearly independent vectors can be inserted in a given basis by exchanging some of its elements with the (mik Example 77 dim(IR[IQ!J) = card(lR) = 2No , and the sequence of b-Iogarithms (logb(p))p= prime is linearly independent by corollary 24 in appendix D.2. This means that for any finite increasing sequence p. = (PI,P2," .Pk) of primes and the corresponding sequence
1086
APPENDIX E. MODULES, LINEAR, AND AFFINE TRANSFORMATIONS
the scalar product map
H: Qk
----t
lR[IQIJ
with H(x) = (Hp., x) = 2:i logb(Pi)xi is a linear injection. The special case of the first three primes and Hprime = (logb(2), logb(3), logb(5)) was discussed in section A.2.3. Here we can prove the result needed for proposition 64:
Lemma 73 For any positive real bound 8 and every real number C/>, there is x E Z3 such that
IH(x) - c/>I < 8
Proof. WLOG, we can work with logarithm basis b = 2. We know from lemma 71 in appendix D.4 that there is x' = (n, m, 0) E Z3 such that 0 < H(x') < 8. Clearly, there is an integer multiple x = z.x' which does the job, QED.
Corollary 25 (of theorem 46) If f : N -=:-. Rn and 9 : M -=:-. Rm are two free modules of finite ranks n, m, isomorphic to the free modules Rn, R m via isomorphisms f, g, then the linear homomorphisms are described by matrices: If h : N ----t M is a linear homomorphism, there is a uniquely determined matrix H E Mm,n(R) such that for x E N, we have h(x) = g-I(H . f(x)), where f(x) is written as a column matrix in Mn,I(R) which canonically identifies to Rn. And conversely, each such matrix defines a linear homomorphism. In other words, there is an isomorphism (E.1)
of additive groups. If R is a commutative ring, then an R-algebra S is also an R-module. We have an injective homomorphism of R-algebras
A : S ----t EndR(S) A(s)(s') = s· s'
(E.2)
which is called the left regular representation of s. If S is a free R-module of dimension n, then the isomorphism (E.1) induces the left regular representation in matrices: A:S
----t
Mn,n(R).
(E.3)
More generally, a linear representation of an R-algebra A is an algebra homomorphism f : A EndR(M) into the R-algebra of endomorphisms of an R-module M.
----t
Definition 136 The points of a k-element local composition K in lRn is in general position iff dim(RK) = k -1. Theorem 47 Let VI, V2, ... Vn be n vectors in a Q-module, and take two submodules G, H of dimensions 9 = dim(G), h = dim(H). Suppose that the module which is generated by the vectors Vi, G, and H, has dimension n + g + h. Then we can have at most n + 9 + h points in general position in the union Ui Vi + G + H.
1087
E.2. MODULE CLASSIFICATION
Proof. WLOG, one may suppose VI = 0 after a shift. Take bases Xl, ••• X g , Yl, ... Yh of G, H, respectively. Then, the vectors 0, V2, ••• V n , Xl, ••• X g , Yl, ... Yh are in general position. Conversely, if the vectors Zl, Z2, ... Zm in the union Ui Vi + G + H are in general position, then dimUQ,{zl' Z2, ... zm}) = m-l. But (Zl' Z2, ... zm) is contained in the module which is spanned by V2, .•. V n , G, and H, whose dimension is n + 9 + h - 1, i.e., m - 1 ::; n + 9 + h - 1, QED.
E.2.2
Endomorphisms on Dual Numbers
For a commutative ring R, we have the commutative R-algebra R[e] of dual numbers (see example 76 in appendix D.l.l). As an R-module, it has dimension 2 and is isomorphic to R2 under the map a + e.b I--t (a, b). By the above corollary 25, the R-linear endomorphism ring of R[e] identifies to the four-dimensional matrix ring M 2,2(R). In this situation, the left regular representation of R[e] is the homomorphism of R-algebras
with
A(a+e.b)=(:
~),
which represents the linear endomorphism of multiplication by a+e.b. We have shown in section 29.6 that M 2,2(R) is generated by the four R-linear basis elements A(IR), A(e), 0:+, 0:+ . A(e), where 0:+ is the sweeping orientation
and that the R-algebra M 2,2(R) identifies to the quotient
of the polynomial R-algebra in the two non-commuting variables A(e), 0:+.
E.2.3
Semi-Simple Modules
A module M -I- OR which has no submodules except OR, M is called simple. A module M is called semi-simple iff it has the following equivalent properties:
Lemma 74 Let M be an R-module. The following statements are equivalent:
(i) Every submodule of M is a sum of simple submodules. (ii) M is the sum of simple submodules.
(iii) M is the direct sum of simple submodules. (iv) Every submodule N of M is a direct summand (i.e., there is a submodule N' of M such that M = N EEl N'.
APPENDIX E. MODULES, LINEAR, AND AFFINE TRANSFORMATIONS
1088
The following is immediate:
Lemma 75 A linear homomorphism between simple R-modules is either an isomorphism or zero. Hence the endomorphism ring EndR(M) of a simple module M is a skew field. For example, the left Mm,m(F)-module Mm,n(F) and the right Mn,n(F)-module Mm,n(F) of m x n-matrices over a skew field F is semi-simple. For the left module, the n columns are the simple submodules, whereas for the right module, the m rows are the simple submodules. Moreover, the only left and right submodules of Mm,n(F) are the zero module and Mm,n(F). In particular, the only two-sided ideals in the ring Mm,m(F) are 0 and Mm,m(F), i.e., this ring is simple.
Theorem 48 (Wedderburn) The semi-simple rings are the finite products matrix rings Mmi,mi (Fi) over skew fields Fi .
I1 Mmi,mi (Fi)
of
If G is a finite group, and if K is a commutative field, we have defined the monoid algebra K (G) in D.l.I, it is called the group algebra in this case. Here are the semi-simple group algebras over commutative fields:
Theorem 49 (Maschke) The group algebra K (G) is semi-simple iff char( K) f card( G).
E.2.4
Jacobson Radical and Socle
Definition 137 The intersection of all maximal submodules of an R-module M the Jacobson radical of M, it is denoted by Rad(M).
=1=
0 is called
Sorite 12 let M, N be R-modules. Then:
(i) For f
E
LinR(M, N), we have f(Rad(M))
~
Rad(N).
(ii) We have Rad(M EB N) = Rad(M) EB Rad(N). (iii) We have Rad(MjRad(M)) =
o.
(iv) We have Rad(RR).M ~ Rad(M).
(v) If M is semi-simple, then Rad(M) = O. (vi) If a submodule N of M has MjN semi-simple, then Rad(M) ~ N. For a ring R, one may look at its left radical Rad(RR), or at its right radical Rad(RR). Fortunately, there is no difference in that:
Proposition 79 The left and right radicals of a ring R coincide, Rad(RR) = Rad(RR), and this (two-sided) ideal Rad(R) is the maximal ideal I which annihilates every semi-simple module M, i.e., I.M = O.
E.2. MODULE CLASSIFICATION
1089
For a ring R, we set
Jr
= {r
E RIIR (j. r· R}, Jl
= {r
E RIIR (j. R· r}.
Proposition 80 For a ring R the following conditions are equivalent:
(i) The quotient ring R/ Rad(R) is a skew field. (ii) We have J r
= Rad(R).
(iii) We have Jl = Rad(R). (iv) The set J r is additively closed.
(v) The set Jl is additively closed. Definition 138 A ring with the equivalent properties of proposition 80 is called local. In particular, a commutative ring R is local iff it has a unique maximal ideal. The length l(M) of a module M is the maximum length 1 of finite chains 0 ¥ Nl ¥ N2 ¥ ... Nl = M of submodules (if that maximum is 00, we set l(M) = 00). A module is called indecomposable iff it is not the direct sum of two proper submodules.
Lemma 76 If M is an R-module with local endomorphism ring, then M is indecomposable. Conversely, let M be an indecomposable module of finite length l. Then EndR(M) is a local ring, and the radical S = Rad(EndR(M)) is nilpotent, namely Sl = O. Proposition 81 Let X c M, YeN be two non-zero submodules of indecomposable R-modules M, N of finite lengths. If F : M ---- N, g : N ---- M are linear maps which induce mutually inverse isomorphisms fiX: X':::' Y,gIY = UIX)-l : Y':::' X, then f,g are isomorphisms. Proof. Since f . f restricts to the identity of X, so does any of its positive powers. So non of them can be the zero endomorphisms of M. But from lemma 76 we know that End(M) is local and that a non-nilpotent endomorphism must be invertible. A symmetric argument yields inversibility of f· g and therefore of both, f and g, QED.
Lemma 77 (Fitting's lemma) Let f : M ---- M be a linear endomorphism of a module M of finite length. Then there is a positive power fn and a direct decomposition M = NEB Ker(Fn) such that fn is an automorphism on N. Definition 139 The socle Soc(M) of a module M is the sum of all its simple submodules. Theorem 50 For a module M of finite length, the following three statements are equivalent:
(i) M is semi-simple. (ii) Rad(M) = O. (iii) Soc(M) = M.
APPENDIX E. MODULES, LINEAR, AND AFFINE TRANSFORMATIONS
1090
E.2.5
Theorem of Krull-Remak-Schmidt
Theorem 51 (Krull-Remak-Schmidt) Let M 1 , ... M k , M{, ... M{ be modules with local endomorphism rings (in particular these modules are all non-zero). Suppose that the direct sums EB i =l, ... k Mi and EB j =l, ... 1M j are isomorphism of R-modules. Then k = l, and there is a permutation (Y of the indices such that we have isomorphisms Mi ~ M~(i) for all i = 1 ... k. Corollary 26 A module of finite length is a direct sum of indecomposable submodules in a unique way up to permutation and isomorphisms of the summands.
This follows from theorem 51 in view of lemma 76, QED.
E.3
Categories of Modules and Affine Transformations
See appendix G for a reference to category theory. For an additive group M and an element m E M, the translation by m is the set map em: M ---> M: x f-+ em(x) = m+x. The exponential notation is chosen because e?: M ---> 6 M is an injective group homomorphism. We denote by eM the group of translations on M, a group which is isomorphic to M. Definition 140 For two rings R, S, an R-module M and an S-module N, a diaffine homomorphism f is a map of form en. fa, where en is a translation on N and fa E Dil(M, N). The set of diaffine homomorphisms f : M ---> N is denoted by M @N. If we fix the underlying scalar restriction 'P : R ---> S and only take fa E Dil
(i) If f = en . fa E M@N, then n = f(O), and fa the dilinear part fa are uniquely determined. (ii) If f and
= en . fa
E
M@N, g
= e1. go
E
= e- n . f. So the translation part en and
N@L, then the set-theoretic composition g . f E M@L
g. f
= e1+g(n)
. go' fa.
(E.4)
(iii) The diaffine f = en . fa E M@N is an isomorphism iff its dilinear part fa is so, and then the inverse is f- 1 = e- /;;' (n) . fa-I. (iv) If f, gEM@
(v) For M = N, 'P = I dR, the general affine group G L(M), i. e., the group of affine automorphisms is isomorphic to the semidirect product M >
E.3. CATEGORIES OF MODULES AND AFFINE TRANSFORMATIONS
1091
The category of modules and diaffine homomorphisms is denoted by Mod, whereas the subcategory of (left) R-modules and R-affine homomorphisms is denoted by ModR. If f : S ----> R is a ring homomorphism, we have the scalar extension functor
(E.5) which acts by scalar extension of the linear parts of morphisms and by the canonical map M ----> S0R : x ~ 10 x on the translation part. See [63, II.5.1] for details. In Mod, we have to add an additional object 0R for each ring R. This is the empty set plus the unique possible scalar multiplication. This is not a module in the usual sense since it is not even a group! But there are important category-theoretic reasons to introduce these objects. Observe that for an S-module N, 0R @N is in bijection with the set of ring homomorphisms Hom(R,S), whereas N@0 R is empty if N is not empty. By Mod@, we denote the category of set-valued presheaves on Mod, i.e., the contravariant set-valued functors F : Mod ----> Sets. In particular, the Yoneda embedding Mod ----> Mod@ yields the representable presheaf @M for a module M, with @M(X) = X@M for X E aMod. This is one reason why we also write X@F for the evaluation F(X), even if F is not representable. By M@ we denote the covariant functor with M@(X) = M@X. In the context of presheaves, we often call a module X that is an argument of such presheaves an address; the reasons for this wording are made explicit in the musicological chapter 6 on forms and denotators.
E.3.1
Direct Sums
Proposition 82 Let A be an R-module, and n a natural number. Then there is a canonical isomorphism @AEBn -::. (@A)n, i.e., AEBn represents the n-fold product functor. Proof. Let X be any module. Then every affine homomorphism f = et . fa : X ----> AEBn projects to the n factors fi = Pi . f via the respective projections Pi : AEBn ----> A. Also, the dilinear part fa projects to the n dilinear factors fa,i : X ----> A. Let ti be the ith component of t. Then we have fi = eti . fa,i' This yields the desired bijection X@AEBn -::. (X@A)n, and this is functorial in X. QED.
E.3.2
Affine Forms and Tensors
In this section we suppose that all modules have a commutative coefficient ring R, i.e., we work in the category Mod R. Tensor products 2 are automatically taken over R. By X* we denote the R-linear dual LinR(X; R) of the R-module X. For the R-module A* of affine forms on an R-module (address) A and an R-module M, we have a canonical linear injection M >----> A* 0 M. In fact, there is an R-linear isomorphism A* -::. RE8A* : eT·x ~ r+x, and we deduce an R-linear isomorphism A* 0M -::. M E8A* 0M, whence the above injection; it maps m E M to e1R.0 0 m. With the above notation, fix an R-module A (an address). We have the subfunctor
(E.6) 2See [63] for tensor products.
1092
APPENDIX E. MODULES, LINEAR, AND AFFINE TRANSFORMATIONS
of A@, and induced on the subcategory Mod R of Mod. We further have the functor A*0: ModR
Sets: M
----+
f--+
A* 0 M.
This functor acts as follows on affine morphisms F = en . Fo : M canonically associated element of A* 0 N. Then, we define
(E.7) ----+
N. Identify n with the
A*0F=e n .A*0 Fo,
in other words, (A* 0 F)o
= A* 0
(E.8)
Fo. For the composition
et+Go(n) . CoFo : M ~ N ~ T
of affine morphisms F
= en . Fo and C = et . Co this implies A* 0 CF =et+Go(n) . A* 0 CoFo =e t . eGo(n) . A* 0 Co· A* 0 Fo =e t . A * 0 Co . en . A * 0 Fo
*
=A 0 C . A
* 0 F,
whence the claimed functoriality. Lemma 78 With the above notation, there is a natural transformation
(E.9)
If M is an R-module, it is defined by its action on pure tensors x 0 mEA* 0 M by {}(x 0
m) : A
----+
M :a
f--+
x(a)m.
(E.1O)
The natural transformation {} is an isomorphism if A is a finitely generated projective module. Proof. The formula (E.IO) is an extension of a classical formula in the linear case, see [63, II.74]. In fact, write A* 0M ~ MEBA* 0M, and then A@M ~ M EBLinR(A, M). Then the classical formula (}o : A* 0 M ----+ LinR(A, M) : x 0 m f--+ (}o(x 0 m) with (}o(x 0 m)(a) = x(a)m extends to the linear map which for special pure tensor arguments x 0 m (}'(rm
+ Xo 0
m) = rm
=e
T •
+ (}o(xo 0
Xo 0 m yields m), corresponding to
eTm • (}o(xo 0 m),
and the latter evaluates to (e Tm • (}o(xo 0 m))(a)
= rm + xo(a)m = x(a)m
E.3. CATEGORIES OF MODULES AND AFFINE TRANSFORMATIONS which means ()' = (). Let us then prove the naturality of (). Let F affine morphism. We have to show that the diagram
=
1093
en. Fo : M ~ N be an
A*®M ~ A@M
lA@F
A*@Fl
(E.ll)
A@N
A*®N
is commutative. It suffices to verify it for pure tensors x ® mE A* ® M. Take a E A. Then
A@F(()(x ® m))(a) =F(()(x ® m)(a)) =F(x(a)m) =n + Fo(x(a)m)
=n + x(a)Fo(m) =()(n)
+ ()(x ® Fo(m))(a)
=()(n + x ® Fo(m))(a)
=()((A* ® F)(x ® m))(a) and we are done with diagram (E.ll). If A is finitely generated projective, the classical linear map ()o is iso, and hence so is (). QED. Observe that the special case A = 0 of zero address is included in the lemma and that in this case, () identifies to the identity transformation on the forgetful functor.
E.3.3
Biaffine Maps
In this section we again suppose that all modules have a commutative coefficient ring R, i.e., we work in the category Mod R . In classical module theory, the tensor product is a universal construction relating to bilinear maps. The extension to biaffine maps runs as follows. Definition 141 Let U, V, W be modules in Mod R . A map f:UxV~W
is called biaffine if it is affine in each variable, i. e., if fu: V
r :U
~
~
are all affine, i.e., fu E V@RW and A 2 (U, V;W).
W: v 1-+ fu(v) = f(u,v) and W : u 1-+ (u) = f (u, v)
r
r
E U@RW. The set of all biaffine maps f is denoted by
Lemma 79 For R-modules U, V, W in Mod, there is a canonical bijection
(E.12) If these sets are given their canonical structure of R-modules, bijection E.12 is an isomorphism of R-modules.
APPENDIX E. MODULES, LINEAR, AND AFFINE TRANSFORMATIONS
1094
Proof. Let f : U x V -+ W be a biaffine map. Then the associated map fu, u E U by definition stays in V@RW. So we have a map h : U -+ V@RW. Let us show that this map is affine. Set >.(v) = f(O,v), this affine map is the constant part of our candidate h, i.e., we claim that fu - >. is linear in u. But fu(v) - >.(v) = r(u) - >.(v) = r(u) - r(O) is linear in u and we are done. Conversely, each g E U@R(V@RW) defines g(u, v) = g(u)(v) with 9 E A2(U, V; W), and this clearly is an inverse to the map f f-7 /? That we have a module isomorphism is clear. QED. Proposition 83 Let U, V, W be R-modules in Mod, and define the affine tensor product U ~ V = U ® V E9 U E9 V. Then we have a canonical bijection (E.13)
i.e., U ~ V is a universal object in the affine category ModR like the tensor product is for R-linear maps. If f E A2(U, V; W), then its image ~f applies a typical element u®v+r+s to ~f(u ® v
where
Of
+ r + s) = Of(u ® v) + fO(r) + fo(s)
- f(O, 0)
(E.14)
is the linear map associated with the bilinear map fo(u,v)
The universal map i : U x V
-+
=
f(u,v) - fo(v) - fO(u)
U ~ V is defined by i(u, v)
+ f(O,O).
(E.15)
= u ® v + u + v.
Proof. The proposition follows directly from lemma 79, the definition of the affine tensor product and the universal property of the linear tensor product. We then have ~f( i(u, v)) = ~f( u ® v + u + v) = Of(u ® v) + fO(u) + fo(v) - f(O,O) = fo(u,v) + fO(u) + fo(v) - f(O,O) f(u,v) - fo(v) - fO(u) + f(O,O) + fO(u) + fo(v) - f(O,O) = f(u,v). QED. Definition 142 For modules U, V, W, X in Mod R and affine maps f : U the affine tensor product map f~g: U~V -+ W~X
-+
W, 9 : V
--t
X,
(E.16)
is defined as the canonical affine map ~h according to proposition 83 which is associated with the biaffine map h: U x V -+ W~X: (u,v) f-7 i(f(u),g(v)). Sorite 14 For modules U, V, W, X in Mod R and with the notation of section E.3.8, we have:
(i)
U~V ~ V~U.
(ii) U ~ OR ~ OR ~ U ~ U. (iii) U ~ (V ~ W)
~ (U ~ V) ~ W, i. e., we can identify these products and write U ~ V ~ W.
(iv) (UIIV) ~ W ~ U~ WII V ~ W. For a module M in ModR, the functor @RM : ModR -+ Sets: B f-7 B@RM contains redundant structure in B since there are elements in B which are annihilated by all linear maps into M. We want to reduce B to a module where this annihilator set is the zero submodule.
E.3. CATEGORIES OF MODULES AND AFFINE TRANSFORMATIONS
1095
Definition 143 With the above notation, set An(B, M) = nkEB@M) Ker(k o ), denote BIM = BIAn(B, M) and write 1M : B ---+ BIM for the canonical projection. The module BIM is called the M-reduction of B. The following lemma is clear: Lemma 80 The assignment ?IM : B f-+ BIM defines a functor on ModR. The projection 1M: B ---+ B I M and the uniquely defined commutative diagrams
(E.17)
which are associated with affine homomorphisms f : C I dMod R ---+? 1M·
---+
B define a natural transformation on
Proposition 84 Let @R dM = @RM·? 1M Then we have a natural isomorphism @R dM ~ @RM. Proof. In fact, the natural transformation 1M modules) B@R dM ~ B@RM. QED.
B
---+
BIM induces an isomorphism (of R-
Corollary 27 With the above notation, the functors B@M and B@red M canonically isomorphic.
BIM@M are
Proof. If A E oMod R , we have
-red A@B@ M = A@BIM@M =A[8:JBIM@RM ~ A@R(BIM@RM) ~ A@R(B@RM) ~A[8:JB@RM
=A@B@M. Let M, A be modules in Mod R. Then we have Lemma 81 There is an isomorphism of R-modules
(E.18) which is functorial in both, A and M.
1096
APPENDIX E. MODULES, LINEAR, AND AFFINE TRANSFORMATIONS
Proof. We have these functorial isomorphisms: LinR(M, A@RR) .:::. LinR(M, A* EB R) .:::. LinR(M, A*) EB M* .:::. (M ® A)* EB M* .:::. (A ® M)* EB M* .:::. LinR(A, M*) EB M* ':::'A@RM*.
=
Proposition 85 Let M be as above, and A maps
u: A@RM* d: A@RM
~
~
Rn, 0 :S n, then we have canonical R-linear
(A@RM)*, (A@RM*)*,
(E.19) (E.20)
which are isomorphisms for M finitely generated and projective. Proof. As to the first map, a linear map v : LinR(A, M*) ~ LinR(A, M)* is defined as follows: For 9 : A ~ M* and f : A ~ M, we have the composition f* . 9 : A ~ A *, which is a bilinear form on A, and we may set e(g)(f) = tr(f*· g), a linear function of g, calculated in the canonical bases of A and A *. This map is canonically extended by the identity on M and we are done for u. For the second map, we have the canonical biduallinear map 1 : A@RM ~ A@RM**, and we may apply the first map to the bidual of M. The statement concerning finitely generated projective modules is standard. QED
E.3.4
Symmetries of the Affine Plane
We consider symmetries, i.e., affine transformations D = et . H on ]R.2. From the geometric point of view, the set ]R.2@]R.2 of these maps is described as follows. Fix a (zero-addressed) local composition ~ = {u, v, w} in the real plane ]R.2, with three points in general position (see appendix E.2.1). Then we know from section 15.2.1 that the map
(E.21) is a bijection. We use this bijection to describe some special transformations: Shearings. Let G be a straight line in]R.2 (not necessarily through the origin), and let u, v, w be in general position such that u, v lie on G, whereas w does not. A shearing S relating to G is a symmetry which leaves both, u, v, fixed and transforms w into w+r.(v - u), r E R Then G remains fixed identically, and w is shifted in parallel motion with respect to G. The nth power sn of S is the shearing which fixes G and transforms w into w+nr.(v-u).
E.3. CATEGORIES OF MODULES AND AFFINE TRANSFORMATIONS
1097
Dilatations. For a point u E JR2, and two scalars 5, CY E JR, a dilatation D by factors 5, CY and centered in u is defined by the prescription that D( u) = u, and that for two other points v, w such that u, v, ware in general position, we have D(v) = u + 5.(v - u), D(w) = u + cy.(w - u). A dilatation with 5 = CY = -1 is called point reflection with center u, it corresponds to a rotation by 180 0 around u. Glide Reflections. Let G be a straight line, and take u, v, w as in the above paragraph E.3.4 about shearing. A glide reflection P is a symmetry, such that P( u) lies on G, and P( v) P(u) = v - u, P(w) - P(u) = u - w. Therefore, P(v) lies also on G, and P(w) lies on the "opposite side" of w on the line through w, u, i.e., P is a translation by P( u) - u, followed by a 'skew' reflection in G in the direction of the line through wand u. Especially for the diagonal G = R(I,I) and u = P(u) = O,W = (-1,1), we obtain the exchange of coordinate axes: the parameter exchange.
E.3.5
Symmetries on 71}
Theorem 52 Every symmetry f E 7l} @7l} is the product of some of the following symmetries:
1. a translation T = e(O,I), 2. a shearing S which leaves the first axis Z.(I, 0) fixed and transforms (0,1) to (1,1), 3. the parameter exchange P,
4. the reflection K at the second axis, 5. the dilatations D m , 0 :::; m in the direction of the first axis by factor m. ----+
The general affine group GL(Z2) is generated by T, S, P, K. Proof. The statement concerning the general affine group is immediate from the first part of the theorem. To show the latter, we say that a symmetry X is "good" if X can be written as a product of symmetries of the required shape. Here are the matrices for the generators:
We first show that all 2 x 2-matrices, i.e., all linear maps X = ( : symmetries of type S, K, D m , P. Observe that p 2 = K2
~)
are products of
= E 2.
1. If p. X is good, then so is X = p 2 . X = p. (p. X). The same is valid for X . P. Here, P . X is the exchange of rows in X, whereas X . P is the exchange of columns in X. 2. We have S-1
= K .S .K =
G~ (1 0)
integer n. Further, p. S±1 . P =
1). Therefore the powers
±1
1
and therefore (p.
7) = (1 0)
sn = (~ s. p)n
are good for
n
1
is good.
APPENDIX E. MODULES, LINEAR, AND AFFINE TRANSFORMATIONS
1098
3. For m
~ 0, p. Dm . P= (1
o
0)
m
and P . K . Dm . P
=
(1 0) 0 -m
are good.
4. If a coefficient of X vanishes, one can enforce c = 0 by a row or column exchange. Because of
(~ ~) = (~ ~). (~ ~). (~ ~)
and the preceding results, such an X is also good. 5. If no coefficient of X vanishes, one can apply ±E2 and exchange of columns that satisfy c ~ d> o. Applying the Euclidean algorithm (section D.3), we can write c = nd + r, 0 ~ r < d. This yields
For translations, observe the following identities: e(l,O)
= p. T. P, e(-l,O) = K. e(l,O) . K, e(O,-l) = p. e(-l,O) . P.
Therefore all transpositions are good since we have natural numbers x, y such that e(±x,±y) = (e(±l,O))x . (e(O,±l))y. This settles all the cases, QED.
E.3.6
Symmetries on 'liP
For integers n ~ 2 and 1 ~ i < j ~ n, we have the diagonal embedding ~i,j : GL(Z2) defined by
>--+
GL(zn)
= a, Xi,j = b, Xj,i = C, Xj,j = d, xu,u = 1 for u =f. i,j, and xu,v = 0 else. Xi,i
Theorem 53 For an integer n groups ~1,jGL(Z2), 1 ~ j ~ n.
~
2 the group GL(zn) is generated by the diagonal embedded
The proof goes by induction on n and uses the Euclidean algorithm, we leave it as an exercise.
E.3. CATEGORIES OF MODULES AND AFFINE TRANSFORMATIONS
E.3.7
1099
Complements on the Module of a Local Composition
Lemma 82 Let A be an address module over the commutative coefficient ring Rand (K, A@M) a commutative local composition. Then:
(i) R.K c (K). (ii) R.K = (K) iff K is embedded. (iii) If f : K
----+
L is a morphism of embedded commutative local compositions
at the same address A. Then any underlying symmetry F : M ----+ N restricts to R.K and R.L, i.e., F(R.K) C R.L, and this restriction is uniquely determined by f. We denote this affine map by R@f : R.K ----+ R.L. Proof. The first statement is clear since R.K = (x - xol x E K). If R.K = (K), then obviously K C (K) = R.K, and K is embedded. Conversely, if K c R.K, then also (K) C R.K and equality follows from (i). As to (iii), observe that we have a linear application R.f : R.K ----+ R.L which is induced by the linear part Fo of F, and which is only a function of f, by lemma 6 of chapter 8. Further, if F = en. Fo, and if k E K, we have n = F(k) - Fo(k) = f(k) - R.f(k) since K C R.K; in other words, n = nj is only a function of f, and not of the underlying F. Therefore, since both, f(k) and R.f(k), are elements of R.L, nj E R.L. This means that for x E R.K, F(x) = nj + Fo(x) = nj + R.f(x) = R@f(x) E R.L, QED.
E.3.8
Fiber Products and Fiber Sums in Mod
Theorem 54 The category Mod of modules and diaffine transformations has arbitrary fiber products.
Proof. We are given a fiber product diagram
K----+M+--L j
9
(E.22)
of modules over the fiber product diagram
(E.23) of corresponding coefficient rings. If any of these modules K, Lor M is empty, or if intersection Im(f) n Im(g) is empty, then the empty module over the fiber product A Xc B of coefficient rings does the job. So we may suppose that neither of these four spaces is empty. Consider the dilinear parts fo and go of f and g. Then we have the dilinear homomorphism d : K EB L ----+ M : (k, l) f-+ fo(k) - go(l) with regard to the fiber product ring homomorphism A Xc B ----+ C. Take any couple (k, l) E K EB L with f(k) = g(l). Then the set-theoretic
1100
APPENDIX E. MODULES, LINEAR, AND AFFINE TRANSFORMATIONS
fiber product ~ C K E9 L equals K ere d) + (k, l). This implies that the diaffine embedding e(k,l) : Ker(d) ~ ~ C K E9 L, followed by the projections to K and L defines a fiber product Ker(d) Pl·e(k,l)
P2· e (k,l) )
1
19 f
K
L
------+
(E.24)
M
of modules and diaffine transformations. QED.
Theorem 55 The category Mod has fiber sums for all pushout diagrams of modules over a fixed coefficient ring, i.e., where the scalar restrictions are the identity. Proof. We are given a fiber sum diagram (E.25)
of modules over coefficient ring A. If M is empty we have to construct the sum K II L, and we may suppose that both summands are nonvoid, the other cases being trivial. Consider the direct sum S = K E9 L E9 A and the affine injections i1
K
>-->
i2 : L
>-->
:
S: k
S :l
f-t
f-t
(k,O, 1) and
(0, l, 0).
Suppose we are given two diaffine transformations f:K~X, g:L~X,
with factorizations f = eX . fo and 9 = eY • go and scalar restriction s : A ~ B. Define a dilinear map ho : S ~ X : (k, l, t) f-t fo(k) + go(l) + s(t)(x - y). Then we have a diaffine transformation h = eY • ho which does the job, in fact, h· i 1 = f, and h . i2 = g. Since i2 is linear, we have h(O) = h(i2(0)) = g(O) = y. Hence the affine part of h is uniquely determined. If we had two candidates h and h* for universal arrows, they would only differ in their dilinear parts ho and h'O. But then, their difference d = ho - h'O would vanish on all elements of shape (0, l, 0), l ELand on all (k, 0, 1), k E K. The latter implies that d(O, 0,1) = 0, and by dilinearity of d, d(k, 0, 0) = deck, 0, 1) - (0,0,1)) = 0, whence the uniqueness of h. On the other hand, if M is non-empty, so are K and L. We then have two arrows u = i 1 . f, v = i2 . 9 : M =4 K II L from the diagram
(E.26)
E.4. COMPLEMENTS OF COMMUTATIVE ALGEBRA
1101
and we are done if we can show that there is a coequalizer3 of the couple u and v. If we have the factorizations u = e t . Uo and v = e S • vo, take the quotient module E = K II L / A( t - s) + Im( Uo va). Clearly, the projection p : K II L -+ E equalizes u and v. If r : K II L -+ X is any diaffine transformation with scalar restriction s : A -+ B and equalizing the couple u and v, then r has a unique factorization through E. In fact, we may suppose without loss of generality that r is dilinear. In this case, r has the required factorization since it annihilates A( t - s) + Im( Uo - vo). With this construction we define K IIM L = E and obtain a commutative diagram
M~
K (E.27)
which is the required pushout diagram in Mod. Observe that this proof technique-build the sum and then the coequalizer-is a special case of the fact that existence of fiber sums is equivalent to existence of sums and co equalizers provided that we have an initial object, see appendix G.2.1. QED.
Proposition 86 A dilinear morphism f : M -+ N over scalar restriction g : A iff f is diinjective, i.e., iff f, g are both injective.
-+
B is mono
Proof. If both, f, g are injective, then clearly the dilinear morphism is mono. If f is not injective, there are two different affine morphisms k i : Oz -+ M, i = 1,2, which are equalized by f. If the scalar restriction g is not injective, there are two different ring homomorphisms r1, r2 : Z[X]-+ A on the polynomial ring Z[X] with r1(X) E ker(g),i = 1,2" and the zero morphism = OZ[X] -+ M for these two scalar restrictions does the job.
E.4
Complements of Commutative Algebra
In this section, all coefficient rings are commutative.
E.4.1
Localization
See also [64, II] for concepts and facts described in this section. Let 5 be a multiplicative subset of a ring A, i.e., st E 5 for all s, t E 5, and 1 E 5. The localization 5- 1 A is the set of equivalence classes of Ax 5 modulo the relation (a, s) rv (a', s') iff there is t E 5 such that t( as' - a's) = O. The equivalence class of (a, s) is denoted by the fraction a/ s or ;. It is a ring by the well-defined addition a/ s + a' / s' = (as' + a's) / ss' and multiplication a/ s.a' / s' = aa' / ss'. The canonical map is : A -+ 5- 1 A : a f--+ a/I is a ring homomorphism with the universal property that for any ring homomorphism f : A -+ B such that f(5) c B X , there is a unique ring homomorphism j : 5- 1 A -+ B such that f = j 0 is. The ring 5- 1 A is called the localization of A in 5. Classical example: A is a domain (no zero divisors), 5 = A - {O}, whence 5- 1 A is the classical field fr(A) of fractions over A. 3See appendix G.2.1.
1102
APPENDIX E. MODULES, LINEAR, AND AFFINE TRANSFORMATIONS
If M is an A-module, the localization S-l M is the set 8- 1M of equivalence classes of Mx8 for the equivalence relation (m, s) '" (m', s') iff there is t E 8 such that t( ms' - m' s) = O. The addition mls+m'ls' = (ms' +m's)lss' and the scalar multiplication rls.mlt = rmlst makes 8- 1M a 8- 1A-module. One has the canonical dilinear homomorphism iM : M -+ 8- 1M : m f---7 mil with respect to the homomorphism is. It has this universal property: For every homomorphism f : M -+ N of A-modules, such that every dilatation s? : N -+ N : n f---7 s.n is bijective, there is a unique homomorphism of A-modules j : 8- 1M -+ N such that f = j oiM' It is easily seen that the tensor product 8- 1A®A M, together with the canonical homomorphism of A-modules M -+ 8- 1A ®A M : m f---7 1 ® m is isomorphic to the localization 8- 1M. For the multiplicative set 8 s = {I, S, s2, s3, ... }, sEA, one writes As, Ms instead of 8;1 A, 8- 1M. For a prime ideal q S;;; A, the complement 8 = A -q is multiplicative by definition. And one then writes A q, Mq instead of 8- 1A, 8- 1M. By the universal property of localization, if f : A -+ B is a ring homomorphism and if 8 S;;; A, T S;;; B are multiplicative sets such that f(8) S;;; T, then there is a canonical ring homomorphism fs,T : 8- 1A -+ T- 1B which extends f. If g : M -+ N is a dilinear homomorphism over f, it extends uniquely to a dilinear homomorphism gS,T : T- 1M -+ T- 1N over fS,T. For a multiplicative set 8 E A let 8' = {t E Althere exists a E A, s E 8 such that s = at} be the saturation of 8. Then the canonical homomorphisms 8- 1A -+ s'-l A and 8- 1M -+ s'-l Mare isomorphisms and we may identify the two corresponding localizations. In particular, if sEA, we identify As, Ms with 8~-1 A, 8~-1 M. So if for s, tEA, one has 8~ S;;; 8~, one has canonical homomorphisms As -+ At, Ms -+ M t · Proposition 87 If is : A -+ 8- 1 A is the localization homomorphism, the inverse image q f---7 iS1(q) is an order preserving bijection from the set of maximal (resp. prime) ideals in 8- 1 A to the set of maximal (resp. prime) ideals in A which are disjoint from 8. In particular, if 8 = A - p for a prime ideal p, the localization Ap is a local ring with maximal ideal mp = Pp and the residue field Kp = Aplmp is isomorphic to the field of fractions fr(Alp).
E.4.2
Projective Modules
Definition 144 An A-module P is projective iff it is a direct summand of a free A-module. Equivalently, it is projective if for each pair of homomorphisms u : P -+ N, v: M -+ N, v an epimorphism, there is a homomorphism w : P -+ M such that u = vow.
Let U f---7 U** be the bidual functor on A-modules. Then a direct summand U c V is mapped into a direct summand U** of V**. Now, if U is projective, it is a direct summand of a free module R(n). It is easily seen that the bidual map R(n) -+ (R(n))** is injective. Therefore, if U is projective, the bidual map U -+ U** is also injective. If U is finitely generated and projective, the bidual map is an isomorphism. Let i : N -+ M be the inclusion of a submodule N of an R-module M. For any x E M and positive exponent r, let
the linear map defined by the rth exterior power of i and the wedge product with x in the
E.4. COMPLEMENTS OF COMMUTATIVE ALGEBRA
1103
exterior algebra of M. This defines a linear map I',t : M
-t
LinR(
1\
T
N,
1\
T+l
M)
(E.2S)
which has the following property: Lemma 83 With the above notation, if N is a direct factor of M which is locally free of rank r, then Ker(N) = N. Proof. Let x E M. In the special case where N is free of rank
1',
I\T
if
I\T N
=
R.u for a basis
vector u of N, the claim x E Ker(N) is equivalent to u 1\ x = o. But this follows from [63, ch.nI, §7, no.g, Prop.13]. In the general case, clearly the condition is sufficient. Conversely, suppose Nx = o. Recall from [64, ch.n, §5 no.3, Th.2] that an R-module is projective of rank l' E N iff it is locally free of rank r. Notice that localizing commutes with exterior powers ([64, ch.n, §2 no.S]), so we may localize in fER such that for the localized element xf EMf, we have NXf E Nf and Nf is free of rank 1', whence xf = o. As there is a cover of Spec(R) with basic open sets D(fi) associated with localizations Rj; such that N f ; is free, we deduce that x E N. QED.
E.4.3
Injective Modules
Proposition 88 ([63, ch.n, §2, exercise 11]) For an R-module M, the following properties are equivalent:
(i) The functor LinR(?, M) is exact. (ii) The functor LinR(?, M) is exact on short exact sequences. (iii) For every R-module E and every linear injection F extends to a linear map E - t M. (iv) For every ideal (l C R and every linear map f f(a) = a.m for all a E R.
>---+
(l - t
E, every linear map F M, there is m
E
-t
M
M such that
(v) M is a direct factor of every module which contains it. (vi) For every R-module E which is sum of M and of a module I with one generator, M is a direct factor of E. Definition 145 An R-module M is said to be injective iff it has the equivalent properties of proposition 88. The ring R is said to self-injective iff it is injective as a (left) module over itself. Exercise 87 Show the following statement: A direct sum of R-modules is injective iff each factor is.
APPENDIX E. MODULES, LINEAR, AND AFFINE TRANSFORMATIONS
1104
Example 78 Let 1 < N be a natural number. Then the Ring 'LN is self-injective. In fact, let N = p~l .... p~r be its prime factorization (see appendix D.2). Then 'LN ~ 'Lp~l X ... 'Lp;:r, and injectivity can be checked on each factor, so suppose N = pn. We apply criterion (iv) of proposition 88. An ideal a in 'Lpn is generated by pm, m :::; n, and we have the isomorphism of 'Lpn-modules a ~ 'Lp(n-rn). Then a linear map f : 'Lp(n-m) --+ 'Lpn evaluates to f(a) = f(a.1z p (n-rn)) = a·f(1z p (n-rn))' and we are done.~In particular, every free 'Lwmodule 'LN is injective. The self-injectiveness of 'Lpn also follows from this: Proposition 89 ([139, proposition 21.5]) Let R be a zero-dimensional local ring. The following are equivalent:
(i) R is self-injective. (ii) The sode of R is simple. In fact, the socle of 'Lpn is isomorphic to 'Lp, the simple group of order p and it is well known that 'Lpn is zero-dimensional. Proposition 90 A finitely generated 'Lpn -module M is injective iff it is free of finite rank. Proof. Clearly, by example 78, a free module of finite rank is injective. Conversely, by the main theorem on finitely generated abelian groups E.2 we have M ~ 'Lpnl x ... 'Lpnk ,ni :::; n. By exercise 87 above, it suffices to see that 'Lprn cannot be injective if m < n. In fact, if the injection 'Lprn +- 'Lpn : X f-+ pn-m.x had a left inverse h, we would have x = h(pn-m.x) = pn-m. x which only works for x = 0, a contradiction. QED.
E.4.4
Lie Algebras
Definition 146 For a module L over commutative ring R, a Lie algebra structure is an Rbilinear multiplication [ ] : L x L --+ L, the Lie bracket, such that [xx] = 0 identically, and the Jacobi identity [x[yzll + [y[zx]] + [z[xyll = 0 holds for all x, y, z E L. A homomorphism of Lie algebras f : L1 --+ L2 is a linear homomorphism such that f([xy]) = [f(x)f(y)] for all x, yELl. The corresponding category of Lie algebras over R is denoted by LieR.
Example 79 If L ~ Rn is free, and if (Xi) is a basis, a Lie algebra structure on L is defined by bilinearity and skew symmetry (which follows from [xx] = 0) of the Lie bracket if the Lie brackets ~k atxk = [Xi, Xj], i < j are known. The condition for such a bracket to generate a Lie algebra is a7i
= 0,
a7j + aji = 0, all i 6~km aijakl k
(E.29)
all i, k,
< j, k,
km + aliakj km = Oll··l + ajlaki ,a Z,], ,m.
(E.30) (E.31)
E.4. COMPLEMENTS OF COMMUTATIVE ALGEBRA
1105
The coefficients a7j are called structural constants of the Lie algebra in the given basis. Example 80 For any module L, the R-algebra of linear endomorphisms End(L) becomes the general linear algebra g[(L) by the bracket [xy] = xoy-yox. A sub-Lie-algebra of a general linear algebra is called a linear Lie algebra. If L -=::'. Rn is free ofrank n, we also write g[(L) = g[(n, R). Its sub algebra of endomorphisms with vanishing trace (check that it is a sub-Lie-algebra!) is called the special linear algebra and denoted by s((L) or s(( n, R). Example 81 Let L be any R-module with a bilinear product x·y (no other conditions required). A derivation is a linear endomorphism D : L -+ L such that D(x· y) = x· D(y) + D(x)· y. The set Der( L) is a submodule of End( L), and in fact a Lie subalgebra of the general linear algebra g[(L). In particular, if we take the Lie algebra structure on L, Der(L) is another Lie algebra. Observe that for x E L, the left multiplication ad(x) = [x?] : y f---+ [xy] is a derivation by the Jacobi identity. One has the Lie algebra homomorphism of adjunction ad: L
-+
Der(L),
(E.32)
a representation of L in the general linear algebra of g((L). A derivation ad(x) is called inner, any other is called outer derivation. Proposition 91 If x is a nilpotent endomorphism in a linear algebra, then its adjoint ad(x) is also a nilpotent endomorphism. See [239, p.12] for the easy proof. Suppose that L is a linear algebra in End(V) for an R-module V, and that the conjugation Inte, e E GL(V) leaves L invariant. Then evidently the conjugation is an automorphism of L. Suppose now that R is a Ql-algebra. If ad(x) is nilpotent, then the exponential exp(ad(x))
= 1 + ad(x) + ad(x)2/2! + ... ad(x)k /k! + ...
(E.33)
is defined. We have: Lemma 84 If ad(x) is nilpotent, then edp(ad(x)) is an automorphism of the Lie algebra L. Moreover, if x is nilpotent, exp(x) is defined and we have Intexp(x) = exp(ad(x)). See [239, p.9] for the proof.
Appendix F
Algebraic Geometry For this chapter, we refer to [64, 123, 198, 199, 140].
F.l
Locally Ringed Spaces
Given a topological space X, its system of open sets Openx is viewed as a category with inclusions as morphisms. If f : X --> Y is a continuous map, the inverse image map U f-+ f-1U defines a functor Open/ : Openy --> Openx. This defines a functor Open? : Top --> Cat into the category of categories and functors 1 . Let C be a category of sets with some additional algebraic structure, such as the categories Mod, Mon, Gr, Ab, Rings, ComRings of modules, monoids, groups, abelian groups, rings, or commutative rings, respectively. A contravariant functor F : Openx --> C is called a C-space (this is a presheaf plus the algebraic morphism conditions). For example, a ringed space is just a Rings-space. In the present context, we always suppose that a ringed space is one with values in ComRings, i.e., a commutatively ringed space. The set of C-spaces on X is denoted by C';faces. The contravariant functor Open? induces a functor C?aces : X f-+ C';faces. It maps the continuous map f : X --> Y to the set map Ci aces : F f-+ F 0 Open/. The image F 0 Open / is denoted by f*F and is called the direct image of F. If F : Openx --> C, G : Openy --> C are C-spaces and f : X --> Y is continuous, then an f -morphism h : F --> G is a natural transformation h : G --> f*F. These morphisms define an evident category, the category cspaces of C-spaces. Suppose that the category C has colimits for filtered diagrams 2 , such as Rings, ComRings, Mod, Gr, Ab, and take a C-space F. For each point x E X, the filtered system Openx,x of open neighborhoods of x defines an object Fx = colimuEopenx,xF(U), the stalk of F at x. So for a (commutatively) ringed space, this is a (commutative) ring. Let h : F --> G be an fmorphism for f : X --> Y. For x E X, we have the restriction Open/,x : Openy,j(x) --> Openx,x 1 Restricted to a universe, if the limitless collection bothers the reader, or even the category of partially ordered sets with order preserving maps, to stick to reality. 2Meaning that for any two objects in the diagram quiver, there are two arrows with these domains targeting at a common codomain.
1107
APPENDIX F. ALGEBRAIC GEOMETRY
1108
of Openf to the neighborhood systems Openy,j(x) and Openx,x. This induces a C-morphism hx : Gf(x) -> Fx. The subcategory LocRgSpaces of ComRings-spaces consists of all ringed spaces F which have a local ring Fx with maximal ideal mx in each point x EX, and of those morphisms h which induce local morphisms hx in all stalks, i.e., h;;l(mx) = my. It is called the category of locally ringed spaces. The residue field in a point x of such a space F is the field K(X) = Fx/mx. For a section s E F(U) over the open set U, we denote by sex) the canonical image of sin K(X). For the category ComMod of modules over commutative rings and dilinear homomorphisms, a ComMod-space F can also be described by the underlying ringed space R and the abelian group space F (same notation), together with a scalar multiplication R(U) x F(U) -> F(U) in each open set U, and the evident dilinear transition maps. One therefore also says that F is an R-module. Same wording, mutatis mutandis, for an R-algebra or for an R-ideal. If we are given a C-space F on X which is a sheaf, and if ~ is a topological base for X, the restriction FI~ to this subcategory of Openx completely determines F. If U E Openx, U = colim(B E ~IB ~ U), and F(U)
= lim(F(B), B
E ~).
Conversely, if we are given a contravariant functor F : ~
->
C, we obtain a C-space F' by
F'(U) = lim(F(B),B E~)
and by the universally given transition morphisms. This presheaf is a sheaf if F is a sheaf on ~, i.e., if for every covering (Bi) of B E ~ by elements of the base, the canonical application F(B) -> I1i F(Bi) is a bijection x ~ (XIBi) onto the tuples (Xi) such that for every i,j and base element B' ~ Bi n B j , we have xilB' = xjlB'.
F.2
Spectra of Commutative Rings
Definition 147 The (prime) spectrum is a contravariant functor
Spec: ComRings
->
LocRgSpaces
which is defined as follows: Let A, B be commutative rings, and let f homomorphism.
A
->
B be a ring
1. The topological space consists of the set Spec(A) = {p a prime ideal in A}. The closed sets are the sets of the form VeE) = {piE ~ p} for a subset E ~ A. Equivalently, a base of open sets is given by the system Df = {pi! .;. p}, f E A, and we have Df n Dg = Dfg. 2. For the base :v = {Dflf E A}, we have a sheaf on :V, which is defined by D f ~ A f , the localization at the saturated multiplicative set S(f) defined by!, a well-defined setup since D f = Dg iff S(f) = S(g) see [198,1.1.3.2]. This presheaf is a sheaf on:V, and the associated sheaf on Spec(A) is denoted by A and called the ring sheaf associated with A. If p E Spec(A) is a prime ideal, we have (A)p .:::. A p, i.e., A E LocRgSpaces.
F.2. SPECTRA OF COMMUTATIVE RINGS
1109
3. For the homomorphism f : A --+ B, the inverse image map on prime ideals Spec(f) : Spec(B) --+ Spec(A) : p f-+ f-1p is defined, and we have Spec(f)-l(Dg) = Df(g), Spec(f)-l(V(E)) = V(f(E), i.e., Spec(f) is continuous. Furthermore, we have a canonical map fg : Ag --+ Bf(g) which is natural and therefore induces a morphism Spec(f) : A --+ B over the continuous (synonymous) map Spec(f). The stalk homomorphism fp : Ap --+ BSpec(f)(p), colimit of the natural homomorphisms fg : Ag --+ Bf(g), is local. One therefore has a contravariant functor Spec as announced, and one often denotes Spec( A), when meaning the locally ringed space A over Spec(A). Theorem 56 The functor Spec is fully faithful, the inverse global section functor r of a LocRgSpaces-morphism u : B --+ A is given by the ring homomorphism u(Spec(A)) : A = A(Spec(A)) --+ B(Spec(B)) = B. See [198, 1.1.6.3] for a proof. Since one often writes F(U) = r(U, F) and calls the elements section above U, the theorem's notation is justified by the global section notation r(A) = r(Spec(A), A). Let Aff be the full subcategory of LocRgSpaces consisting of the objects which are isomorphic to prime spectra. These spaces are called affine schemes. We therefore have that the map Spec: ComRings --+ Aff is an equivalence of categories. If M is an A-module, we have a A-module 1M, whose sections on the base 1) are defined by r(Dg, 1M) = Ag Q9 M = M g, the localization of M at the multiplicative set S(g). Proposition 92 The map M f-+ 1M is an exact!' and fully faithful functor from the category of A-modules ModA to the category ModA of A-modules. It also commutes with co limits of modules, with tensor products, Hom-modules, with sums and intersections of submodules. The inverse to this functor is the global section functor 1M f-+ r(Spec(A), 1M), which is also exact. See [198, 1.1.3] for a proof. The modules in ModA , which are hit by this tilding process are the quasi-coherent ones: A module M over a ringed space A over a topological space X is quasi-coherent iff there is a covering Xi of X such that each restriction Mi = M IXi is the cokernel of a homomorphism h : Aii --+ Afi, where Ai = AIXi . Theorem 57 An A-module M is isomorphic to a module
1M
iff it is quasi-coherent.
See [198, 1.1.4.1] for a proof. This means that we have an equivalence of categories of quasi-coherent modules over A and Mod A (with linear homomorphisms). For a ring element f E A, one has Spec(Af) ~ D f . When restricting the associated ring and module sheaves A, 1M to basic open sets D f' this yields ring and module sheaves which are isomorphic to Af' Mf. Theorem 58 Let M be an A -module. The following conditions are equivalent:
.L
3A sequence K M .!!.., L of linear homomorphisms of modules is exact in M iff Im(f) = Ker(g). Such exact sequences are preserved by the functor.
APPENDIX F. ALGEBRAIC GEOMETRY
1110
(i) M is projective4 and finitely generated. (ii) There is a finite family (Ii) of elements of A which generate the ideal A, i.e., Spec(A)
Ui Df,
such that the localizations Mj;
= r(Df,M)
=
are free of finite rank over Aj;.
This is why one can also define a finitely generated projective module as being a locally free module of (locally defined) finite ranks. If the locally constant rank is constant n, the module is said to be locally free of rank n.
F.2.1
Sober Spaces
A topological space X is irreducible - iff every non-empty open subset is dense, or, equivalently, if any two non-empty open sets have a non-empty intersection. A subset of a topological space is called irreducible if it is so with its relative topology. A point x of an irreducible space X is said to be generic iff its (always irreducible) closure {x}. We say that a point x dominates a point y, in signs x> y, iff {y} ~ {x}. This is a partial order relation on X. An irreducible component of a space X is a maximal irreducible subset. Sorite 15 These are the sorite properties concerning irreducibility:
(i) A subset of a topological space is irreducible iff its closure is. (ii) Irreducible components are closed. (iii) Every irreducible subset is contained in an irreducible component, in particular, a topological space is the union of its irreducible components. (iv) The image f(E) of an irreducible subset E irreducible.
~
X under a continuous map f : X
-+
Y is
Definition 148 A topological space X is sober iff each closed irreducible subset has a unique generic point. Call Sob the full subcategory of the category Top of topological spaces consisting of sober spaces. If A is a commutative ring, and if E ~ Spec(A) , then we denote J(E)
=
npEE
p, and
E = V(J(E)). This ideal is prime iff E is irreducible. In this case, E = {J(E)}. In fact, for two points p, q in Spec(A), p > q iff p ~ q. In particular, Spec(A) is a sober space. Its irreducible components correspond to the minimal prime ideals.
Proposition 93 The canonical injection j: Sob has a left adjoint?S : Top
-+
Sob.
4See definition 144 in appendix EA.2.
-+
Top
F.3. SCHEMES AND FUNCTORS
1111
Proof idea. This adjoint associates with any X a sober space XS which is defined as follows. Its points are the irreducible closed sets in X. The open sets are the sets V S= {Y E xSlYnv -=I- 0}, where V varies over all open sets in X. Clearly, (V n W)S = VS n W s and (U Wi)S = U Wt for any family (Wi) of open sets. On continuous maps f : X ----+ Y, the functor acts via r : XS ----+ ys : E f-4 f(E). One has a canonical continuous map qx : X ----+ XS : x f-4 {x} and a commutative diagram of continuous maps:
X
qx
1
f
-----+
Y
lqy
(F.l)
r ys Xs -----+ The map qx : X ----+ XS is a homeomorphism if X is sober. The adjunction is given by the mutually reciprocal maps Top(X,j(Y)) ----+ Sob(XS, Y) : f f-4 qy:l 0 rand Sob(XS, Y) ----+ Top(X,j(Y)) : g f-4 go qx. Lemma 85 The canonical continuous map qx : X ----+ XS is a quasi-homeomorphism,. i.e., the inverse image map 2xs ----+ 2x is a bijection between the open sets of XS and those of X.
F.3
Schemes and Functors
A scheme (X, Ox) is a ringed space Ox on X which locally is isomorphic to a spectrum of a commutative ring, i.e., there is an open covering (Xi) of X and a family Ai of rings such that (Xi,OxIXi) .::::. Spec(Ai). The category Schemes of schemes is the subcategory of LocRgSpaces whose objects are schemes. By Yoneda, we have a fully faithful functor Y: Schemes ----+ Schemes@.
Proposition 94 The restriction YAff: Schemes
----+
Aff@
is fully faithful. Equivalently, the corresponding functor YColllRings: Schemes
----+
ComRings@
into the category ComRings@ of covariant set-valued functors on ComRings is fully faithful. This means that we may consider schemes as special covariant functors on the category of commutative rings. The functors which correspond to schemes are characterized by a sheaf condition:
Property 3 We are given a functor G E ComRings@. For every ring A E ComRings, and every finite family (fi) of elements of A which generate A as an ideal, the diagram
i,j
is exact, we say (by abuse of language, but theoretically justifiable) that G is a sheaf.
APPENDIX F. ALGEBRAIC GEOMETRY
1112
Then the full subcategory of ComRings@ consisting of sheaves G in the sense of property 3, together with the property that there is a family of rings R t and morphisms 5 at : Rt @ ---- G such that for every field K, G(K) = Ut Rt@K, comprises the functors which are isomorphic to images of schemes under the Yoneda map YComRings, see [198, 1.2.3.6J and [140, VI.2J for details. This means that schemes are characterized without any reference to the geometry of ringed spaces. See also [314, III.3J for the relation of this setup to the systematic topos-theoretic restatement of the schemes in terms of the Zariski site. The most important universal property of the category of schemes is that it has fiber products. In the affine case, we have Spec(A) xSpec(C) Spec(B) .::. Spec(A 0c B).
F.4
Algebraic and Geometric Structures on Schemes
If a scheme is viewed as a set-valued functor on rings, the sets may also be enriched by algebraic structures, such as groups, monoids, etc., to yield a category c. We then view a scheme as a functor S : ComRings ---- C, and say that S is a C-scheme, for example, an abelian groupscheme if C = Ab.
Example 82 For O:s n, we have the additive group scheme An whose functor is An(R) = R n , with the canonical addition of this free module, and the canonical transitions An(R) ____ An(s) for a ring homomorphism f : R ---- S. Example 83 The n-dimensional linear group scheme is given by the functor R f--> GL(n, R) c Mn,n(R) .::. An 2 , together with the canonical map GL(n, R) ---- GL(n, S) for a ring homomorphism f : R ---- S. The set GL(n,R) is defined as the set of n x n-matrices M with invertible determinant: det(M) E RX. The functor is represented by the affine scheme GL n = Spec(Z[Xij,1 :S i,j :S nJdet), where det = Det(Xij). The group structure is the multiplication of invertible matrices.
FA.1
The Zariski Tangent Space
For a field K, we are given a K-scheme X, i.e., a scheme s : X ---- Spec(K) in the comma category SchemesjSpec(K) (see section G.2.1). Suppose that we have a K-rational point x: Spec(K) ---- X, i.e., a section of s. This means that the corresponding K-algebra Ox,x has an isomorphism K .::. /'i,x,x = Ox,xjmx,x. The Zariski tangent space in x is the linear K-dual Tx,x = (mx,xjm~,x)*· Consider the K-scheme DK = Spec(K[eJ) over the dual numbers K[eJ, see example 76 in appendix D.l.l. It has the K-rational point e : Spec(K) ---- DK corresponding to the projection K[eJ ---- K : e f--> 0. Proposition 95 With the above hypotheses and notation, there is a bijection of the Zariski elements t of the tangent space Tx,x and the morphisms T: DK ---- X of K-schemes which map the K -rational point E to the K -rational point x. 5 R t @ is
the covariant functor on rings, i.e., Rt@R = H omCOInRings (Rt, R).
F.5. GRASSMANNIANS
1113
See [140, VL1.3] for a proof. In particular, if the scheme X is given by its functor on rings, this means that the tangents are special elements of the evaluation of the functor in dual numbers X(K[cJ). For example, if X = Ak = Al Xz K, we have tangents x + c.T, T E K, over the rational point x E K ~ A k (K).
F.5
Grassmannians
A subfunctor G ~ F in ComRings@ is open iff for every morphism a : R@ ~ F (corresponding to an element a E F(R) via Yoneda), the fiber product projection G xa R@ ~ R@ is isomorphic to the functor of an open subscheme of Spec(R). Clearly, then, if b : X@ ~ F is a morphism from a representable functor X@ of a scheme X, then the projection G Xb X@ ~ X@ is isomorphic to an open subscheme of X. An open covering of a functor F in ComRings@ is a family (gi : G i ~ F) of open subfunctors of F such that the fiber product projections G i Xb X@ ~ X@ for morphisms b : X@ ~ F from the representable functor of a scheme X define an open covering of X. For example, the open sub functors of an affine scheme Spec(R) are the functors FI : ComRings ~ Sets of form FI(S) = {f : R ~ SIJ(I)S = S}, where I is an ideal in R. The Grassmann scheme Grassr,n is defined for any couple 0 :::; r :::; n of natural numbers by the functor Grassr,n(R) = {V <;;; RnlRnlV locally free of rank r}, which for a ring homomorphism R
~
S maps the exact sequence
to the exact sequence
o ~ Im(S ®R V)
~
sn
~
S ®R (Rn IV)
~
0,
where the image of the tensorized space S ®R V is the image of V under this map. The locally free quotient remains locally free since the localization on R carries over to a localization over S: For fER, and its image l' E S, we have (S ®R Rn IVh, ~ Sf' ®R f (Rn IVh. The functor Grassr,n is covered by the following open subfunctors. Let i. = iI, i 2, ... ir be an increasing subsequence 1 :::; i l < i2 < ... ir :::; n. We have the affine open sub functors GrasSn,r,i (R) of those submodules VeRn such that the factor Ri = EBj=I, ... r R.eij of Rn projects isomorphically onto the quotient Rn IV. If denotes the complementary increasing sequence, GrasSn,r,i (R) identifies to the set of graphs r f of linear maps in LinR(Ri~, R i .), i.e., to n x (n - r )-matrices with columns (ei'k , f (ei'k ))t. In fact, the isomorphism Ri ~ Rn IV corresponds to an isomorphism V ~ Ri~, and this makes V a graph of a linear map in LinR(Ri~, R i ). The fact that these open subfunctors (represented by affine schemes Arx(n-r)) cover the Grassmannian results from the situation over a field, where the covering is evident.
<
Proposition 96 Let n be a positive natural number. Then the subfunctor
Bn: R
f-+
{x E Rnl x is part of a basis of Rn}
of the affine n-space An over Z is an open subscheme.
(F.2)
APPENDIX F. ALGEBRAIC GEOMETRY
1114
Proof. Consider the open subscheme GL n of An2 • Then Bn is the image of GL n under the projection onto the first column (nlj) which by [199, IV /2,2.4.7] is (universally) open. QED. Lemma 86 If X is an S-scheme, E a quasi-coherent Ox-module, then any section s : S Grassr(E) is a closed immersion.
-+
Proof. By [198, Proposition (9.7.7)], Grassr(E) separated over S, and by [198, Corollaire (5.2.4)], a section of such a structural morphism is a closed immersion. QED. Lemma 87 If R is a product of local rings of finite length, then for two elements x, y of an R-module M, RXx = RXy iff these elements generate the same space, i.e., R.x = R.y.
Without loss of generality, we may suppose that R is local with maximal nilpotent ideal m. Clearly, the condition is sufficient. Suppose now that R Xx -=1= RXy, and therefore R XxnRXy = 0. Then, R.x = R.x implies R Xx C m.y, since R = R X U m. But then we have x E m.y, and symmetrically y E m.x, which gives x E mk.y for all powers k, and m being nilpotent yields x = y = 0, a contradiction. QED.
F.6
Quotients
If G is a finite group, and if (X, Ox) is a scheme, a group action of G on X can be given by a group homomorphism a : G -+ Aut(X). This can also be seen as a morphism of schemes a' : G z x Spec(Z) X -+ X with the functorially described axioms of group actions associated with the functors of the schemes Gz, X, the scheme G z = Spec(XP) is a group scheme whose multiplication is associated with the group multiplication JI- : G x G -+ G via the ring homomorphism JI-' : xP -+ XpxG ~ xP ®z 71,0. The scheme Gz is finite and locally free over z. The set-theoretic orbits of the action a are the equivalence classes defined by the relation on the product set X x X, image of the set map G x X -+ X x X : (g,x) t-t (a(g).x,x). If we use the schema-theoretic map a' : G z xSpec(Z) X -+ X, the cokernel functor of the pair pr2, a' : G z XSpec(Z) X ~ X of functor morphisms, if it exists, is called the scheme functor of orbits of X under the action of G. We have this particular case of [123, 111,2.6.1]: Theorem 59 With the above notation, if G is a finite group and a' : G z x Spec(Z) X -+ X the group action associated with an 'abstract' action a : G -+ Aut(X) on the scheme X, such that every set-theoretic orbit is contained in an affine open subscheme of X, then there is a scheme-functor of orbits Y = coker(pr2' a') and the associated diagram of schemes (qua locally ringed spaces)
G is exact.
XSpec(Z)
X
~
X
-+
Y
Appendix G
Categories, Topoi, and Logic For a comprehensive introduction to category theory, see [313]. For top os theory and sheaves see [314], for topos theory and logic, see [186].
G.l
Categories Instead of Sets
One may rebuild mathematics from categories rather than from sets. In this framework, the most radical approach is the arrow-only definition of a categoryl: Definition 149 A category C is a collection of objects f, g, h, . .. which are called morphisms, together with a partial composition fog which yields morphisms ofC. An identity is a morphism e such that, whenever defined, we have eo f = f and 9 0 e = g. We have these axioms:
1. Whenever one of the two compositions (f 0 g) 0 h, f 0 (g 0 h) is defined, both are defined and they are equal; we denote the resulting morphism by fog 0 h. 2. If fog, 9 0 h are both defined, (f
0
g)
0
h is defined.
3. For every morphism f there are two identities, a 'left' identity eL and a 'right' identity eR, such that eL 0 f, f 0 eR are defined (and necessarily equal to f). It is easily seen that two right (left) identities of a morphism f are necessarily equal; they are called the domain of f (codomain of f) and are denoted by dom(f) (codom(f)). To make domain and codomain evident, one also writes f : a ~ b with a = dom(f) , b = codom(f) instead of f. For two morphisms a, b, the collection of those f with dom(f) = a, codom(f) = b is denoted by Hom(a, b), Homc(a, b), C(a, b), ... according to the specific situation. Evidently, no morphism can be a member of H om( a, b) and of H om( ai, b') if either a =I- a' or b =I- b' , i.e., the Hom collections form a partition of C (in the non-set-theoretic common sense). 1 Mac Lane calls this type of set-less categories "metacategories", and reserves the proper term "category" for metacategories which are built upon sets. We do however preconize the foundational character of metacategories and therefore omit the "meta" prefix. However, we then should provide a germ for existing categories, in order to get off ground as with axiomatic set theory. See [314, VI.lO] for a discussion of the foundation of mathematics via topoi.
1115
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC
1116
Exercise 88 Two identities e, e' of C can be composed iff they are equal, and then e 0 e (identities are idempotent).
=
e
In a more conservative understanding of categories, the identities are associated with the "objects" of a category, which are a second type of concepts, but do not enrich the category except in the way it is constructed. The identification of objects and identities is carried out as often as possible in our present text. In either case, the collection of identities (qua objects) is denoted by oC or Ob(C), whereas the morphisms are denoted by IC or Mor(C). To stress the morphic character of an identity e (in contrast to the underlying object in the conservative understanding), one also writes I de instead of e. In a category, a morphism f is mono, a monomorphism, iff for any two compositions fog, fog', the equality fog = fog' implies g = g'. The morphism f is epi, an epimorphism, iff for any two compositions go f, g' 0 f, the equality go f = g' 0 f implies g = g'. The morphism f is called a section if there is a left inverse g, i.e., go f = dom(j); f is called a retraction if it has a right inverse h, i.e., f 0 h = dom(h). A morphism f that is a section and a retraction is iso, an isomorphism. If dom(j)) = codom(j), the morphism is called endo, an endomorphism. An endomorphism which is an isomorphism is called auto, an automorphism. The collection of endomorphisms for a domain c is denoted by End(c), whereas the collection of automorphisms for c is denoted by Aut(c). If these collections are sets, they define monoids End(c) and groups Aut(c) with the identity Ide as unit. Exercise 89 The composition of two monomorphisms, epimorphisms, isomorphisms, endomorphisms, and automorphisms, if defined, shares, each of these properties.
G.l.I
Examples
Example 84 The category Sets of all sets. The morphisms are the set maps between existing sets, and the composition is the usual composition of set maps. Remark 28 Usually, the delicate comprehension axiom which can cause contradictory constructions of sets, is avoided by a strong restriction of the available sets. One takes a very large set U, which has the properties of a "universe", i.e., it is stable in the following sense:
• If x E U, then x
c
U;
• If x, Y E U, then {x, y} E U; • If x E U, then 2X E U (the set of all subsets, the powerset); • A set of all natural 2 numbers N is element of U;
• If f
:x
----t
y is a surjective function with x E U, Y C U, then y E U
One then restricts the Sets objects to the elements of the universe U and says that these are small sets. We denote such a category of small sets by Setsu. 2For example the set of finite ordinals 0 = 0,1 = {O}, 2 = {O, I}, ... n, n+ = n U {n}, ...
G.l. CATEGORIES INSTEAD OF SETS
1117
Example 85 Given a quiver Q = (head, tail: A ::::::t V) (see section C.2.2), the path category P(Q) has the paths as morphisms, the identities are the lazy paths, and the composition is the path composition. Here, the vertexes are separate concepts which can be identified (and in fact are identified in our construction) with the lazy paths. All paths are mono and epi, but only the identities are isomorphisms. The terminology "quiver" stems from algebra, in category theory, a quiver is more known as a "diagram scheme" . Relations among paths give rise to quotient categories as follows: Suppose that we are given any binary relation '" between some paths of equal domain and codomain. Consider the smallest equivalence relation ",' among paths which contains '" and is a 'two-sided ideal' in the sense that for f ",' g with dom(f) = dom(g) = d, codom(f) = codom(g) = c and h, k with dom(h) = c, codom(k) = d, we have co f ",' cog and f 0 d ",' god. Then we obtain a new category, the quotient category P( Q) I "', and its morphisms are the equivalence classes of paths, while the composition is the composition of representatives of these classes. In the language of category theory, the relation", is called a commutativity relation of the given diagram scheme. Example 86 Fix a ring R, the matrix category over R is the collection MR of all m x n-matrices M = (mi,j) with coefficients in R and for any row and column numbers m, n, together with the usual matrix multiplication M . N as composition. The identities are all the identity matrices En,n = 1,2, ... (over R). We evidently have HomMR(En,Em ) = Mm,n(R). In particular, the vectors in Rn are identified with the morphisms in HomMR(E 1 , Em) = M1,n(R). Example 87 Given a category C, the isomorphism classes of C-objects define a skeleton category C I iso: For each isomorphism class, select a representative and then consider the full subcategory3 of C on these representative objects. Clearly C/iso is defined up to isomorphism of categories (see below G.1.2) and no two skeleton objects are isomorphic. Example 88 Common examples of categories are the categories Mon of monoids, Gr groups, Rings of rings with ring homomorphisms, LinMod R R-modules with linear homomorphisms, LinMod modules with dilinear homomorphisms, Mod R R-modules with affine homomorphisms, Mod of modules with diaffine homomorphisms, or Top of topological spaces with continuous maps. Example 89 For every category C we have the opposite category COPP. Its morphisms are the same, but composition works via f oOpP g = g 0 f, i.e., it is defined iff the composition with opposite factors is defined in C. This opposite construction exchanges the domains and codomains of morphisms. Intuitively, an arrow f : x ----> y in C becomes a arrow f : y ----> x in COPP.
G.1.2
Functors
Functors are the morphisms between categories:
Definition 150 If C, D are categories, a functor F : C every morphism c in C a morphism F(c) in D such that 3See example 90 below.
---->
D is a function which assigns to
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC
1118
(i) F(c) is an identity if c is so, (ii) if cod is defined in C, then F(c)
0
F(d) is defined and F(c 0 d) = F(c)
0
F(d).
In particular, functors carry isomorphisms to isomorphisms. Moreover, the composition FoG: C ---- E of two functors F : C ---- D, G : D ---- E is a functor. Two categories C, D are called isomorphic if there exists a functor isomorphism, i.e., two functors F : C ~ D, F- I : D ~ C such that F- I 0 F = Ide, F 0 F- I = Ide. A functor is called full iff the F(Homc(x,y)) = HomD(F(x),F(y)) for all object pairs c,d. It is called faithful iff F : Homc(x, y) ---- HomD(F(x), F(y)) is injective for all pairs c, d. It is called fully faithful iff it is full and faithful, i.e., the map F: Homc(x,y) ---- HomD(F(x),F(y)) is a bijection. Functors are also called "covariant" since they are opposed to functors F : COPP ____ D, which are called "contravariant" but then also denoted by F : C ---- D. One often considers systems ofmorphisms in a category C, which are defined by a graphical approach: diagrams. Here is the precise definition. Definition 151 A diagram in a category C is a functor ~ : P(Q) ---- C, where Q is a quiver. The diagram ~ is said to commute with respect to a relation,"" among Q-paths, iff ~ factorizes through P( Q) / '"". If the relation is maximal (it identifies all paths having common domain and codomain), then the diagram is said to be commutative without specification of '"". By the very definition of a path category, diagrams are given by systems of morphisms in C which cope with the domain-codomain configuration in the underlying quiver (i.e., diagram scheme). Example 90 If C is a category, a subcategory is a sub-collection C' of C such that for each morphism f in C', its domain and codomain are also in C', and such that for any two f, g in C' such that fog is defined in C, the composition is also a morphism in C'. A category can be defined by an arbitrary selection of objects (identities) out of C and the full collections of morphisms having these identities as domains or codomains. Such a subcategory is called a full subcategory of C. A subcategory obviously induces an embedding functor C' ---- C by the identity on the morphisms in C'. For any collection S of morphisms in C, the smallest subcategory of C containing S is denoted by (S) and called the subcategory generated by S. Example 91 If C, D are two categories, the product category C x D consists of all ordered pairs (c, d) of morphisms c in C and din D. The composition (c, d) 0 (d, d') is possible iff it is possible in each component and then evaluates to (c, d) 0 (d, d') = (c 0 d, dod'). One has the canonical projection functors PI : C x D ---- C, P2 : C x D ---- D with PI (c, d) = c, Ps (c, d) = d. The same procedure allows the definition of any finite product of categories.
G.1.3
Natural Transformations
Natural transformations are the morphisms between functors. Definition 152 If F, G : C ---- D are two functors, a natural transformation t : F ---- G is a system of morphisms t(c) : F(c) ---- G(c) in D, for each object c in C, such that for every
G.l. CATEGORIES INSTEAD OF SETS
1119
morphism f : x -+ yin C, we have G(f)ot(x) = t(y)oF(f). One can also rephrase this property by requiring the following commutative diagram in D: F(x) F(f)
1
F(y)
~ G(x)
1
G(f)
(G.l)
~ G(y)
Natural transformations can be composed in an evident way, and the composition is associative. For every functor F we have the natural identity IdF.We therefore have the category Func(C, D) of functors F : C -+ D and natural transformations N at(F, G) between two functors F, G : C -+ D. Properties between such functors are said to be natural if they relate to the category Func(C, D), for example, F ~ G is a natural isomorphism iff it is an isomorphism among the natural transformations from F to G. If two categories C, D satisfy the following properties, they are called equivalent, equivalence is an equivalence relation which is weaker than isomorphism. Lemma 88 For categories C, D the following properties are equivalent:
(i) There are two functors F : C
-+ D, G : D where these isomorphisms are natural.
-+
C such that Go F ~ I de and FoG ~ I dD
,
(ii) There is a functor F : C -+ D which is fully faithful and essentially surjective, i.e., every object (identity) in oD is isomorphic to an image F(c) of an object ofC. Example 92 If C is a category with sets as hom collections Hom(x,y), we have two types of hom functors as follows: For fixed object x, we have the functor H om(x, 7) : C -+ Sets: y f--+ Hom(x,y), which sends a morphism f: y -+ z to Hom(x,f) : Hom(x,y) -+ Hom(x,z) : u f--+ fou. We further have the contravariant functor Hom(7,y): COPP -+ Sets: x f--+ Hom(x,y), which sends a morphism f : x -+ z to Hom(f, y) : Hom(z, y) -+ Hom(x, y) : u f--+ u 0 f. The category Func(COPP, Sets) of contravariant set-valued functors on C is denoted by C@; its elements are called (set-valued) presheaves over C. In the theory of denotators, one works with Mod@ and for a module M, we have the notation HomMod(M, 7) = M@, whereas the contravariant hom functor is H omMod (7, M) = @M. Example 93 For two categories C and D and an object S of D, we have the constant functor [S] : C -+ D with [S](X) = Sand [S](f) = Ids for all X E Ob(C) and all f E Mor(C). In particular, if S is a set, then we write [S] for the constant functor in Mod@ if the contrary is not stressed. Given a quiver G, if we fix an object c in a category C, we have the constant diagram ~e = [c]. It associates every vertex of G with c and every arrow with Ide. For a diagram ~ in C, a natural transformation [c] -+ ~ is called a cone on ~, whereas a natural transformation ~ -+ [c] is called a cocone on ~. In a cone, all arrows starting from c must commute with the arrows of the diagram, whereas in a co cone all arrows arriving at c must commute with the arrows of the diagram.
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC
1120
G.2
The Yoneda Lemma
For a given category C with sets as hom collections H om( x, y), the Yoneda embedding Y is the functor Y: C -+ C@: x I-> Y(x) = Hom(?,x) (G.2) with the natural transformations Y(J : x -+ y) : Y(x) -+ Y(y) being defined by u I-> f 0 u for u : z -+ x E Y(x)(z) = Hom(z, x). For C = Mod, we also write Y =?@, i.e., Y(M) = @M for a module M. A functor F in C@ is called representable iff there is an object c in oC such that F .::. Y(c). Yoneda's lemma states that the full subcategory of representable functors in C@ is equivalent to C, and that such an equivalence is given by the Yoneda embedding. More precisely: Lemma 89 For every functor F in C@ and object c in oC, the map f.:
Nat(Y(c),F)
-+
F(c): hI-> h(c)(Idc )
(G.3)
is a bijection. The proof is an easy exercise, but see also [198, 313]. In particular, if F = Y(d), we have a bijection f. : N at(Y (c), Y (d) .::. H om( c, d). More precisely, this means that the Yoneda functor Y is fully faithful, so we obtain an equivalence of categories as announced. For C = Mod, we also write F(A) = A@F, even if F is not representable. We then have the bijection Nat(@A,F) .::. A@F. This means that the evaluation of F at "address" A is the same as the calculation of the morphisms from @A to F. This is a justification of the name "address" for the argument A: Evaluating F at A means "observing F under all morphisms" when being "positioned on (the functor @A of) A". And the Yoneda philosophy means that F is known, when it is known while observed from all addresses.
G.2.1
Universal Constructions: Adjoints, Limits, and Colimits
Definition 153 We suppose that for two categories C, D, the hom collections are sets. Given two functors F : C -+ D, G : D -+ C, we say that C is left adjoint to D or (equivalently) that G is right adjoint to F, in signs F -1 G iff the functors H omD(F(?), ?) : copp x D -+ Sets and Homc(?, G(?)) : copp x D -+ Sets are isomorphic. One also writes this fact in these symbols: c -+ G(d) F(c) -+ d
meaning that morphisms in the numerator correspond one-to-one to morphisms in the denominator. In particular, if we are given an adjoint pair of functors F -1 G, when fixing the variable din D, the adjointness isomorphism means that the contravariant functor c I-> HomD(F(c),d) is representable by the object G(d).
C.2. THE YONEDA LEMMA
1121
Example 94 For C = D = Sets, fix a set A. We have the functors Ax? : Sets -> Sets: X f-+ A x X and ?A : Sets -> Sets: X f-+ X A , which are an adjoint pair Ax? -n A via the isomorphism that sends f : AxX -> B to f -I: X -> BA : f -I (x)(a) = f(a, x). This adjointness property is crucial in the definition of exponential objects in topoi. The "exponential set" BA represents the functor X f-+ H om(A x X, B). See section G.3.2 for this subject. A terminal object 1 in a category C is one that admits exactly one morphism, denoted by ! : x -> 1 from each object x of C. An initial object 0 is a terminal object in the opposite category. For example, in Sets, every singleton, such as 1 = {O}, is a terminal object, while the empty set 0 is initial.
Example 95 A terminal object in C@ is defined by the constant 1c @ = [lJ (of the set 1). For a presheaf P E C@, a global section 'Y is a natural transformation 'Y : 1c @ -> P. In other words, the global sections r correspond to the hom functor rep) = N at(l c @, P). The global section functor r : C@ -> Sets is right adjoint to the constant functor [ J : Sets -> C@. Universal objects in categories, such as limits and colimits, are related to terminal or initial objects as follows. Given a "basis" object b of C, the comma category C/b has all morphisms f : x -> b as objects, and for two objects f : x -> b, 9 : Y -> b, we have
HomC/b(J,g)
= {ulg 0 u = J},
the set of "commutative triangles above b" with the evident composition. The cocomma category C;oPPb is the comma category (COPP /bYPp, in other words, for two objects f : b -> x, 9 : b -> y, we have
H omc/oPPb(f, g)
= {ulu 0 f = g}.
Given a quiver Q and a category C, we have in the category Func(P(Q) , C) of diagrams in C, and given such a diagram ~, the comma category Func(P(Q), C)/~. In this category, take the full subcategory cones(~) of cones [cJ -> ~. Then a limit of ~ is a terminal object lim(~) in cone8(~). Since terminal objects are evidently unique up to isomorphisms, a limit is also unique up to isomorphism. A colimit colim(~) of a diagram ~ is an initial object in the subcategory cocone8(~) of cocones on ~ in the cocomma category Func(P(Q) , C);oPp~. If the diagram is a pair f : a -> C, 9 : b -> c, the limit is called the fiber product or pullback of f, g, or (more sloppily) of a and b if f, 9 are clear; it is denoted by a Xc b. If the diagram is a pair f : c -> a, 9 : c -> b, the colimit is called the fiber sum or pushout of f, g, or (more sloppily) of a and b if f, 9 are clear; it is denoted by a U c b. The limit of two isolated objects a, b (discrete diagram with two points) is called the (cartesian) product of a, b and denoted by a x b. The colimit of two isolated objects a, b is called the (disjoint) sum of a, b and denoted by aU b.
Theorem 60 The category of sets Sets has arbitrary limits and colimits. For a category C with sets as hom collections, the category C@ of presheaves over C has arbitrary limits and colimits. Proof. If ~ is a diagram of set morphisms fi,j,k : Xi -> X j , the limit is the subset in IIi Xi consisting of all families (Xi) such that for any pair (Xi, Xj) E Xi X Xj and any fi,j,k, we have
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC
1122
fi,j,k(Xi) = Xj' The projections lim(tl) ---+ Xi are the restrictions of the canonical projections from the product to Xi. The colimit is the set colim(tl) of equivalence classes LL Xii rv defined by the equivalence relation generated by the relation Xi rv Xj iff there is fi,j,k(Xi) = Xj' The morphisms Xi ---+ colim(tl) are the injections Xi ---+ 11 Xi, followed by the quotient map 11 Xi ---+ colim(tl). The universal properties are immediate and left as an exercise. For a diagram tl of presheaves F i , we take for each argument c in C the set-theoretic limit or colimit, respectively, of the set diagram tl(c) of the sets Fi(C) and the corresponding maps to define the limit or colimit of tl, respectively, QED. The following proposition makes sure that the category of presheaves over C is not too large with respect to its Yoneda embedding of C: Proposition 97 Every presheaf F in C@ is a colimit of representable presheaves. See [314, pp.41/42] for a proof. The idea of this proof uses the so-called category of elements of a functor F. Its objects are all pairs (C,p) where C is an object of C, and p E C@F. The morphisms (C, p) ---+ (C', p') are the morphisms u : C ---+ C' in C such that p'. u = p.
Ie F
Definition 154 A category is called (finitely) (co)complete iff it has (co) limits for all (finite) diagrams (diagrams with finitely many objects and arrows). Proposition 97 turns out to make the Yoneda embedding into a universal device for making a category C cocomplete: Proposition 98 For each functor f : C ---+ E to a cocomplete category E, there exists an essentially unique colimit preserving functor L : C@ ---+ E such that f = LoY. See [314, p.43] for a proof.
G.2.2
Limit and Colimit Characterizations
Proposition 99 For any category C, the following statements are equivalent:
(i) C is finitely complete. (ii) C has finite products and equalizers4 . (iii) C has a terminal object and fiber products. For a proof, see [481, I, 7.8.8]. Proposition 100 For any category C, the following statements are equivalent:
(i) C is finitely cocomplete. (ii) C has finite sums and coequalizers5 . (iii) C has an initial object and fiber sums. equalizer is a limit of a pair j, g : x :4 y of arrows. coequalizer is a colimit of a pair j, g : x :4 y of arrows.
4 An 5A
1123
G.2. THE YONEDA LEMMA
This is just the dual statement of proposition 99. Proposition 101 Let C be a finitely complete category. A morphism f : A iff the canonical projections PI and P2 in the pullback
-->
B in C is mono
(G.4)
coincide and are isomorphisms. Proof. Clearly, if f is mono, then X = A and PI = P2 = 1A define a fiber product. Conversely, if = P2 is an isomorphism, then any couple u, v : Z --> A with f . u = f . v creates factorizations u = Pl· t and v = P2 . t through t : Z --> X which therefore also coincide. QED. Therefore we have the dual result:
PI
Corollary 28 Let C be a finitely cocomplete category. A morphism f : A the canonical morphisms i l and i2 in the pushout
-->
B in C is epi iff
(G.5)
B~X coincide and are isomorphisms. Proposition 102 For any category C, let f : H (i) The morphism f is mono iff A@f : A@H
(ii) The morphism f is epi iff A@f : A@H
-->
-->
-->
G be a morphism in C@. Then:
A@G is injective for all objects A of c.
A@G is surjective for all objects A of C.
(iii) The morphism f is iso iff it is mono and epi iff A@f : A@H objects A of c.
-->
A@G is bijective for all
Proof. Observe that C@ is finitely complete and cocomplete and that limits and colimits are calculated pointwise. Let us first look at point (iii). Clearly, f is iso iff its evaluations A@f : A@H --> A@G are all bijective. Further, we know from proposition 101 that f is mono iff the fiber product projections PI and P2 coincide and are iso. But with (iii) this is true iff this is true for all evaluations at objects A of C, i.e., iff this is true set-theoretically, and this means having an injection for every object A of C, and (i) is done; the dual argument shows (ii). Finally, iso always implies mono and epi; conversely, mono and epi means being in- and surjective, i.e., bijective at every object A of C, whence f is iso. QED.
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC
1124 G.2.2.1
Special Results for Mod@
Lemma 90 Let H be a functor in Mod @, M, N be two addresses, and f : N ....., M a morphism of addresses. Then the map
M
f--t
M@Fin(H):= {F C M@H,card(F)
< oo},
(G.6)
f@H(X),
(G.7)
together with the maps f@Fin(H) : M@Fin(H) ....., N@Fin(H) : X
f--t
defines a functor Fin(H) in Mod@. Lemma 91 For any functor H in Mod@and address M, the maps
singH(M) : M@H....., M@Fin(H) : x defines a monomorphism singH: H
>->
f--t
{x}
(G.8)
Fin(H) of functors.
Lemma 92 The map
Fin: Mod@ ....., Mod@ : H
f--t
Fin(H)
(G.g)
defines an endofunctor on Mod @, and the monomorphism sing defines a natural transformation sing: Id Mod
@
>->
(G.lO)
Fin.
Lemma 93 Let D = Ho ....., HI ....., H 2 .•• be a natural sequence diagram in Mod@. Then we have
h
fa
colim(Fin(D)) ..::'. Fin(colim(D).
(G.11)
This yields an important proposition for the construction of circular forms and denotators. Proposition 103 Let H be a functor in Mod@. Then there are functors X and Y in Mod@ such that
X..::'. Fin(H x X) and
(G.12)
Y ..::'. H x Fin(Y).
(G.13)
Proof. For the first isomorphism, let (Xn)O
Xn=
{
0 Fin(H x X n -
= 0, n > O.
for n I )
for
(G.14)
Then we have a diagram of subfunctors
(G.15)
1125
G.3. TOPOl
for all 0 :S n. In fact, clearly Xo <-4 Xl' Now, let 0 < n and take an address M. We have M@Xn = M@Fin(H x X n - l ) = Fin(M@H x M@Xn - l ). Since by induction X n - l <-4 X n , we have M@Xn - l C M@Xn and hence M@Xn C M@Xn+1' Now, we know from [481] that the product commutes with the colimit over a sequence diagram. Taking the diagram D = Xo ---.
Xl ---. X 2 ... and setting X h
= colim(D),
fo
lemma 93 yields
Fin(H x X)
(G.16)
=
Fin(H x colim(D)) ~ Fin(colim(H x D)) ~ colim(Fin(H x D)) ~ colim(D) X
= (G.17)
and we are done for the first isomorphism. For the second, take
Yn= We again have a diagram E
{
0
for n = 0,
H x Fin(Yn-d
for n > O.
= Yo ---. Y1 ---. Y2 ... 90
91
(G.18)
of subfunctors (G.19)
for all 0 :S n. Setting Y
= colim(E),
our second isomorphism results:
H x Fin(Y)
=
(G.20)
H x Fin( colim(E)) ~ H x colim(Fin(E))
~
colim(H x Fin(E)) ~ colim(E) Y
= (G.21)
and we are done.
G.3
Topoi
Topoi are special categories which imitate the crucial constructions of set theory, such as cartesian products, disjoint unions, power sets, and characteristic maps. In our context, topoi play two roles: (1) the role of basic mathematical realities which are instantiated to get off ground in denotator theory, i.e., to build compound concept spaces and their points; (2) the more technical role of topoi of sheaves associated with presheaves for Grothendieck topologies.
1126
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC
G.3.1
Subobject Classifiers
Definition 155 Given a category C which is finitely complete, with the terminal object 1, a monomorphism true: 1 >--> 0 in C is called a subobject classifier iff given any monomorphism a : 8 >--> X in C, there is a unique morphism XO- : X -+ 0 such that the diagram
8
,1 1
0-
X
----+
1
(G.22)
XU
true
0
----+
is a pullback. Subobject classifiers are unique up to isomorphism. If a subobject classifier exists, the morphism X must be the same if we replace a by a 0 q for any isomorphism q : 8' ~ 8 since an isomorphic object to a pullback is also a pullback. A subobject of X is an equivalence class of monomorphisms a : 8 >--> X under the relation a rv a' iff there is an isomorphism q such that a' = a 0 q. Suppose that the collection of subobjects of X is a set 8ubc(X) for each object X in C. Then this is a presheaf in C@ by this map: take a morphism f : Y -+ X. Then we define
8ubc(f) : 8ubc(X)
-+
8ubc (Y) : a
1--+
af
where a f : 8 Xx Y -+ Y is the canonical projection of the pullback under ward that this is a monomorphism. Then we have:
f, a. It is straightfor-
Proposition 104 A category C which is finitely complete and such that the subobject presheaf 8ubc is defined, has a subobject classifier iff the 8ub c is representable, 8ubc(X) ~ Hom(X, 0) for all X. If so, the subobject classifier can be set to the inverse image E >--> 0 of I do in 8ubc (0). See [314, p.33] for a proof. Example 96 In the category Sets, the ordinal number inclusion true: 1 >--> 2 = {O, I} : 0 1--+ 0 is a subobject classifier. Since the subobjects of a set X in Sets identify to the subsets 8 ~ X, we have the classical result that subsets 8 of X are characterized by their characteristic maps XS : X -+ 2, a fact that is also traced in the notation 2x for the set of subsets, the powerset of X. Example 97 The equivalence classes of monomorphisms of presheaves 8 >--> X in C@ are defined by their images Im(8) ~ X (take everything pointwise). So subc@(X) ~ {8 ~ X}, the set of subfunctors of X (supposing that it exists as a set). By the Yoneda lemma, if a subobject classifier in C@ exists, we must have 6 8ub c @(@Y) ~ Hom(@Y,O) ~ O(Y). So the functor Y 1--+ 8ub c @(@Y) is a canonical candidate for 0, and it in fact does the job, see [314, pp.37/38]. The final presheaf being the constant presheaf 1c X 1--+ 1, we get the true morphism (natural transformation) true(O) = @Y. A subfunctor of @Y is called a sieve in Y, so a candidate for the subobject classifier is the functor of sieves (verify that it is a functor!). @
6Writing the shorter @Y instead of Hom(?, Y).
:
G.3. TOPOr
1127
Exercise 90 The categories Ab of abelian groups or Mod R of R-modules have no subobject classifiers. In denotator theory, sieves and more general sub functors replace local compositions (which are essentially subsets of ambient modules) in the functorial setup. This is also necessitated since module categories are no topoi (since they have no subobject classifiers, see definition 156 in appendix G.3.3), so the passage to the presheaves over modules, i.e., the category Mod@ is mandatory in order to recover the subobject classifier structure.
G.3.2
Exponentiation
Recall example 94 in appendix G.2.1 of exponential sets. More generally, a category C is called cartesian closed iff it has finite products 7 and each element A is exponentiable, which means that the functor A x? has a right adjoint ?A, i.e., we have an adjoint pair offunctors A x? _j?A. Example 98 The category of sets Sets is cartesian closed. And any product of cartesian closed categories is cartesian closed. Example 99 A category of presheaves C@ is cartesian closed by the following discussion. Again, we use the Yoneda lemma to find a canonical candidate of the exponentiation X Y of two presheaves X, Y. If the exponential X Y exists, we must have U@X Y -=:::. Nat(@U,X Y ) -=:::. N at(@U x Y, X). So one canonical definition must be
U@X Y
= Nat(@U x Y,X)
(G.23)
for any object U in C, which is evidently a presheaf. The proof that this formula does the job is found in [314, p.47]. In every cartesian closed category C one has these standard formulas
which follow from the universal adjointness property of exponentiation.
G.3.3
Definition of Topoi
There are several equivalent definitions of a topos, which we first summarize in the following proposition: Proposition 105 For a category C, the following group properties are equivalent: 1.
(a) C is cartesian closed, (b) C has a subobject classifier 1 >----* O.
2.
(a) C is cartesian closed,
7 Equivalently:
binary products and a terminal object.
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC
1128
(b) C is finitely cocomplete, (c) C has a subobject classifier 1>-+ o. 3.
(a) C has a terminal object and pullbacks, (b) C has exponentials, (c) C has a subobject classifier 1 >-+
4.
o.
(a) C has a terminal object and pullbacks, (b) C has an initial object and pushouts, (c) C has exponentials, (d) C has a subobject classifier 1 >-+
5.
o.
(a) C is finitely complete, (b) C has power objects8 ,
Definition 156 A category C which has the equivalent groups of properties in proposition 105 is called a (elementary) topos. Here are some general properties and examples of topoi:
Proposition 106 For a topos C a comma category Cjb is also a topos. Proposition 107 For a category C the presheaf category C@ is a topos. This is immediate from the previous discussion of the presheaf category.
Proposition 108 Let C be a topos. Then we have these properties: (i) Every morphism f has an image, i.e., factors as f = i 0 e with i mono and e epi. For any two such factorizations f = i 0 e, f = i' 0 e', there is an isomorphism t such that e' = toe, i' = i 0 t.
(ii) A morphism is iso iff it is mono and epi. (iii) The pullback of an epi is an epi. (iv) Every arrow X
-+
0 is iso.
(v) Every arrow 0 -+ X is mono. Definition 157 Logical morphisms between topoi are functors which preserve (up to isomorphism) finite limits, exponentials, and subobject classifiers. For example, the canonical base change functor Cjb change morphism c -+ b is logical, see [314, p.193J. 8S ee [186, p.106j for this group of properties
-+
Cjc of comma topoi for a base
G.4. GROTHENDIECK TOPOLOGIES
GA
1129
Grothendieck Topologies
Grothendieck topologies and associated topoi of sheaves are a classical example for the geometric aspects of topoi. Here is the context. Given a finitely complete category (a small one for those who like universes) C with the subobject classifier functor of sieves X@O = Subc@(@X) (see example 97). Recall that given a morphism f : Y ----; X the functor maps a sieve S S;; X to the pullback sieve f*(S) = S Xx Y. Definition 158 A Grothendieck topology on a category C is a function J which for each X is a subset X @J S;; X @o of sieves in X with these properties:
(i) @x
E
X@J,
(ii) (Stability) If SEX@J, then for f : Y ----; X, f* S E Y@J, (iii) (Transitivity) If SEX @J and REX @O with f* R E Y@J for all f : Y ----; X in S, then R E X@J. A site is a pair (C, J) of a Grothendieck topology J on a category C. A sieve in X@J is called a covering sieve, one also says that "it covers X". The first two requirements mean that J is a subfunctor of 0 through which the true arrow factorizes. Very often, Grothendieck topologies are not given directly, but via a so-called basis: Definition 159 For a finitely complete category C, a basis (for a Grothendieck topology) is a function K which assigns to each object X a collection K(X) of families of morphisms with codomain X such that:
(i) For every isomorphism f : X' .::::. X, the singleton {J} is in K(X); (ii) (Stability) If(fi: Xi ----; X) E K(X), and h: Y ----; X, then (h* fi: XixxY ----; Y) E K(Y);
(iii) (Transitivity) If (fi : Xi ----; X) then (fi
0
E K(X) and, for each index i, (fij : X ij ----; Xi) E K(X i ), fij : X ij ----; X) E K(X).
A pair (C, K) is again called a site (see below for a justification!); whereas the families in the sets K(X) are called covering families. Here is the relation to Grothendieck topologies: Given a basis K as above, one defines JK(X)
= {SI there is R
E K(X) with R S;; S},
(G.24)
where R S;; S means that R is in the union of the evaluations UC Z@S of S. And the converse: Given a family R of morphisms with codomain X, we denote by (R) the sieve generated by R, i.e., the smallest sieve in X containing all arrows of R. Then a Grothendieck topology J can be defined by the following basis K which is this set at X: K(X) = {R S;; @XI(R) E X@J}.
(G.25)
1130
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC
G.4.l
Sheaves
Definition 160 Given a site (C, J), a presheaf P in C is a sheaf for J iff for every covering sieve 5 <;;; @X, the inclusion induces a bijection N at( @X, P) 2'; N at( 5, P). This condition can be rephrased for a basis K of J in a more effective and classical way. To this end recall that a (co ) equalizer of a pair f, g : x :::::t y of parallel arrows is the (co )limit of this diagram. Proposition 109 A presheaf P on C is a sheaf for the topology J, iff for any covering family (Ii : Xi ---'> X) E K(X), the canonical diagram (G.26) i,j
is an equalizer. Here the two arrows to the right stem from the two projections from Xi Xx Xj to Xi and to Xj, whereas the arrow to the left stems from the covering morphisms fi. Definition 161 A Grothendieck topos is a category which is equivalent to the full subcategory 5h(C, J) of sheaves in C@ The justification of this terminology lies in the following theorem: Theorem 61 A Grothendieck tapas is an elementary tapas. The proof evidently splits in the verification of finite completeness, existence of exponentials and of a subobject classifier. For details, see [314, pp.128-144]. Finite completeness is easy since: Lemma 94 Limits of sheaves are sheaves. Lemma 95 If P is a presheaf and P is a J -sheaf in C@, then the presheaf exponential pP is a sheaf and therefore an exponential in 5h(C, J). Definition 162 A sieve L <;;; @X is closed with respect to (C, J) iff f : Y Y@J implies fEY @L.
---'>
X with 1*(L) E
Proposition 110 The function X 1-* X @DSh = {closed sieves in X} c X @D contains @X defines a subpresheaf of the subobject classifier ofC@ and is a sheaf. Together with the morphism true: lc@ ---'> DSh it defines a subobject classifier of 5h(C, J). The topos 5h(C, J) is a subtopos of C@ with the natural inclusion i : 5h(C, J) ---'> C@. This natural transformation has a left adjoint of sheafification which we shall discuss now. If P is a presheaf over C, the sheafification operator P 1-* P+ evaluates as follows. For an object X E C and a sieve 5 E X@J, consider the limit Match p (5) = TIj:Y-.XES Y@P. Consider the diagram (Match p (5))sEx@J with canonical restriction maps Match p (5) ---'> Matchp(T) for
G.5. FORMAL LOGIC
1131
T <;;; 5, and define X@P+ = limsEx@JMatchp(5). For a morphism g : Xl ----> X 2 , we have a map P+(g) : X 2 @P+ ----> Xl@P+. It takes a "matching family" (xf )fES to the matching family (Xg.h)hE9*S. This evidently defines a presheaf, and we have a canonical morphism 'f): P ----> p+. Then: Theorem 62 With the above notation, we have
(i) The presheaf p+ is separated!. (ii) The presheaf P is separated iff 'f) is mono.
(iii) The presheaf P is a sheaf iff 'f) is iso. For a proof, see [314, III.5]. In particular, the double application aP and a natural presheaf morphism P ----> aP. We have
=
((P)+)+ yields a sheaf
Theorem 63 The map P f-4 aP defines a left adjoint of the inclusion i; a a 0 i is isomorphic to the identity on the sheaf category 5h(C, J).
---j
i. The composition
For a proof, see [314, III.5, Theorem 1] and [314, III.5, Corollary 6]. Corollary 29 If f : F argument.
---->
G is a morphism of sheaves, f is mono iff it is an injection for each
The proof follows from the fact that this is true for presheaves, and that by the adjunction theorem 63, i preserves and reflects lO monomorphisms, QED.
G.5
Formal Logic
Formal logic does not replace absolute logic which is built upon the non-formalizable theorem of identity (A is identical to A), of contradiction (A and non-A exclude each other), and of the excluded third (there is no third choice except A or non-A). It does however model the way a specific domain of knowledge can handle its formal truth mechanisms.
G.5.1
Propositional Calculus
Sentences in propositional calculus are defined from a set
APPENDIX G. CATEGORlES, TOPOl, AND LOGIC
1132
Property 4 Given the symbol sets <1>,3, D., we require:
(i)
c
S(EX);
(ii) if a E S(EX), then (Ia) E S(EX); (iii) if a,(3 E S(EX), then (a)&((3) E S(EX); (iv) if a, (3 E S(EX), then (a)I((3) E S(EX);
(v) if a, (3 E S(EX), then (a) -> ((3) E S(EX). Clearly, in S(EX), the building blocks of a sentence are uniquely determined, so it makes sense to define set-valued functions E : S(EX) ~ A on such sentences by recursion of the building blocks. Suppose that A is a lattice, i.e., a partially ordered set (A,:S) with a join operation V : A x A ~ A, a meet operation 1\ : A x A ~ A, minimum (False) .1, a maximum (True) T, further a unary negation operation -, : A ~ A, and a binary implication operation ::::}: A x A ~ A. Call such an A a logical algebra. Then any set function EO : ~ A extends in a unique way to the evaluation E = E(EO) : S(EX) ~ A by these rules: Property 5 For all sentences a, (3, we set
(i) E(!a) = -'E(a); (ii) E( (a )&((3))
=
E( a) 1\ E((3);
(iii) E((a)I((3)) = E(a) V E((3); (iv) E((a) -> ((3))
= E(a) ::::} E((3).
Propositional calculus deals with the evaluation map on special logical algebras. A sentence a is called A-valid, in symbols: A F a, iff it E(a) = T for all evaluations EO : ~ A on the propositional variables. It is called classically valid or a tautology iff it is 2-valid for the wellknown Boolean algebra 2 = {O, I} of classical truth values, where we set T = 1,.1 = O. The symbol for classical validity is F a. Here are typical classes of logical algebras: Boolean Algebras. A Boolean algebra is a distributive logical algebra such that x V -,x = T and x 1\ -,x = 1.. Distributivity means that x 1\ (y V z) = x 1\ Y V x 1\ z and x V (y 1\ z) = x 1\ Y V x 1\ z. Further, implication is defined by x ::::} x = -,x V y. In a Boolean algebra (BA), one has these properties: -,-,x = x, x 1\ Y = .1 iff y :S -,x, x :S y iff -'y :S -,x, -,(x 1\ y) = -,x V -,y, -,(x V y) = -,x 1\ -'y. Heyting Algebras. A Heyting algebra A is a partially ordered set which, as a category whose morphisms x ~ yare the pairs x :S y, has all finite products and coproducts, and which has exponentials, so it is cartesian closed. In other words, a Heyting algebra is a lattice with minimum .1 and maximum T which has exponentials x Y • One writes the product as meet (1\) and the coproduct as join (V). The exponential x Y is written as y ::::} x, and the adjunction property of exponentiation reads z :S y ::::} x iff z 1\ y :S x.
(G.27)
G.5. FORMAL LOGIC For a Heyting algebra, we define a negation -,x = x -,x iff y 1\ x = -1.
1133 ~
-1, which is equivalent to y
~
Proposition 111 For a Heyting algebra, we have these identities: x ~ -,-,x, x ~ y implies -'y ~ -,x, -,x = -,-,-,x, -'-'(xl\y) = -,-,xl\-,-,y, (x ~ x) = T, xl\(x ~ y) = xl\y, Y 1\ (x ~ y) = y, x ~ (y 1\ z) = (x ~ y) 1\ (x ~ z). Proposition 112 A Heyting algebra is distributive, and it is Boolean iff x x, or iff x V -,x = T for all x.
= -,-,x for all
Proposition 113 For a presheaf category C@, the partially ordered set Subc@(P) of an object P is a Heyting algebra. The connectives are defined as follows. If S, T are two subfunctors of P, then:
(i) X@(SVT) = X@SUX@T; (ii) X@(S 1\ T) = X@SnX@T; (iii) (S ~ T)(X) = {x E X@PI for every morphism f : Y -+ X, if X· fEY@S, then X· f E Y@T}; (iv) X@(.,..,S) = {x E X@PI for every morphism f: Y -+ X,X' f r:J. Y@S}; More generally (see [314, IV.8] for details): Theorem 64 For every topos C, the partially ordered set Sub (X) .::::. Hom(X,O) of subobjects of X is a Heyting algebra.
To the left, this structure stems from the canonical Heyting algebra structure on the subobjects of X. To the right, this structure is induced by the following operations on 0: 1. Negation -, : 0 -+ 0 is the characteristic map of the false arrow false: 1 which is the characteristic map of the zero arrow 0 >---> 1.
>--->
0,
2. Disjunction 1\ : n x n -+ n is the character of the diagonal morphism b.(true, true) : 1-+ O. 3. Conjunction V : 0 x 0 -+ 0 is the character of the image of the universal morphism aU b: 0 U 0 -+ 0 x 0, where a = b.(Idn, true) : 0 -+ 0 x 0 and b = b.(true, Idn) : 0-+0 x O. 4. Implication V : 0 x 0 -+ 0 is the character of the equalizer of PI, 1\ : 0 x 0 ~ O. If a sentence a is valid for all Boolean algebras, we write BA F a. If it is valid for all Heyting algebras, we write H A F a. If it is valid in the Heyting algebra l@O of a topos C, we write C Fa. Validity is also described by a recursive construction process of valid sentences. One gives a set AX of sentences, called axioms, and defines theorems as those sentences s which are at the end of proof chains, i.e., finite sequences of sentences (so, Sb'" Sn, s) such that each member of this sequence is either an axiom or can be inferred from earlier members by a set RULES of rules. The classical setup is this. AX consists of 12 types of sentences:
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC
1134
Axiom 7 Axioms of classical logic (OL):
(i) a -> (a&a) (ii) (a&{3) -> ({3&a) (iii) (a
-> (3) -> ((a&1') -> ({3&1'))
(iv) ((a
-> (3)&((3 -> 1')) -> (a -> 1')
(v) {3 -> (a -> (3) (vi) (a&(a
-> (3)) -> {3
(vii) a -> (al{3) (viii) (al{3) (ix) ((a
-> ({3la)
-> (3)&((3 -> 1')) -> ((aim -> 1')
(x) (!a) -> (a -> (3) (xi) ((a -> (3)&(a -> (!(3))) -> (!a) (xii) al(!a) The system CL has one single rule of inference:
Principle 29 (Modus ponens) From a and a -> {3, (3 may be derived. The property of a sentence a of being a CL-theorem is denoted by
1-a. CL
Following
Heyting, the intuitionistic logic (IL) is the (CL) with the axiom (xii) omitted, and the same inference rule.
Theorem 65 The following validity statements are equivalent:
(i)
1-a. CL
(ii) F a. (iii) There exists a Boolean algebra B such that BFa. (iv) BA F a. See [186] for details. We have this weaker relation:
Theorem 66 The topos validity C F a implies classical validity
1-a. CL
Definition 163 A topos is Boolean iff the Heyting algebra Sub(X) of each object X is Boolean. Theorem 67 The following statements for a topos C are equivalent:
G.5. FORMAL LOGIC
1135
(i) C is Boolean. (ii) Sub(0.) is a Boolean algebm. (iii) true: 1 -+ 0. has a complement in Sub(0.). (iv) false: 1 -+ 0. is the complement of true in Sub(0.).
(v) true U false = Id o in Sub(0.). (vi) C is classic, i.e., true U false (vii) The first inclusion
Ll :
-+
0. is iso.
1 -+ 1 + 1 is a subobject classifier.
For a proof, see [186, p.156 if.]. Theorem 68 If the topos C is Boolean, then C F al!a for all sentences a. Theorem 69 For a topos C, the following are equivalent:
(i) C F a iff 1-- a for all a. CL
(ii) C F al!a for all a. (iii) SUb(l) is a Boolean algebm. Theorem 70 We have HA F a iff 1-- a. IL
Theorem 71 For all topoi C, the validity
G.5.2
1-a IL
implies C F a.
Predicate Logic
Predicate calculus generates a richer set of sentences whose validity is a function of the interpretation of predicate variables and individual variables and not only of abstract propositional variables. We are given a set Y = {LO, Ll, ••• } of individual variables, a set 3 = {I, &, I, ->,3, \f} of predicate connectives, a set II = lli=o,1,2, ... IIi, IIi = {Ai, B i , ... } of i-ary predicate variables, and a set Ll = {(,)} of brackets as above. Within the free monoid P EX = F M(Y U 3 U II U Ll) of predicate expressions, one exhibits the subset FO(PEX) of formulae as follows: Property 6 Given the symbols Y, 3, II, Ll, we require
(i) (Atomic formulae) Ai Li1 Li2 ••• Li; E FO(PEX) for any predicate variables are constants.
Ai
E IIi and
Lik
E Y, and O-ary
1136
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC
(ii) (Propositional formulae) Ifa,f3 E FO(PEX), then
(!a) E FO(PEX), (a)&(f3) E FO(PEX), (a)I(f3) E FO(PEX), (a) -> (f3) E FO(PEX). (iii) (Quantifier formulae) Ifa E FO(PEX), x E Y, then
('v'x)a E FO(PEX), (3x)a E FO(PEX). An individual variable x which appears in the formula after an expression of type ('v'x) or (3x) is bound, otherwise it is free. A model 9J1 of a predicate logic FO(PEX) is a set M, together with i-ary relations ai,b i ... <;;; Mi (elements aD EM for constants). In such a model, the formal predicate expressions are interpreted via the interpretation of atomic formulae Ai Lil Li2 ... Lii by the truth value of (Xiu Xi" ... x;J E ai . Whereas the interpretation of a quantifier formula ('v'x)a means "true" if the truth value of the interpretation of a is "true" for all valuations in M of all occurrences of the variable x, and the interpretation of a quantifier formula (3x)a means "true" if the truth value of the interpretation of a is "true" for at least one value in M of the occurrences of the variable x-of course, this is only decided if no free variables are left, in which case one calls the formula a sentence, otherwise, no truth value is defined and the formula is just a truth-valuefunction of the left free variables. We write 9J1 F a[x] for truth value "true" for the evaluation [x] of the free variables of a. The recursive calculation of truth values of compound formulas relates to the Boolean algebra Sub(M) as follows: To begin with, if a logical combination of two formulae is considered, one may suppose that both variable sets of these formulas coincide by just taking their union if they do not coincide. If we fix such a variable set of cardinality m, say, the truth evaluation of a formula a with (at most) these m free variables can be described by the inverse image supp(a) <;;; M m of true. Then evidently, supp(!a) = Mm - supp(a), supp((a)&(f3)) = supp( a) nsupp(f3), supp( (a) I(f3)) = supp( a) U supp(f3) , supp( (a) - > (f3)) = supp(!a) U supp(f3). For the quantifiers, we have this situation: If a variable x is bound by a quantifier, we have the support supp(a) <;;; M m of the given formula a and the support supp(('v'x)a) <;;; Mm-l or supp( (3x)a) <;;; Mm-l, respectively. Suppose that x is the ith variable, then we have the projection Pi : Mm ---+ Mm-l, which omits the ith coordinate, and the inverse image map p* : 2Mm - , ---+ 2Mm. If S is a support of a formula a in Mm, then the support ('v'x)(S) of ('v'x)a is the set {(Yl, ... , Ym-l)l(Yl, ... Yi-l, x, YHl,··· Ym-l) E S for all x EM}, while the support of (3x)(S) of (3x)a is the set {(Yl, ... Ym-dl(Yl, ... Yi-l, x, YHl, ... Ym-l) E S for at least one x E M}. Proposition 114 The functor of partially ordered sets 'v'x : 2Mm ---+ 2Mm - , is a right adjoint of p* : 2Mm - , ---+ 2Mm , while 3x : 2Mm ---+ 2Mm - , is its left adjoint, in other words, p*(X) c Y
iff Xc 'v'(x)(Y) and Y C p*(X) iff 3(x)(Y) eX.
1137
G.5. FORMAL LOGIC
The topos-theoretic generalization of this result is the following theorem; for a proof, see [314, p.209,p.206]. Theorem 72 If f : A -7 B is a morphism in the topos C, then the functor Sub(B) -7 Sub(A) of Heyting algebras (which are viewed as categories via their partial orders as morphisms) associated with the natural morphism 0/ : OB -7 OA has a right adjoint functor \:I x and a left adjoint functor 3 x . In order to rewrite the predicate calculus in general topoi, one uses the characteristic maps associated with supports of predicates as follows: If M is a non-zero object of the topos C, we consider the characters Xa'" : Mm -7 0 of the "supports" am C M m of m-ary predicates. Their recombination via logical constructions runs as follows: Using the morphisms of negation, conjunction, disjunction, and implication defined above in section G.5.1 for 0, one has the evident combination of supports of formulas via their characters. The new thing here is the definition of quantifier supports. Given a character Xc> : M m - 7 0, we have the adjoint morphism adi(xc» : M m - 1 -7 OM with respect to the ith coordinate. Then we have two arrows \:1M, 3 M : OM - 7 O. The first \:1M is the character of the adjoint of the composite trueo! 0 pr M : 1 x M -7 M -7 O. The second 3 M is the character of the image of the composed map pr[lM 0 EM : E >----> OM X M -7 OM, where EM : E >----> OM is the subobject whose character is the evaluation map eVM : OM x M - 7 0 adjoint to the identity on OM. We then have these formulas:
\:I(x)a has the character \:1 M oadi(xc», 3(x)a has the character 3 M 0 adi(xc»,
G.5.3
A Formal Setup for Consistent Domains of Forms
Since forms do not automatically exist if we allow circularity, it is important to set up a formal mathematical context in order to describe what a logically consistent domain of forms should be. This mathematical formalism turns out to be valid in an interesting general context. We have been working in the topos Mod@ of presheaves over the category Mod, where we have the Yoneda embedding Y : Mod - 7 Mod@. Without loss of generality, we may identify Mod with the full subcategory of represented presheaves @M, M a module. More generally, we may consider Yoneda pairs n c [;, where n is a full subcategory of a topos [;, n playing the role of represented modules (we also say that n is a Yoneda subcategory). This means that we require that the canonical Yoneda functor [; -7 [;@ -7 n@ be fully faithful. By Yoneda's lemma, we may identify the evaluation M@F of a "presheaf" FE [; at a "module" MEn by the morphism set from M to F: M@F = Home( M, F). This setup in particular includes the classical case of [; = Sets and n the one-element category consisting of a singleton 1 (the terminal object in Sets), say 1 = {0}, and its unique identity morphism. In this case, we may identify l@F and the set F. To achieve the intended formalism, we consider the set M ono( [;) of monomorphisms in [;. We further consider the set Types = {Simple, Syn, Limit, Colimit, Power} ofform types. And we need the free monoid Names = FM(UNICODE) over the UNICODE alphabet (which is an extension of the ASCII alphabet to non-European letters). We next need
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC
1138
FORMS
Names
F
Dia(Nclll1es/E)
sem(F) = (type,., id,.,lD,) MOl1o(E)
Types
Figure G.1: The formal setup of a semiotic of E-forms. the set Dia( Names) of all diagram schemes with vertexes in Names. More precisely, a diagram scheme over Names is a finite directed multigraph whose vertexes are the elements of a subset of Names, and whose arrows i: A ---+ B are triples (i,A,B), with i = 1, ... natural numbers to identify arrows for given vertexes. Next, consider the set Dia(Names/E) of diagrams on Dia(Names) with values in E. Such a diagram is a map dia : D ---+ E which to every vertex of the diagram scheme D associates an object of E and to every arrow associates a morphism in E between corresponding vertex objects. So i : A ---+ B is mapped to the morphism dia(i) : dia(A) ---+ dia(B). We also will identify two such diagrams iff their arrows for given names A, B are permutations of each other, Le., we only consider the orbits of diagrams modulo the permutation group of arrows on given names. Why? Because any construction of limits or colimits is invariant under this group since the limit condition is a logical conjunction which does not depend on the numbering of the arrows. So this identification will always be valid unless explicitly suspended. Observe further that a multiple appearance of a vertex in a diagram scheme is not allowed, so when constructing diagram schemes upon form names, one must add synonymous forms when multiple appearance of one and the same form in a diagram is desired. This is the advantage of form names: the annoying indexing of mathematical names can be absorbed by intrinsic renaming on the level of form names. With these notation, we can define a semiotic of E-forms as follows (see also figure G.1):
Definition 164 A semiotic of E-forms is a set map sem: FORMS
---+
Types x Mono(E) x Dia(Names/E)
defined on a subset FORMS c Names with the following properties (i) to (iv). To ease language, we use the following notation and terminology:
G.S. FORMAL LOGIC
1139
• An element FE FORMS is called a form name, and the pair (F, sem) a form, • prl . sem(F) = t(F) (=type of F), • pr2 . sem(F)
= id(F) (= identifier of F),
• domain(id(F))
= fun (F) (=
• codomain(id(F))
topo~2 or "space" of F),
= frame(F) (= frame or "frame space" of F),
• pr3 . sem(F) = coord(F) (= coordinator of F). Then these properties are required:
(i) The empty word 0 is not a member of FORMS (ii) For any vertex X of the coordinator diagram coord(F), if X E FORMS, then we have coord(F)(X)
= fun(X).
(iii) If the type t(F) is given, we have the following for the corresponding frames: • For Syn and Power, the coordinator has one vertex G E FORMS and no arrows, i.e., coord(F) : G ----+ fun(G), which means that in these cases, the coordinator is determined by a form name G. Further, forSyn, we have frame(F) = fun(G), and for Power, we have frame(F) ifcoord(F): G ----+ fun(G), as above.
=
nfun(G),
• For Limit and Colimit, the coordinator is any diagram coord(F) whose names are all in FORMS. Further, for Limit, we have frame(F) = lim(coord(F)), and for Colimit, we have frame(F) = colim(coord(F)). • For type Simple, the coordinator has the unique vertex 0, and a value coord(F) : 0----+ M for a 'module' MER (i.e., a represented presheaf M = @X in the case of presheaves over Mod), or, in a more sloppy notation: coord( F) = M. Here, circular forms are evidently included via form names which refer to themselves in their diagrams or in deeper recursion structures. With this definition we may discuss the existence and size of form semiotics, i.e., the extent of the FORMS set, maximal such sets, gluing such sets together along compatible intersections, etc. However, we shall not pursue this interesting and logically essential branch for simple reasons of space and time. 12The functor in the special case E = Mod@.
1140 G.5.3.1
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC Morphisms Between Semiotics of Forms
Although the theory of form semiotics is in its very beginnings, it is clear that two form semiotics with intersecting domains FORM1 and FORM2 nee not be contradictory even if the semiotic maps do not coincide on the intersection FORM1 n FORM2. In fact, it could happen that on this intersection, the maps are just "equivalent" semiotics. More generally, it could happen that two form semiotics have subsemiotics which are in complete correspondence and therefore we may glue them to a global semiotic structure. In other words: It is reasonable and feasible to consider morphisms and then categories of form semiotics and therefore isomorphisms of semiotics, which enables us to construct global semiotics just by gluing together local "charts" as usual. Let us abbreviate Sema(£) = Types x Mono(£) x Dia(Names/£), Sema being an abbreviation for semantic target space. Suppose that we are given two form semiotics sem1 FORMS1 ----> Sema(£I) , sem2 : FORMS2 ----> (£2)' We correspondingly denote by
fun1, fun2, iI, t2, idl, id2, frame1, frame2, and coord 1, coord2 the respective maps. Consider pairs (u,v) where u: FORMS1 ----> FORMS2 is a set map, and where v : £1 ----> £2 is a logical functor (see appendix G.3) sending R1 to R 2 • We say that the pair (u, v) is morphic (for FORMS1, FORMS2) iff 1. We have u(0)
= 0.
2. The functors commute with u, v, i.e., we have v . fun1 3. The type is invariant and u, i.e., t2 . u
= fun2 . u.
= t1'
In particular, mono- and epimorphisms on £1 are preserved (see appendix G.2.2). Suppose that we are given a diagram scheme
(modulo permutations on the numberings of the arrows between fixed names, as announced!) associated with the form name F E FORM1. Let IDI be the vertex names of D. We define a diagram scheme E as follows. Its vertexes are the image lEI = u(IDI). For every vertex pair (X, Y) of lEI we take all arrows i : A ----> B with X = u(A), Y = u(B). By lexicographic order on the triples (A, B, i), we can order all these arrows and index them with positive natural numbers j = 1, ... n(X, Y). This defines a unique new diagram scheme. Secondly, we define a new diagram C' : E ----> £2 as follows: If the arrow i : A ----> B gives arrow j (i) : X ----> Y, the new diagram C' maps this arrow to the morphism
Denote this diagram by (u,v)(C). Clearly, since we only retain orbits of diagram schemes, we have functoriality, i.e., if (U1' V1), (U2' V2) are two such morphic pairs for FORMS1, FORMS2, and FORMS2, FORMS3 , respectively we have (G.28)
1141
G.5. FORMAL LOGIC
Definition 165 A morphic pair (u,v) for the pair FORMS I ,FORMS2 is said to be a morphism of form semiotics (u, v) : semI ----* sem2 if the following semiotic data with each given form name F E FORMS I are verified: 1. Let F be simple, i.e., F ----- Simple(M). Then u(F) ----- Simple(N)13, and we require Id
that • N
Id'
= v(M),
• Id'=v(Id), i.e., the monomorphism of Id' is the v-image of the monomorphism of I d-the domains and codomains are already the right ones, only the morphism (a monomorphism by the conservation of limits) has to fit-so Id'
= v(Id) : fun2(u(F)) = v(junl(F))
>---->
N
= v(M).
2. Let F be synonymous, i.e., F ----- Syn(G). Then u(F) ----- Syn(G'), and we require that Id
G' = u(G) and Id' = v(Id) = fun2(u(F))
Id'
>---->
fun2(u(G)).
3. Let F be of power type, i.e., F ----- Power(G). Then u(F) ----- Power(G'), and we require
that G'
u
= u(G) and Id' = v(Id) = f un2(u(F))
..::::, ov(funl(G) =
Of un2(U(G)).
I~
>---->
V(Of un l(G))
4. Let F be of limit (resp. colimit) type, i.e., F ----- Limit(C) (resp. F ----- Colimit(C)) Id
Id
= coord(F). Then we have u(F) ----Limit(C') (resp. u(F) ----- Colimit(C')). Id' Id' We then require that C' = (u,v)(C) and that Id' = v(Id) = fun2(u(F)) = v(junl(F))
with C
v(lim(C))"::::' lim(v· C)..::::, lim((u,v)(C)) (resp. the analogous expression with colimits). >---->
In any case, the associated form u(F) is related to its ingredients through the given functor v and the recursive constructions on the coordinators via u. Clearly, the evident composition of two morphisms from formula (G.28) is again a morphism, and we obtain the category ForSem of form semiotics. G.5.3.2
Local and Global Form Semiotics
It is clear what one should understand by a global form semiotic: This is a set G, together with a covering I and an atlas fi : Ii ..::::, FORM Si of bijections onto domains of form semiotics semii : FORMSi ----* Sema such that all the induced bijections Ui,j : FORMSil j ..::::, FORMSjl i extend to isomorphisms (Ui,j, Vi,j) of form semiotics. This means in particular that all intersections FORMSil j are form domains of form sub-semiotics in semi, and that the underlying functors on Ei are compatible. We leave the details to the interested reader. 130bserve that in the case 'R = Mod, t: = Mod@ of presheaves, we usually write the module as a coordinator, but we mean its represented functor @M.
1142
APPENDIX G. CATEGORIES, TOPOl, AND LOGIC
G.5.3.3
Connotator From Semiotics
Denotator and form names were very simple word objects in the previous setup. But name spaces may also be required to encompass more articulated structures, in other words: we want names to be denotators as well, thereby turning the denotator concept into a 'connotator' concept. Here is the formal setup. We again suppose given a Yoneda pair R,£. We also retain the set Mono(£). We are given two sets V of denotators and F of forms, they are supposed to parametrize denotators and forms according to the following system of maps. We have three maps on V:
coordinate: V -+ £, form: V -+F, denotatorName: V -+ v. The coordinate C of a denotator is supposed to be any morphism with domain A within R, which is called the denotator's address. We require that a denotator be uniquely determined by its coordinate C, form F, and denotatorName N. This is why denotators are also written as quadruples N : A@F(C), where the address is denoted for comfort since it is important information. The denotator's form mimics the space where the denotator lives. To this end, we need two more sets. The set of types is T = {Limit, Colimit, Power, Simple}, it contains the basic constructors of objects in a topos. But we omit synonymy in this generic setup because it can be mimicked by a limit with just one vertex. We also need the set Diagrams(V / £) of finite diagrams whose vertexes are denotators, and whose arrows are numbered by 1,2,3, ... as above. This means that the diagram schemes are these symbols, and that the evaluation of the diagram scheme yields objects and morphisms in £. Forms have, by hypothesis, uniquely determined values under these four maps:
formName: F -+ V, identifier: F -+ Mono(£), diagram: F -+ Diagrams(V, £), type: F -+ T. This means that a form can be written as FN : Id.T(Dg), where FN is the form's name denotator, Id its identifier, T its type, and Dg its diagram. We impose a small number of axioms for these structures. To this end, we call the domain dom(Id) of a form FN: Id.T(Dg) the form's space, whereas the codomain cod(Id) is called its frame space. Accordingly, for a denotator N : A@F(C), the composition Id 0 C with its form's identifier I d is called the frame coordinate, it uniquely determines the denotator's coordinate.
Axiom 8 Here are the conditions for this setup: (i) The map formName is injective, i.e., the form's name is a key.
(ii) For all form diagrams, except for simple type, the vertex denotators of the diagram schemes are form names, and their values are the spaces of the respective forms.
G.S. FORMAL LOGIC
1143
(iii) If the form's type is Limit or Coli mit , its frame space is the limit or colimit of the diagram. (iv) If the form's type is Power, the diagram has just one vertex and no arrows, and the frame space is Os, where S is the space of the vertex form.
(v) There is a denotator 0 which is not a form name, and for a simple type form, the diagram has exactly the vertex 0, no arrow, and the value is a 'representable' object X in R. In other words, the simple type frame space is just a representable object in disguise. Such a diagram is represented by 0X. The language of forms and denotators has been encoded in an ASCII-based textual form, like 'lEX, which is therefore called Denotex and is available in BNF14. In RUBATO®, a Denotex parser is available for communication with Denotex files. Our present notation in this section, such as FN : Id.T(Dg) for forms and N : A@F(C) for denotators, is an illustration of the Denotex notation. Example 100 An elementary form for names can be set up as follows: The form N F represented by fn : Id.Simple(Dg) is simple with the diagram 07L(UNICODE). The identifier Id is the identity on the representable presheaf @7L(UNICODE), and the name fn is a denotator fn : O@FN(C), whose coordinate C is the zero-addressed homomorphism C : 0 ---+ 7L(UNICODE) with value C(O) = "NameForm" with denotatorName(fn : O@NF(C)) = fn, i.e., it is its proper name denotator. So its identification resides on its coordinate value "NameForm" and the form named fn. This identifies the entire NF form. Then, general UNICODE names n may be defined by n : O@NF(Cn), where the value Cn(O) = "anyName" is any UNICODE string combination, such as "3.Violin+4.Piano", and which are their proper name denotators, i.e., denotatorName(n : O@NF(C)) = n.
14Denotex was developed in collaboration with Thomas Noll, Jorg Garbers, Stefan Goller, and Stefan Miiller.
Appendix H
Complements on General and Algebraic Topology H.1
Topology
Refer to [261] for general topology, and to [498] for algebraic topology.
H.1.1
General
A topological space is a pair (X, Openx) of a set X and a set Openx of open subsets of X such that X is open, Un V is open if U, V are so, and Ui Ui is open for any family (Ui ) of open sets, in particular the union of the empty family, the empty set, is open. The complement X - U of an open set U is called closed. Therefore the collection Closedx of closed sets fits with the corresponding axioms: the union of any two closed sets is closed, the intersection of any family of closed sets is closed 1 , and the empty set is closed. If we define the closure Y of any subset of X as the intersection of all closed sets containing Y, then the topology is again defined by the axioms for the closure operator : 2x --+ 2x , i.e., 0" = 0, is idempotent, Y ~ Y, and Y U Z = Y U Given two topologies Openx,Open'x, one says that Openx is coarser than Open'x or that Open'x is finer than Openx iff Openx ~ Open'x. On any set X, the coarsest topology consists just of X and of the empty set, it is called the indiscrete topology, whereas the finest topology is the powerset of X, it is called the discrete topology. The intersection of any family of topologies on X is the finest topology which is coarser than each member of the family. Every set of subsets S of X is contained in the intersection of all topologies containing this subset, a family containing at least the discrete topology. It consists of all unions of finite intersections (the empty intersection gives X) of members of S and is denoted by Open(S). A neighborhood W of x E X is a subset containing an open set U which contains x. Finite intersections of neighborhoods of x are neighborhoods, supersets of neighborhoods are neighborhoods. An accumulation point of a subset Y of X is a point not in Y which intersects
z.
IThe intersection of the empty family being defined as the total space X.
1145
1146
APPENDIX H. COMPLEMENTS ON GENERAL AND ALGEBRAIC TOPOLOGY
Y in each of its neighborhoods. The closure of a subset Y of X is the union of Y and of its accumulation points. The interior UO of a subset is the union of all open subsets of U. It is also the complement of the closure of its complement. The interior operator has an evident set of axioms corresponding to the closure axioms which also characterize the topology. The boundary au of a subset is the difference U - Uo. A subset B of open sets is called a base for the topology iff any open set is the union of a family of B members, or, equivalently, every neighborhood of a point contains a neighborhood from B. The axioms for a set of subsets B of X to be a base for a topology is that X = U B, and that for any two U, B E B, U n V is the union of members of B. A subbase for a topology Openx on X is a set S of subsets of X such that Openx = Open(S).
H.1.2
The Category of Topological Spaces
Suppose that (X, Openx) , (Y, Openy) are topological spaces. A set map X -+ Y is continuous iff the inverse map 2Y -+ 2x induces a map Openy -+ Openx. The set-theoretic composition of continuous maps is continuous, the identity map is so, and therefore, we have the category Top of topological spaces and continuous maps. An isomorphism of topological spaces is called a homeomorphism. Any subset W of a topological space becomes a topological space by the coarsest topology Openw = OpenxlW such that the inclusion W C X is continuous; its open sets are just the intersections of open sets of X with W, this topology is called the relative topology on W. More generally, given any set map f : X -+ Y into a topological space (Y, Openy) the coarsest topology Openylf (smallest SE,lt of open sets) on X such that f becomes continuous is given by the set of inverse images of open sets of Y, we also call it the relative topology with respect to f. Conversely, for a set map f : X -+ Y, where (X, Openx) is a topological space, we have a finest topology such that f becomes continuous, it is given by the set of all subsets of Y such that their inverse image is open in X. This is the quotient topology Openx / f. If (Xi) is a family of topological spaces, the cartesian product It Xi has the coarsest topology such that the projections to all factors become continuous. A base of this product topology is given by the products TIi Ui of open sets Ui ~ Xi with Ui = Xi except for a finite number of indices. This is a limit in the category Top. The coarsest topology on the set-theoretical limit lim(][J)) of a diagram of continuous maps is the limit in Top, a similar construction (this time with the finest topology) yields the colimit of a diagram of continuous maps. If we are given a family Ii : Xi -+ X of set maps whose domains are topological spaces, there is a finest topology which makes these maps continuous. Its universal property is that with this topology on X, a map g : X -+ Y into a topological space Y is continuous iff all compositions g 0 Ii are so. This is a particular case of a quotient topology for the situation It Xi -+ X. This topology is called the coinduced topology. If the maps Ii are inclusions of subspaces Xi of a topological space X, the topology of X is called coherent or weak if it is coinduced from the relative topologies on the spaces Xi. If we are given a set X, together with a collection of subsets Ci of X which are topological spaces such that for all indexes i, j, the intersections Ci n Cj have the same relative topology as inherited from Ci or from Cj , and that these intersections are closed in both, Ci and Cj • Then the coinduced topology is coherent with this family, in other words, the coinduced topology relativizes to the given topologies on all Ci .
H.l. TOPOLOGY
H.1.3
1147
Uniform Spaces
Topologies are often defined by relations that stem from metrical distance functions. The axiomatics is as follows:
Definition 166 A uniformity on a set X is a set U of uniform sets U s;;: X 2 such that:
(i) Each uniform set contains the diagonal .6.. (ii) If U is uniform, so is U- 1 . (iii) If U is uniform, then there is a uniform V such that V
0
V.
(iv) If U, V are uniform, then so is Un V.
(v) If U is uniform, then so is every superset in X2. The prototype of a uniformity is given by a distance function, i.e., a pseudo-metric d : X x X ----+ lR as defined in definition 171 in appendix 1.1.1. The uniformity contains all U s;;: X 2 which contain a set of type U, = {(x,y)ld(x,y) < E},E > O. Each uniformity U gives rise to a uniform topology Open(U) whose open sets are those V such that for each x E V, there is a uniform set U with U[x] c V, where U[x] = {yl(x, y) E U}. So the uniform topology imitates metrical neighborhoods.
H.1.4
Special Issues
Definition 167 A topological space X is said to be:
(i) To iff for any two different points x, y
E
X, at least one of them is not the specialization
of the other;
(ii) Tl iff every point is closed, i. e,., no other point dominates it; (iii) T2 (Hausdorff) iff every two different points have disjoint neighborhoods.
Definition 168 A subset LeX of a topological space X is said to be locally closed iff one of the equivalent properties holds:
(i) L = 0 n C, 0 open, C closed. (ii) Every point 1 E L has an open neighborhood Uz such that Uz n L is closed in Uz. (iii) L is open in its closure in X. See [65, I,§3.3] for a proof.
Definition 169 A topological space X is called quasi-compact iff every covering of X by open sets admits a finite subcovering. A Hausdorff quasi-compact space is called compact. Typically, prime spectra of commutative rings are quasi-compact but not compact.
1148
H.2
APPENDIX H. COMPLEMENTS ON GENERAL AND ALGEBRAIC TOPOLOGY
Algebraic Topology
Refer to [498] for this section.
H.2.1
Simplicial Complexes
A simplicial complex K is a set V of vertexes, together with a subset K of 2 v whose elements are called simplexes such that (1) each singleton {v}, v E V is a simplex, (2) each non-empty subset of a simplex is a simplex. If for a simplex s of K, card(s) = q + 1, one says that s is a q-simplex or a q-dimensional simplex. A subsimplex s' <:;; s of a simplex s is called a face of s; it is called a q-face if it is a q-dimensional simplex, we also write 8' :S s instead of s' <:;; s. Evidently, a simplicial complex is completely determined by its simplex set K and may be identified with it.
Example 101 Let U be a covering of a set X by non-empty subsets. The nerve n(U) of U is the simplicial complex with V = U, and the simplexes s being those finite sets s = {uo, UI, .•. up} in U which have non-empty intersection ns = i Ui.
n
The dimension dim(K) of a simplicial complex K is the maximal dimension of its simplexes, including the special cases dim(0) = -1 dim(K) = 00 if no maximal dimension exists. A simplicial map f : KI -4 K2 is a set map f : VI -4 V2 on the underlying vertex sets such that the induced map 2' : 2V1 -4 2V2 carries simplexes to simplexes, i.e., restricts to a map f : KI -4 K 2, meaning that if s E K I , then f(s) E K 2. One may also say that it is a set map F : KI -4 K2 which is induced by a map f on the underlying vertex sets. The simplicial complexes and their simplicial maps define the category Simpl of simplicial complexes. A subcomplex L of a simplicial complex K is a subset of simplexes which is also a simplicial complex. L is full iff a simplex of K whose vertexes belong to L is also in L. For example, given a simplicial complex K and a natural number k, the k-dimensional skeleton Klk is the sub complex of all simplexes of dimension :S k. For a covering U, the k-dimensional skeleton of its nerve is denoted by nk(U), Example 102 Let Covens be the category of set coverings, whose objects are pairs (X, I) of sets X and coverings I of X by non-empty subsets. The morphisms are pairs (f, ¢) : (X, I) -4 (Y, J) with f : X -4 Y, ¢ : I -4 J two maps such that for all i E I, f(i) C ¢(i). We then have the nerve functor n : Covens -4 Simpl : (X, I) 1---+ n(I).
H.2.2
Geometric Realization of a Simplicial Complex
We have a functorially defined geometric representation of simplicial complexes K by topological spaces IKI as follows. The set IKI is the subset of those functions a : V(K) -4 I = [0,1] into the real unit interval I such that 1. the support supp(a) 2. 2::vEV(K) a(v) = 1.
= {v
E V(K)la(v) =f:.
O} of a is a simplex,
H.2. ALGEBRAIC TOPOLOGY
1149
The value a( v) is called the vth barycentric coordinate of a. On the set I(V(K)) offunctions with finite support, one has the Euclidean metric d(a, (3) = Iia - (3112. We induce this metric and its associated topology (see section H.1.3) on IKI and denote it by IKld. For a simplex s E K, the closed simplex lsi is defined by
lsi = {a
E
Klsupp((a) c s}.
Evidently, if dim(s) = q, there is a homeomorphism Isld ~ !1 q = {x E I Iq+ll I: Xi = I} onto the "standard closed q-simplex". If s, t E K, either s n t = 0 or a common face, and then Isntl = lsi n Itl, so Isldn Itld is closed in both, Isld, Itld, and the relative topologies from Isld, Itld coincide on the intersection. By the remarks on coinduced topologies in section H.1.2, we have the coherent topology on IKI which is coinduced from the topologies on the closed simplexes. This means that
IKI is closed/open iff each intersection En Isld is closed/open. Therefore, a function f : IKI ---+ X into a topological space X is continuous iff its restrictions fllsl are so for all simplexes s of K. In particular, the identity IKI ---+ IKld is continuous, therefore, IKI is Hausdorff, it is also normal, see [498, 3.1, Th.17]. Also, IKI is compact iff K is Fact 19 A subset E ~
finite. Call K locally finite, iff every vertex belongs to a finite number of simplexes. Then
Theorem 73 For a simplicial complex K the following statements are equivalent:
is locally finite. The identity IKI ---+ IKld is a homeomorphism. IKI is metrizable, i.e., there is a metric whose topology is the
(i) K (ii) (iii)
coherent topology.
See [498, 3.2, Th.8] for a proof. If f : Kl ---+ K2 is a simplicial map, we have the continuous map
Ifl(a)(v)
=
.E
a(w)
f(w)=v
which is continuous for both topologies on IKI. We are therefore given two functors I?I, 1?ld : Simpl---+ Top and a natural transformation Id: 171 ---+ 171d. A continuous map f: IKI---+ Xc]Rn is said to be linear iff f(a) = I:vEV(K) a(v)f(v) for all a E IKI. Any function on the vertexes may uniquely be extended to a continuous linear map, this is the universal property of affine pointsets in general position. In particular, the maps If I associated with a simplicial map f is linear.
Definition 170 A geometric realization of a simplicial complex K in]Rn (injection) of IKI in ]Rn.
is a linear embedding
Theorem 74 If a simplicial complex K has a geometric realization in ]Rn, then it is countable, locally finite and has dimension ::; n. Conversely, if it is countable, locally finite, and has dimension::; n, then it has a geometric realization as a closed subset of ]R2n+1 . Example 103 For the nerve n(U) of a finite covering U, we write N(U) for the geometric realization In(U)I, we also write Nk(U) for Ink(U)I.
1150
H.2.3
APPENDIX H. COMPLEMENTS ON GENERAL AND ALGEBRAIC TOPOLOGY
Contiguity
A simplicial pair is a couple (K, L), where K is a subcomplex of L. A simplicial map of pairs f: (K1' L 1) ----; (K2' L 2) is a simplicial map f : L1 ----; L2 which induces a simplicial map on the respective subcomplexes. Two simplicial maps f, l' : (K1' Ld ----; (K2' L 2) are called contiguous if for every simplex s in K1 or L 1, the union f(s) U l'(s) is a simplex in K2 or Ls. Contiguity is an equivalence relation and defines contiguity classes of simplicial maps. Two continuous maps f, 9 : X ::::::i Y of topological spaces are called homotopic iff there is a continuous map (a homotopy) F : X x I ----; Y such that f = F(?, 0), g = F(?, 1); the homotopy relation is an equivalence relation. If X' ~ X is a subspace, and if fiX' = glX', a homotopy is called relative to this subspace, iff FIX' x t = fiX' = glX', all tEl. Lemma 96 ([498, lemma 2, p.130]) Contiguous simplicial maps which agree on a subcomplex define contiguous maps which are homotopic relative to the space of the subcomplex.
H.3
Simplicial Coefficient Systems
A simplicial complex K can be viewed as a category whose objects are the simplexes s of K, and whose morphisms are the inclusions s ~ t of simplexes. For a commutative ring R, a coefficient system of R-modules is a covariant functor M : K ----; RMod with values in the category RMod of R-modules and affine homomorphisms. Let I:::l. q = {O, 1, 2, ... q} be the standard simplex of dimension q. A singular simplex of dimension q is a simplicial map s : I:::l. q ----; K, i.e., a sequence so, Sl, ... Sq of points in K which define a simplex. If we have any set map f : I:::l. p ----; I:::l. q , we have the singular p-simplex 1(s) = so f : I:::l. p ----; K. For a singular simplex s, we denote M(s) = M(Im(s)). Clearly Im(f(s)) ~ Im(s). Therefore we have an affine homomorphism fs : M(f(s)) ----; M(s). Denote by Sn (K) the set of singular simplexes of dimension n in K. Then we have a module cn(K; M) = TIsESn(K) M(s), whose elements are called the singular cochains of dimension n. For a map f : I:::l. p ----; I:::l. q , we have an affine map
(H.l) which has 1((as )sESp (K») (bt)tESq(K) and bt = (Cn(K; M))n is a simplicial cochain complex.
H.3.1
ft(al(t»). In other words, C*(K; M) =
Cohomology
Suppose now that the simplicial cochain complex stems from a system of coefficients with linear maps. Then all the transition maps of equation (H.l) are linear. Consider now the strictly increasing ith-face maps F~ : I:::l. n - 1 ----; I:::l. n leaving aside index i in I:::l. n , i.e., mapping I:::l. n - 1 onto the subset {O, 1, 2, ... i ... n}. Then we have the coboundary map d n : Cn(K; M) ----; C n+1(K; M), n+1 dn(a) = -1)j F~+1 (a),
2) j=O
(H.2)
H.3. SIMPLICIAL COEFFICIENT SYSTEMS and dn+l odn
1151
= O. This means that Im(d n ) ~ Ker(dn+l),
groups
Hn(K;M)
and we may consider the cohomology
= Ker(dn)/Im(dn_d
for n ~ 0, with the trivial extension to C- 1 (K; M) =
o.
(H.3)
Appendix I
Complements on Calculus 1.1
Abstract on Calculus
1.1.1
Norms and Metrics
Definition 171 A pseudo-metric on a set V is a (pseudo-distance) function d : V x V such that: 1. (Positivity) 0:::; d(x,y), and d(x,x) 2. (Symmetry) d(x,y)
= d(y,x)
= 0 for
-->
lR
all (x,y) E V x V,.
for all (x,y) E V x V,.
3. (Triangle inequality) d(x,z):::; d(x,y) +d(y,z) for all (x,y,z) E V x V If conversely d(x, y) = 0 implies x called a metric (distance function).
xv.
= y, the pseudo-metric (pseudo-distance function) is
Definition 172 For a pseudo-metric space (X, d), if 0 < r, x E X, the open ball of radius r around x is Br(x) = {yld(y,x) < r}. The system of open balls {Br(x)IO < r,x E X} is a base of a topology!, the (uniform) topology associated with the pseudo-metric d. Evidently, this topology is Hausdorff iff the pseudo-metric is a metric. A map f : V --> V of a pseudo-metric space V is called an isometry iff d(J(x), f(y)) = d(x,y), for all (x,y) E V x v.
Lemma 97 ([73, Lemma 4]) Given an action J1, : G x V space (V, d) by isometries, then inf d(g.x, y)
=
inf d(g.x, y)
=
gEG gEG 1 In
-->
V of a group G on a pseudo-metric
inf d(g.x', y)
whenever G.x= G.x',
(1.1)
inf d(g.x, y')
whenever G.y= G.y'.
(1.2)
gEG gEG
fact, the system {Br = {(x, Y) E V 2 Id(x, y)
< r }ID < r} 1153
is a base of a uniformity.
APPENDIX I. COMPLEMENTS ON CALCULUS
1154
With the above notation, we may define
d*(G.x,G.y)
= infd(g.x,y),
(1.3)
gEG
and lemma 97 guarantees that this is a well-defined function d* : G\ V x G\ V
~
R
Definition 173 If d(x, y) is a pseudo-metric on a set V, and if we have a group action G x V ~ V, we say that g acts by isometries, iff each map g. : V ~ V is an isometry, i.e., iff d(g.x,g.y) = d(x,y), for all x,y E V,g E G. Lemma 98 ([73, Lemma 5]) Let d be a pseudo-metric on V, and f1, : G x V ~ V a group action by isometries. Then the function d* defined in (1.3) is a pseudo-distance on the orbit space G\V.
1.1.2
Completeness
A Cauchy sequence in a uniform space (X,U) is a sequence (Xi)i=0,1,2, ... of elements in X such that for every uniform set U E U, there is an index t such that (Xi,Xj) E U for all i,j > t. A uniform space is (sequentially) complete iff every Cauchy sequence converges. Lemma 99 A closed subspace of a complete uniform (in particular: a metric space) space is complete.
Definition 174 A norm on real vector space X is a function (x,y)EVXV: 1. (Positivity)
o:s IIxll,
and
Ilxll = 0
II II : X
~ ~
such that for all
iff x = 0;
P·xll = l'\I·llxll; inequality) IIx + yll :S Ilxll + Ilyll·
2. (Homogeneity) 3. (Triangle
Every norm gives rise to an associated metric d(x, y) = Ilx - yll, and therefore to an associated topology. A normed vector space with a complete associated (uniform) topology is called a Banach space. Example 104 On ~n, we have three well-known norms. If x 1. the absolute or I-norm is
2. the Euclidean norm is 3. the uniform norm is
= (Xl, ... X n )
E ~n, then
Ilxlll = 2:i lXii,
IIxl12 = J2:i x;,
Ilxll oo =
max{lxill i = 1, ... n}.
For real numbers a < b, we have the vector space Cora, b] of continuous real-valued functions on the interval [a, b]. On Cora, b], we have three well-known norms (corresponding to the above three norms). For f E Cora, b], we have:
1.1. ABSTRACT ON CALCULUS
1155
1. the absolute or I-norm is IIfl11 =
2. the Euclidean norm is IIfl12 =
3. the uniform norm is IIflloo
U:
J: If
I,
j2)1/2,
= Max[a,bjlfl.
Two norms II 111, II 112 on a real vector space X are called equivalent iff there are two positive constants a, b such that II 111 ::; a·11 112, II 112::; b·1I 111. Equivalent norms give rise to the same associated uniformities and topologies, so they have the same Cauchy sequences.
Theorem 75 Any two norms on a finite-dimensional real vector space are equivalent. See [307, Th.3.4.I] for a proof. The theorem implies that every finite-dimensional normed real vector space is Banach, since the standard ]Rn is so under the Euclidean norm. We shall therefore mainly work in ]Rn.
1.1.3
Differentiation
We say that two functions f, 9 : U --+ ]Rm that are defined in a neighborhood U of 0 E ]Rn define the same germ iff they coincide on a common neighborhood of O. (We are in fact considering the colimit of function spaces on the neighborhood system of 0.) The set of germs in 0 of functions f with f(O) = 0 is denoted by Fo. Within this vector space, we have the vector subspace DFo of those f with f(O) = 0 and Ilf(z)ll/llzll --+ 0 if z --+ O. We evidently have LinR.(]Rn, ]Rm) n DFo = {O}.
Definition 175 A function f : U --+ ]Rm which is defined in a neighborhood U ~ ]Rn of a point x is differentiable in x iff there is a linear map D E LinR. (]Rn , ]Rm) such that D. x f - D E D Fo, where D. x f (z) = f (x + z) - f (x). By the above, D is uniquely determined and is denoted by D f x' The coefficient of row i and column j of the matrix of D fx in the canonical basis is denoted by 8fd8xj, whereas the matrix is called the Jacobian of fin x. A function f : 0 --+ V on an open set 0 ~ ]Rn with values in an open set V ~ ]Rm is differentiable if it is differentiable in each point of its domain O. A differentiable function on 0 defines its derivative D f : 0 --+ LinR. (]Rn , ]Rm) ~ ]Rnm, which may again be differentiated according to the norm on the space of linear maps. Inductively we define DtH f = D(Dt f), if it exists. The function f is Cr iff all derivatives D f, D2 f, ... Dr f exist and are continuous, CO denotes just the set of continuous functions. This definition is however not in the right shape for functorial behavior. One therefore adds the linear behavior to the function as follows: Let TO = 0 x ]Rn be the tangent bundle of the open set O. Then we define Tf: TO --+ T]Rn by Tf(x,u) = (f(x), Dfx(u)). This implies that if g: U --+]Rl is a second differentiable function on an open set U ~ ]Rm, then 9 0 f is differentiable and T(g
0
I)
=
Tg 0 Tf.
So we have a functor T : f f---t T f and the natural transformation PT1 projection. More generally, defining TrH f = T(Tr I), we also have
T
--+
I d of first
APPENDIX I. COMPLEMENTS ON CALCULUS
1156
Moreover, if we identify Linw. (JRn, Linw. (JRn, JRm)) with Bilw. (JRn, JRm)), etc. for higher multilinear maps, the higher derivatives Dr fx identify to r-linear maps (JRny ---+JRm. Proposition 115 If f is C, then Dr fx is a symmetric r-linear matrix. The category of r times differentiable or Cr functions has the property that the linear parts of the tangent maps compose as normal linear maps do, and this means that the Jacobians of isomorphisms are invertible quadratic matrices. A curve in JRn is a CI-map y : U ---+JRn. Its derivative DYt in a point t E U is a linear map JR ---+ JRn which identifies to the image of 1 in DYt (1) E JRn, meaning that the derivative can be identified with a continuous map y' : U ---+ JRn : t f---+ y'(t) = DYt(l).
1.2
Ordinary Differential Equations (ODEs)
Throughout this section, D denotes an open set in JR n , and vector field (a function) with components fi, i = 1, ... n.
f : D ---+ JRn
denotes a continuous
Definition 176 Let ( E JR, 1] ED, J (() an open interval containing (, and U (1]) c D an open neighborhood of 1] in D. Denote by AU, (, 1], J((), U(1])) the set of all CI-functions y : J(() ---+ U (1]) such that y' = f 0 y and y(() = 1]. (1.4)
Denote by BU, (, 1], J((), U(1])) the set of all CI-functions y : J(() ---+ U(1]) such that y
= 1] +
t?
f
0
y.
(1.5)
Lemma 100 With the above definitions, we have
AU, (, 1], J((), U(1]))
= BU, (, 1]" J((), U(1])).
The easy proof is left to the reader.
1.2.1
The Fundamental Theorem: Local Case
The following theorem is called the local case of the fundamental theorem of ordinary differential equations. Theorem 76 With the preceding notation and definitions, suppose that f is locally Lipschitz, i.e., for every XED, there is a neighborhood U(x) c D and a positive number L such that Xl,X2 E U(x) implies If(xI) - f(x2)1 :s; L.lxl - x21. Then for any "initial condition" ( E JR,1] E D, there is an open interval J(() containing (, and an open neighborhood U(1]) of 1] such that AU, (, 1], J((), U(1])) is a singleton. The element of A is called the local solution of the differential equation y' = f 0 y at J((), U(1]).
1.2. ORDINARY DIFFERENTIAL EQUATIONS (ODES)
1157
The proof uses lemma 100 and refers to the set B. In fact, it is shown that the operator
is a contraction, and contractions have a unique fixpoint.
Proposition 116 Let T : X --+ X be a contraction on a complete metric spacil (X, d), i.e., there is a constant 0 < c < 1 such that d(T(x),T(y)) ::s: c.d(x,y) lor all X,y E X. Then, T has a unique fixpoint z = T(z). Proof. It suffices to show that the sequence (x n = Tn(x)) is Cauchy. In fact, setting k we have d(xn' x m ) = d(Tn(x), Tm(x)) = cMin(n,m) .d(x, Tk). But
d(x, Tk)
= In-ml,
::s: d(x, T(x)) + d(T(x) , T2(x)) + ... d(x k- 1 , Tk) 1
::s: (1 + c + ... Ck- 1 )d(x, T(x)) ::s: 1 _ c d(x, T(x)). So this term is limited, while cMin(n,m) tends to zero as n, m tend to infinity, QED.
Corollary 30 Let X be a complete metric space, and B = Br(x) the closed ball 01 radius r > 0 around x. Let T : B --+ X a contraction with d(T(x), x) ::s: (1 - c)r, 0 < c < 1. Then T has a unique fixpoint in B. Proof. We know from lemma 99 that B is complete. Further, for y E B, we have d(T(y), x) ::s: d(T(y), T(x)) + d(T(x), x) ::s: c.d(y, x) + (1 - c)r ::s: r. Therefore, T leaves B invariant and the claim follows from proposition 116, QED.
Corollary 31 With the notation 01 corollary 30, suppose that T : Br(x) --+ X is a contraction with d(T(x), x) < (1 - c)r, 0 < c < 1. Then there is a unique fixpoint 01 T in Br(x). Next, we need some auxiliary results concerning uniform convergence of continuous functions. Let W be a Banach space (in our case W = IR n ), A a set, then we set
B(A, W)
=
{I : A
--+
Wlll/lico < oo}.
Proposition 117 The set B(A, W) with the usual scalar multiplication and addition 01 functions is a Banach space. It is clear that B(A, W) is a vector space. Let (fn)n be a Cauchy sequence in B(A, W). Since for any x E A, IIln(x) - Im(x) II ::s: Illn - Imll, and the right term converges to zero, the left term is also a Cauchy sequence in Wand converges to limn-+coln(x) = I(x). We first show that limn-+coln = f. For 0 < E, let N be such that n,m > N implies 111m - Inll > Eo Then by definition, for all x E A, Illn(x)- l(x)11 = Il/n(x)-limm>N Im(x)11 = limm>Nll/n(x)- Im(x)11 ::s: E. Therefore Illn - III ::s: E, and I = (f - In) + In is a sum of two elements of B(A, W) and therefore lives in B(A, W), whereas limn-+coln = I, QED. 2See this appendix, section 1.1.1.
APPENDIX I. COMPLEMENTS ON CALCULUS
1158
Theorem 77 Let A be a metric space, W a Banach space, and let
BC(A, W)
= B(A, W) n COCA, W)
be the set of continuous functions with limited norm. Then BC(A, W) sub-vector space, and therefore also Banach.
c B(A, W)
is a closed
Proof. It is clearly a sub-vector space. Let Un) be a Cauchy sequence in BC(A, W). It converges to f in B(A, W). We have to show that it is also continuous. In fact, given 0 < E select n such that Ilf - fnll < E/3. Let a E A. Take 0 < 8 such that d(x, a) < 8 implies Ilfn(x) - fn(a)11 < E/3. Then Ilf(x) - f(a)11 :::; Ilf(x) - fn(x)11 + Ilfn(x) - fn(a)11 + Ilfn(a) - f(a)11 < E/3 + E/3 + E/3, QED. We are now ready for the proof of the local theorem. Recall that we are given a locally Lipschitz vector field function f: D ----> IRn. Consider the Banach space BC= BC(J((),lRn) for an interval J(() whose length 8 will be determined in the course of the proof. Select 0 < r such that (1) the closed ball B r ("7)- C D, and (2) fIB r ("7)- is Lipschitz with a constant L. Then f is evidently limited on B r ("7)-, let m be an upper bound. Let fj : J(() ----> B r ("7)- : t f--+ "7 be the constant map. Consider the closed ball Br(fj)- C BC around fj. For every g E Br(fj)-, fog: J(() ----> IR n lives in BC. We now show that the operator T(g) = "7 + fog defines a contraction
J:
with contraction constant c such that d(T(fj), fj) < (1- c)r. According to corollary 30, this will imply that T has a unique fixpoint in Br(fj)- and we are done. Evidently, T(g) is continuous. Further, for any x E J((), we have IT(g)(x)1 :::; 1"71 + I J~ f 0 gl :::; 1"71 + Ix - (1·llf 0 glloo which evidently is finite. We are left with the contraction claims. We have
IIT(fj) - fjlloo = lUb J (oIIT(fj)(t)
- fj(t) II = lub J «() II
it
f("7)11
= lub J «() It - (1·lf("7)1 :::; 8·lf("7)1 :::; 8.m. For two functions gl, g2
E
IIT(gt) - T(g2)11
Br (iJ) -, we have
= lub J «() I
it
f(gl) - f(g2)1 :::; 8·llf 0 gl - f
0
g211
= 8.lub J «()lf(gl(S)) - f(g2(S))1 :::; 8.L.lub J «()lgl(S) - g2(s)1 = 8.L·llgl - g211· This means that T is a contraction with c
8.m < (1- c)r = (1- 8.L)r, i.e., 8 <
1.2.2
= 8.L if 8 is such that 8.L <
1. Further, we need
m~Lr solves the problem, QED.
The Fundamental Theorem: Global Case
The global fundamental theorem deals with maximal integral curves y : J ential equation y' = f 0 y.
---->
D for the differ-
1.2. ORDINARY DIFFERENTIAL EQUATIONS (ODES)
1159
Definition 177 We say that u "" v for x, Y E D iff there is a curve Y : J ----+ D, defined on an open interval J for the differential equation Y' = f 0 y, and such that {u, v} C y( J). Lemma 101 The relation rv is an equivalence relation. The equivalence class of an element xED is denoted by [xl. It is clearly reflexive and symmetric. It is transitive for the following reason. Let Yi : J i ----+ D, i = 1,2 be two integral curves such that YI(tI) = x, YI(t2) = y, Y2(t3) = y, Y2(t4) = z. By an evident parameter shift, we may suppose t2 = h. We claim that
Suppose that YI (t) =I Y2(t) for at> t2. Let t2 ~ to be the infimum of these t. Since our curves are continuous, we have YI(tO) = Y2(tO). But then, according to the local theorem 76, there is an E-ball Ue(to) around to and a neighborhood U(YI(tO) = Y2(tO)) such that there is a unique integral curve Y : Ue (to) ----+ U (YI (to)). But we may suppose WLOG that E is so small that both YI!Ue(tO),Y2IUe(to) have their codomains in U(YI(tO)). Evidently, these solutions must then coincide with the unique solution on the open interval Ue(to), but this contradicts the choice of to. A symmetric argument holds for the supremum So ~ t2 of those arguments with YI(t) =I Y2(t). Therefore, YIIJ1 n h = Y21hn h,and we may extend the integral curves YI,Y2 to the domain J 1 U J 2 , whence the transitivity of the rv-relation, QED. Theorem 78 Let xED. Then there is a unique integral curve Y : J ----+ D with yeO) = x, Y' = f 0 y, and such that J contains all domains of any integral curve z, z(O) = x, z' = f 0 z. We have y(J) = [xl and write f for this curve; it is called the global solution through x.
Ix
Proof. Let r = {rYi C JR x DlrYi = graph of solution Yi of y~ = Yi 0 f, x = Yi(O)}. Since two solutions coincide on the intersection of their domains, the union uris functional, and the union of the domains is an open interval J. Further, the function Y of this graph is a solution of the differential equation Y' = f 0 f which reaches all elements equivalent to x, QED.
Definition 178 The quotient D / rv= {Ix fix E D} is called the phase portrait of the vector field f and denoted by D / f. An integral curve which is not an injective function of its parameter is called a cycle of the field f.
Iw
Proposit"ion 118 Let f be a cycle with is JR and f is T -periodic.
Iw
Iw
Iw f(tt} = Iw f(tl + T).
Then the cycle's domain
Proof. Let y : J ----+ D be the cycle f with y(tt} = z. Consider the function y = yo eT : J - T ----+ D. Evidently, y(lt) = y(lt), and y also solves the differential equation since y'(t) = y'(t + T) = f 0 yet + T) = f 0 yet). So, since y has a common value with y attt, by maximality of y, we have y = YIJ - T, and J - T c J, whence J = [-00, b[. Symmetrically, exchanging tl with It + T, and T with -T, we obtain J = [a,oo[, i.e., J = R Now, for any t E JR, with y = yo eT , uniqueness guarantees y = y, whence the periodicity of y, QED.
APPENDIX I. COMPLEMENTS ON CALCULUS
1160
Proposition 119 Suppose that D- is compact (e.g.: D is bounded), and that f is locally Lipschitz on D-. If the domain J =]a, b[ of a maximal curve Ix f has finite upper bound b, then t ~ b implies Ix f(t) ~ aD Sketch of proof: Write y = Ix f, and suppose that the closure of y(J) were in D. Then, since y(J)- is compact, there is a convergent sequence tn ~ b with a convergent image sequence y(t n ) ~ q, qED. It can be shown that q is uniquely determined, i.e., another such sequence yields the same limit. We then set y(b) = limt---+by(t) = q, and y may be extended to a local solution containing b, a contradiction, QED.
1.2.3
Flows and Differential Equations
On an open set 0 <:;;; JRn, a vector field (a C1-map) f : 0 ~ JRn can also be viewed by its graph as a section 0 ~ TO : x f-+ F(x) = (x, f(x)). If x E 0, an integral curve of F in x is a curve y: U(O) ~ 0 defined on an open neighborhood U(O) of 0 such that yeO) = x and y' = f 0 y. By the main theorem 78 of ODEs, there is a unique maximal integral curve Ix f for every point xEO.
For a vector field F on 0 <:;;; JRn, a flow box is a triple (U, a, W) where U <:;;; 0 is open, a is a positive real number of 00, and W: Ux] - a,a[~ 0 is C1 such that for all x E U, Wx : ] - a, a[~ 0 : t f-+ W(x, t) is an integral curve of F at x. Two flow boxes (U, a, W), (U ' , a', W') always coincide in their maps W, W' on the intersection (U nU') x (] - a, a[n] - a', a'D of their domains. For each point x E 0, there is a flow box (U, a, W) with x E U. Let 'DF = {(x, t) E JR x 01 there is an integral curve Ix whose domain contains t}. Then (1) 'DF is open in JR x JRn; (2) there is a unique map W F : 'DF ~ 0 such that t f-+ WF(t,X) is an integral curve at x for all x E O.
1.2.4
Vector Fields and Derivations
For a C1-function f : 0 ~ JR, we have the derivative T f : TO ~ TJR, whose second component evaluates to linear forms on JR n. This map df = PT2 0 Tf = Df is called the differential of f. If F : 0 ~ TO is a vector field, the composition LF f = df 0 F : 0 ~ JR is called the Lie derivative of f with respect to F. If we denote by grad (f) the differential of f as a tangent vector (axJ, ... ax,,J) (the old-fashioned gradient of I), the Lie derivative is just the scalar product of grad (f) with the vector field. If F(O) denotes the real algebra3 of C1-function on 0, the map LF : F(O) ~ F(O) is a derivation in the sense that:
(i) LF is linear; (ii) for f, 9 E F(O), we have LF(f.g)
=
f.LF(g)
+ LF(f).g;
(iii) If c E F(O) is constant, then LFc = O.
Therefore, we also have d(f.g) = df.g+ f.dg and dc vector space of all C1-vector fields on O. Then: 3Multiplication goes pointwise.
= 0 for a constant c.
Denote by VF(O) the
I.3. PARTIAL DIFFERENTIAL EQUATIONS
1161
Theorem 79 The Lie map L? : VF(O)
--+
Der(:F(O)) : F
f---?
LF
is an isomorphism of vector spaces. See [2, Th.8.1O] for a proof. In particular, the Lie bracket [LF' Lc] = LF 0 Lc - Lc 0 LF which is a derivation, must be the Lie derivative of a unique vector field which is denoted by [F, G], the Lie bracket of the vector fields F and G. The Lie bracket makes the vector space V F( 0) into a real Lie algebra, see section E.4.4.
1.3
Partial Differential Equations
For this section, refer to [252]. We only need a short review of quasi-linear first order partial differential equations (PDE). Recall that a PDE is an equation of type E( Xl, X2, ... U, U X1 , U X2 , ••• U X1X1 , U X1X2 ' ••• ) = where U is a function of the n real variables Xl, X2, ... X n , with its partial derivatives U X1 , ••• , the higher partial derivatives U X1X1 ••• etc. A solution is meant to be such a function U which is defined in an open set 0 of ~n. Its order m is the highest number of iterated partial derivatives, whereas E is called quasi-linear iff it is an affine function of the derivatives of U of highest order m, with coefficients that are functions of the variables Xl, X2, ... U, U X1 ••• until derivatives of order
°
m-l.
A first-order quasi-linear PDE has the shape
and can be solved by a system of ODEs, this is the method of characteristics. We illustrate the method for two variables, i.e., for the equation a(x, y, u)u x
+ b(x, y, u)u y = c(x, y, u).
(1.6)
The solution u(x,y) is represented as a surface z = u(x,y) in ~3. Such a surface is called an integral surface of the equation (1.6). We have a vector field F(x, y, z)
=
(a(x, y, z), b(x, y, z), c(x, y, z))
on the common domain U of the three functions a, b, c. The tangent space of an integral curve at X, y, u(x, y) is spanned by the vectors X = (1,0, u x ) and Y = (0,1, u y ). Their vector product Y 1\ X = (u x , u y , -1) is the normal vector to the integral surface. Therefore equation (1.6) just means that the scalar product (F(x, y, z), Y 1\ X) vanishes identically, i.e., the vector field F is tangent to the integral surface. Clearly, only the direction of the vectors of the vector field F, the characteristic directions matter for the equation (1.6). It is easily seen that an integral curve of F, if it crosses a point of an integral surface, is entirely contained in this surface. Therefore an integral surface is the union of integral curves of the directional vector field F. An integral surface can be constructed by finding a curve r which lies in an integral surface, and which
1162
APPENDIX 1. COMPLEMENTS ON CALCULUS
is never parallel to an integral curve of F. This is the Cauchy problem for the equation (1.6). Then, the parameter of r and the curve parameter of a flow box (see section 1.2.3) around r describe the integral surface. Technically, the existence condition for ret) = (r x(t), r yet), r zCt)) to generate a surface is that the projection rxy(t) = (rx(t),ry(t)) is never parallel to the projection Fxy of the directional field on the xy plane. The existence of a curve r is again guaranteed by the main theorem of ODEs, and we are done.
Part XVII
Appendix: Tables
1163
Appendix J
Euler's Gradus Function This table lists the rational numbers x/y with Euler's gradus suavitatis r(x/y)
[71].
~
10, see also
I r I Intervals 2
1/2
3
1/3, 1/4
4
1/6,2/3,1/8
5
1/5,1/9,1/12,3/4,1/16
6
1/10,2/5,1/18,2/9,1/24,3/8,1/32
7
1/7,1/15,3/5,1/20,4/5,1/27,1/36,4/9,1/48,3/16,1/64
8
1/14,2/7,1/30,2/15,3/10,5/6,1/40,5/8,1/54,2/27,1/72,8/9,1/96,3/32,1/128
9
1/21,3/7,1/25,1/28,4/7,1/45,5/9,1/60,3/20,4/15,5/12,1/80,5/16,1/81,1/108, 4/27,1/144,9/16,1/192,3/64,1/256
10
1/42,2/21,3/14,6/7,1/50,2/25,1/56,7/8,1/90,2/ 45,5/18,9 /10,1/120,3/40,5/24, 8/15,1/160,5/32,1/162,2/81,1/216,8/27,1/288,9/32,1/384,3/128,1/512
1165
Appendix K
Just and Well-Tempered Tuning This table lists the just coordinates of the just tuning intervals (with respect to c, second tone in first column) according to Vogel [547], see subsubsection 7.2.1.2, together with the value in Cents, and the deviation in % from the tempered tuning with 100, 200, 300, etc. Cents. Tone
Frequency
Octave
Fifth
Third
Pitch
%
name
ratio
coord.
coord.
coord.
(Ct)
deviation
1
0
0
0
0
0
-1
c
16/15
4
-1
111.73
+11.73
d
9/8
-3
2
0
203.91
+1.96
eD
6/5
1
1
-1
315.65
+5.22
e
5/4
-2
0
1
386.31
-3.42
f
2
-1
0
498.05
-0.39
f~
4/3 45/32
-5
2
1
590.22
-1.63
g
3/2
-1
1
0
701.96
+0.28
aD
8/5
3
0
-1
813.69
+1.71
5/3
0
-1
1
884.36
-1.74
bD
16/9
4
-2
0
996.09
-0.39
b
15/8
-3
1
1
1088.27
-1.07
dD
a
1167
Appendix L
Chord and Third Chain Classes L.1
Chord Classes
This section contains the list of all isomorphism classes of zero-addressed chords in PiM Od 12 . The meanings of the column items are explained in subsection 11.3.7; here we give a short definition.
• Class Nr. is the number of the isomorphism class, numbers with extension ".1" indicate the class number for classification under symmetries from Z (no fifth or fourth transformations). Autocomplementary classes have a star after the number. • Representative of Nr. without hat is the number's representative in full circles, the one with hat is the complementary chord. • Group of symmetries is Sym(Nr.). To keep notation readable, we use the notation with linear factor to the left. • Conj. Class denotes the conjugacy class symbol of Sym(Nr.) and refers to the numbering 1,2, ... 19 from [402]. • Card. End. Cl. Nr.lNr. is the pair of numbers of conjugacy classes of endomorphisms in Nr. and in its complement Nr., respectively.
1169
APPENDIX L. CHORD AND THIRD CHAIN CLASSES
1170
-
Class Nr.
Representative Nr. =., Nr. =
1
••••••••••••
0
Chord Classes Group of Symmetries
Conj. Class
~
19
28128
8
1131
3
3123
8
3125
8
3119
8
3131
13
3128
(-le- 2)
2
4114
{I}
1
4130
{I}
1
8136
(5)
4
4120
(7e 6 ) {I, 7, -leB , 5e B }
6
5129
8
4118
GL(ZI2)
End.
Nr·lNr:
One/Eleven Element
2
• 0 0 0 D 0 0 0 0 0 00
Z~2
3
• • 0 0 0 0 0 0 0 0 00
(-Ie-I)
3.1
• 0 0 0 0 • 0 0 0 0 00
4
• 0 • 0 0 0 0 0 0 0 00
Two/Ten Elements
5
• 0 0 • 0 0 0 00 0 00
{I, 7, -le- 2, 5e- 2} {1,5,
7e- 3 , 5e B ,
6
• 0 0 0 • 0 0 0 0 0 00
{I, 7,
7
• 0 0 0 0 0 • 0 0 0 00
Z~2 ~
-le- 3 }
-leB }
e6Z12
Three/Nine Elements
8
• • • 000000000
8.1
• 0 • 0 0 0 0 • 0 0 00
9
• • 0 • 0 0 0 0 0 0 00
9.1
• 0 0 0 0 0 0 0 0 0 00
10
• • 0 0 • 0 0 0 0 0 00
10.1
• 0 0 • 0 0 0 • 0 0 00
11
• • 0 0 0 • 0 0 0 0 00
12
• • 0 0 0 0 • 0 0 0 00
13
• 0 • 0 • 0 0 0 0 0 00
14
• 0 • 0 0 0 • 0 0 0 00
(7)
6
8131
15
• 0 0 • 0 0 • 0 0 0 00
{I, 5, -le6 , 7e 6 }
8
5132
16
• 0 0 0 • 0 0 0 • 0 00
Z~2 ~ e4Z12
15
4120
17
• • • • 00000000
(-le- 3 )
3
418
{I}
1
5119
{I}
1
5119
{I}
1
7123
Four/Ei9ht Elements
17.1
• 0 • 0 0 • 0 • 0 0 00
18
• • • 0.0000000
18.1
• 0 • 0 • 0 0 • 0 0 00
19
• • • 00.000000
19.1
• • 0 • 0 0 0 0 • 0 00
20
• • • 000.00000
L.l. CHORD CLASSES
1171
Chord Classes-Continued
Class , Represent~e Nr. Nr. =., Nr. =
0
, conj .' ~ End. Class Nr.lNr.
, Group of Symmetries
20.1
• • 0 0 0 • 0 • 0 0 00
21
••• 0000.0000
{l, 7, -le- 2 , 5e- 2 }
9
719
(-le- 4 )
2
6120
22
• • 0 • • 0000000
22.1
• 0 • 0 0 • 0 0 0 • 00
23
• • 0 • 0 • 0 0 0 0 00
(5)
4
5113
24
• • 0 • 0 0 • 0 0 0 00
(7e 6 )
6
6117
25
• • 0 • 0 0 0 • 0 0 00
{l}
1
013
25.1
• • 0 • 0 • 0 0 0 0 00
26
• • 0 • 0 0 0 0 0 • 00
{I}
1
12131
26.1
• 0 • 0 0 • 0 0 • 0 00
27
• • 0 • 0 0 0 0 0 0
{l , 7e- 3 , 5e 2 , -le- l }
11
5113
28
• • 00 • • 000000
(-le 7 )
3
6114
28.1
• • 0 0 0 • 0 0 • 0 00
29
• • 0 0 • 0 0 • 0 0 00
(7)
6
10123
• • 0 0 • 0 0 0 • 0 00
(5e 4 )
4
111 23
31
• • 0 0 • 0 0 0 0 • 00
{I ,
10
9119
32
• • 000 • • 00000
{1,5,-le 6 ,7e6 }
8
7115
33
• • 0 0 0 0 • • 0000
{l, 7, -le- l , 5e- l ,
14
7114
9
6117
8
11119
13
9128
17
7121
(-le- 4 )
2
517
{I}
1
6110
{I}
1
81 12
{I}
1
8112
30
.0
-le- l
,
5e- 4
,
7e 3 }
e6 , 7e 6 , 5e 5 , -le5 } 34
• 0 • 0 • 0 • 0 0 0 00
35
• 0 • 0 • 0 0 0 • 0 00
36
• 0 • 0 0 0 • 0 • 0 00
{1,7,-le 6 ,5e6 } {I, 7, -le\ 5e 4 } {I, 7, -le- 2 , 5e- 2 , e6
37
• 0 0 • 0 0 • 0 0 • 00
,
7e 6
Z;2 ~
,
5e4
,
-le4 }
e3Z12
Five/Seven Elements
38
• • • • • 0000000
38.1
• 0 • 0 • 0 0 • 0 • 00
39
• • • • 0.000000
39.1
• • 0 • 0 • 0 0 0 0 .0
40
• • • • 00.00000
40.1
• • 0 • 0 0 • 0 • 0 00
41
• • • • 000.0000
APPENDIX L. CHORD AND THIRD CHAIN CLASSES
1172
Class Nr.
Chord Classes-Continued Group of Symmetries Nr. =., Nr. = 0
41.1
••• 00.0.0000
42
• • • 0 • • 000000
-
Conj.
~
Class
Nr·lN;
{I}
1
6116
{I}
1
8120
Representative
End.
42.1
• • 0 • 0 • 0 0 • 0 00
43
••• 0.0.00000
43.1
• • 0 • 0 • 0 • 0 0 00
44
••• 0.00.0000
(7)
6
719
45
••• 0.000.000
{I}
1
16122
45.1
• • 0 0 • 0 • 0 • 0 00
46
••• 0.0000.00
(5e- 4 )
4
5112
47
••• 0.00000.0
(-le- 2 )
2
8114
47.1
• 0 • 0 • 0 • 0 0 • 00
48
• • • 00 • • 00000
{I}
1
8118
48.1
• • 0 . 0 0 0 • • 000
49
••• 00.00.000
{I}
1
10118
49.1
• • 0 . 0 0 0 0 • • 00
50
••• 00.000.00
(-le- 2 )
2
9113
50.1
•• 0 •• 000.000
51
• • • 000 • • 0000
6
9111
8
7117
{I}
1
10120
{I}
1
14126
{I, 5, -le6 , 7e 6 }
8
818 16116
52
••• 000.0.000
53
•• 0 •• 0.00000
53.1
• • 0 • 0 0 • 0 0 0 .0
54
•• 0 •• 00.0000
54.1
• • 0 0 • 0 • 0 0 • 00
55
• • 0 . 0 • • 00000
(7) {I ,
-le- 2
"
7e 6
5e 4 }
56
• • 0 • 0 • 0 0 0 • 00
(5)
4
57
• • 0 . 0 0 • • 0000
(7e 6 )
6
12116
58
• • 0 • 0 0 • 0 0 • 00
(7e 6 )
6
18123
59
• • 0 • 0 0 0 • 0 • 00
(7)
6
13129
60
• • 00 • • 0 0 . 0 0 0
(5)
61 62
4
11119
• • 0 0 . 0 0 • • 000
{I, 7,
-le4,5e4 }
8
14114
• 0 • 0 • 0 • 0 • 0 00
{I, 7, -le4 , 7e4 }
8
11119
L.1. CHORD CLASSES
Class Nr.
1173
Chord Classes-Continued Representative , Group of Nr. =., Nr. = 0 Symmetries
-
, coni.' ~ End. Class Nr.lNr.
Six/Six Elements
63*
• • • • • • 000000
63.1*
• • 0 • 0 • 0 0 • 0 .0
64*
• • • • • 0.00000
64.1*
• • 0 • 0 • 0 • 0 0 .0
65
• • • • • 00.0000
65.1
••• 0.00.0.00
66
• • • • • 000.000
66.1
••• 00.0.0.00
67*
•••• 0.0.0000
(-le- 5 )
3
515
{I}
1
919
{I}
1
919
(-le- 4 )
2
1216
(5e- 2 )
5
616
68*
•••• 0.00.000
{I}
1
919
69
•••• 0.000.00
{I}
1
15111
69.1
•• 0 •• 0.0.000
70*
•••• 0.0000.0
{I, 5, -le- 3 , 7e- 3 }
10
616
71*
• • • • 00 • • 0000
{I}
1
11111
71.1 *
• • • 0 0 . 0 • • 000
72
•••• 00.0.000
6
8110
73
•••• 00.00.00
(7e 6 ) (-le- 3 )
3
1319
73.1
• • 0 . 0 0 . 0 • • 00
74*
• • • • 000 • • 000
(-le- 3 )
3
75*
• • • 0 •• 00.000
{I}
1
717 17117
75.1*
• • 00 • • 0 0 . 0 . 0
76*
••• 0 •• 000.00
(5e- 5 )
4
10110
77*
••• 0.0.0.000
(5e 4 )
4
14114
78*
••• 0.0.000.0
{I}
1
23123
78.1*
• • 0 • 0 • 0 • 0 • 00
79
• • • 0 . 0 • • 0000
(7)
6
18110
80
••• 0.00.00.0
{l, 7, 5e- 2 , -le- 2 }
9
151 11
81*
• • • 0 . 0 0 0 • • 00
(5e- 4 )
4
11111
82*
• • • 00 • • 0 0 . 0 0
{I}
1
17117
82.1*
• • 0 • • 00 • • 000
83*
• • • 000 • • • 000
{I, -le- 2 , 5e- 2 , 7}
13
12112
1174
APPENDIX L. CHORD AND THIRD CHAIN CLASSES Chord Classes-Continued Class Nr.
Representative Nr.=., Nr.=o
-
Group of Symmetries
Conj.
~
Class
Nr·lNr:
7
14114
8
151 23
12
20120
16
12112
18
12112
End.
Ke 6Z,2 84*
•• 0 •• 0.00.00
85
•• 0 •• 00.0.00
86*
•• 0.00 •• 0.00
(7)
87*
• • 0 • • 00 • • 000
{1,e 4 ,e8 ,5,5e4 ,5e8 , -Ie-I, -le3 , -le 7 , 7e-1,
88*
• 0 • 0 • 0 • 0 • 0
.0
(7e 3 ) {I, -le-4, 5e-4, 7} K e 6Z,2
7e 3 ,7e7 } Z;2 K eZl2
L.2. THIRD CHAIN CLASSES
L.2
1175
Third Chain Classes
The following list of third chain translation classes shows the class number in the first column, where equivalence (rv) means that the same pc set is generated. The second column shows the pitch classes in the order of appearance along the third chain. The third column shows the third chain, the fourth column shows the chord class of the pc set, and the fifth column shows lead-sheet symbols as systematically derived in subsection 25.2.1. Third Chains Chain Nr. equiv.
1
rv
Pitch Classes from 0
1
Third Chain
1
Chord Class
1
Lead-Sheet Symbols
Two Pitch Classes
1 2
1
0,3 0,4
I:
1 1 :
trd Trd
Three Pitch Classes
3 4 5 6
0,3,6 0,3,7 0,4,7 0,4,8
33 34 43 44
7 8 9 10 11 12 13
0,3,6,9 0,3,6,10 0,3,7,10 0,3,7,11 0,4,7,10 0,4,7,11 0,4,8,11
14 15 16 17 18 19 20 21 22 23 24
0,3,6,9,1 0,3,6,10,1 0,3,6,10,2 0,3,7,10,1 0,3,7,10,2 0,3,7,11,2
3334 3343 3344 3433 3434 3443
0,4,7,10,1 0,4,7,10,2 0,4,7,11,2 . 0,4,7,11,3 '0,4,8,11,2
4333 4334 4343 4344 4433
15 10.1 10.1 16
CO,Cm5-
Cm C C+,C5+
Four Pitch Classes
333 37 334 26.1 343 22.1 344 30 433 26.1 434 28.1 443 30 Five Pitch Classes 58 53.1 56 53.1 42.1
C07C07 Cm7 Cm7+ C7
C7+ C+7+ C07-/9C09C09 Cm9Cm9
59 58 47.1 42.1
C7+/9, Cmaj7/9
60 56
C+7+/9
Cm7+/9, Cmmaj7/9 C9-
C9 C7+/9+
APPENDIX L. CHORD AND THIRD CHAIN CLASSES
1176
Chain Nr. equiv.
I Pitch Classes
rv
from 0
25
0,4,8,11,3
26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46
0,3,6,9,1,4 0,3,6,9,1,5 0,3,6,10,1,4 0,3,6,10,1,5 0,3,6,10,2,5 0,3,7,10,1,4 0,3,7,10,1,5 0,3,7,10,2,5 0,3,7,10,2,6 0,3,7,11,2,5 0,3,7,11,2,6 0,4,7,10,1,5 0,4,7,10,2,5 0,4,7,10,2,6 0,4,7,11,2,5 0,4,7,11,2,6 0,4,7,11,3,6 0,4,8,11,2,5 0,4,8,11,2,6 0,4,8,11,3,6 0,4,8,11,3,7
47 48 49 50 51 52 53 54 55 56 57
0,3,6,9,1,4,7 0,3,6,9,1,4,8 0,3,6,9,1,5,8 0,3,6,10,1,4,7 0,3,6,10,1,4,8 0,3,6,10,1,5,8 0,3,6,10,1,5,9 0,3,6,10,2,5,8 0,3,6,10,2,5,9 0,3,7,10,1,4,8 0,3,7,10,1,5,8
1 J
Third Chains-Continued
1
Chord Class
Lead-Sheet Symbols
4434 60 Six Pitch Classes 84* 33343 85~ 33344 33433 79 65.C 33434 69.1 33443 84* 34333 64.1* 34334 34343 63.1* 75.1* 34344 34433 69.1 82* 34434 73.1 43334 64.1* 43343 78.1* 43344 43433 65.C 43434 66.C 43443 82* 85~ 44333 78.1* 44334 44343 75.1* 87* 44344 Seven Pitch Classes 58~ 333433 54.C 333434 333443 54.C 58~ 334333 47.1 334334 38.1 334343 54.C 334344 47.C 334433 54.C 334434 54.C 343334 38.C 343343
C+7+/9+
Third Chain
C07-/9-/11C07-/9-/11 C09-/11C09-/11 COlI Cm9-/11Cm9-/11 Cm11 Cm11+ Cm7+/11 Cm7+/11+ C9-/11 C11 C11+ C7+/11 C7+/11+ C7+/9+/11+ C+7+/11 C+7+/11+ C+7+/9+/11+ C+7+/9+/(11)/13C07-/9-/11-/13C07-/9-/11-/13 C07-/9-/13 C09- /11-/13C09-/11-/13 C09-/13 C09-/13+ C013 C013+ Cm9-/11-/13 Cm9-/13
L.2. THIRD CHAIN CLASSES
1177 Third Chains-Continued
Chain Nr. '" equiv.
_I
Pitch Classes from 0
_I
Third Chain
1 Chord 1 Lead-Sheet Class Symbols
58
0,3,7,10,1,5,9
343344
47.C
59
0,3,7,10,2,5,8
343433
38.C
Cm9-/13+ Cm13
60
0,3,7,10,2,5,9
343434
38.C
Cm13+
61
0,3,7,10,2,6,9
343443
54.C
Cm11+/13+
62
0,3,7,11,2,5,8
344333
54.C
Cm7+/13
63
0,3,7,11,2,5,9
344334
47.C
Cm7+/13+
64
0,3,7,11,2,6,9
344343
54.C
65
0,3,7,11,2,6,10
344344
60~
Cm7+/11+/13+ Cm7+ /11 + / (13) /15-
66
0,4,7,10,1,5,8
433343
54.C
C9-/13
67
0,4,7,10,1,5,9
433344
54.C
68
0,4,7,10,2,5,8
433433
47.C
C9-/13+ C13
69
0,4,7,10,2,5,9
433434
38.1~
CI3+
70
0,4,7,10,2,6,9
433443
47.C
Cl1+/13+
71
0,4,7,11,2,5,8
434333
54.C
C7+/13
72
0,4,7,11,2,5,9
434334
38.C
C7+/13+
73
0,4,7,11,2,6,9
434343
38.C
74
0,4,7,11,2,6,10
434344
45.C
C7+/11+/13+ C7+ /11 + / (13) /15-
75
0,4,7,11,3,6,9
434433
54.C
76
0,4,7,11,3,6,10
434434
55~
77
0,4,8,11,2,5,9
443334
54.C
C+7+/13+
78
0,4,8,11,2,6,9
443343
47.C
79
0,4,8,11,2,6,10
443344
62~
C+7+/11+/13+ C+7+/11+/(13)/15-
80
0,4,8,11,3,6,9
443433
54.C
81
0,4,8,11,3,6,10
443434
45.C
82
0,4,8,11,3,7,10
443443
60~
83
0,3,6,9,1,4,7,10
3334333
84
0,3,6,9,1,4,7,11
3334334
85
0,3,6,9,1,4,8,11
86
C7+/9+/11+/13+ C7+ /9+ /11+ /(13)/15-
C+7+/9+/11+/13+ C+ 7+ /9+ /11+ /(13)/15C+ 7+ /9+/(11)/13-/15-
Eight Pitch Classes
3334343
3r 26.C 22.C
C07-/9-/11-/13- . .. C07-/9-/11-/13 . ..
0,3,6,9,1,5,8,11
3334433
26.C
C07-/9-/13 ...
87
0,3,6,10,1,4,7,11
3343334
29~
C09-/11-/13- . ..
88
0,3,6,10,1,4,8,11
3343343
18.C
C09-/11-/13 . ..
89
0,3,6,10,1,5,8,11
3343433
17.C
C09-/13 ...
90
0,3,6,10,2,5,8,11
3344333
26.1~
COI3 ...
91
0,3,6,10,2,5,9,1
3344344
3C
C013+ ...
C07-/9-/11-/13- ...
APPENDIX L. CHORD AND THIRD CHAIN CLASSES
1178
Chain Nr. '" equiv.
92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107", 84 108 109 110 111 112 113 114 115 116 117
'" 87
'" 85 '" 88 '" 92
I Pitch Class~s from 0
0,3,7,10,1,4,8,11 0,3,7,10,1,5,8,11 0,3,7,10,2,5,8,11 0,3,7,10,2,5,9,1 0,3,7,10,2,6,9,1 0,3,7,11 ,2,5,9, 1 0,3,7,11,2,6,9,1 0,3,7,11,2,6,10,1 0,4,7,10,1,5,8,11 0,4,7,10,2,5,8,11 0,4,7,10,2,5,9,1 0,4,7,10,2,6,9,1 0,4,7,11,2,5,9,1 0,4,7,11,2,6,9,1 0,4,7,11,2,6,10,1 0,4,7,11,3,6,9,1 0,4,7,11,3,6,10,1 0,4,7,11,3,6,10,2 0,4,8,11,2,5,9,1 0,4,8,11,2,6,9,1 0,4,8,11,2,6,10,1 . 0,4,8,11,3,6,9,1 0,4,8,11,3,6,10,1 0,4,8,11,3,6,10,2 0,4,8,11,3,7,10,1 0,4,8,11,3,7,10,2
Third Chains-Continued
I Third
Chain
3433343 3433433 3434333 3434344 3434434 3443344 3443434 3443443 4333433 4334333 4334344 4334434 4343344 4343434 4343443 4344334 4344343 4344344 4433344 - 4433434 4433443 4434334 4434343 4434344 4434433 4434434
I Chord I Lead-Sheet Class
Symbols
3C 18.C 22.C 18.C
Cm9-/11-/13 ... Cm9-/13 ... Cm13 ... Cm13+ ... Cm11+/13+ ... Cm7+/13+ ... Cm7+/11+/13+ ... Cm7+/11+/(13)/15- ... C9-/13 ... C15 C13+ ...
29~ 34~ 29~ 28~ 29~
18.C 22.C 26.C 18.C 17.C 25.C 26.C 29~ 30~
3C 18.C 3r 22.C 18.C 35~
3C 30~
C11+/13+ . .. C7+/13+ ... C7+/11+/13+ ... C7+/11+/(13)/15- ... C7+/9+/11+/13+ ... C7+/9+/11+/(13)/15- ... C7+/9+/11+/(13)/15- ... C+7+/13+ ... C+7+/11+/13+ ... C+7+/11+/(13)/15- ... C+7+/9+/11+/13+ ... C+7+/9+/11+/(13)/15- ... C+7+/9+/11+/(13)/15- ... C+7+/9+/(11)/13-/15- ... C+7+/9+/(11)/13-/15- ...
Nine Pitch Classes
118 119 120 121 122 123 124 125
0,3,6,9,1,4,7,10,2 0,3,6,9,1,4,7,11,2 0,3,6,9,1,4,8,11,2 0,3,6,9,1,5,8,11,2 0,3,6,10,1,4,7,11,2 0,3,6,10,1,4,8,11,2 0,3,6,10,1,5,8,11,2 0,3,6,10,2,5,9,1,4
33343334 33343343 33343433 33344333 33433343 33433433 33434333 33443443
15~
9.C 9.C 15~ 1O~
13~
9.C 1O~
C07-/9-/11-/13- ... C07-/9-/11-/13- ... C07-/9-/11-/13 ... C07-/9-/13 ... C09-/11-/13- . .. C09-/11-/13 . .. C09-/13 ... C013+ ...
1179
L.2. THIRD CHAIN CLASSES
Chain NT. equiv.
I Pitch Classes
rv
from 0
126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151
0,3,7,10,1,4,8,11,2 0,3,7,10,1,5,8,11,2 0,3,7,10,2,5,9,1,4 0,3,7,10,2,6,9,1,4 0,3,7,11,2,5,9,1,4 0,3,7,11,2,6,9,1,4 0,3,7,11,2,6,9,1,5 0,3,7,11,2,6,10,1,4 0,3,7,11,2,6,10,1,5 0,4,7,10,1,5,8,11,2 0,4,7,10,1,5,8,11,3 0,4,7,10,2,5,8,11,3 0,4,7,10,2,6,9,1,5 0,4,7,11,2,6,9,1,5 0,4,7,11,2,q,1O,1,5 0,4,7,11,3,6,9,1,5 0,4,7,11,3,6,10,1,5 0,4,7,11,3,6,10,2,5 0,4,8,11,2,6,9,1,5 0,4,8,11,2,6,10,1,5 0,4,8,11,3,6,9,1,5 0,4,8,11,3,6,10,1,5 0,4,8,11,3,6,10,2,5 0,4,8,11,3,7,10,1,5 0,4,8,11,3,7,10,2,5 0,4,8,11,3,7,10,2,6
rv
119
rv
122
rv rv
136 137
Third Chains-Continued
I Third
I Chord I Lead-Sheet
Chain
Class
Symbols
34333433 34334333 34343443 34344343 34433443 34434343 34434344 34434433 34434434 43334333 43334334 43343334 43344344 43434344 43434434 43443344 43443434 43443443 44334344 44334434 44343344 44343434 44343443 44344334 44344343 44344344
1O~
Cm9-/11-/13 ... Cm9-/13 ... Cm13+ ... Cm11+/13+ ... Cm7+/13+ ... Cm7+/11+/13+ ... Cm7+/11+/13+ ... Cm7+/11+/(13)/15- ... Cm7+/11+/(13)/15- ... C9-/13 ... C9-/13 ... C17 C11+/13+ ... C7+/11+/13+ ... C7+/11+/(13)/15- ... C7+/9+/11+/13+ ... C7+/9+/11+/(13)/15- ... C7+/9+/11+/(13)/15- ... C+7+/11+/13+ ... C+7+/11+/(13)/15- ... C+7+/9+/11+/13+ ... C+7+/9+/11+/(13)/15- ... C+7+/9+/11+/(13)/15- ... C+7+/9+/(11)/13-/15- ... C+7+/9+/(11)/13-/15- ... C+7+/9+/(11)/13-/15- ...
9.C 9.C 15~
13~
9.C 1r 1O~
1C 15~
1O.C 1O.C 10.C 8.C 12~
1r 12~
1C 1O.1~ 14~
10.1~ 8.1~
1r 1O.C 10.C 16~
Ten Pitch Classes
152 153 154 155 156 157 158 159
rv
152
0,3,6,9,1,4,7,10,2,5 0,3,6,9,1,4,7,11,2,5 0,3,6,9,1,4,8,11,2,5 0,3,6,10,1,4,7,11,2,5 0,3,6,10,1,4,8,11,2,5 0,3,6,10,2,5,9,1,4,7 0,3,6,10,2,5,9,1,4,8 0,3,7,10,1,4,8,11,2,5
333433343 333433433 333434333 334333433 334334333 334434433 334434434 343334333
5~ 4~
5~ 3~ 4~
5~ 6~ 5~
C07-/9-/11-/13- ... C07-/9-/11-/13- ... C07-/9-/11-/13 ... C09-/11-/13- . .. C09-/11-/13 . .. C013+ ... C013+ ... Cm9-/11-/13 ...
APPENDIX L. CHORD AND THIRD CHAIN CLASSES
1180
Chain Nr. '" equiv.
I Pitch Classes
Third Chains-Continued
from 0
I Third
Chain
I Chord I Lead-Sheet Class
Symbols
Cm9-/11-/13 ... Cm9-/13 ... Cm13+ ... Cm11+/13+ ... Cm7+/13+ ... Cm7+/11+/13+ ... Cm7+/11+/13+ ... Cm7+/11+/(13)/15- ... Cm7+/11+/(13)/15- ... Cm7+/11+/(13)/15- ... C9-/13 ... C9-/13 ... C19
160
0,3,7,10,1,4,8,11,2,6
343334334
6~
161
0,3,7,10,1,5,8,11,2,6
343343334
162
0,3,7,10,2,5,9,1,4,8
343434434
3.C 3.C
163
0,3,7,10,2,6,9,1,4,8
343443434
r
164
0,3,7,11,2,5,9,1,4,8
344334434
6~
165
0,3,7,11,2,6,9,1,4,8
344343434
3.C
166
0,3,7,11,2,6,9,1,5,8
344343443
r
167 '" 160
0,3,7,11,2,6,10,1,4,8
344344334
6~
168 '" 161 169
0,3,7,11,2,6,10,1,5,8
344344343
3.C
0,3,7,11,2,6,10,1,5,9
344344344
6~
170
0,4,7,10,1,5,8,11,2,6
433343334
r
171
0,4,7,10,1,5,8,11,3,6
433343343
3.C
172
0,4,7,10,2,5,8,11,3,6
433433343
6~
173
0,4,7,10,2,6,9,1,5,8
433443443
6~
174
0,4,7,11,2,6,9,1,5,8
434343443
3.C
175 '" 170 176 177", 171
0,4,7,11,2,6,10,1,5,8
434344343
0,4,7,11,2,6,10,1,5,9
434344344
0,4,7,11,3,6,9,1,5,8
434433443
0,4,7,11,3,6,10,1,5,8 0,4,7,11,3,6,10,1,5,9
434434343 434434344
r
0,4,7,11,3,6,10,2,5,8
178 179 180", 172
r
3.C 3.C 6~
434434433
6~
181
0,4,7,11,3,6,10,2,5,9
434434434
3.C
182
0,4,8,11,2,6,10,1,5,9
443344344
6~
3.C
183
0,4,8,11,3,6,10,1,5,9
443434344
184
0,4,8,11,3,6,10,2,5,9
443434434
r
185
0,4,8,11,3,7,10,1,5,9
443443344
6~
186
0,4,8,11,3,7,10,2,5,9
443443434
3.C
187
0,4,8,11 ,3,7,10,2,6,9
443443443
6~
C11+/13+ ... C7+/11+/13+ ... C7+/11+/(13)/15- ... C7+/11+/(13)/15- ... C7+/9+/11+/13+ ... C7+/9+/11+/(13)/15- ... C7+/9+/11+/(13)/15- ... C7+/9+/11+/(13)/15- ... C7+/9+/11+/(13)/15- ... C+7+/11+/(13)/15- ... C+7+/9+/11+/(13)/15- ... C+7+/9+/(11)/13-/15- ... C+7+/9+/(11)/13-/15- ... C+7+/9+/(11)/13-/15- ... C+7+/9+/(11)/13-/15- ...
Eleven Pitch Classes 188
0,3,6,9,1,4,7,10,2,5,8
3334333433
2~
189
0,3,6,9,1,4,7,11,2,5,8
3334334333
2~
190
0,3,6,10,1,4,7,11,2,5,8
3343334333
2~
191
0,3,6,10,1,4,7,11,2,5,9
3343334334
2~
192
0,3,6,10,1,4,8,11 ,2,5,9
3343343334
2~
193 '" 191
0,3,6,10,2,5,9,1,4,7,11
3344344334
2~
C07-/9-/11-/13- ... C07-/9-/11-/13- ... C09-/11-/13- ... C09-/11-/13- ... C09-/11-/13 ... C013+ ...
L.2. THIRD CHAIN CLASSES
1181 Third Chains-Continued
Chain Nr./ Pitch Classes "" equiv. from 0
/ Third Chain
/ Chord / Lead-Sheet Class Symbols
194 "" 192 195
0,3,6,10,2,5,9,1,4,8,11
3344344343
2~
C013+ ...
0,3,7,10,1,4,8,11,2,5,9
3433343334
2~
196
0,3,7,10,1,4,8,11,2,6,9
3433343343
2~
197
0,3,7,10,1,5,8,11,2,6,9
3433433343
2~
198 "" 195
0,3,7,10,2,5,9,1,4,8,11
3434344343
2~
199
0,4,7,10,1,5,8,11,2,6,9
4333433343
2~
200
0,4,7,10,1,5,8,11,3,6,9
4333433433
2~
201
0,4,7,10,2,5,8,11,3,6,9
4334333433
2~
Cm9-/11-/13 ... Cm9-/11-/13 ... Cm9-/13 ... Cm13+ ... C9-/13 ... C9-/13 ... C21
202 "" 199
0,4,7,10,2,6,9,1,5,8,11
4334434433
2~
203 "" 191
0,4,7,11,3,6,10,2,5,9,1
4344344344
2~
204 "" 192
0,4,8,11,3,6,10,2,5,9,1
4434344344
2~
205 "" 195
0,4,8,11,3,7,10,2,5,9,1
4434434344
2~
206 "" 196
0,4,8,11,3,7,10,2,6,9,1
4434434434
2~
207
0,3,6,9,1,4,7,10,2,5,8,11
33343334333
208 "" 207
0,4,7,10,2,5,8,11,3,6,9,1
43343334334
209"" 207
0,4,7,10,2,6,9,1,5,8,11,3
43344344334
210 "" 207
0,4,8,11,3,7,10,2,6,9,1,5
44344344344
C11+/13+ ... C7+/9+/11+/(13)/15- ... C+7+/9+/(11)/13-/15- ... C+7+/9+/(11)/13-/15- ... C+7+/9+/(11)/13-/15- ...
Twelve Pitch Classes
C C C C
C07-/9-/11-/13- ... C23 C11+/13+ ... C+7+/9+/(11)/13-/15- ...
Appendix M
Two, Three, and Four Tone Motif Classes
M.l
Two Tone Motifs in OnPiM Od 12,12
ClassNr. 1 2 3 4 5
Representative (0,0), (0, 1) (0,0), (0,2) (0,0), (0,3) (0,0),(0,4) (0,0), (0,6) 1183
1184
M.2
APPENDIX M. TWO, THREE, AND FOUR TONE MOTIF CLASSES
Two Tone Motifs in OnPiM od5,12 ClassNr.
Representative
1 2 3 4
(0,0), (0, 1) (0,0),(0,2) (0,0),(0,3) (0,0), (0,4)
5 6 7
(0,0),(0,6) (0,0), (1,0) (0,0), (1, 1) (0,0),(1,2) (0,0), (1,2) (0,0), (1,4) (0,0),(1,6)
8
9 10
11
M.3. THREE TONE MOTIFS IN ONPIMOD 12 ,12
1185
Three Tone Motifs in OnPiM Od 12 ,12
M.3
Refer to the discussion in subsection 11.3.8 for the entries of this table. The order of these representatives is a historical one. After this table, the representatives are also visualized on a 12 x 12 square in list M.1. Three-Element Motif Classes in OnPiMOd 12 12 Class NT. I Representative I Kernel I Class Weight
1 2 3 4 5 6 7 8 9 10 11 12 13 14
(0,0),(1,0),(2,0) (0,0),(1,0),(3,0) (0,0),(1,0),(4,0) (0,0),(1,0),(5,0) (0,0),(1,0),(6,0) (0,0),(2,0),(4,0) (0,0),(2,0),(6,0) (0,0),(3,0),(6,0) (0,0),(4,0),(8,0) (0,0),(1,0),(0,1) (0,0),(2,0),(0,1)
Z.(1,2) Z.(1,2) Z.(I,O) Z.(1,2) Z.(1,2)
x x x x x
Z.(I, 1) Z.(O, 1) Z.(O, 1) Z.(I, 1) Z.(I, 0)
(Z4 x 2Z4) x Z.(I, 1) (Z4 x 2Z4) x Z.(O, 1) Z.(1,2) x Z~ (Z~) x Z.(I, 1) OxO Z.(2,0) x 0 o x Z.(I, 0)
15
(0,0), (3,0), (0, 1) (0,1),(0,2),(3,0) (0,0), (0, 1), (4,0) (0,0),(1,2),(2,0)
16 17 18 19 20 21 22
(0,0), (2,0), (0,2) (0,0),(6,0),(0,1) (0,0),(3,0),(0,2) (0,0),(0,2),(3,1) (0,0), (4,0), (0, 2) (0,0),(4,0),(0,4) (0,0),(6,0),(0,2)
23 24
(0,2),(0,4),(6,0)
2Z~ x Z.(I, 1)
(0,0),(4,0),(0,4)
Z~ x 0
25
(0,0),(6,0),(0,3) (0,0),(6,0),(0,6)
Z.(2,0) x Z~
26
Ox Z.(I, 1) Z.(I,O) x 0 Z.(1,2) x 0 2Z~ x 0
Z.(2,0) x Z.(I, 0) Z.(2,0) x Z.(O, 1) Z.(2,0) x Z.(I, 1) (Z4 x 2Z4) x 0
o x Z~ 2Z~
x Z.(I, 0)
2Z~ x Z~
(1,1,2) (1,2,3) (1,3,4) (1,1,4) (1,1,6) (2,2,4) (2,4,6) (3,3,6) (4,4,4)
I Volume 0 0 0 0 0 0 0 0 0 1 2
(1,1,1) (1,1,2) (1,1,3) (1,1,1) (1,1,4)
3 3 4
(1,1,2)
4
(2,2,2) (1,1,6) (1,2,3) (1,1,2) (2,2,4) (3,3,3) (2,2,6)
4 6 6 6 4 3 0
(2,2,2) (4,4,4)
0 4
(3,3,6) (6,6,6)
6 0
1186
APPENDIX M. TWO, THREE, AND FOUR TONE MOTIF CLASSES
•••• •••• •••• •••• ••••• ••••• 1
2 3 4
5 6 7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
n
~
M
~
~
Figure M.l: Representatives of the 26 isomorphism classes of three-element motives in OnPiMOd 12 ,12.
M.3. THRE E TONE M OTIFS IN O NPIM
1187
OD12,12
10
11
3
9
F ig u '" M.2 ' I I - d i. g c = of d o w \n a u c e i, p e d a l" ,a ti o n = o n g th e 26 \sO motives in O n P iM O d1 worphi"n d 2. 12 · " " " " of
I I I I I I I I I I
APPENDIX M. TWO, THREE, AND FOUR TONE MOTIF CLASSES
1188
M.4
Four Tone Motifs in OnPiM Od 12 ,12
This list was calculated by Straub in [513], refer to subsection 11.3.8 for details. The list's numbering follows Straub's algorithm; * denotes classes which are not determined by volume and class weight.
Class Nr.
° 1
2
Four-Element Motif Classes I Class Weight
I Representative
(0,0) ,(0,1 ),(0,2),(0, 7) (0,0),(0,1 ),(0,2), (0,3) (0,0),(0,1 ),(0,2),(0,6)
(1,1,5,5) (1,1,2,2) (1,4,5,7)
3 4
(0,0) ,(0,1) ,(0,2), (0,5)
(1,4,2,3)
(0,0),(0,1) ,(0,2), (0,4)
(1,2,3,6)
5
(0,0) ,(1,0) ,(0,5),(0,6)
(4,4,5,5)
6
(0,0) ,(0,1) ,(0,4) ,(0,5)
(4,4,3,3)
7
(0,0),(0,1 ),(0,3),(0,5)
(4,2,2,6)
8
(0,0) ,(0,1) ,(0,4),(0,8)
(4,3,3,9)
9
(0,0) ,(0,1) ,(0,6),(0,7)
(5,5,5,5)
10 11
(0,0) ,(0,1 ),(0,3),(0,6)
(5,2,2,8)
(0,0) ,(0,1) ,(0,3) ,(0,7)
(5,2,3,7)
12
(0,0),(0,1 ),(0,4), (0, 7)
(5,3,3,8)
13 14 15
(0,0),(0,1 ),(0,3), (0,10) (0,0),(0,1) ,(0,3) ,(0,4) (0,0),(0,1) ,(0,3) ,(0,9)
(2,2,2,2) (2,2,3,3) (2,3,7,8)
16
(0,0) ,(0,1) ,(0,4),(0,9)
(3,3,3,3)
17 18 19
(0,0) ,(0,2), (6,0) ,(6,10) (0,0),(0,2),(0,4),(6,0) (0,0),(0,2),(0,4),(6,2)
(23,23,22,22) (23,6,22,7) (6,6,22,22)
20
(0,0) ,(0,2),(0,4) ,(0,6)
(6,6,7,7)
21
(0,0),(0,2),0,4), (0,8)
(6,7,7,9)
22
(0,0) ,(0,2), (6,0),( 6,2)
(22,22,22,22)
23
(0,0) ,(0,2) ,(6,0),( 6,6)
(22,22,22,26)
24
(0,0) ,(0,2) ,(0,6),(6,2)
(22,22,7,7)
25
(0,0) ,(0,2), (0,6) ,(6,0)
(22,7,7,26)
26
(0,0),(0,2),(0,6),(0,8)
(7,7,7,7)
27
(0,0),(0,3),(0,6),(0,9)
(8,8,8,8)
28
(0,0) ,(0,6),( 6,0),( 6,6)
(26,26,26,26)
I Volume
° ° ° ° ° ° ° ° ° ° ° ° ° °° ° ° ° ° ° ° ° ° ° ° ° ° ° °
1189
M.4. FOUR TONE MOTIFS IN ONPIMOD 12 ,12 Four-Element Motif Classes-Continued Class Nr. Representative Class Weight Volume
I
29 30 31 32 33 34* 35* 36* 37* 38* 39* 40* 41* 42* 43 44* 45* 46* 47 48* 49* 50 51 52 53 54 55 56 57 58 59 60 61 62
I
(0,0) ,(0,1 ),(0,2) ,(1,0) (0,0),(0,1 ),(0,5) ,(1,0) (0,0), (0,1), (0,6) ,(1,0) (0,0), (0,1) ,(0,3), (1,0) (0,0), (0,1) ,(0,4), (1,0) (0,0) ,(0,1) ,(1,0), (1,5) (0,0),(0,1) ,(1,0),(7,7) (0,0) ,(0,1) ,(1,0),(1,1) (0,0) ,(0,1) ,(1,0),(3,5) (0,0), (0,1) ,(1,0) ,(3,11) (0,0),(0,1 ),(1,0) ,(1,2) (0,0),(0,1 ),(1,0),(5, 10) (0,0),(0,1) ,(1,0),( 4,10) (0,0), (0,1) ,(1,0),(2,4) (0,0) ,(0, 1),(1,0) ,(2,5) (0,0) ,(0, 1),(1,0) ,(1,3) (0,0),(0,1 ),(1,0) ,(7,9) (0,0),(0,1 ),(1,0),(3,3) (0,0) ,(0,1) ,(1,0), (6,8) (0,0) ,(0,1) ,(1,0), (1,4) (0,0) ,(0,1) ,(1,0),( 4,4) (0,0),(0,1) ,(1,0) ,(1 ,6) (0,0), (0,1) ,(1,0) ,(2,2) (0,0) ,(0,1) ,(1,0) ,(2,3) (0,0),(0,1) ,(1 ,0) ,(6,9) (0,0), (0,1) ,(1,0), (8,8) (0,0),(0,1),(1,0),(3,4) (0,0) ,(0,1) ,(2,0) ,(3,1) (0,0) ,(0,1) ,(2,0) ,(3,4) (0,0),(0,1 ),(3,0),( 4,1) (0,0),(0,1 ),(0,2) ,(2,1) (0,0), (0,1), (0,2) ,(2,0) (0,0) ,(0,1) ,(0,5) ,(2,0) (0,0) ,(0,1) ,(0,5), (2,1)
I
(1,10,10,11 ) (4,10,10,14) (5,10,10,17) (2,10,11,12) (3,10,12,14) (10,10,10,10) (10,10,10,10) (10,10,10,10) (10,10,10,13) (10,10,10,13) (10,10,11,11) (10,10,11,11) (10,10,11,15) (10,10,11,15) (10,10,11,19) (10,10,12,12) (10,10,12,12) (10,10,12,12) (10,10,15,19) (10,10,14,14) (10,10,14,14) (10,10,17,17) (10,11,11,13) (10,11,13,15) (10,11,12,18) (10,13,14,14) (10,12,15,18) (11,11,12,12) (11,12,12,15) (12,12,14,14) (1,11,11,15) (1,11,11,16) (4,11,11,14) (4,11,11,20)
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 2 2 2
1190
APPENDIX M. TWO, THREE, AND FOUR TONE MOTIF CLASSES
Four-Element Motif Classes-Continued Class Nr. I Representative I Class Weight I Volume
63 64 65 66 67 68 69 70 71 72
73* 74* 75* 76* 77* 78* 79* 80* 81* 82* 83* 84* 85* 86* 87* 88* 89* 90* 91 92 93 94 95 96
(0,0) ,(0,1), (0,6) ,(2, 1) (0,0),(0,1), (0,6), (2,0) (0,0),(0,1) ,(0,3), (2,0) (0,0),(0,1) ,(0,3), (2,1) (0,0) ,(0, 1) ,(0,4), (2,1) (0,0), (0,1 ),(0,4), (2,0) (0,0) ,(0,1) ,(2,0),( 4,6) (0,0) ,(0,1) ,(2,0),( 4,0) (0,0), (0,1) ,(2,0),( 6,6) (0,0), (0,1) ,(2,0),( 6,0) (0,0), (0,1) ,(2,0) ,(2, 1) (0,0), (0,1) ,(2,0) ,(2,5) (0,0) ,(0,1) ,(2,0) ,(2,7) (0,0),(0,1), (2,0) ,(2, 11) (0,0) ,(0,1), (2,0), (6,5) (0,0),(0,1), (2,0), (6,11) (0,0) ,(0, 1), (2,0),( 4, 7) (0,0) ,(0, 1), (2,0), (8,11) (0,0) ,(0, 1), (2,0), (2,2) (0,0) ,(0, 1) ,(2,0), (8,10) (0,0) ,(0,1) ,(2,0), (4,1) (0,0), (0,1) ,(2,0) ,(8,5) (0,0) ,(0,1) ,(2,0) ,(8,4) (0,0), (0,1) ,(2,0) ,(2,4) (0,0), (0,1) ,(2,0),( 6, 7) (0,0), (0,1) ,(2,0),( 6,1) (0,0), (0,1), (2,0) ,(2,9) (0,0), (0,1), (2,0) ,(2,3) (0,0), (0,1), (2,0),( 4,3) (0,0) ,(0,1), (2,0),( 4,2) (0,0) ,(0,1), (2,0),( 4,9) (0,0) ,(0, 1), (2,0), (4,8) (0,0) ,(0, 1), (4,2), (6,1) (0,0) ,(0, 1),(4,2), (6,4)
(5,5,11,11) (5,22,11,11) (2,11,15,18) (2,11,16,18) (3,11,14,18) (3,11,20,18) (23,11,11,15) (6,11,11,14) (22,11,15,17) (7,11,14,17) (11,11,11,11) (11,11,11,11) (11,11,11,11) (11,11,11,11) (11,11,11,19) (11,11,11,19) (11,11,15,15) (11,11,15,15) (11,11,15,16) (11,11,15,16) (11,11,14,14) (11,11,14,14) (11,11,14,20) (11,11,14,20) (11,11,17,17) (11,11,17,17) (11,11,18,18) (11,11,18,18) (11,15,15,19) (11,15,16,19) (11,14,14,19) (11,14,20,19) (15,15,17,17) (15,16,18,18)
2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2
M.4. FOUR TONE MOTIFS IN ONPIMOD 12 ,12
1191
Four-Element Motif Classes-Continued Class Nr. I Representative I Class Weight I Volume 97
(0,0) ,(0, 1 ),( 4,0),( 6,1)
(14,14,17,17)
2
98
(0,0), (0,1),( 4,0), (6,4)
(14,20,18,18)
2
99
(0,0), (0,1), (0,2), (3,0)
(1,13,12,18)
3
100
(0,0), (0,1), (0,2), (3,1)
(1,12,12,19)
3
101
(0,0), (0,1), (0,5), (3,0)
(4,4,12,12)
3
102
(0,0), (0, 1), (0,4), (3,2)
(4,3,13,12)
3
103
(0,0) ,(0, 1) ,(0,6), (3,2)
(5,13,13,17)
3
104
(0,0) ,(0, 1) ,(0,6), (3, 1)
(5,12,12,17)
3
105
(0,0) ,(0, 1) ,(0,6) ,(3,0)
(5,12,12,25)
3
106
(0,0), (0,1) ,(0,3), (3,2)
(2,13,12,19)
3
107
(0,0), (0,1), (0,3), (3, 1)
(2,12,12,18)
3
108
(0,0), (0,1), (0,3), (3,0)
(2,12,21,18)
3
109
(0,0), (0,1), (0,4), (3,0)
(3,3,12,12)
3
110
(0,0), (0, 1), (0,4), (3,1)
(3,3,12,21)
3
111
(0,0), (0, 1) ,(3,0), (6,0)
(8,12,12,17)
3
112
(0,0), (0,3) ,(0,6), (3,0)
(8,21,21,25)
3
113*
(0,0) ,(0, 1) ,(3,0), (3,5)
(13,13,12,12)
3
114*
(0,0) ,(0, 1) ,(3,0) ,(3, 11)
(13,13,12,12)
3
115*
(0,0), (0,1), (3,0) ,(9, 11)
(13,13,12,12)
3
116
(0,0), (0, 1), (3,2) ,(3,8)
(13,13,17,17)
3
117
(0,0), (0, 1), (3,0) ,(3,2)
(13,12,19,18)
3
118* 119*
(0,0), (0,1), (3,0), (9,7) (0,0), (0, 1), (3,0), (3,7)
(12,12,12,12) (12,12,12,12)
3 3
120*
(0,0), (0,1), (3,0), (3,1)
(12,12,12,12)
3
121*
(0,0) ,(0, 1), (3,0), (9,3)
(12,12,12,21 )
3
122*
(0,0) ,(0, 1) ,(3,0) ,(3,3)
(12,12,12,21 )
3
123
(0,0), (0, 1), (3,0),( 6,5)
(12,12,19,19)
3
124
(0,0), (0,1) ,(3,0),( 6,1)
(12,12,17,17)
3
125
(0,0), (0,1), (3,0) ,(3,6)
(12,12,17,25)
3
126
(0,0), (0,1), (3,0), (3,10)
(12,12,18,18)
3
127
(0,0), (0,1), (3,0), (6,9)
(12,21,18,18)
3
128*
(0,0), (0,3), (3,0), (3,3)
(21,21,21,21)
3
129*
(0,0), (0,3), (3,0), (9,9)
(21,21,21,21)
3
130
(0,0) ,(0,3) ,(3,0) ,(3,6)
(21,21,25,25)
3
1192
APPENDIX M. TWO, THREE, AND FOUR TONE MOTIF CLASSES
Four-Element Motif Classes-Continued I Representative I Class Weight I Volume
Class Nr.
131 132 133 134 135 136 137 13ts 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161* 162* 163* 164*
(0,0), (0, 1) ,(0,2), (4,3) (0,0) ,(0, 1) ,(0,2), (4,1) (0,0) ,(0, 1), (0,2), (4,0) (0,0) ,(0, 1), (0,5), (4,2) (0,0), (0, 1), (0,5), (4,3) (0,0), (0,1), (0,5), (4,0) (0,0), (0,1), (0,5), (4,1) (0,0), (0,1), (0,6), (4,3) (0,0), (0,1), (0,6),( 4,1) (0,0), (0,1), (0,6), (4,0) (0,0), (0,1), (0,3),( 4,2) (0,0), (0,1) ,(0,3), (4,1) (0,0) ,(0, 1) ,(0,3),( 4,0) (0,0), (0, 1), (0,3), (4,3) (0,0) ,(0, 1) ,(0,4), (4,1) (0,0), (0, 1) ,(0,4), (4,0) (0,0) ,(0,2) ,(2,0), (6,10) (0,0) ,(0,2), (2,0), (4,4) (0,0) ,(0, 1), (4,0), (8,6) (0,0), (0,2), (0,4), (2,0) (0,0) ,(0,2), (0,4), (4,2) (0,0), (0,2), (0,4), (4,0) (0,0), (0,2), (2,0), (2,6) (0,0), (0,2), (4,0), (6,2) (0,0), (0,2), (0,6), (2,0) (0,0), (0,2), (0,6), (4,0) (0,0), (0,2), (0,6), (4,2) (0,0), (0,1), (4,0), (8,0) (0,0), (0,2), (4,0), (8,0) (0,0), (0,4), (0,8), (4,0) (0,0), (0,1), (4,2), (4,7) (0,0), (0,1), (4,2), (4,3) (0,0), (0, 1), (4,0),( 4,7) (0,0), (0,1), (4,0),( 4,11)
(1,15,15,15) (1,15,14,14) (1,15,14,20) (4,15,15,14) (4,15,15,20) (4,14,14,14) (4,14,14,24) (5,5,15,15) (5,5,14,14) (5,7,15,14) (2,2,15,15) (2,2,14,20) (2,3,15,14) (2,3,15,20) (3,3,14,14) (3,3,14,24) (23,16,16,16) (23,16,20,20) (6,15,15,14) (6,16,16,20) (6,20,20,20) (6,20,20,24) (22,22,16,16) (22,22,20,20) (22,7,16,20) (7,7,20,20) (7,7,20,24) (9,14,14,14) (9,20,20,20) (9,24,24,24) (15,15,15,15) (15,15,15,15 ) (15,15,14,14) (15,15,14,14)
4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4
M.4. FOUR TONE MOTIFS IN ONPIMOD 12 ,12
1193
Four-Element Motif Classes-Continued Class Nr. Representative Class Weight Volume
I
I
I
165*
(0,0),(0,1),( 4,0),( 4,2)
(15,15,14,20)
4
166*
(0,0),(0,1),(4,0),(4,10)
(15,15,14,20)
4
167*
(0,0),(0,1),(4,0),(4,1)
(14,14,14,14)
4
168*
(0,0) ,(0,1 ),( 4,0),( 4,5)
(14,14,14,14)
4
169
(0,0),(0,1),(4,0),( 4,4)
(14,14,14,24)
4
170
(0,0),(0,2),(2,0) ,(2,2)
(16,16,16,16)
4
171*
(0,0),(0,2),(2,0),(8,8)
(16,16,20,20)
4
172*
(0,0),(0,2),(2,0),(2,4)
(16,16,20,20)
4
173*
(0,0),(0,2),(4,0),( 4,2)
(20,20,20,20)
4
174*
(0,0),(0,2),( 4,0),( 4,10)
(20,20,20,20)
4
175
(0,0),(0,2),( 4,0),( 4,4)
(20,20,20,24)
4
176
(0,0),(0,4),( 4,0),( 4,4)
(24,24,24,24)
4
177
(0,0) ,(0,1) ,(0,2) ,(6,1)
(1,1,17,17)
6
178
(0,0) ,(0,1) ,(0,2), (6,3)
(1,2,19,18)
6
179
(0,0),(0,1 ),(0,2),( 6,4)
(1,23,18,18)
6
180
(0,0) ,(0,1 ),(0,2) ,(6,0)
(1,22,19,17)
6
181
(0,0) ,(0,1) ,(0,5),( 6,0)
(4,4,17,17)
6
182
(0,0) ,(0,1) ,(0,4),( 6,5)
(4,3,19,18)
6
183
(0,0), (0,1), (0,5) ,(6,3)
(4,6,18,18)
6
184
(0,0), (0,1) ,(0,5),( 6,1)
(4,7,19,17)
6
185
(0,0),(0,1),(0,6),(6,5)
(5,5,19,19)
6
186 187 188
(0,0) ,(0,1), (0,6), (6,1) (0,0), (0,1), (0,6), (6,2)
(5,5,17,17) (5,22,19,19)
(0,0),(0,1), (0,6),( 6,4)
(5,22,18,18)
6 6 6
189
(0,0),(0,1) ,(0,6),( 6,3)
(5,8,18,18)
6
190
(0,0) ,(0,1) ,(0,6),( 6,0)
(5,26,17,17)
6
191
(0,0), (0,1) ,(0,3),( 6,0)
(2,2,17,25)
6
192
(0,0) ,(0,1) ,(0,3),( 6,4)
(2,2,18,18)
6
193
(0,0) ,(0,1), (0,3),( 6,5)
(2,23,19,18)
6
194
(0,0) ,(0,1 ),(0,3),( 6,1)
(2,22,17,18)
6
195
(0,0) ,(0,1 ),(0,3),( 6,3)
(2,22,18,25)
6
196
(0,0) ,(0,1 ),(0,4),( 6,1)
(3,3,17,25)
6
197
(0,0) ,(0,1) ,(0,4),( 6,3)
(3,3,18,18)
6
198
(0,0),(0,1) ,(0,4),( 6,2)
(3,6,19,18)
6
APPENDIX M. TWO, THREE, AND FOUR TONE MOTIF CLASSES
1194
Four-Element Motif Classes-Continued Class Nr. I Representative I Class Weight I Volume 199
(0,0), (0,1), (0,4), (6,0)
(3,7,17,18)
6
200
(0,0), (0,1), (0,4),( 6,4)
(3,7,18,25)
6
201
(0,0), (0,1), (6,3),( 6,9)
(22,8,18,18)
6
202
(0,0) ,(0,3) ,(0,6),( 6,3)
(8,8,25,25)
6
203
(0,0) ,(0,3) ,(0,6),( 6,0)
(8,26,25,25)
6
204*
(0,0), (0,1),( 6,0),( 6,5)
(19,19,17,17)
6
205*
(0,0) ,(0, 1),( 6,0), (6,11)
(19,19,17,17)
6
206*
(0,0), (0, 1),( 6,2), (6,3)
(19,19,18,18)
6
207*
(0,0) ,(0, 1),( 6,2), (6,5)
(19,19,18,18)
6
208*
(0,0), (0,1), (6,0), (6,1)
(17,17,17,17)
6
209*
(0,0), (0,1), (6,0), (6,7)
(17,17,17,17)
6
210*
(0,0), (0,1), (6,0), (6,3)
(17,18,18,25)
6
211*
(0,0), (0,1), (6,0),( 6,9)
(17,18,18,25)
6
212*
(0,0), (0,1),( 6,3),( 6,4)
(18,18,18,18)
6
213*
(0,0), (0,1),( 6,3),( 6,10)
(18,18,18,18)
6
214*
(0,0), (0,3),( 6,0),( 6,3)
(25,25,25,25)
6
215*
(0,0) ,(0,3),( 6,0),( 6,9)
(25,25,25,25)
6
M.5. THREE TONE MOTIFS IN ONPIMOD5 ,12
M.5
1195
Three Tone Motifs in OnPiM od5,12
Refer to the discussion in subsection 11.3.8 for the entries of this table. The order of these representatives is a historical one.
Cl. Nr.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27
Three-Element Motif Classes in OnPiMod5 ,12 Representative Kernel in Z5 x Z12
(0,0),(1,0),(0,4) (0,0),(1,0),( 4,4) (0,0),(1,0),(3,4) (0,0),(0,4),(1,8) (0,0),(1,4),( 4,8) (0,0) ,(1,0) ,(0,10) (0,0),(1,0),( 4,10) (0,0),(1,0),(3,10) (0,0),(0,4),(1,2) (0,0),(1,4),(0,2) (0,0),(1,4),( 4,2) (0,0),(1,4),(3,2) (0,0),(0,4),(1,6) (0,0),(1,4),(0,6) (0,0),(1,4),( 4,6) (0,0),(1,4),(3,6) (0,0),(3,4),(1,6) (0,0), (1,4), (1,6) (0,0),(1,0) ,(0,1) (0,0),(1,0),( 4,1) (0,0),(1,0) ,(3,1) (0,0),(0,4),(1,5) (0,0),(1,4),(0,5) (0,0),(1,4),(4,5) (0,0),(1,4),(3,5) (0,0),(0,4),(1,9) (0,0),(1,4),(0,9)
(Z3 X 0) X (0 x Z5) (Z3 X 0) x Z5.(1, 1) Z~ x (Z3 X 0) x Z5.(2, 1) Z~ x Z3.(1, 1) x (Z5 X 0) Z~ x Z3.(1, 1) x Z5.(1, 1) (Z4 x 2Z4) X (Z3 x 0) X (0 x Z5) (Z4 x 2Z4) X (Z3 x 0) x Z5.(1, 1) (Z4 x 2Z4) X (Z3 x 0) x Z5.(2, 1) (Z4 x 2Z4) x Z3.(1, 1) x (Z5 X 0) (Z4 x 2Z4) x Z3.(1, 1) x (0 x Z5) (Z4 x 2Z4) x Z3.(1, 1) x Z5.(1, 1) (Z4 x 2Z4) x Z3.(1, 1) x Z5.(2, 1) (Z4 x 2Z4) x (0 x Z3) X (Z5 x 0) (Z4 x 2Z4) x (0 x Z3) x (0 x Z5) (Z4 x 2Z4) x (0 x Z3) x Z5.(1, 1) (Z4 x 2Z4) x (0 x Z3) x Zs-(2, 1) (Z4 x 2Z4) x (0 x Z3) x Z5.(1, 2) (Z4 x 2Z4) x (0 x Z3) x Z5.(4, 1) (Z4 x 0) X (Z3 X 0) X (0 x Z5) (Z4 x 0) X (Z3 X 0) x Zs-(l, 1) (Z4 x 0) X (Z3 X 0) x Z5.(2, 1) (Z4 x 0) x Z3.(1, 1) x (Z5 X 0) (Z4 x 0) x Z3.(1, 1) x (0 x Z5) (Z4 x 0) x Z3.(1, 1) x Z5.(1, 1) (Z4 x 0) x Z3.(1, 1) x Zs-(2, 1) (Z4 x 0) X (0 x Z3) X (Z5 x 0) (Z4 x 0) X (0 x Z3) x (0 x Z5) Z~ x Z~ x
APPENDIX M. TWO, THREE, AND FOUR TONE MOTIF CLASSES
1196
Three-Element Motif Classes in OnPiMod 5,12-continued Cl. Representative Kernel Nr. in Z5 x Z12 28
(0,0),(1,4),( 4,9)
(Z4 x 0)
X
(0 x Z3) x Z5.(I, 1)
29
(0,0),(1,4),(3,9)
(Z4 x 0)
X
(0 x Z3) x Z5.(2, 1)
30
(0,0),(3,4),(1,9)
(Z4 x 0)
X
(0 x Z3) x Z5.(I, 2)
31
(0,0),(1,4),(1,9)
(Z4 x 0)
X
(0 x Z3) x Z5.(4, 1)
32
(0,0) ,(0,6),(1,1)
Z4.(1,2)
X
(Z3
0)
X
(Z5
33
(0,0) ,(1,6),(0,1)
Zdl,2) x (Z3 x 0)
X
(0 x Z5)
34
(0,0) ,(1 ,6),( 4,1)
Zdl,2) x (Z3
X
0)
X
Z5.(1, 1)
35
(0,0) ,(1,6),(3,1)
Z4.(1,2) x (Z3
X
0) x Z5.(2, 1)
36
(0,0) ,(0,10) ,(1,5)
Zdl,2) x Z3.(I, 1) x (Z5
37
(0,0) ,(1 ,10),(0,5)
Z4.(1,2) x Z3.(I, 1) x (0 x Z5)
38
(0,0),(1,10),( 4,5)
Z4.(1,2) x Z3.(I, 1) x Zdl, 1)
39
(0,0) ,(1,10) ,(3,5)
Z4.(1,2) x Z3.(I, 1) x Z5.(2, 1)
40
(0,0), (0,10),(1,9)
Zdl,2) x (0 x Z3)
41
(0,0) ,(1,10) ,(0,9)
Z4.(1,2) x (0 x Z3) x (0 x Z5)
42
(0,0),( 4,10),(1,9)
Z4.(1,2) x (0 x Z3) x Z5.(I, 1)
43
(0,0) ,(3,10),(1,9)
Zdl,2) x (0 x Z3) x Z5.(I, 2)
44
(0,0), (1,10),(3,9)
Z4.(1,2) x (0 x Z3) x Z5.(2, 1)
45
(0,0),(1,10) ,(1,9)
Z4.(1,2) x (0 x Z3) x Z5.(4, 1)
X
X
X
X
0)
0)
(Z5 x 0)
Appendix N
Well-Tempered and Just Modulation Steps N.1
12-Tempered Modulation Steps
N.1.1
Scale Orbits and Number of Quantized Modulations
In the following table, the exclamation sign (!) in column 6 means that quantization is not possible for every translation quantity p in the notation of theorem 30.
Orbits and Number of Quantized Modulations Class
I # Min.
Cadence Sets
I # Quanta I # Quant. Mod. 54 (!)
38
9
42
38.1
5
20
26
47
6
28
30
47.1
15
66
114
50
7
34
42
50.1
6
36
46
52
5
24
24 (!)
55
6
30
32 (!)
61
10
38
62
62
5
24
24 (!)
39
9
29
93
39.1
6
23
55
40
10
24
108
40.1
7
26
72
1197
1198
APPENDIX N. WELL-TEMPERED AND JUST MODULATION STEPS Orbits and Number of Quant. Mod.-Continued Class I # Min. Cadence Sets # Quanta I # Quant. Mod.
1
41 41.1 42 42.1 43 43.1 44 45 45.1 46 48 48.1 49 49.1 51 53 53.1 54 54.1 56 57 58 59 60
7 6 6 7 6 7 9 7 10 6 10 7 7 7 9 7 9 7 21 7 8 18 11 6
25 21 22 28 22 26 23 21 21 26 23 28 21 26 13 27 25 32 32 24 21 17 22 21
75 53 54 74 57 72 89 63 105 56 109 68 71
74 86 67 91 71
226 70 71
185 101 60
N.l. 12-TEMPERED MODULATION STEPS
N.1.2
n99
Quanta and Pivots for the Modulations Between Diatonic Major Scales (No.38.1)
Quanta and Pivots for the Modulations Between Diatonic Major Scales
Transl. p 1 1
2 2 2
3 3 4 4 4
I Cadence I Quantum {II, V} {II, III} {VII} {II, V} {IV, V} {II, V} {II, III} {VII} {IV, V} {II,III}
I {VII} 6 6 6 6
{II,III} {IV, V} {IV, V} {II, III} I {VII}
8 8 8
9 9
10 10
10 n n
{VII} {IV, V} {II, III} {II, V} {IV, V} {VII} {II, V} {II,III} {II, V} {IV, V}
I Modulator I Pivots e 5n e 5n
{II,III, V, VII} {II, III, V, VII}
0 • • 0 • • 0 . 0 . O.
e6n e6n
0 • • 0 • • 0 . 0 . O.
e 6n
{II, IV, V II} { II ,IV, V, V II} { II ,IV, V, V II}
e7n e7n
{II,III, V, VII} {II,III, V, VII}
0 • • • • • • • 0 . O.
e 8n e 8n
• • • • 0 • • • • • O.
e 8n
{II,IV, V, VII} {II,III, V, VII} {V, VII}
.0 •• 0 ••••••• .0 •• 0 •••••••
0 •• 0 •• 0.000.
.0.00.0 ••••• .0.00.0 ••••• 00 • • 0 • • 0 0 . O.
10 0 . 0 • • 0 . 0 0 • •
I e9 11
0 ••••• 0 •••••
0 • • • • • • • • • O.
e6 elOn
•••• 0.0 •••••
e6 elOn
1.0.00 •• 00.0.1
e ll 11
• • • • 0 • • • • • O.
0 ••••• 0 •••••
eon eon
•••• 0.0.0 •••
eOn
0 0 . 0 • • • • • • O.
eln
0 0 . 0 • • • • • • O.
eln
0 •• 00.0.00 ••
I {II,IV, VII} {II,III, V, VII} { II ,IV, V, V II} { II ,IV, V, V II} {II,III, V, VII} I {III, V, VII}
{II, VII} { II ,IV, V, V II} {II, III, V, VII} {II ,IV, V, V II} { II ,IV, V, V II}
• 0 • • 0 • 0 • 0 • O.
e2n e 2n
• 0 • • 0 • 0 • 0 • O.
e2n
{III, V, VII} {II,III, V, VII} {II,III, V, VII}
e 3n e3n
{ II ,IV, V, V II} {II,IV, V, VII}
• 0 • • 0 • 0 0 0 • O.
0 •• 0 •••••••• 0 •• 0 ••••••••
1200
APPENDIX N. WELL-TEMPERED AND JUST MODULATION STEPS
N.1.3
Quanta and Pivots for the Modulations Between Melodic Minor Scales (No.4 7.1)
The symbol p stands for "translation p"; M stands for "Modulator".
Quanta and Pivots for the Modulations Between Melodic Minor Scales p 1 1 1 1 1 1 2 2 2
3 3 3 3
I Cadence {II,IV}, {IV, VII} {III, VI}, {V, VI} {III, VII} {IV, V} {II, III} {I,VII} {III, V}, {III, VII}, {II, III} {II,IV}, {II, VI},{I,II} {I, III}, {III, VI},{III,IV} {III, V}, {III, VI}, {V, VI}, {I, V} {III, V}, {III, VI}, {V, VI}, {I, V} {III, V}, {III, VI}, {V, VI}, {I, V} {III, V}, {III, VI}, {V, VI}, {I, V}
4
{III, V} {I,III} {II, VI}, {I, II} {IV, VII} {III, VI} {III, VII} {V, VI} {III,IV} {II,III} {I, VII} {I,III} {III, V} {III, VII},{II,III} {III, VI}, {III, IV}
5
{I,II},{I,V},{III,VI},{V,VI}
4 4 4 4 4 4 4 4 4 4 4 4 4
I Quantum
IM
• • • • 0 • • 00 • • 0
e3 11
• • • • • • 0 •• 0 ••
e3 11
. 0 0 • • 0 • • • • O.
e3 11
•••• 0 • • • • • •0
e3 11
.00 •••••••••
e3 11
• • • • • 0 • • • • O.
e3 11
• • 0 • • • 0 . 0 . O.
e4 11
• 0 • 0 • • 0 • 0 • O.
e4 11
0 • • • 0 . 0 . 0 . O.
e4 11
. 0 • • 0 • • • 00 • •
e5 11
. 0 • • 0 • • • 00 • •
e5 11
. 0 • • 0 • • • 00 • •
e5 11
. 0 • • 0 • • • 00 • •
e5 11
00 • • • 0 0 . 0 0 0 .
e6 11 e6 11
• • • 0 • • • • 0 . O.
e6 11
• • • • • • • 00.00
e6 11
0 • • 0 • • 00 • • • 0
• 0 0 • 0 0 • • 0 • O.
e6 11 e6 11
• • • • • • • • 000.
e6 11
0 • • • • • 0 . 0 . O.
e6 11
• • 0 . 0 • • • 0 . O.
e6 11
. 0 • • • 0 • • 0 . O.
e6 11
. 0 0 • • 00 • • 0 0 .
e4
00 • • 00 • • 00 • •
e4
0 ••• 0 ••• 0 •••
e4
•• 0 ••• 0 ••• O •
e4
• 0 • • • • 0 •• 00.
e7 11
0 ••••• 0.000.
I Pivots {II,IV, VII} {I,III, V, VI} {III, V, VII} {II,IV, V, VII} {II, III, V, VII} {I, III, V, VII} {II,III, V, VII} {I, II,IV, V I} {I, III,IV, V I} {I,III, V, VI} {I,III, V, VI} {I,III, V, VI} {I,III, V, VI} {III, V} {I,III} {I, II,IV, V I} {II,IV, V II} {I,III, VI} {III, V, VII} {I, III, V, VI} {I, II I,IV, V I} {II,III, V, VII} {I, III, V, VII} {I,III} {III, V} {II,III, V, VII} {I, II I, IV, V I} {I , III " V VI}
N.l. 12-TEMPERED MODULATION STEPS
1201
Quanta and Pivots for Melodic Minor Scales-Continued p
5 5 5 6 6 6 6
7 7 7 7 8 8 8 8 8 8 8 8 8 8 8 8 8 8 9 9 9 9
10 10
10
I Cadence {II, VI}, {I, II} {IV, VII}, {IV, V} {III, VII} {III, V}, {III, VII}, {II, III} {I, III}, {III, VI}, {III, IV} {I,III},{III, VI},{III,IV} {III, V}, {III, VII}, {II, III} {III, V}, {I, V}, {III, VII}, {I, VII} {II, VI}, {I, II} {IV, VII}, {IV, V} {III, VI} {III, V} {I,III} {II, VI} {IV, VII}, {IV, V} {III, VI} {III, VII} {V, VI} {III,IV} {II,III} {I, VII} {I,III} {III, V} {III, VII}, {II,III} {III, VI}, {II, IV} {III, V} {II, IV}, {II, VI} {I, III}, {I, V}, {III, VII}, {I, VII} {I,II} {II, V}, {III, VII}, {II, III} {II, IV}, {IV, VII}, {IV, V} {I,III},{III,VI},{III,IV}
I Quantum
IM
.0 •••• 0.0 •• 0
e 7n e 7n
. 0 0 • • 00 • • • • •
e 7n
.0.00.0 •••••
•• 0.0.0 •••
o.
0 • • • 0 • • • 0 . O. • • 0 . 0 • • • 0 . O. 0 • • • 0 . 0 • • • O. . 0 • • 00 • • 0 • • • .0.0 •• 0.0 •••
e8n e8 n e6 e6 e9 n e 9n
00 • • • • • • 00 • •
e9 n e9 n
.00.000.00 ••
elOn
00 • • 0 0 0 • • 0 0 .
elOn
••• 00.00 ••••
elOn
•••• 0.0 ••••0
elOn
00 • • 0 . 0 • • 0 0 .
elOn
•• 0.000.0 •••
elOn
.0 •• 0.0 •• 0 ••
elOn
.0 • • • • • • 0.00
0 • • • 0 . 0 • • • O.
elOn
•• 0.0.0.0 •••
elOn
• • • • 000 • • • • •
elOn
. 0 0 • • 00 • • 0 0 . 00 • • 00 • • 00 • • 0 ••• 0 ••• 0 ••• • • 0 • • • 0 • • • O. . 0 0 • • 00 • • 0 0 . • 0 • 0 0 • • 0 0 • O.
e8 e8 e8 e8 e l1 n e l1 n
. 0 . 0 • • • • 0 . O.
e l1 n e l1 n
• • 0 • 0 • 0 • 0 • O.
eOn
.0 •• 0.0.0 ••0
eOn
0 ••• 0.0.0 •••
eOn
. 0 • • • 00 • • • O.
I Pivots {I, II,IV, V I} {II,IV, V, VII} {III, V, VII} {II, III, V, VII} {I,I II,IV, V I} {I,I II ,IV, V I} {II, III, V, VII} {I,III, V, VII} {I, II,IV, V I} { II ,IV, V, V II} {I,III, VI} {III, V} {I, III} {II,IV, VI} {II,IV, V, V II} {I, III, VI} {III, V, VII} {I,III, V, VI} {I, III,IV, VI} {II,III, V, VII} {I,III, V, VII} {I, III} {III, V} {II, III, V, VII} {I,III,IV, VI} {III, V} {II,IV, VI} {I, III, V, VII} {I, III,IV, VI} {II, III, V, VII} {II,IV, V, VII} {I, III,IV, VI}
1202
p
APPENDIX N. WELL-TEMPERED AND JUST MODULATION STEPS
I Cadence
Quanta and Pivots for Melodic Minor Scales-Continued
I Quantum
{II, IV}, {II, V I} {III, VI} {III, VII}, {I, VII} {V, VI} {III, IV} {I,ll}
11 11 11 11 11 11
N.l.4
IM
• • • 0 • • 00 • • O.
e 1 11
00 • • 0 • • • • 0 • •
e 1 11
•••• 0 •• 0 ••••
e 1 11
• • • • 0 • • • • 0 ••
e 1 11
00 • • • • • • • • • •
e 1 11
• • • 0 • • • • • • O.
e 1 11
I Pivots {I,IV, VI} {I,III, VI} {I, III, V, VII} {I,III, V, VI} {I,III, IV, VI} {I,II,IV, VI}
Quanta and Pivots for the Modulations Between Harmonic Minor Scales (No.54.1)
For this table, we need a numbering of the 21 minimal cadence sets:
0= {II, VII} 5 = {V, VII} 10 = {III, VII} 15 = {I, VII} 20 = {V, VI}
1 = {I, III}
2 = {II,IV}
3
6= {I, VI}
7 = {IV, VII} 12 = {II, V} 17 = {II,III}
8 = {I, V}
11 = {I,IV} 16 = {I,ll}
= {III, V}
13 = {III, VI} 18 = {III,IV}
4 = {IV, VI}
9 = {II, VI} 14 = {VI, VII} 19 = {IV, V}
Quanta and Pivots for the Modulations Between Harmonic Minor Scales
Transl. p
I Cadence Nr.
I Quantum
3/9
1,3,6,8,10,11,15-20
o
3/9
2,4,7,9,14
. 0 • • 0 • • 0 • • O.
3/9
5,12
0 •• 0 •• 0 •• 0 ••
4/8
0,7,12,14-17,19
o
4/8
1,6,13
. 0 0 • • 00 • • 0 0 .
4/8
2
••• 0 ••• 0 •••0
4/8
3
00 • • 00 • • 00 • •
4/8
4,11,18
• • 0 • • • 0 • • • O.
4/8
5,10
0 ••• 0 ••• 0 •••
4/8
8,20
.0 • • • 0 • • • 0 ••
6
2,7
• 0 • 0 0 • • 0 • 0 O.
6
3,10,17
0 • • • 0 . 0 • • • O.
0 0 0 0 0 0 0 0 0 00
0 0 0 0 0 0 0 0 0 00
1Pivots
{I,ll, III, IV, V, VI, VII} {II, IV, VI, VII} {II, V, VII} {I, II, II I,IV, V, V I, V II} {I,ll, VI} {II, IV} {III, V} {I,III,IV, VI} {III, V, VII} {I, III, V, VI} {II ,IV, V II} {II, III, V, VII}
1203
N.2. 2-3-5-JUST MODULATION STEPS Quanta and Pivots for Harmonic Minor Scales-Continued
I Quantum
I Pivots
Transl. p
Cadence Nr.
6
4,9,14
.0 •• 0 •• 0 ••
6
5,12
0 •• 00.0 •• 00.
6
8,11,13,15,16,18
• • • • 0 • • • • • O.
6
19
• • • 00 • • • • 0 0 .
{II, IV, VI, VII} {II, V, VII} {I, II, II,IV, V, VI, VII} {II, IV, V, VII}
1,2,5,7,10,11
any cadence set
••••••••••••
{I, II, II,IV, V, VI, VII}
N.1.5
o.
Examples of 12-Tempered Modulations for all Fourth Relations Examples of Modulations for all Fourth Relations
Start
-7
C""F C"" B, C""E, C "" A, C""D, C""G, C""B C""E C""A C""D C""G
N.2
Target
I Neutral Ie Ie,Ve Ie,Ve Ie,IVe
Ie Ie,IVe U VIe, VIe U VIle Ie, Ve,IIe Ie, VIe Ie,Ve Ie, VIe, Ve Ie,Ve
I Pivots
I Cadence
VIIF U IlF
IF, IVF, VF , IF
IIIB" VB, U VIIB, lIE" VE, U VIlE, IlA, U VIlA, IIID"IID, U VIID, IIG, U VIIG,,IIG, IVB,IlBUVIIB VE U VIlE IVA VIlD UVD
VIIB,,IB, VE, UVIIE"IE,
IIIG
IVA" VA"lA, lID" VD,,ID, VG, U VIIG,,IG, IVB, VB,IB IVE, VE,IE IVA U VIlA, VA,IA lID, VD,ID IIG, VG,IG
2-3-5-Just Modulation Steps
The following tables show data for modulations from C-tonic.
N.2.1
Modulation Steps between Just Major Scales
Here, we have the two modulators
(N.1) (N.2) The numbering of minimal cadence sets is the one used in formula (26.6). The tonics D* and
B; are the usual third comma shifted representatives of D and B-flat.
APPENDIX N. WELL-TEMPERED AND JUST MODULATION STEPS
1204
Pivots for the Modulations Between Just Major Scales Translation
1Target Tonic I Modulator I Cadence 1Pivots
(1,0) (-1,0) (2,0) (-2,0) (0,1) (0,1) (0,-1) (0,-1) (1,1) (-1,-1) (1,-1) (1,-1) (-1,1) (-1,1) (-2,1) (2,-1)
N.2.2
<1>2
5
F
<1>2
D
<1>2
BD
<1>2
E E
<1>2
AD AD B DD ED ED
<1>2
A A
<1>2
D*
<1>2
B;
<1>2
1 1 5 1 5 5 5 1 5 1 1 5 1 5 1
G
<1>1 <1>1 <1>2 <1>2 <1>2 <1>1 <1>1
{V, VII} {II,IV} {II, IV} {V, VII} {II, V, VII} {II, V, VII} {II, IV, VII} {II, V, VII} {II, V, VI} {II, V, VII} {II, V, VII} {II ,IV, V II} {II, IV, V II} {II, IV, VII} {III, V, VII} {II,IV, VI}
Modulation Steps between Natural Minor Scales
We have the two modulators
The minimal cadence sets are these:
J 1 = {VII}, h = {III, VI}, J 3 = {V, VI}, J 4 = {IV, V}, J 5 = {II}, J 6 = {III, IV}. Pivots for the Modulations Between Natural Minor Scales Translation
(-1,0) (1,0) (-2,0) (2,0)
I Target Tonic I Modulator I Cadence F
<1>2
5
G
<1>2
BD
<1>2
D
<1>2
1 1 5
Pivots
{II,IV} {V, VII} {V, VII} {II,IV}
1205
N.2. 2-3-5-JUST MODULATION STEPS Pivots for the Modulations Between Natural Minor Scales-Continued
Translation
I Target Tonic I Modulator I Cadence I Pivots
(0, -1) (0, -1) (0,1) (0,1) (-1, -1) (1,1) (1, -1) (1, -1) (-1,1) (-1,1) (2, -1) (-2,1)
N.2.3
A, A, E E D, B
E,
E, A A
,
B*
D*
1
{II,IV, VII} {II,IV, VII} {II,IV, VII} {II, V, VII} {III, V, VII} {II,IV, VI} {II, V, VII} {II, V, VII} {II, V, VII} {II,IV, V II} {II,IV, VI} {III, V, VII}
5 5 5 1 5 1 5 1 1 5 1
Modulation Steps From Natural Minor to Major Scales
We have the two modulators
eb.A, e b .B,B =
(1°
The minimal cadence sets are the same as for major scales.
Pivots for the Modulations From Natural Minor to Major Scales
Translation
(-2,0) (-1,0) (1,0) (2,0) (-1, -1) (0, -1) (1, -1) (2, -1)
I Target Tonic I Modulator I Cadence I Pivots
B,
F
G
D D,
A, E,
B;
5
{V, VII} {II,IV} {V, VII} {II,IV} {II, V, VII}
5 1 1
{II,IV, VII} {II, V,vII} {II,IV, VI}
5 1 5 1
1206
APPENDIX N. WELL-TEMPERED AND JUST MODULATION STEPS
N.2.4
Modulation Steps From Major to Natural Minor Scales
We have the two modulators (same as above).
e b .A,
(1°
The minimal cadence sets are those of minor scales.
Pivots for the Modulations From Major to Natural Minor Scales Translation I Target Tonic I Modulator I Cadence I Pivots
(-2,0) (-1,0) (1,0) (2,0) (-2,1) (-1,1) (0,1) (1,1)
N .2.5
1 5 1 5 1 1 5 5
Bo F
G
D D A E
B
{V, VII} {II, IV} {V,VII} {II,IV} {III, V, VII} {II,IV, VII} {II, V, VII} {II,IV, VI}
Modulation Steps Between Harmonic Minor Scales
We have the unique translation modulator
The minimal cadence sets are these:
Jl = {III}, J 2 = {II}, J3 = {VII}, J4 = {I,IV}, J5 = {I, V}, J 6 = {I, VI},h = {IV, V},Js = {IV, VI},Jg = {V, VI}. Pivots for the Modulations Between Harmonic Minor Scales Translation I Target Tonic I Modulator I Cadence I Pivots
(3,0) (-3,0) (0,1) (0,1) (0,1)
A* Ef: E E E
8 8
2 4
8
{II,IV, VI} {II, IV, VI} {II, IV} {I,III,IV, VI} {I, III,IV, VI}
1207
N.2. 2-3-5-JUST MODULATION STEPS Pivots for the Modulations Between Harmonic Minor Scales-Continued Translation
(0, -1) (0, -1) (0, -1) (2,1) (-2,-1) (0,2) (0, -2) (1,2) (-1,-2)
I Target Tonic I Modulator I Cadence I Pivots
AD AD
2
4
AD
8
h
8
CD
8
Cu
4
FD
4
D" BD,D
4
4
(1,0) (-1,0) (2,0) (-2,0)
C
F
D BD
(3,0) (-3,0) (-2 ... 1,1) (-2 ... - 1,1)
A*
(2,1) (1. .. 2,1) (1,1) (-1. .. 2,1) (1. .. 2,-1) (-2,-1) (-2 ... -1,-1) (-1, -1) (0,2) (0, -2)
FU
1,5,7,9 1,5,7,9 9 9 9 9 7 9 5,9
4
8
7 9 5,9
N.2.6
E;
B
CD
4
DD
8
C"
FD
7 7
{II, IV} {I, III,IV, VI} {I, III,IV, VI} {II,IV, VI, VII} {II, IV, VI, VII} {I, III,IV, VI} {I, III, IV, VI} {I, II I,IV, V I} {I,III, IV, VI} {J. .. VII} {I ... VII} {I ... VII} {I ... VII} {I ... VII} {I ... VII} {I ... VII} {I ... VII} {I ... VII} {I ... VII} {I ... VII} {I ... VII} {I ... VII} {I ... VII} {I ... VII} {II,IV, VI} {II,IV, VI} {I ... VII}
Modulation Steps Between Melodic Minor Scales
We have the two modulators
APPENDIX N. WELL-TEMPERED AND JUST MODULATION STEPS
1208
The minimal cadence sets are these:
J4
J 1 = {I}, J 2 = {II}, h = {III}, = {I,IV}, J 5 = {VI}, J 6 = {V II}.
Pivots for the Modulations Between Melodic Minor Scales
Translation
I
Target Tonic
I Modulator I Cadence I Pivots
(1,0)
G
iI>2
6
(-1,0)
F
iI>2
2
(2,0)
D
iI>2
2
(2,0)
D
iI>1
(-2,0)
BD BD E E E E
iI>1
5 6
iI>1
3 5 6
AD AD AD AD A A A A
iI>1
5
iI>1
(1,-1)
ED ED
6 2
iI>1
3
(-1,1)
ED
iI>2
3
(-1,1)
ED
iI>2
2
(-2,1)
D*
iI>2
5
(2,-1)
B:
iI>2
1
(-2,0) (0,1) (0,1) (0,1) (0,1) (0,-1) (0,-1) (0,-1) (0,-1) (-1,1) (-1,1) (-1,1) (-1,1) (1,-1)
N .2.7
iI>1 iI>2 iI>2 iI>1
5 2
iI>2
5
iI>1
6
<1>2
6
iI>1
2
iI>1
3 5
iI>2 iI>2
{V, VII} {II,IV} {II, IV} {II, IV, VII} {V, VII} {II, IV, VII} {II, V, VII} {I,III,IV, VI} {I,III,IV, VI} {II, III, V, VII} {I, III, IV, VI} {I, III, V, VI} {II, III, V, VII} {II, IV, V II} {II, IV, VI, VII} {I, III, V, VI} {I, III, V, VI} {II ,IV, V II} {II, IV, VI, VII}
{I, III, V, VI} {I, III, IV, VI} {II, V, VII} {III, V, VII} {II, IV, VI}
General Modulation Behaviour for 32 Alterated Scales
The following list refers to the 32 scales as defined in 27.1.6.1. Following Radl [429], we say that a scale type
N.2. 2-3-5-JUST MODULATION STEPS
1209
• has no modulations if its modulation domain is empty (always excluding the start tonality!), • has infinite modulations if its modulation domain is infinite, • has modulations if its modulation domain is not empty (always excluding the start tonality!), has limited modulations if the transitive closure (all tonics which can be reached by successive modulations from relative modulation domains) of its modulation domain is not total space.
Modulation Behaviour in 32 Alterated Scale Types No. I Scale Type
I Behaviour
1
c, d, e, j, g, a, b
has modulations: see table N.2.1
2
c, d, e, j, g, a, bb
has modulations: ±(1,0) with II I, ±(O, 1) with V, V I
3
c, d, e, j, g, ab, b
has modulations: corresponds to No. 11
4
c, d, e, j, g, ab, bb
has modulations: corresponds to No.9
5
c,d,e,j~,g,a,b
has modulations: see table N.2.2
6
c, d, e, h, g, a, bb
has modulations (special table in [429])
7
c, d, e, j~, g, ab, b
has modulations: ±(1,0) with II, ±(O, 1) with III, VII
8
c, d, e, j~, g, ab, bb
has limited modulations: see No. 25
9
c,d,eb,j,g,a,b
has modulations: see table N.2.6
10
c, d, eb, j, g, a, bb
has modulations: corresponds to No.2
11
c,d,eb,j,g,ab,b
has modulations: see table N.2.5
12
c,d,eb,j,g,ab,bb
has modulations: see table N.2.2
13
c, d, eb, j~, g, a, b
has modulations: ±(1,0) with II, ±( -1,1) with I, V II
14
c, d, eb, jij, g, a, bb
has modulations: ±(1, 1) with I, V, ±(O, 1) with I, V
15
c,d,eb,jij,g,ab,b
has no modulations
16
c, d, eb, h, g, ab, bb
has modulations: corresponds to No.7
17
c,db,e,j,g,a,b
has modulations: see No.7
18
c, db, e, j, g, a, bb
has modulations: corresponds to No. 14
19
c, db, e, j, g, ab, b
has no modulations: see No. 15
20
c, db, e, j, g, ab, bb
has modulations: corresponds to No. 13
21
c, db, e, j~, g, a, b
has infinite modulations
22
c, db, e, h, g, a, bb
has infinite modulations
23
c,db,e,jij,g,ab,b
has modulations: ±(1,0) with V, ±(O, 1) with II
24
c, db, e, j~, g, ab, bb
has infinite modulations
25
c, db, eb, j, g, a, b
has limited modulations
APPENDIX N. WELL-TEMPERED AND JUST MODULATION STEPS
1210
Modulation Behaviour in 32 Alterated Scale Types-Continued
No.
I Scale Type
I Behaviour
26
c, d., e., J, g, a, b.
has modulations: corresponds to No.6
27
has modulations: see No. 16
28
c,d.,e.,J,g,a.,b c, d., e., J, g, a., b.
29
c,d.,e.,J~,g,a,b
has infinite modulations: corresponds to No. 24
30
c, d., e., J~, g, a, b.
has infinite modulations: corresponds to No. 22
31
c, d., e., h, g, a., b
has modulations: corresponds to No. 23
32
c, d., e., J~, g, a., b.
has infinite modulations: corresponds to No. 21
has modulations: see table N.2.1
Appendix 0
Counterpoint Steps 0.1
Contrapuntal Symmetries
All the following tables relate to representatives of strong dichotomies (X/Y) which are indicated in the table after the counterpoint theorem 33 in subsection 31.3.3 .
0.1.1
Class Nr. 64
Ik Ig 2
e6 (1
+ c:.6) (1 + c:.6)Z12 +
e£·85
+ c:.6)
4 5
e£·ll11
7
e6 (1 + c:.6) e6 (1 + c:.6) e£·37 e£·37
11
e£·ll11
e£·85
c:.e6 X
+c:.e85X seek=2 Z12
e6 (1
9
I g.x[c:] n X[c:]
I g.X[c:]
Z12
+
c:.e ll 11X
seek=2 seek=2 Z12 +c:.e37X seek=9 seek=5 seek=2
z even: z+c:.{5,11} z odd: z+c:.X Z12 +c:.{4,5, 7,9} seek=2
+ c:.{2, 4, 7, 9} seek=2 seek=2 Z12 + c:.{2, 4, 5, 7} seek=9 seek=5 seek=2 Z12
1211
I card(g.X[c:] n X[c:]) I 48
48 48 48 48 48
APPENDIX O. COUNTERPOINT STEPS
1212
Class Nr. 68
I g.X[e] n X[e]
I g.X[e] 0
ec . 6 7
1.:12 +e.e6 7X
1
ec . 3 7
1.:12
ec .6 (1
+ e.6)
+ e.e3 7X (1 + e.6)1.: 12 + e.e 6 X
+ e.(X - {O}) 1.:12 + e.{O, 2, 3, 5} z even: z + e.{2, 8} 1.:12
I card(g.x[e] n X[e]) I 60 48
z odd: z +e.X
+ e.6)1.: 12 + e.7X
7+e.6
(1
2
5
1.:12 +e.5X
3
ec .6 (1
z odd:
+ e.6)
(1
+ e.6)1.:12 + e.e6 X
+ e.{O, 2, 8} z + e.(X - {O})
z even: z
+ e.{O, 1,3, 5} even: z + e.{2, 8}
1.:12
48
z
48
z odd: Z+e.X 7+e.6
(1 + e.6)1.: 12
+ e.7X
z even: z z odd: z
5
ec .3 11 e c .6 (1
8
+ e.6)
1.:12
+ e.e 3 X
1.:12
+ e.{O, 2, 8} + e.(X - {O})
+ e.{O, 1,2, 3}
seek=l
seek=l
7+e.6 ec .3 11
seek=3
seek=3
seek=5
seek=5
ec . 3 7
seek=l
seek=l
5
seek=2
seek=2
48
48
1213
0.1. CONTRAPUNTAL SYMMETRIES
0.1.3
Class Nr. 71
We abbreviate C(g) = card(g.X[c] n X [10]).
I g.X[c]
°
ec .8 (5
1
ec .8 5
I g.x[c] n X[c]
I C(g) I 48
+ 10.6)
Z12 +c.e8 5X (1 + c.6)Z12 + c.e 8 5X
Z12 + c.{l, 2, 6, 7} z even: z + c.{l, 2, 6, 7} z odd: z + c.{O, 1,2, 7}
ec . 3 (7 + 10.3)
(1 + c.3)Z12 + c.e 3 7X
ec .3 (7 - 10.3)
(1 - c.3)Z12 + c.e 3 7X
ec .9 (7 + 10.3)
(1 + c.3)Z12 + c.e 9 7X
ec .9 (7 - 10.3)
(1 - c.3)Z12 + c.e 9 7X
= 0,4,8 : z + c.{O, 3} 42 z = 1,5,9 : z + c.{O, 1,3,6, 7} z = 2,6,10 : Z + c.{3, 6} Z = 3,7,11 : Z + c.{O, 1, 2, 6, 7} Z = 0,4,8: Z + c.{O, 3} Z = 1,5,9 : Z + c.{O, 1,2,6, 7} Z = 2,6,10: Z +c.{3,6} Z = 3,7,11 : Z + c.{O, 1,3,6, 7} Z = 0,4,8 : Z + c.{3, 6} Z = 1,5,9 : Z + c.{O, 1,2,6, 7} Z = 2,6,10 : Z + c.{O, 3} Z = 3,7,11 : Z + c.{O, 1,3,6, 7} Z = 0,4,8 : Z + c.{3, 6} Z = 1,5,9: Z + c.{O, 1,3,6, 7} Z = 2,6,10 : Z + c.{O, 3} Z = 3,7,11 : Z + c.{O, 1,2,6, 7}
2
ec .6 (1 + 10.6)
(1 + c.6)Z12 + c.e6 X
Z
even: Z + c.{O, 1,6, 7} odd: z+c.X
60
3 6
ec .6 (1 + 10.6) ec . 25 ec . 2(5 + 10.6)
seek=2 Z12 +c.e 2 5X (1 + c.6)Z12 + c.e 2 5X
seek=2 Z12 + c.{O, 1,2, 7} Z even: Z + c.{O, 1,2, 7} Z odd: Z + c.{l, 2, 6, 7}
60 48
7
ec .9 (7 + 10.3)
seek=1 seek=1
seek=1 seek=1
42
z
Z
ec .3 (7
+ 10.3)
APPENDIX O. COUNTERPOINT STEPS
1214
0.1.4
Class Nr. 75
I k I9
°
I g.x[c]
ef:·97
I g.X[c] n X[c]
Z12 +c.e 9 7X
ef:·9(7 + 10.4)
ef:·9(7 - 10.4)
ef:·6(1 + 10.6)
(1 + c.4)Z12 +
(1 - c.4)Z12 +
(1 + C-.6)Z12 +
c.e9 7X
c.e 9 7X
c-.e6 X
Z12 + c.{1, 4, 5, 8} Z = 0,3,6,9: Z + c.{1, 4, 5, 8} Z
= 1,4,7,10 : Z + c.{O, 1, 5, 8}
Z
= 2,5,8,11: z+c.{0,1,4,5}
Z
= 0,3,6,9 : Z + c.{I, 4, 5, 8}
Z
= 1,4,7,10 : Z + c.{O, 1,4, 5}
Z
= 2,5,8,11 : Z + c.{O, 1,5, 8}
Z
even:
Z
odd: z+c-.X
Z
ef:·6(1 + c-.6)
seek=O
seek=O
2
ef:·8(5 + c-.4)
(1 + C-.4)Z12 + c-.e8 5X
Z
= 0,3,6,9 : Z + c-.{O, 1,4, 8}
Z
= 1,4,7,10 : Z + c-.{O, 1,4,5, 8}
Z Z
= 2,5,8,11 : Z + c-.{O, 2, 4, 5, 8} = 0,3,6,9 : Z + c.{O, 1,4, 8}
Z
= 1,4,7,10 : Z + c-.{O, 2, 4, 5, 8}
Z
= 2,5,8,11 : Z + c-.{O, 1,4,5, 8}
(1 - C-.4)Z12 + c-.e8 5X
48
+ c.{2, 8}
1
ef:·8(5 - c-.4)
I C(g) I
48 56
4
ef:·6(1 + c-.6)
seek=1
seek=1
5
ef:·8(1
± c-.4)
seek=2
seek=2
56
8
ef:·511
Z12 + c.e 5 11X (1 + C-.4)Z12 + c-.e5 11X
Z12 + c.{O, 1,4, 5}
48
ef:·5(11 + c-.4)
Z Z Z
ef:·5(11 - 10.4)
(1 - C-.4)Z12 + c.e 5 11X
= 0,3,6,9 : Z + c-.{O, 1,4, 5} = 1,4,7,10 : Z + c.{I, 4, 5, 8} = 2,5,8,11 : Z + c-.{O, 1,5, 8}
Z
= 0,3,6,9: Z + c-.{O, 1,4, 5} = 1,4,7,10 : Z + c-.{O, 1, 5, 8}
Z
= 2,5,8,11 : Z + c.{I, 4, 5, 8}
Z
48
1215
0010 CONTRAPUNTAL SYMMETRIES
0.1.5
Class Nr. 78
Ik I9
°
ee 09(7 + c03)
I goX[c] (1 + c03)Z12 + coe97X
I goX[c] n X[c] = 0,4,8: z + co{l, 4} = 1,5,9 : Z + co{O, 2, 4, 6, 1O} = 2,6,10 : Z + co{1, 1O} = 3,7,11 : Z + co{O, 1,4,6, 1O} = 0,4,8: Z + co{l, 4} = 1,5,9 : Z + co{O, 1,4,6, 1O} = 2,6,10 : Z + co{1, 1O} = 3,7,11 : Z + co{O, 2, 4, 6, 1O} = 0,4,8 : Z + co{l, 1O} = 1,5,9 : Z + co{O, 1,4,6, 1O} = 2,6,10: Z +co{1,4} = 3,7,11 : Z + co{O, 2, 4, 6, 1O} = 0,4,8 : Z + co{l, 1O} = 1,5,9 : Z + co{O, 2, 4, 6, 1O} = 2,6,10 : Z + co{l, 4} = 3,7,11 : Z + co{O, 1,4,6, 1O}
42
+ co{O, 4, 6, 1O} Z odd: Z +coX seek=l Z = 0,3,6,9 : Z + co{O, 2, 6, 1O} Z = 1,4,7,10 : Z + co{O, 2, 4, 6, 1O} Z = 2,5,8,11 : Z + co{l, 2, 4, 6, 1O} Z = 0,3,6,9 : Z + co{O, 2, 6, 1O} Z = 1,4,7,10 : Z + co{l, 2, 4, 6, 1O} Z = 2,5,8,11 : Z + co{O, 2, 4, 6, 1O}
60
42
z z Z Z
ee 09(7 - c03)
(1 - c03)Z12 + coe97X
Z Z Z Z
ee 03(7 + c03)
(1 + c03)Z12 + coe37X
Z Z Z Z
ee 03(7 - c03)
(1 - c03)Z12 + coe37X
Z Z Z Z
1
ee 06(1 + c06)
(1 + c06)Z12 + coe6X
2 4
ee 06(1 + c06)
seek=l
5+c.4
(1 + c.4)Z12 + c05X
5 - c.4
(1 - c.4)Z12 + c05X
ee 09(7 ± c03) ee 03(7 ± c03) ee06(5 + co2)
seek=O seek=O
seek=O seek=O
(1 + c02)Z12 + coe65X
Z
6 10
Z
Z Z Z Z Z
ee 06(5 - co2)
(1 - c02)Z12 + coe65X
I C(g) I
Z Z Z Z Z Z
even:
Z
= 0,6 : Z + co{O, 2, 4, 6} = 1,7 : Z + co{l, 2, 4, 6, 1O} = 2,8 : Z + co{O, 4, 6, 1O} = 3,9 : Z + co{O, 2, 6, 1O} = 4, 10: Z +co{0,2,4, 1O} = 5, 11: Z +co{0,2,4,6, 1O} = 0,6: Z + co{O, 2,4,6} = 1,7 : Z + co{O, 2, 4, 6, 1O} = 2,8 : Z + co{O, 2, 4, 1O} = 3,9 : Z + co{O, 2, 6, 1O} = 4,10 : Z + co{O, 4, 6, 1O} = 5,11 : Z + co{l, 2, 4, 6, 1O}
60 56
52
1216
APPENDIX O. COUNTERPOINT STEPS
Class Nr. 82
I g.X[c] 0
I g.x[c] n X[c]
ee.6(1 + c.6)
(1 + c.6)Z12 + c.e 6 X
Z
ee.6(7 + c.6)
(1 + c.6)Z12 + c.e 67X
z even: z + c.{3, 7, 9}
ee.l1 (11 - c.4)
(1 + c.4)Z12 + c.e l1 11X
even: Z + c.{3, 9} z odd: z +c.X
I C(g) I 48
z odd: z +c.(X - {7}) z z z
ee.l1 (11 + c.4)
(1 - c.4)Z12 + c.e l1 11X
z z z
3
4 7 8
= 0,3,6,9: z + c.{3, 4, 7, 8} = 1,4,7,10 : z + c.{O, 3, 7, 8} = 2,5,8,11 : z + c.{O, 3, 4, 7} = 0,3,6,9: z + c.{3,4, 7,8} = 1,4,7,10 : z + c.{O, 3, 4, 7} = 2,5,8,11 : z + c.{O, 3, 7, 8}
ee· l1 11
Z12 + c.e l1 11X
ee.8(5 - c.4)
(1 + c.4)Z12 + c.e 85X
ee.8(5 + c.4)
(1 - c.4)Z12 + c.e 85X
ee.6(1 + c.6)
seek=O
ee.6(7 + c.6)
(1 + c.6)Z12 + c.e 6 X (1 + c.6)Z12 + c.e6 7X
7 ee.37
Z12 +c.7X Z12 + c.e 37X
Z12 + c.(X - {7})
ee.6(1 + c.6) ee.3(7 + c.4)
(1 + c.6)Z12 + c.e6 X (1 + c.6)Z12 + c.e6 7X (1 + c.4)Z12 + c.e3 7X
ee.3(7 - c.4)
c.e3 7X
ee.6(7 + c.6)
Z12 + c.{3, 4,7, 8} z = 0,3,6,9 : z + c.{O, 4,7, 8} z = 1,4,7,10 : z + c.(X - {7}) z = 2,5,8,11 : z + c.(X - {9}) z = 0,3,6,9 : z + c.{O, 4, 7, 8} z = 1,4,7,10 : z + c.(X - {9}) z = 2,5,8, 11: z +c.(X - {7})
48
seek=O
Z12 + c.{O, 3, 4, 7} seek=O seek=O z = 0,3,6,9 : z + c.{O, 3, 4, 7} z = 1,4,7,10 : z + c.{3, 4, 7, 8} z
(1 - c.4)Z12 +
56
z z
= 2,5,8,11 : z + c.{O, 3, 7, 8} = 0,3,6,9: z + c.{O, 3, 4, 7} = 1,4,7,10 : z + c.{O, 3, 7, 8}
z=2,5,8,11 :z+c.{3,4,7,8}
56 48
1217
0.2. PERMITTED SUCCESSORS FOR THE MAJOR SCALE
0.2
Permitted Successors for the Major Scale
For the sweeping orientation, given a cant us firmus step x f-t y, one is allowed to move from a consonance c (i.e., x + c.c) in the top row to a consonance d (Le., y + c.d) in the right column iff there is a * in the corresponding matrix entry. 1. Oblique Motion in Cantus Firmus
Of-tO
2f-t2
4f-t4
5f-t5
7f-t7
9f-t9
11f-t11
0479
0379
0378
0479
0479
0378
038
**0 * * ** 47 ** *** 9
* * *0 * * * 37 ** * * 9
***0 * * ** 37 ** *** 8
**0 * **4 * * * 97 ***
**0 * * ** 74 ** *** 9
2. Minor Ascending Second in Cantus Firmus 4f-t5 11 f-t 0 0378
****0 * * * ** 74 ** ****9
038
***0
***4 ***7 ***9
3. Minor Descending Second in Cantus Firmus
5f-t4
o f-t 11
0479
0479
****0 ****3 ** *7 ****8
****0 ****3 ****8
4. Major Ascending Second in Cantus Firmus Of-t2 2f-t4 5f-t7 9 f-t 11 7f-t9 0479
*
**0
* * * ** 37 ** ****9
0379
****0 * * * ** 37 ** *** *8
0478
* ** ** 04 * *7 ** ** * * 9
0479
* **0 ****8 *
* * * ** 37
0378
**** **** ****
0 3 8
5. Major Descending Second in Cantus Firmus 2f-tO 4f-t2 7f-t5 9f-t7 11 f-t 9 0379
**0 ** * * ** 47 ** ****9
0378
****0 ****3 ** *7 ****9
0479
* ** * ** ***
*0 *4 *7 *9
0378
* * *** *** *
*0 *4 *7 *9
038
***0 ***3
***7 ***8
***0 * **3 * * * 87 ***
**0 * * 83 **
APPENDIX O. COUNTERPOINT STEPS
1218
6. Minor Ascending Third in Cantus Firmus 2f--+5
4f--+7
9f--+O
11 f--+ 2
0379
0378
0378
038
** * ** ** 04 * * * 97 * *
****0 * * * *4 * * * ** 97 *
* ** ** 04 * * * * ** 97 *
***0 * *3 ***7 * *9
7. Minor Descending Third in Cantus Firmus 5f--+2
7f--+4
Of--+9
0479
0479
0479
****0 * * * * 37 ** *** 9
* * * *
0
* * * * 37 ** * * * *8
****
2 f--+ 11 0
* * * * 37 ** * ** *8
0379
****0 * * * *3 ****8
8. Major Ascending Third in Cantus Firmus Of--+4
5f--+9
7 f--+ 11
0479
0479
0479
* ** ** *
**0 * *3 *7 **8
** * ** ** 30 * * * ** 87 *
* **0 ****3 * **8
9. Major Descending Third in Cantus Firmus 4f--+O
9f--+5
11 f--+ 7
0378
0378
038
****0 * * * *4 ** *7 * * * *
9
****0 * * * *4 ** *7 * ***9
***0 * * *4 ***7 * * *
9
10. Ascending Fourth in Cantus Firmus Of--+5
2f--+7
4f--+9
7f--+O
9f--+2
11 f--+ 4
0479
0379 * * * * 0 * * * *4 ** *7 * ***9
0378
0479
0378
038
** *7 ****9
** ** * ** 87
* * *7 * * *8
* * **
** * * ** *
*0 ** 47 *9
* * **
* * *0 * * *3 ** * ** 87
* * * *0 * * * *4
* * * *0 ****3
* **0 ***3
11. Descending Fourth in Cantus Firmus 5f--+O
7f--+2
9f--+4
Of--+7
2f--+9
4 f--+ 11
0479
0479
0378
0479
0379
0378
****0 ****4 ** *7 ****9
* * * *
0
* * * ** 37 ** ****9
* * * *0
****3 ** *7 ****8
* * **0 ****4 ** *7 ****9
12. Ascending Tritone in Cantus Firmus 5 f--+ 11
11 f--+ 5
0479
038
**0 * * ** * 83 *
**0 ***4 ***7 * *9
****0 * * * *3 ** *7 ****8
****0 ****3 ****8
Part XVIII
References
1219
Bibliography [1] Abel V and Reiss P: MUTABOR II - Software Manual. Mutabor Soft, Darmstadt 1991 [2] Abraham R and Marsden J: Foundations of Mechanics. Benjamin, New York et al. 1967 [3] Ackermann Ph: Computer und Musik. Springer, Wien and New York 1991 [4] Aczel P: Non-well-founded Sets. No. 14 in CSLI Lecture Notes. Center for the Study of Language and Information, Stanford 1988 [5] Ackermann Ph: Developing Object-Oriented Multimedia Software. dpunkt, Heidelberg 1996 [6] Adorno Th W: Fragment tiber Musik und Sprache. Stuttgart, Jahresring 1956 [7] Adorno Th W: Der getreue Korrepetitor (1963). Gesammelte Schriften, Bd. 15, Suhrkamp, Frankfurt am Main 1976 [8] Agawu V K: Playing with Signs. Princeton University Press, Princeton 1991 [9] Agmon E: A Mathematical model of the diatonic system. JMT, 33, 1-25, 1989 [10] Agmon E: Coherent Tone-Systems: a study in the theory of diatonicism. JMT, 40(1), 39-59, 1996 [11] Agon A: OpenMusic: Un langage visuel pour la composition musicale assistee par ordinateur. PhD Dissertation, Universite Paris VI, Paris 1998 [12] Akmajian A et al.: Linguistics. MIT Press, Cambridge MA 1995 [13] Alain: Alain Citation on Almada's painting in the Gulbenkian Foundation Center. Lisbon 1968 [14] D'Alembert J Le Rond: Einleitung zur Enzyklopadie (1751). (German Translation) Fischer, Frankfurt/Main 1989 [15] Amuedo J: Computational Description of Extended Tonality. Master Thesis, U Southern California, Los Angeles 1995 [16] Andreatta M: Group-theoretical Methods Applied to Music. Independent Study Dissertation, University of Sussex, Sussex 1997 1221
1222
BIBLIOGRAPHY
[17] Andreatta M: La theorie mathematique de la musique de Guerino Mazzola et les canons rythmiques. Memoire DEA, EHESS Paris IV & IRCAM, Paris 1999 [18] Ansermet E: Die Grundlagen der Musik im menschlichen Bewusstsein. 4th Ed. Piper, Miinchen and Ziirich 1986 [19] Apfel E: Diskant und Kontrapunkt in der Musiktheorie des 12. bis 15. Jahrhunderts. Heinrichshofen, Wilhelmshafen 1982 [20] Aristoteles: Topik (Organon V, 345 b.C.). Rolfes E (German transl.), Meiner, Hamburg 1992 [21] Artin M: Algebra. Birkhauser, Basel et al. 1993 [22] Arlin M, Grothendieck A, Verdier J L: Theorie des Topos et Cohomologie Etale des Schemas (Tomes 1,2,3). Springer LN 269, 270, 305, Springer, New York et al. 1972-1973 [23] Askenfelt A et al.: Musical Performance. A Synthesis-by-Rule Approach. Computer Music J. 7/1, 1983 [24] Assayag G: Du calcul secret au calcul visuel. In: Delalande F and Vinet H (eds.): Interface homme-machine et creation musicale. Hermes, Paris 1999 [25] Auroux S: La semiotique des encyclopedistes. Payot, Paris 1979 [26] Babbitt M: Some Aspects of Twelve-Tone Composition. In: Hays W (ed.): The Score and IMA Magazine, 12, 53-61, 1955 (reprinted in "Twentieth Century Views of Music History", 364-371, Scribner, New York 1972) [27] Babbitt M: Twelve-Tone Invariants as Compositional Determinants. Musical Quarterly, 46, 245-259, 1960 [28] Babbitt M: Set Structure as a Compositional Determinant. JMT, 5(2), 72-94, 1961 [29] Babbitt M: Twelve-Tone Rhythmic Structure and the Electronic Medium. PNM, 1(1), 49-79,1962 [30] Babbitt M: The Structure and Function of Music Theory. College Music Symposium, Vol. 5 , 1965 (reprinted in Boretz and Cone, 1972, 10-21) [31] Babbitt M: Words about Music. Dembski S and Straus J N (eds.), University of Wisconsin Press, Madison 1987 [32] Bach J S: Krebskanon. In: Musikalisches Opfer, 70. (BWV 1079), Neue Gesamtausg. samtl. Werke Ser. VIII, Bd.1, Barenreiter, Kassel 1978 [33] Bach J S: Choral No.6, In: Himmelfahrlsoratorium (BWV 11), Neue Gesamtausg. samtl. Werke Ser. II, Bd.8, Barenreiter, Kassel 1978 [34] Bacon F: De dignitate et augment is scientiarum. 1st Ed. 1605, improved 1623
BIBLIOGRAPHY
1223
[35] Baker J, Beach D, and Bernard J: Music Theory in Concept and Practice. Eastman Studies in Music, University of Rochester Press, 1997 [36] Balzano G: The group-theoretic description of 12-fold and microtonal pitch systems. CMJ, 4, 66-84, 1980 [37] Bandemer H and Gottwald H: Fuzzy Sets, Fuzzy Logic, Fuzzy Methods. Wiley, New York et al. 1995 [38] Banter H: Akkord-Lexikon. Schott, Mainz 1982 [39] Barlow K: Uber die Rationalisierung einer harmonisch irrationalen Tonhahenmenge. Preprint, Kaln 1985. [40] Barrow, J and Tipler, F: The Anthropic Cosmological Principle: Oxford University Press, New York 1986 [41] Barthes R: Elements de semiologie. Communications 4/1964 [42] Basten A: Personal e-mail communciaction to G.M. November 22, 1996 [43] Batschmann 0: Einfiihrung in die kunstgeschichtliche Hermeneutik. Wissenschaftliche Buchgemeinschaft Darmstadt, Darmstadt 1986 [44] Bauer M: Die Lieder Franz Schuberts. Breitkopf und Hartel, Leipzig 1915 [45] Bazelow A and Brickel F: A Partition Problem Posed by Milton Babbitt. PNM, 14(2), 15(1),280-293, 1976 [46] Beethoven L van: Grosse Bonate fur das Hammerklavier op.l06 (1817-1818). Ed. Peters, Leipzig 1975 [47] Bekesy G von: Experiments in Hearing. McGraw-Hill, New York 1960 [48] Beran J: Cirri. Centaur Records, 1991 [49] Beran J and Mazzola G: Immaculate Concept. SToA music 1002.92, Zurich 1992 [50] Beran J and Mazzola G: Analyzing Musical Structure and Performance-a Statistical Approach. Statistical Science. Vol. 14, No.1, 47-79, 1999 [51] Beran J and Mazzola G: Visualizing the Relationship Between Two Time Series by Hierarchical Smoothing. Journal of Computational and Graphical Statistics, Vol. 8, No.2, 213-238, 1999 [52] Beran J and Mazzola G: Timing Microstructure in Schumann's "Traumerei" as an Expression of Harmony, Rhythm, and Motivic Structure in Music Performance. Computers and Mathematics with Applications, Vol. 39, Issue 5/6, 99-130, 2000 [53] Beran J: Maximum likelihood estimation of the differencing parameter for invertible short and long-memory ARIMA models. J.R. Statist. Soc. B, 57, No.4, 659-672, 1995
1224
BIBLIOGRAPHY
[54] Beranek L L: Acoustics, 1954 -1993, The Acoustical Society of America, ISBN: 0-88318494-X [55] Berger M: Geometry I, II. Springer, Berlin et al. 1987 [56] Bernard J: Chord, Collection, and Set in Twentieth-Century Theory. In: Baker J et al. this bibliography, 11-52 [57] Biok H-R: Zur Intonationsbeurteilung kontextbezogener sukzessiver Intervalle. Bosse, Regensburg 1975 [58] Blood A, Zatorre J, Bermudez P, Evans A C: Emotional response to pleasant and unpleasant music correlates with activity in paralimbic brain regions. Nature Neuroscience, Vol. 2, No.4, J82-387, April 1999 [59] Boretz B and Cone E T: Perspectives on Contemporary Music Theory. W.W. Norton and Company, New York 1972 [60] Boulez P: Musikdenken heute I,ll; Darmstadter Beitrage V, VI. Schott, Mainz 1963, 1985 [61] Boulez P: Le timbre et l'ecriture, Ie timbre et Ie langage. In: Bourgeois Chr (ed.): Le timbre, metaphore pour la composition. IRCAM, Collection Musique/Passe/Present, Paris 1991 [62] Bourgeois Chr (ed.): Le timbre, metaphore pour la composition. IRCAM, Collection Musique/Passe/Present, Paris 1991 [63] Bourbaki N: Elements de Mathematique, Algebre, Ch.1-9. Hermann, Paris 1970-1973 [64] Bourbaki N: Elements de Mathematique, Algebre Commutative, Ch.1-7. Hermann, Paris 1961-65 [65] Bourbaki N: Elements de Mathematique, Topologie Generale, Ch.1-4. Hermann, Paris 1971 [66] Brandle L: Die "Wesentlichen Manieren" (Ornamente in der Musik). Oesterreichischer Bundesverlag, Wien 1987 [67] Brandt C and Roemer C: Standardized Chord Symbol Notation: Roerick Music, Sherman Oaks, CA 1976 [68] Brecht B et al.: Conductor Follower. In: ICMA (ed.): Proceedings of the ICMC 95, S. Francisco 1995 [69] Brockwell P J and Davis R A: Time Series: Theory and Methods. Springer, New York 1987 [70] Bruhn H: Harmonielehre als Grammatik der Musik. Psychologie Verlags Union, Miinchen et al. 1988 [71] Busch H R: Leonhard Eulers Beitrag zur Musiktheorie. Bosse, Regensburg 1970
BIBLIOGRAPHY
1225
[72] Buser P and Imbert M: Audition. Hermann, Paris 1987 [73] Buteau Ch: Motivic Topologies and their Signification in Musical Motivic Analysis. Masters Thesis, U Laval/Quebec 1998 [74] Buteau Ch and Mazzola G: From Contour Similarity to Motivic Topologies. Musicae Scientiae, Vol. IV, No.2, 125-149, 2000. [75] Buteau Ch: Reciprocity Between Presence and Content Functions on a Gestalt Composition Space. Tatra Mt. Math. Publ. 23, 17-45, 2001 [76] Cabral B and Leedom L: Imaging vector fields using line integral convolution. Computer Graphics 27 (SIGGRAPH93 Proceedings), 263-272, 1993 [77] Calvet 0 et al.: Modal synthesis: compilation of mechanical sub-structure and acoustical sub-systems. In: Arnold S, Hair G, ICMA (eds.): Proceedings of the 1990 International Computer Music Conference. San Francisco 1990 [78] Camuri A et al.: Toward a cognitive model for the representation and reasoning on music and multimedia knowledge. In: Haus G and Pighi I (eds.): X Colloquio di Informatica Musicale. AIMI, LIM-Dsi, Milano 1993 [79] Carey N and Clampitt D: Aspects of well-formed scales. MTS, 11, 187-206, 1989 [80] Cardelli L and Wegner P: On Understanding types, data Abstraction and polymorphism. Computing Surveys, Vol. 17,4, 1985 [81] Castagna G: Foundations of Object-oriented programming. ETAPS, Lisbone 1998 [82] Castine P: Set Theory Objects. Lang, Frankfurt/Main et al. 1994 [83] CERN. Internet Information via http://www.cern.chI995 [84] Chatterjee S and Price B: Regression Analysis by Example. Wiley, 2nd ed., New York 1995 [85] Chomsky N and Halle M: The Sound Pattern of English. Harper and Row, New York 1968 [86] Chowning J: The Synthesis of Complex Audio Spectra by Means of Frequency Modulation. Journal of the Audio Engineering Society 21 (7), 1985 [87] Clarke E: Imitating and Evaluating Real and Transformed Musical Performances. Music Perception 10/3, 317-341, 1993 [88] Clough J and Myerson G: Variety and multiplicity in diatonic systems. JMT, 29, 249-270, 1985 [89] Clough J and Douthett J: Maximally even sets. JMT, 35, 93-173, 1991 [90] Clough J: Diatonic Interval Cycles and Hierarchical Structure. PNM, 32(1), 228-253, 1994
1226
BIBLIOGRAPHY
[91] Clynes M: Sentics. The Touch of Emotions. Anchor Doubleday, New York 1977 [92] Clynes M: Secrets of Life in Music. Analytica, Stockholm 1985 [93] Cohn R: Introduction to Neo-Riemannian Theory: A Survey and a Historical Perspective. JMT 42(2), 167-180, 1998 [94] Conen H: Formel-Komposition. Schott, Mainz et al. 1991 [95] Cooper K N D et al.: Handwritten Music-Manuscript Recognition. In: ICMA (ed.): Proceedings of the ICMC 96, S. Francisco 1996 [96] Couasnon B et al.: Using a Grammar For a Reliable Full Score Recognition System. In: ICMA (ed.): Proceedings of the ICMC 95, S. Francisco 1995 [97] Creutzfeldt 0 D: Cortex Cerebri. Springer, Berlin et al. 1983 [98] Czerny C: Pianoforte Schule. 1840 [99] Dahlhaus C: Zur Theorie des klassischen Kontrapunkts. Kirchenmusikalisches Jb 45, 1961 [100] Dahlhaus C: Uber den Begriff der tonalen Funktion. In: Vogel M (ed.): Beitrage zur Musiktheorie des 19. Jahrhunderts. Bosse, Regensburg 1966 [101] Dahlhaus C and Eggebrecht H H: Was ist Musik? Heinrichshofen, Wilhelmshaven et al. 1985 [102] Dahlhaus C: Untersuchung tiber die Entstehung der harmonischen Tonalitat. Barenreiter, Kassel et al. 1967 [103] Dahlhaus C et al.: Neues Handbuch der Musikwissenschaft, Bd. 1-13: Athenaion and Laaber, Laaber 1980-1993 [104] Dahlhaus C and Mayer G: Musiksoziologische Reflexionen. In: Dahlhaus, C and de la Motte-Haber H (eds.): Neues Handbuch der Musikwissenschaft, Bd. 10: Systematische Musikwissenschaft. Laaber, Laaber 1982 [105] Dahlhaus C: Ludwig van Beethoven und seine Zeit. Laaber, Laaber 1987 [106] d'Alembert J Le Rond: Einleitung zur 'Enzyklopadie'. Fischer, Frankfurt/Main 1989 [107] Dallos P: The active cochlea. J. Neurosci. Dec;12(12):4575-85, 1992 [108] Dannenberg R B: An on-line algorithm for real-time accompaniment. In: ICMA (ed.): Proceedings of the ICMC 84, S. Francisco 1984 [109] Dannenberg R B et al.: Automatic Ensemble Performance. In: ICMA (ed.): Proceedings of the ICMC 94, S. Francisco 1994 [110] Danuser H et al. (eds.): Neues Handbuch der Musikwissenschaft, Bd. 11: Interpretation. Laaber. Laaber 1992
BIBLIOGRAPHY
1227
[111] de Bruijn N G: P6lya's Theory of Counting. In: Beckenbach E F (ed.): Applied Combinatorial Mathematics, Ch.5. Wiley, New York 1964 [112] de Bruijn N G: On the number of partition patterns of a set. Nederl. Akad. Wetensch. Proc. Ser. A 82 = Indag. Math. 41, 1979 [113] Debussy C: Preludes, Livre I (1907-1910). Henle, Munchen 1986 [114] Dechelle F et al.: The Ircam Reall-Time Platform and Applications. In: ICMA (ed.): Proceedings of the ICMC 95, S. Francisco 1995 [115] Delalande F: La gestique de Gould. In: Guertin G (ed.): Glenn Gould Verdun 1988
pluriel. Corteau,
[116] Degazio B: A Computer-Based Editor For Lerdahl and Jackendoff's Rhythmic Structures. In: ICMA (ed.): Proceedings of the ICMC 96, S. Francisco 1996 [117] de la Motte D: Harmonielehre. Barenreiter/dtv, Kassel 1976 [118] de la Motte-Haber Hand Emons H: Filmmusik. Eine systematische Beschreibung. Munchen, Hanser 1980 [119] de la Motte-Haber H: Handbuch der Musikpsychologie. Laaber-Verlag, 2.Ed., Laaber 1996 [120] de la Motte-Haber H: Rationalitat und Affekt. In: Gi::itze H and Wille R (eds.): Musik und Mathematik. Springer, Berlin et al. 1985 [121] de la Motte-Haber H: Musikalische Hermeneutik und empirische Forschung. In: Dahlhaus, C and de la Motte-Haber H (eds.): Neues Handbuch der Musikwissenschaft, Bd. 10: Systematische Musikwissenschaft. Laaber, Laaber 1982 [122] de la Motte-Haber H: Die Umwandlung der Interpretationsparameter in Struktureigenschaften. In: "Das Paradox musikalischer Interpretation" , Symposion zum 80. Geburtstag von K. von Fischer, Univ. Zurich 1993 [123] Demazure M and Gabriel P: Groupes Algebriques. Masson & Cie./North-Holland, Parisi Amsterdam 1970 [124] Dennett, D C: Quining qualia. In Marcel A and Bisiach E (eds.): Consciousness in Contemporary Science. Oxford University Press, 1988 [125] Desain P and Honing H: The Quantization of Musical Time: A Connectionist Approach. Computer Music Journal 13 (3), 56-66, 1989 [126] Descartes R: Musicae Compendium. Herausgegeben und ins Deutsche ubertragen als "Leitfaden der Musik" von J. Brockt, Wiss. Buchgesellschaft, Darmstadt 1978 [127] Dieudonne J: Foundations of Modern Analysis. Academic Press, New York et. al. 1960 [128] Di::ihl F: Webern - Weberns Beitrag zur Stilwende der Neuen Musik. Katzbichler, Munchen et. al. 1976
1228
BIBLIOGRAPHY
[129] Dreiding A et al.: Classification of Mobile Molecules by Category Theory. In: Symmetries and Properties of Non-Rigid Molecules. Studies in Physical and Theoretical Chemistry, 23, 1983 [130] Dufourt H: Les difficultes d'une prise de conscience tMorique. In: Le compositeur et l'ordinateur. 6-12, Ircam, Centre Georges Pompidou, Paris 1981 [131] Eberle G: "Absolute Harmonie" und "Ultrachromatik". In: Kolleritsch 0 (ed.): Alexander Skrjabin. Universal Edition, Graz 1980 [132] Eco U: Kunst und Schonheit im Mittelalter. Hanser, Wien 1991 [133] Eco U: Die Suche nach der vollkommenen Sprache. Beck, Munchen 1994 [134] Eggebrecht H H: Interpretation. In: "Das Paradox musikalischer Interpretation", Symposion zum 80. Geburtstag von K. von Fischer, Univ. Zurich 1993 [135] Eggebrecht H H: Musik im Abendland. Piper, Munchen and Zurich 1996 [136] Ehrenfels Chr von: Uber Gestaltqualitaten. Vierteljahresschrift fur wissenschaftliche Philosophie XIV, 1890 [137] Eitz C: Das mathematisch reine Tonsystem. Leipzig 1891 [138] Eimert H: Grundlagen der musikalischen Reihentechnik. Universal Edition, Wien 1964 [139] Eisenbud D: Commutative Algebra with a View Toward Algebraic Geometry. Springer New York et al. 1996 [140] Eisenbud D, and Harris J: The Geometry of Schemes. Springer, New York 2000 [141] Engstrom B: Stereocilia of sensory cells in normal and hearing impaired ears. Scand. Audiol. Suppl. 19, 1-34, 1983 [142] Essl K: Strukturgeneratoren. Beitrage zur elektronischen Musik 5, IEM, Graz 1996 [143] Euler L: Tentamen novae theoriae musicae (1739). In: Opera Omnia, Ser.III, VoLl (Ed. Bernoulli, E et al.). Teubner, Stuttgart 1926 [144] Euler L: Conjecture sur la raison de quelques dissonances generales re<;ues dans la musique (1764). In: Opera Omnia, Ser.III, VoLl (Ed. Bernoulli, E et al.). Teubner, Stuttgart 1926 [145] Euler L: De harmoniae veris principiis per speculum musicum representatis (1773). In: Opera Omnia, Ser.III, VoLl (Ed. Bernoulli, E et al.). Teubner, Stuttgart 1926 [146] Feldman J et al.: Force Dynamics of Tempo Change in Music. Music Perception, 10, 1992 [147] Ferretti, R and Mazzola, G: Algebraic Varieties of Musical Performances. Tatra Mt. Math. Publ. 23, 59-69, 2001 [148] Feulner J et al.: MELONET: Neural Networks that Learn Harmony-Based Melodic Variations. In: ICMA (ed.): Proceedings of the ICMC 94, S. Francisco 1994
BIBLIOGRAPHY
1229
[149J Fichtner R: Die verborgene Geometrie in Raffaels "Schule von Athen". Oldenburg 1984 [150J Fink E: Grundlagen der Quantenmechanik. Akademische Verlagsgesellschaft, Leipzig 1968 [151J Finscher L: Studien zur Geschichte des Streichquartetts. Biirenreiter, Kassel 1974 [152J Finsler P: Uber die Grundlegung der Mengenlehre. Erster Teil. Die Mengen und ihre Axiome. Math. Z. 25, 683-713, 1926 [153J Finsler P: Aufsatze zur Mengenlehre. Unger G (ed.), Wiss. Buchgesellschaft, Darmstadt 1975 [l54J Fleischer A: Eine Analyse theoretischer Konzepte der Harmonielehre mit Hilfe des Computers. Magisterarbeit, MWS, HU Berlin 1996 [155J Fleischer A, Mazzola G, Noll Th: Zur Konzeption der Software RUBATO fur musikalische Analyse und Performance. Musiktheorie, Heft 4, 314-325, 2000 [156J Forster M: Technik modaler Komposition bei Olivier Messiaen. Hanssler, NeuhausenStuttgart 1976 [157J Forte A: A Theory of Set-Complexes for Music. JMT, 8(2), 136-183, 1964 [158J Forte A: Structure of Atonal Music: Practical Aspects of a Computer-Oriented Research Project. In: Musicology and the Computer. Musicology 1966-2000. A Practical Program. Three Symposia. American Musicological Society, NY 1970 [159J Forte A: Structure of Atonal Music. Yale University Press, New Haven 1973 [160J Forte A: La Set-complex theory: Elevons les enjeux! Analyse musicale, 4e trimestre, 80-86, 1989 [161J Frank H: RUBATO® Broadcast. ORF2: Modern Times, Jan. 10, 1997 [162J Freedman D Z and Nieuwenhuizen P van: Supergravitation und die Einheit der Naturgesetze. In: Dosch H G (ed.): Teilchen, Felder und Symmetrien. Spektrum der Wissenschaft, Heidelberg 1984 [163J Friberg A: Generative Rules for Music Performance: A Formal Description of a Rule System. Computer Music Journal, Vol. 15, No.2, 1991 [164J Friberg A et al.: Performance Rules for Computer-Controlled Contemporary Keyboard Music. Computer Music Journal, Vol. 15, No.2, 1991 [165J Friberg A et al.: Recent Musical Performance Research at KTH. In: Sundberg J, (ed.): Generative Grammars for Music Performance. KTH, Stockholm 1994 [166J Friberg A: A Quantitative Rule System for Musical Performance. KTH PhD-Thesis, Stockholm 1995 [167] Fripertinger H: Enumeration in Musical Theory. Beitrage zur elektronischen Musik 1, Hochschule fur Musik und Darstellende Kunst, Graz 1991
1230
BIBLIOGRAPHY
[168] Fripertinger H: Die Abziihltheory von P6lya. Diplomarbeit, Univ. Graz 1991 [169] Fripertinger H: Endliche Gruppenaktionen in Funktionenmengen-Das Lemma von Burnside-Repriisentantenkonstruktionen-Anwendungen in der Musiktheorie. Doctoral Thesis, Univ. Graz 1993 [170] Fripertinger H: Untersuchungen uber die Anzahl verschiedener Intervalle, Akkorde, Tonreihen und anderer musikalischer Objekte in n-Ton Musik. Magisterarbeit, Hochschule fur Musik und Darstellende Kunst, Graz 1993 [171] Fripertinger H: Anwendungen der Kombinatorik unter Gruppenaktionen zur Bestimmung der Anzahl "wesentlich" verschiedener Intervalle, Chorde, Tonreihen usw. Referat an der Univ. Innsbruck, Math. Institut d. Karl-Franzens-Univ., Graz 1996 [172] Fripertinger H: Enumeration of Mosaics. Discrete Mathematics, 199, 49-60, 1999 [173] Fripertinger H: Enumeration of Non-isomorphic Canons. Tatra Mt. Math. Publ. 23,47-57, 2001 [174] Fux J J: Gradus ad Parnassum (1725). Dt. und kommentiert von L. Mitzler, Leipzig 1742 [175] Gabriel P: Personal Communication. Zurich 1979 [176] Gabrielsson A: Music Performance. In: Deutsch D (ed.): The Psychology of Music (2nd ed.). Academic Press, New York [177] Gabrielsson A: Expressive Intention and Performance. In: Steinberg R (ed.): Music and the Mind Machine. Springer, Berlin et al. 1995 [178] Gamer C: Some combinatorial resources of equal-tempered systems. JMT, 11, 32-59, 1967 [179] Gamma E, Helm R, Johnson R, Vlissides J: Design Patterns, Elements of Reusable Object-Oriented Software. Addison-Wesley, Reading Mass. et al., 1994 [180] Gerwin Th: IDEAMA, Zentrum fur Kunst und Medientechnologie, Karlsruhe 1996 [181] Geweke John: A comparison of tests of independence of two covariance-stationary time series. J. Am. Statist. Assoc., 76, 363-373, 1981 [182] Giannitrapani D: The Electrophysiology of Intellectual Functions. Karger, Basel 1985 [183] Gilson E: Introduction aux arts du beau. Vrin, Paris 1963 [184] Godement R: Topologie algebrique et theorie des faisceaux. Hermann, Paris 1964 [185] Goethe J W von: Brief an ZeIter. 9. Nov. 1829 [186] Goldblatt R: Topoi. North-Holland, Amsterdam et al. 1984 [187] Goldstein J L: An Optimum Processor Theory for the Central Formation of the Pitch of Complex Tones. J. Acoust. Soc. Am. 54, 1496 1973
BIBLIOGRAPHY
1231
[188] Gorenstein D: Classifying the finite simple groups. Bull. A.M.S. 14, 1-98, 1986 [189] Gottschewski H: Tempohierarchien. Musiktheorie, Heft 2, 1993 [190] Gotze H and Wille R (eds.): Musik und Mathematik. Springer, Berlin et al. 1985 [191] Gould G: The Glenn Gould Reader. Alfred A. Knopf, New York 1984 [192] Goupillaud P, Grossmann A, Morlet J: Cycle-octave and related transforms in seismic signal analysis. Geoexploration, 23, 85-102, 1984-1985 [193] Grabusow N: Vielfalt akustischer Grundlagen der Tonarten und Zusammenklange - Theorie der Polybasiertheit. Musiksektion des Staatsverlags, Moskau 1929 [194] Graeser W: Bachs "Kunst der Fuge". In: Bach-Jahrbuch 1924 [195] Greimas A J: Les actants, les acteurs et les figures. In: Chabrol C (ed.): Semiotique narrative et textuelle. Larousse, Paris 1974 [196] Greub W: Linear Algebra. Springer, Berlin et al. 1967 [197] Gross D: A Set of Computer Programs to Aid in Music Analysis. Ph. Diss. Indiana Univ. 1975 [198] Grothendieck A and Dieudonne J: Elements de Geometrie Algebrique I. Springer, Berlin et al. 1971 [199] Grothendieck A and Dieudonne J: Elements de Geometrie Algebrique I-IV. Publ. Math IHES no. 4, 8, 11, 17, 20, 24, 28, 32, Bures-sur-Yvette 1960-1967 [200] Grothendieck A: Correspondence with G. Mazzola. April 1, 1990 [201] Guevara R C L et al.: A Modal Distribution Approach to Piano Analysis and Synthesis. In: ICMA (ed.): Proceedings of the ICMC 96, S. Francisco 1996 [202] Gurlitt et al. (eds.): Riemann Musiklexikon/Sachteil. Schott, Mainz 1967 [203] Habermann R: Elementary Applied PDEs, Prentice Hall 1983 [204] Halsey D and Hewitt E: Eine gruppentheoretische Methode in der Musiktheorie. Jahresber. d. Dt. Math.-Vereinigung 80, 1978 [205] Handschin J: Der Toncharakter. Zurich 1948 [206] Hanslick E: Vom Musikalisch-Schonen. Breitkopf und Hartel (1854), Wiesbaden 1980 [207] Hardt M: Zur Zahlenpoetik Dantes. In: Baum R. and Hirdt W (eds.): Dante Alighieri 1985. Stauffenburg, Tubingen 1985 [208] Harris C and Brinkman A R: A unified set of software tools for computer-assisted settheoretic and serial analysis of contemporary music. Proc. ICMC 1986, ICA, San Francisco 1986
BIBLIOGRAPHY
1232 [209] Hartshorne R: Algebraic Geometry. Springer, New York et al. 1977
[210] Hashimoto S and Sawada H: Musical Performance Control Using Gesture: Towards Kansei Technology for Art. In: Kopiez Rand Auhagen W (eds.): Controlling Creative Processes in Music. Peter Lang, Frankfurt am Main et al. 1998 [211] Haugh L D: Checking for independence of two covariance stationary time series: a univariate residual cross correlation approach. J. Am. Statist. Assoc., 71, 378-385, 1976 [212] Haus G et al.: Stazione di Lavoro Musicale Intelligente. In: Haus G and Pighi I (eds.): X Colloquio di Informatica Musicale. AIMI, LIM-Dsi, Milano 1993 [213] Hebb D 0: Essay on mind. Hillsdale, New Jersey, Lawrence Erlbaum Associates, 1980 [214] Hegel G W F: Wissenschaft der Logik I (1812). Felix Meiner, Hamburg 1963 [215] Heiberg J L and Menge H (eds.): Euclidis opera omnia. 8 vol. & supplement, in Greek. Teubner, Leipzig 1883-1916. [216] Heijink H, Desain P, Honing H, Windsor L: Make me a match: An evaluation of different approaches to score-performance matching. Computer Music Journal, 24(1), 43-56, 2000 [217] Helmholtz H von: Die Lehre von den Tonempfindungen als physiologische Grundlage der Musik (1863). Nachdr. Darmstadt 1968 [218] Henck H: Karlheinz Stockhausens Klavierstuck IX. Verlag fur systematische Musikwissenschaft, Bonn-Bad Godesberg 1978 [219] Hentoff N: Liner notes to Coltrane's last album "Expression". Impulse AS-9120, New York 1967 [220] Herbort H J: Keine Ausweispfiicht fur cis. Die Zeit Nr. 43, 21. Oktober 1988 [221] Hesse H: Das Glasperlenspiel (1943). Suhrkamp, Frankfurt/M. 1973 [222] Heussenstamm G: Norton Manual of Music Notation. Norton & Comp., New York 1987 [223] Hichert J: Verallgemeinerung des Kontrapunkttheorems fUr die Hierarchie aller starken Dichotomien in temperierter Stimmung. Diplomarbeit, TU Ilmenau 1993 [224] Hindemith P: Unterweisung im Tonsatz. Schott, Mainz 1940 [225] Hiller Land Ruiz P: Synthesizing sounds by solving the wave equation for vibration objects. J. of the Audio engineering Soc. 19: 463-470, 542-551, 1971 [226] Hjelmslev L: La Stratification du Langage. Minuit, Paris 1954 [227] Hjelmslev L: Prolegomenes
a une theorie du langage.
Minuit, Paris 1968-71
[228] Hjelmslev L: Nouveaux essays. PUF, Paris 1985 [229] Hofstadter D: Godel, Escher, Bach. New York: Basic Books, New York 1979
BIBLIOGRAPHY
1233
[230] Hong Y: Testing for independence between two covariance stationary time series. Biometrika, 83, No.3, 615-626, 1996 [231] Honing, H: Expresso, a strong and small editor for expression. In: ICMA (ed.): Proceedings of the ICMC 92, S. Francisco 1992 [232] Hooft G 't: Symmetrien in der Physik der Elementarteilchen. In: Dosch H G (ed.): Teilchen, Felder und Symmetrien. Spektrum der Wissenschaft, Heidelberg 1984 [233] Hornl D et al.: Learning Musical Structure and Style by Recognition, Prediction and Evolution. In: ICMA (ed.): Proceedings of the ICMC 96, S. Francisco 1996 [234] Horry Y: A Graphical User Interface for MIDI Signal Generation and Sound Synthesis. In: ICMA (ed.): Proceedings of the ICMC 94, S. Francisco 1994 [235] Howe H: Some combinatorial properties of Pitch-Structures. PNM, 4(1), 45-61, 1965 [236] Hu S-T: Mathematical Theory of Switching Circuits and Automata. University of California Press, Berkeley and Los Angeles 1968 [237] Hudak P et al.: Haskore Music Notation - An Algebra of Music. J. Functional Programming Vol. 6 (3) 1996 [238} Hudspeth A J and Corey D P: Sensitivity, Polarity, and Conductance Change in the Response of Vertebrate Hair Cells to Controlled Mechanical Stimuli. Proc. Nat. Acad. Sci. Am. 74(6), 2407-2411, 1977 [239] Humphreys J E: Introduction to Lie Algebras and Representation Theory. Springer, New York et al. 1972 [240] Hung R et al.: The Analysis and Resynthesis of Sustained Musical Signals in the Time Domain. In: ICMA (ed.): Proceedings of the ICMC 96, S. Francisco 1996 [241] Hunziker E and Mazzola G: Ansichten eines Hirns. Birkhauser, Basel 1990 [242] Husmann H: Einfiihrung in die Musikwissenschaft. Heinrichshofen, Wilhelmshaven 1975 [243] Jackendoff Rand Lerdahl F: A Generative Theory of Tonal Music. MIT Press, Cambridge MA,1983 [244] Jakobson R and Halle M: Fundamentals of Language. Mouton, Le Hague 1957 [245] Jakobson R: Linguistics and Poetics. In: Seboek, TA (ed.): Style in Language. Wiley, New York 1960 [246] Jakobson R: Language in relation to other communication systems. In: Linguaggi nella societa e nella tecnica. Edizioni die Communita, Milano 1960 [247] Jakobson R: Holderlin, Klee, Brecht. Suhrkamp, Frankfurt/Main 1976 [248] Jakobson Rand Pomorska K: Poesie und Grammatik. Dialoge. Suhrkamp, Frankfurt/Main 1982
1234
BIBLIOGRAPHY
[249] Jakobson R: Semiotik - Ausgewahlte Texte 1919-1982. Holenstein E (ed.), Suhrkamp, Frankfurt/Main 1988 [250] Jauss H R: Riickschau auf die Rezeptionstheorie-Ad usum Musicae Scientiae. In: Danuser Hand Krummacher F (Hsg.): Rezeptionsasthetik und Rezeptionsgeschichte in der Musikwissenschaft. Laaber, Laaber 1991 [251] Jeppesen K: Kontrapunkt. Breitkopf und Hartel, Wiesbaden 1952 [252] John F: Partial Differential Equations. Springer, Heidelberg et al. 1978 [253] Johnson T: See his web page at http://www.tom.johnson.org [254] Julia G: Sur l'iteration des fonctions rationnelles. J. de Math. Pure et Appl. 8, 1918 [255] Kagel M: Translation - Rotation. Die Reihe Bd.7, Universal Edition, Wien 1960 [256] Kahle W: Taschenatlas der Anatomie, Thieme/dtv, Stuttgart 1979
Bd.3:
Nervensystem und Sinnesorgane.
[257] Kaiser J: Beethovens 32 Klaviersonaten und ihre Interpreten. Fischer, Frankfurt/Main 1979 [258] Kant I: Kritik der reinen Vernunft. Meiner, Hamburg 1956 [259] Karg-Elert S: Polaristische Klang- und Tonalitatslehre (1931). Out of print, cf.: Schenk P.: Karg-Elerts polaristische Harmonielehre. In: Vogel, M (ed.) Beitrage zur Musiktheorie des 19. Jahrhunderts. Bosse, Regensburg 1966 [260] Katayose H et al.: Demonstration of Gesture Sensors for the Shakuhachi. In: ICMA (ed.): Proceedings of the ICMC 94, S. Francisco 1994 [261] Kelley J L: General Topology. Van Nostrand, Princeton et al. 1955 [262] Kiczales, G, Rivieres J, Bobrow D G: The Art of the Metaobject Protocol. The MIT Press, Boston 1991 [263] Kirsch E: Wesen und Aufbau der Lehre von den harmonischen Funktionen. Leipzig 1928 [264] Klemm M: Symmetrien von Ornament en und Kristallen. Springer, Berlin et al. 1982 [265] Knapp J: Franz Liszt. Berlin 1909 [266] Kohler E: Brief an Guerino Mazzola. Hamburg 1988 [267] Koenig Th: Robert Schumanns Kinderszenen op.15. In: Metzger H-K und Riehn R (Hrg.): Robert Schumann II. edition text+kritik, Miinchen 1982 [268] Kollmann A: An Essay on Musical Harmony. London 1796 [269] Komparu K: The Noh Theatre. Weatherhill/Tankosha, New York et al. 1983
BIBLIOGRAPHY
1235
[270] Kopiez R and Langner J: Entwurf einer neuen Methode der Performance analyse auf Grundlage einer Theorie oszillierender Systeme. In: Behne K-E and de la Motte H (eds.): J.buch der D. Ges. fur Musikpsychologie. 12 (1995), Wilhelmshaven 1996 [271] Langner J, Kopiez R, Feiten B: Perception ad Representation of Multiple Tempo Hierarchies in Musical Performance ad Composition: Perspectives from a New Theoretical Approach. In: Kopiez Rand Auhagen W (eds.): Controlling Creative Processes in Music. Peter Lang, Frankfurt am Main et al. 1998 [272] Kopiez R: Aspekte der Performanceforschung. In: de la Motte H: Handbuch der Musikpsychologie. Laaber, Laaber 1996 [273] Kopiez R: Mensch - Musik - Maschine. Musica, 50 (1), 1996 [274] Kopiez R: "The most wanted song/The most unwanted song" - Klangfarbe als wahrnehmungsiisthetische Kategorie. Musicology Conference, Halle 1998 [275] Kopiez R, Langner J, Stoffel Ch: Realtime analysis of dynamic shaping. Talk at the 6th International Conference on Music Perception and Cognition (ICMPC), 5.10. 8.2000, Keele, England [276] Korner T W: Fourier Analysis. Cambridge University Press 1988, Cambridge [277] Kostelanetz R: John Cage. Praeger, New York 1968 [278] Kouzes R T et al.: Collaboratories: Doing Science on the Internet. Computer, August 1996 [279] Kronland-Martinet R: The Wavelet Transform for Analysis, Synthesis, and Processing of Speech and Music Sounds. Computer Music Journal, 12 (4), 1988 [280] Kronman U and Sundberg J: Is the Musical Ritard an Allusion to Physical Motion? In: Gabrielsson A (ed.): Action and Perception in Rhythm and Meter. Bubl. of the Royal Swedish Acad. of Sci. 55, Stockholm 1987 [281] Kuratowski K and Mostowski A: Set Theory. North Holland, Amsterdam 1968 [282] Kuhner H: Virtual Table. CG TOPICS 3/97 [283] Lang S: Introduction to Differentiable Manifolds. Interscience, New York et al. 1962 [284] Lang S: Elliptic Functions. Addison-Wesley, Reading, Mass. 1973 [285] Lang S: SL 2 (lR). Addison-Wesley, Reading, Mass. 1975 [286] Langner G: Evidence for Neuronal Periodicity Detection in the Auditory System of Guinea Fowl: Implications for Pitch Analysis in the Time Domain. Exp. Brain Res. 52, 333-355 1983 [287] Langner G: Periodicity coding in the auditory system. Hear Res. Jul;60(2):115-42, 1992
1236
BIBLIOGRAPHY
[2SS] Langer S: Feeling and Form, Routledge and Kegan Paul, London 1953 [2S9] Langner J, Kopiez R, Stoffel Ch, Wilz M: Realtime analysis of dynamic shaping. In: Woods C et al. (eds.): Proceedings of the Sixth International Conference on Music Perception and Cognition, Keele, UK, 2000 [290] Lawvere F W: An elementary theory of the category of sets. Proc. Natl. Acad. Sci. 52, 1506-1511, 1964 [291] Leach J L: Towards a Universal Algorithmic System for Composition of Music and AudioVisual Works. In: ICMA (ed.): Proceedings of the ICMC 96, S. Francisco 1996 [292] Leman M: Schema Theory. Springer, Berlin et al. 1996 [293] Lerdahl F: Timbral Hierarchies. Contemporary Music Review, vol.2, no.l, 19S7 [294] Levi-Strauss C: Le cru et Ie cuit: Mythologies 1. PIon, Paris 1964 [295] Lewin D: Re: Intervallic Relations between Two Collections of Notes. JMT, 3(2), 29S-301, 1959. [296] Lewin D: The Intervallic Content of a Collection of Notes, Intervallic Relations between a Collection of Notes and Its Complement: An Application to Schoenberg's Hexachordal Pieces. JMT, 4(1), 9S-101, 1960 [297] Lewin D: Forte's Interval Vector, My Interval Function, and Regener's Common-Note Function. JMT, 21(2), 194-237, 1977 [29S] Lewin D: A Formal Theory of Generalized Tonal Functions. JMT 26(1), 32-60, 19S2 [299] Lewin D: On Formal Intervals between Time-Spans. Music Perception, 1(4),414-423, 19S4 [300] Lewin D: Generalized Musical Intervals and Transformations. Yale University Press, New Haven and London 19S7 [301] Lewin D: Musical Form and Transformation: 4 Analytic Essays. Yale University Press, New Haven and London 1993 [302] Lewis C I: Mind and Word Order. Dover, New York 1956 [303] Leyton M: Symmetry, Causality, Mind. MIT Press, Cambridge/MA and London 1992 [304] Leyton M: A Generative Theory of Shape. Springer, Berlin et al. 2001 [305] Lichtenhahn E: Romantik: Aussen- und Innenseiten der Musik. In: "Das Paradox musikalischer Interpretation", Symposion zum SO. Geburtstag von K. von Fischer, Univ. Zurich 1993 [306] Lippe E, and ter Hofstede A H M: A Category Theory Approach to Conceptual Data Modeling. Informatique Theorique et Applications, Vol 30, 1, 31-79, 1996 [307] Loomis L H and Sternberg S: Advanced Calculus. Addison-Wesley, Reading, Mass. 1965
BIBLIOGRAPHY
1237
[308] Louis A K et al.: Wavelets. Teubner, Stuttgart 1994 [309] Ludi W: Fax to G Mazzola. Malans 01/23/1991 [310] Ludwig H: Marin Mersenne und seine Musiklehre. Olms, Hildesheim 1971 [311] Lussy M: Traite de l'expression musicale. Paris 1874 [312] Lutz R: Le poete dans son oeuvre. Seminar Jung Univ. Zurich, 1980 [313] Mac Lane S: Categories for the Working Mathematician. Springer, New York et al. 1971 [314] Mac Lane Sand Moerdijk I: Sheaves in Geometry and Logic. Springer, New York et al. 1994 [315] MacLean P D: The triune brain, emotion, and scientific bias. In: Schmitt F 0 (ed.): The Neurosciences: Second Study Program, 336-348, Rockefeller Univ. Press, New York 1970 [316] Maeder R: Programming Mathematica. Addison-Wesely, Reading, Mass. 1991 [317] Marek C: Lehre des Klavierspiels. Atlantis, Zurich 1977 [318] Martinet A: Elements de Linguistique Generale. Colin, Paris 1960 [319] Martino D: The Source Set and Its Aggregate Formations. JMT, 5(2), 224-273, 1961 [320] Marx A B: Die Lehre von der musikalischen Komposition. 4 Bde., Leipzig 1837-47 [321] Mason R M: Enumeration of Synthetic Musical Scales (... ). J. of Music Theory 14, 1970 [322] Mathiesen Th J: Transmitting Text and Graphics in Online Database: The Thesaurus Musicarum Latinarum Model. In: Hewlett W B and Selfridge-Field E (eds.): Computing in Musicology Vol.9, CCARH, Menlo Park 1993-94 [323] Mattheson J: Der vollkommene Kapellmeister. Hamburg 1739 [324] Maxwell H J: An Expert System for Harmonizing Analysis of Tonal Music. In: Balaban M et al. (eds.): Understanding Music with AI: Perspectives on Music Cognition MIT Press. Cambridge MA 1992 [325] Mazzola G: Akroasis-Beethoven's Hammerklavier-Sonate in Drehung (fur Cecil Taylor). LP, recorded August 30, 1979, WERGO/Schott, Mainz 1980 [326] Mazzola G: Musique et theorie des groupes. Conference at Institut de Mathematique, U Geneve, January 31, 1980 [327] Mazzola G: Die gruppentheoretische Methode in der Musik. Lecture Notes, Notices by H. Gross, SS 1981, Mathematisches Institut der Universitiit, Zurich 1981 [328] Mazzola G: Gruppen und Kategorien in der Musik. Heldermann, Berlin 1985 [329] Mazzola G et al.: Rasterbild-Bildraster. Springer, Berlin et al. 1986
1238
BIBLIOGRAPHY
[330] Mazzola G: Die Rolle des Symmetriedenkens fur die Entwicklungsgeschichte der europiiischen Musik. In: Symmetrie, Katalogband VoU zur Symmetrieausstellung, Mathildenhohe, Darmstadt, 1986 [331] Mazzola G: Obertone oder Symmetrie: Was ist Harmonie? In: HerfF R (ed.): MikrotoneHelbling, Innsbruck 1986 [332] Mazzola G: Mathematische Betrachtungen in der Musik I,ll. Lecture Notes, Univ. Zurich 1986/87 [333] Mazzola G: Der Kontrapunkt und die K/D-Dichotomie. Manuscript, University of Zurich 1987 [334] Mazzola G: Die Wahl der Zahl- eine systematische Betrachtung zum Streichquartett. In: Dissonanz 17, 1988 [335] Mazzola G and Hofmann G R: Der Music Designer MD-Z71 - Hardware und Software fur die Mathematische Musiktheorie. In: Petsche H (ed.): Musik - Gehirn - Spiel, Beitriige zum 4. Herbert-von-Karajan-Symposion. Birkhiiuser, Basel 1989 [336] Mazzola G et al.: A Symmetry-Oriented Mathematical Model of Classical Counterpoint and Related Neurophysiological Investigations by Depth-EEG. In: Hargittai I (ed.): Symmetry II, CAMWA, Pergamon, New York 1989 [337] Mazzola G et al.: Hirnelektrische Vorgiinge im limbischen System bei konsonanten und dissonanten Kliingen. In: Petsche H (ed.): Musik - Gehirn - Spiel, Beitriige zum 4. Herbertvon-Karajan-Symposion. Birkhiiuser, Basel 1989 [338] Mazzola G: presto Software Manual. SToA music, Zurich 1989-1994 [339] Mazzola G: Synthesis. SToA 1001.90, Zurich 1990 [340] Mazzola G: Geometrie der Tone. Birkhiiuser, Basel et al. 1990 [341] Mazzola G and Muzzulini D: Tempo- und Stimmungsfelder: Perspektiven kunftiger Musikcomputer. In: Hesse H P (ed.): Mikrotone III. Edition Helbling, Innsbruck 1990 [342] Mazzola G and Muzzulini D: Deduktion des Quintparallelenverbots aus der KonsonanzDissonanz-Dichotomie. Accepted for publication in: Musiktheorie, Laaber 1990 [343] Mazzola G: Mathematische Musiktheorie: Status quo 1990. Jber. d.Dt. Math.-Verein. 93, 6-29, 1991 [344] Mazzola G: Mathematical Music Theory-An Informal Survey. Edizioni Cerfim, Locarno 1993 [345] Mazzola G: RUBATO at SMAC KTH, Stockholm 1993 [346] Mazzola G and Zahorka 0: Tempo Curves Revisited: Hierarchies of Performance Fields. Computer Music Journal 18, No.1, 1994
BIBLIOGRAPHY
1239
[347] Mazzola G and Zahorka 0: The RUBATO Performance Workstation on NeXT STEP. In: ICMA (ed.): Proceedings of the ICMC 94, S. Francisco 1994 [348] Mazzola G and Zahorka 0: Geometry and Logic of Musical Performance I, II, III. SNSF Research Reports (469pp.), Universitat Zurich, Zurich 1993-1995 [349] Mazzola G et al.: Analysis and Performance of a Dream. In: Sundberg J (ed.): Proceedings of the 1995 Symposium on Musical Performance. KTH, Stockholm 1995 [350] Mazzola G et al.: The RUBATO Platform. In: Hewlett W B and Selfridge-Field E (eds.): Computing in Musicology 10. CCARH, Menlo Park 1995 [351] Mazzola G and Zahorka 0: The PrediBase Data Base System of RUBATO on NEXTSTEP. In: Selfridge-Field E (ed.): Handbook of Musical Codes. CCARH, Menlo Park 1995 [352] Mazzola G: Inverse Performance Theory. In: ICMA (ed.): Proceedings of the ICMC 95, S. Francisco 1995 [353] Mazzola G et al.: Neuronal Response in Limbic and Neocortical Structures During Perception of Consonances and Dissonances. In: Steinberg R (ed.): Music and the Mind Machine. Springer, Berlin et al. 1995 [354] Mazzola G and Zahorka 0: RUBATO und der Einsatz von Hypermedien in der Musikforschung. Zeitschrift des Deutsches Bibliotheksinstituts, Berlin Jan. 1996 [355] Mazzola G and Zahorka 0: Topologien gestalteter Motive in Kompositionen. To appear In: Auhagen W et al. (eds.): Festschrift zum 65. Geburtstag J.P. Fricke. Preliminary online version: http://www.uni-koeln.de/phil-fak/ muwi/ publ/ fdricke / festschrift.html [356] Mazzola G: Towards Big Science. Geometry and Logic of Music and its Technology. In: Enders Band Knolle N (eds.): Symposionsband Klangart '95, Rasch, Osnabruck 1998 [357] Mazzola G et al.: The RUBATO Homepage. http://www.rubato.org, Univ. Zurich, since 1996 [358] Mazzola G: Objective C and Category Theory. Seminar Notes, Multimedia Lab, CS Department, U Zurich, Zurich 1996 [359] Mazzola G et al.: Orbit. Music & Arts CD-1015, Berkeley 1997 [360] Mazzola G and Beran J: Rational Composition of Performance. In: Kopiez Rand Auhagen W (eds.): Proceedings of the Conference "Controlling Creative Processes in Music". Lang, Frankfurt and New York 1998 [361] Mazzola G: Semiotic Aspects of Musicology: Semiotics of Music. In: Posner R et al. (Eds.): A Handbook on the Sign-Theoretic Foundations of Nature and Culture. Walter de Gruyter, Berlin and New York 1998
BIBLIOGRAPHY
1240
[362] Mazzola G: Music@EncycloSpace. In: Enders Band Stange-Elbe J (ed.): Musik im virtuellen Raum (Proceedings of the klangart congress'98). Rasch, Osnabriick 2000 [363] Mazzola G: Humanities@EncycloSpace. FER-Studie Nr.XX. Schweizerischer Wissenschaftrat, Bern 1998 http://www.swr.ch [364] Mazzola G: Die schone Gefangene - Metaphorik und Komplexitat in der Musikwissenschaft. In: Kopiez Ret al. (eds.): Musikwissenschaft zwischen Kunst, Aesthetik und Experiment. Konigshausen & Neumann, Wiirzburg 1998 [365] Mazzola G: The Topos Geometry of Musical Logic. Appears in the Proceedings of the Fourth Diderot Symposium of the European Math. Soc., Springer, Heidelberg et al. 2002 [366] Mazzola G: Degenerative Theory of Tonal Music. To appear in: Proceedings of the klangart congress'99. Universitat Osnabriick 2002 [367] Mazzola G: Classifying Algebraic Schemes for Musical Manifolds. Tatra Mt. Math. Publ. 23, 71-90, 2001 [368] Mazzola G: L'Essence du Bleu (sonate pour piano). Acanthus, Riittenen 2002 [369] Mesnage M: La Set-Complex Theory: de quels enjeux s'agit-il? Analyse musicale, 4e trimestre, 87-90, 1989 [370] Messiaen 0: Technique de mon langage musical. Leduc, Paris 1944 [371] Meyer J: Akustik und musikalische Auffiihrungspraxis. Verlag Das Musikinstrument, Frankfurt/Main 1980 [372] Meyer-Eppler W: Grundlagen und Anwendungen der Informationstheorie. Springer, Berlin 1959 [373] Meyer-Eppler W: Zur Systematik der elektrischen Darmstadter Beitrage III, Schott, Mainz 1960
Klangtransformationen.
In:
[374] Michels U: dtv-Atlas zur Musik I,ll. dtv /Barenreiter, Miinchen/Kassel 1977, 1985 [375] Miklaszewski K: A case Study of a Pianist Preparing a Musical Performance. Psychology of Music, 17, 95-109, 1989 [376] Misch C and Wille R: Stimmungslogiken auf MUTABOR: Eine Programmiersprache. In: Herf F R (ed.): Mikrotone II. Edition Helbling, Innsbruck 1988 [377] Molino J: Fait Musical et Semiologie de la Musique. Musique en Jeu 17 1975 [378] Montiel Hernandez M: El Denotador: Su Estructura, construccion y Papel en la Teoria Matematica de la Musica. UNAM, Mexico City 1999 [379] Moog R: MIDI, Musical Instrument Digital Interface. Audio Eng. Soc. 34, Nr.5, 1986
BIBLIOGRAPHY
1241
[380J Morris R D: Composition with Pitch-Classes. Yale University Press, New Haven et al. 1987 [381J Morris R D: Compositional Spaces and Other Territories. PNM 33, 328-358, 1995 [382J Morris R D: K, Kh, and Beyond. In: Baker J et al. this bibliography, 275-306 [383J Mozart W A: Die ZauberJlote. (Klavierauszug) Ed. Peters o.J. [384J Muller W: Darstellungstheorie von endlichen Gruppen. Teubner, Stuttgart 1980 [385J Mumford D: Lectures on Curves on an Algebraic Surface. Princeton University Press, Princeton 1966 [386J Mumford D and Suominen K: Introduction to the theory of moduli. In: Oort F (ed.): Algebraic Geometry Oslo 1970. Wolters-Noordhoff, Groningen 1972 [387J Murenzi R: Wavelets. Combes J M, Grossman A, Tchmitchian P (eds.), Springer Berlin et aI., 1988 [388J MusicKit Online-Documentation. Version 4.0 1994 [389J Muzzulini D: Konsonanz und Dissonanz in Musiktheorie und Psychoakustik. Lizenziatsarbeit, MWS Univ. Zurich 1990 [390J Muzzulini D: Musical Modulation by Symmetries. J. for Music Theory 1995 [391J Muzzulini D: Tempo Modifications and Spline Functions. NF-Report 1993, Univ. Zurich 1993 [392J Narayan SS, Temchin AN, Recio A, Ruggero MA: Frequency tuning of basilar membrane and auditory nerve fibers in the same cochleae. Science. 282(5395):1882-4, Dec 4 1998 [393J Nattiez J-J: Fondements d'une Semiologie de la Musique. Edition 10/18 Paris 1975 [394J Neumaier W: Was ist ein Tonsystem? Lang, Frankfurt/Main et al. 1986 [395J Neuwirth G: Josquin Desprez, "Erzahlen von Zahlen". In: Musik-Konzepte 26/27, edition text+kritik, Munchen 1982 [396J NeXTSTEP Online-Documentation; Version 3.3. NeXt Inc., Redwood City 1995 [397J Nieberle R: !RCAM Signal Processing Workstation. Keyboards, Dec. 1992 [398J Noether E: Hyperkomplexe Gr6ssen und Darstellungstheorie. Math. Zeitschr., Vol. XXX 1929 [399J Noll J: Musik-Programmierung. Addison-Wesley, Bonn 1994 [400J Noll Th: Morphologische Grundlagen der abendliindischen Harmonik. Doctoral Thesis, TU Berlin 1995
1242
BIBLIOGRAPHY
[401] Noll Th: Fractal Depth Structure of Tonal Harmony. In: ICMA (ed.): Proceedings of the ICMC 95, S. Francisco 1995 [402] Noll Th: http://www.cs.tu-berlin.de/noll/ChordDictionary.sea.hqx. TU Berlin 1996 [403] Noll Th: Harmonische Morpheme. Musikometrika 8, 7-32, 1997 [404] Noll Th: The Consonance/Dissonance-Dichotomy Considered from a Morphological Point of View. In: Zannos I (ed.): Music and Signs. ASCO Publ., Bratislava 1999 [405] Nunn D et al.: Acoustic Quanta. In: ICMA (ed.): Proceedings of the ICMC 96, S. Francisco 1996 [406] Oettingen A von: Das duale Harmoniesystem. Leipzig 1913 [407] Okopenko A: Lexikon-Roman. Residenz Verlag, Salzburg 1970 [408] Opcode: MAX. http://www.opcode.com/products/max/. Opcode Systems Inc. 1997 [409] Osgood C E et al.: The Nature and Measurement of Meaning. Psychological Bulletin 49, 1952 [410] Parncutt R: The Perception of Pulse in Musical Rhythm. In: Gabrielsson A (ed.): Action and Perception in Rhythm and Music. Royal Swedish Adademy of Music, No.55 [411] Parncutt R: Recording Piano Fingering in Live Performance. In: Enders B, Knolle N (eds.): KlangArt-KOngress 1995, Rasch, Osnabriick 1998 [412] Parncutt Ret al.: Interdependence of Right and Left Hands in Sight-read, Written, and Rehearsed Fingerings. Proc. Euro. Sco. Cog. Sci. Music, Uppsala 1997 [413] Parncutt R: Modeling Piano Performance: Physics and Cognition of a Virtual Pianist. In: ICMA (ed.): Proceedings of the ICMC 97, S. Francisco 1997 [414] Perle G: Serial Composition and Atonality: An Introduction to the Music of Schoenberg, Berg and Webern. 5th ed., revised, University of California Press, Berkeley 1981 [415] Petsche H et al.: EEG in Music Psychological Studies. In: Steinberg R (ed.): Music and the Mind Machine. Springer, Berlin et al. 1995 [416] Petsche H: Private correspondence. Vienna, March 2001 [417] Platon: Phaidron. [418] Plomp Rand Levelt W: Tonal Consonance and Critical Bandwidth. J.Acoust. Soc. Am. 38, 548, 1965 [419] Pope S T: Music Notations and the Representation of Musical Structure and Knowledge. Perspectives of New Music, Spring-Summer 1986 [420] Popper K R: Conjectures and Refutations. Routledge & Kegan Paul, London 1963
1243
BIBLIOGRAPHY
[421] Posner R: Strukturalismus in der Gedichtinterpretation. In: Blumensadth H (Ed.): Strukturalismus in der Literaturwissenschaft. Kiepenheuer & Witsch, K61n 1972 [422] Prame E: Measurements of the Vibrato Rate of 10 Singers. In: Friberg A et al. (eds.): Proceedings of the 1993 Stockholm Music Acoustic Conference. KTH, Stockholm 1994 [423] Priestley M B: Spectral Analysis of Time Series. Academic Press, London 1981 [424] Promies W: Symmetrie in der Literatur. In: Symmetrie, Katalogband VoU zur Symmetrieausstellung. Mathildenh6he, Darmstadt, 1986 [425] Promies W: Stolbergs Gedicht im Poesiegarten In: Symmetrie, Katalogband Vol.3 zur Symmetrieausstellung. Mathildenh6he, Darmstadt, 1986 [426] Puckette M and Lippe C: Score Following in Practice. In: ICMA (ed.): Proceedings ofthe ICMC 92, S. Francisco 1992 [427] Pulkki V et al.: DSP Approach to Multichannel Audio Mixing. In: ICMA (ed.): Proceedings of the ICMC 96, S. Francisco 1996 [428] Quinn I: Fuzzy Extensions to the Theory of Contour. Music Theory Spectrum. Vol. 19/2, 1997 [429] Radl H: Versuch tiber die Modulationstheorie Mazzolas in reiner Stimmung. Diploma Thesis, U Augsburg, Augsburg 1998 [430] Rahn J: Basic Atonal Theory. Longman, New York 1980 [431] Rahn J: Review ofD. Lewin's "Generalized Musical Intervals and Transformations". JMT, 31, 305-318, 1987 [432] Raffman D: Language, Music, and Mind. MIT Press, Cambridge et al. 1993 [433] Rameau J-Ph: Traite de l'Harmonie, Reduite
a ses Principes Naturels.
Paris 1722
[434] Ratz E: Einftihrung in die musikalische Formenlehre. Universal Edition, Wien 1973 [435] Read G: Music Notation. Crescendo Publ., Boston 1969 [436] Reeves H: Patience dans l'azur. L'evolution cosmique. Seuil, Paris 1981 [437] Reichardt J F: Vermischte Musikalien. Riga 1777 [438] Repp B: Diversity and Commonality in Music Performance: An Analysis of Timing Microstructure in Schumann's "Traumerei". J. Acoustic Soc. Am. 92, 1992 [439] Repp B: e-mail communication of tempo data. Haskins Laboratories, New Haven, June 2, 1995 [440] Repp B: Patterns of note onset asynchronies in expressive piano performance. J. Acoustic Soc. Am. 100, 1996
1244
BIBLIOGRAPHY
[441] Repp B: Pedal Timing and Tempo in Expressive Piano Performance: A Preliminary Investigation. Psychology of Music 24, 1996 [442] Repp B: The Art of Inaccuracy: Why Pianists' Errors Are Difficult to Hear. Music Perception, 14, 2 1997 [443] Repp B: Expressive Timing in a Debussy Prelude: A Comparison of Student and Expert Pianists. Musicae Scientiae 1, 1997 [444] Reti R: The Thematic Process in Music (1951). Greenwood Press, Westport 2nd ed. 1978 [445] Reti R, commented by Kopfermann M: Schumanns Kinderszenen: quasi Thema mit Variationen. In: Musik-Konzepte Sonderband Robert Schumann II, edition text + kritik, Mlinchen 1982 [446] Rhode W S and Robles L: Evidence from Mossbauer Experiments for Nonlinear Vibration in the Cochlea. J. Acoust. Soc. Am. 48, 988, 1970 [447] Richter P et al.: How Consistent are Changes in EEG Coherence Patterns Elicited by Music Perception? In: Steinberg R (ed.): Music and the Mind Machine. Springer, Berlin et al. 1995 [448] Riemann B: Uber die Hypothesen, welche der Geometrie zugrunde liegen (1854). Gott. Abh. No.13 (published in 1867) [449] Riemann H: Musikalische Logik. Leipzig 1873 [450] Riemann H: Musikalische Syntaxis. Leipzig 1877 [451] Riemann H: Vereinfachte Harmonielehre oder die Lehre von den tonalen Funktionen der Akkorde. London 1893 [452] Riemann H: Uber Agogik. In: Praludien und Studien II. Leipzig 1900 [453] Riemann H: System der musikalischen Rhythmik und Metrik. Breitkopf und Hartel, Leipzig 1903 [454] Riemann H: Handbuch der Harmonie- und Modulationslehre. Berlin o.J. [455] Riemann H: Handbuch der Harmonielehre. Leipzig 6/1912 [456] Riemann H: Grundriss der Musikwissenschaft. Leipzig 1928 [457] Riemann Musiklexikon, Sachteil, 12. Auflage. Schott, Mainz 1967 [458] Ries F: Biographische Notizen liber L. van Beethoven (1838). New edition by Kalischer A Ch, 1906 [459] Risset J-C: Real-time: Composition or performance? Reservations about real-time control in computer music and demonstration of a virtual piano partner. In: Kopiez Rand Auhagen W (eds.): Controlling creative processes in music. Peter Lang, Frankfurt/Main 1998
BIBLIOGRAPHY
1245
[460] Roads C: The Computer Music Tutorial. MIT Press, Cambridge Mass. and London 1998 [461] Rodet X et al.: Xspect: a New X/Motif Signal Visualization, Analysis and Editing Program. In: ICMA (ed.): Proceedings of the ICMC 96, S. Francisco 1996 [462] Roederer J G: Physikalische und psychoakustische Grundlagen der Musik. Springer, Berlin 1977 [463] Rossi A: Die Geburt der modernen Wissenschaft in Europa. Beck, Munchen 1997 [464] Ruetschi U-J: Denotative Geographical Modelling-an attempt at modelling geographical information with the Denotator system. Diploma thesis, University of Zurich, 2001 [465] Rufer J: Die Komposition mit zwolf Tonen. Biirenreiter, Kassel 1966 [466] Ruwet N: Langage, Musique, Poesie. Seuil, Paris 1972 [467] S-Plus. MathSoft Inc., Seattle, Washington 1995
[468] Sachs K-J: Der Contrapunctus im 14. und 15. Jahrhundert. AMW, Franz Steiner, Wiesbaden 1974 [469] Sattler D E (ed.): Friedrich Holderlin - Einhundert Gedichte. Luchterhand, Frankfurt/Main 1989 [470] Salzer F: Structural Hearing: Tonal Coherence in Music. (German: Strukturelles Horen. Noetzel, Wilhelmshaven 1960) Dover, New York 1962 [471] Saussure F de: Cours de Linguistique Generale. Payot, Paris 1922 [472] Saussure F de: Linguistik und Semiotik - Notizen aus dem Nachlass. Fehr J (ed.), Suhrkamp, Frankfurt/Main 1997 [473] Scarborough D L et al.: Connectionist Models for Tonal Analysis. Computer Music Journal Vol. 13, 1989 [474] Schenker H: Funf Urlinien-Tafeln. Universal Edition, Wien 1932 [475] Schenker H: Theorien und Phantasien III: Der freie Satz. Universal Edition, Wien 1935 [476] Schmidt J (Hrsg.): Celibicache. Book and Movie PARS (ISBN 3-9803265-1-9), Munchen 1992 [477] Schmidt-Biggemann W: Topica universalis. Meiner, Hamburg 1983 [478] Schonberg A: Harmonielehre (1911). Universal Edition, Wien 1966 [479] Schonberg A: Die Komposition mit zwolf Tonen. In: Style and Idea, New York 1950 [480] Schubert F: Auf dem Wasser zu singen, op. 72 (1823). Biirenreiter/Henle Munchen/Kassel 1982
1246
BIBLIOGRAPHY
[481] Schubert H: Kategorien I, II. Springer, Berlin et al. 1970 [482] Schumann R: Kinderszenen, op. 15 (1839). Henle, Miinchen 1977 [483] Schumann R: Dritter Quartett-Morgen. NZ f. Musik 1838 [484] Schweitzer A: Johann Sebastian Bach. (1908) Breitkopf & Hartel, Wiesbaden 1979 [485] Serres M (ed.): Elements d'histoire des sciences. Bordas, Paris 1989 [486] Shaffer L H: Musical Performance as Interpretation. Psychology of Music, Vol. 23, 1995 [487] Siron J: La partition interieure. Outre mesure, Paris 1992 [488] Slawson W: The Musical Control of Sound Color. Canadian Univ. Music Review, No.3, 1982 [489] Slawson W: Sound Color. U. Cal. Press, Berkeley 1985 [490] Sloan D: From DARMS to SMDL, and back again. In: Haus G and Pighi I (eds.): X Colloquio di Informatica Musicale. AIMI, LIM-Dsi, Milano 1993 [491] Sloboda J: The Musical Mind: An Introduction to the Cognitive Psychology of Music. Calderon Press, Oxford 1985 [492] Sloterdijk P: Selbstexperiment. Hanser, Mnchen & Wien 1996 [493] SMDL committee: Standard Music Description Language Draft, Pittsford, NY 1995 [494] Smith III J 0: Synthesis of bowed strings. In: Strawn J, Blum T, ICMA: Proceedings of the International Computer Music Conference, San Francisco 1982 [495] Smith III J 0: Physical Modeling of Musical Instruments, part 1. Computer Music Journal, 16 (4), 1992 [496] Smith III J 0: Physical Modeling Synthesis Update. Computer Music Journal, 20 (2), 1996 [497] Sodomka A: KUNSTRADIO at documentaX. http://www.culture.net/orfkunstradio /BIOS / sodomkabio.html, Vienna 1997 [498] Spanier E H: Algebraic Topology. McGraw-Hill, New York et al. 1966 [499] Spath H: Eindimensionale Spline-Interpolations-Algorithmen. Oldenburg, Miinchen et al. 1990 [500] Spath H: Zweidimensionale Spline-Interpolations-Algorithmen. Oldenburg, Miinchen et al. 1991 [501] Squire R L and Butters N (eds.): Neuropsychology of Memory. D. Guildford Press, NewYork and London 1984
BIBLIOGRAPHY
1247
[502] Stahnke M: Struktur und .Asthetik bei Boulez. Wagner, Hamburg 1979 [503] Stange-Elbe J, Mazzola G: Cooking a Canon with RubatoPerformance Aspects of J.S. Bach's "Kunst der Fuge". In: ICMA (ed.): Proceedings of the ICMC 98,179-186, San Francisco 1998. [504] Stange-Elbe J: Analyse- und Interpretationsperspektiven zu J.S. Bachs "Kunst der Fuge" mit Werkzeugen der objektorientierten Informationstechnologie. Habilitationsschrift (manuscript, avaliable in several German libraries), Osnabriick 2000 [505] Stange-Elbe J: Instrumentaltechnische Voraussetzungen fiir eine computergestiitzte Interpretation. To appear in: Enders B (ed.): Proceedings of the klangart congress'Ol, Osnabriick [506] Starr D and Morris R: A General Theory of Combinatoriality and the Aggregate. PNM, 16(1),364-389, 16(2), 50-84, 1977-78 [507] Starr D: Sets, Invariance and Partitions. JMT, 22(1), 1-42, 1978 [508] Stein S and Sandor S: Algebra and Tiling: Homomorphisms in the Service of Geometry.: Math. Assoc. Amer., Washington, DC 1994 [509] Steinberg R (ed.): Music and the Mind Machine. Springer, Heidelberg 1995 [510] Steele G L: Common Lisp: The Language, 2nd Edition. Digital Press, 1990 [511] Stone P: Symbolic Composer. http:/www.xs4all.nl/ rv psto/ [512] Stopper B: Gleichstufig temperierte Skalen unter Beriicksichtigung von Oktavstreckungen. In: Hosp I (Hrsg.): Bozener Treffen 1991 - Skalen und Harmonien. Siidtiroler Kulturinstitut, Bozen 1992 [513] Straub H: Beitriige zur modultheoretischen Klassifikation musikalischer Motive. Diplomarbeit ETH-Ziirich, Ziirich 1989 [514] Straub H: Kadenzielle Mengen beliebiger Stufendefinitionen. Unpublished manuscript, Ziirich 1999/2000 [515] Straus J N: Voice Leading in Atonal Music. In: Baker J et al. this bibliography, 237-274 [516] Stumpf C: Tonpsychologie. Leipzig 1883-1890 [517] Sundberg J, Askenfelt A, Fryden L: Musical Performance: A Synthesis-by-Rule Approach, Computer Music Journal, 7, 37-43, 1983 [518] Sundberg J and Verillo V: On the Anatomy of the Retard: A Study of Timing in Music. J. Acoust. Soc. Am. 68, 772-779, 1980 [519] Sundberg J: Music Performance Reseach. An Overview. In : Sundberg J, Nord L, Carlson R (eds.): Music language, Speech and Brain. London 1991
1248
BIBLIOGRAPHY
[520] Sundberg J (ed.): Generative Grammars for Music Performance. KTH, Stockholm 1994 [521] Sundin N-G: Musical Interpretation in Performance. Mirage, Stockholm 1984 [522] Suzuki T et al.: Musical Instrument Database with Multimedia. In: ICMA (ed.): Proceedings of the ICMC 96, S. Francisco 1996 [523] Suppan W: Zur Verwendung der Begriffe Gestalt, Struktur, Modell und Typus in der Musikethnologie. In: Stockmann D and Steszwski J (eds.): Analyse und Klassifikation von Volksmelodien. PWM Edition Krakau 1973 [524] Symmetrie. Katalogband VoLl zur Symmetrieausstellung, Mathildenhohe, Darmstadt, 1986 [525] Terhardt E: Zur Tonhohenwahrnehmung von Kliingen. II. Ein Funktionsschema. Acustica 26, 187-199, 1972 [526] Thorn R: Stabilite structurelle et morphogenese. Benjamin, Reading MA 1972 [527] Tinctoris J: Opera Theoretica, Vol. 2: Liber de arte contapuncti. Seay A (ed.): Amer. Institute of Musicology 1975 gopher:/ /IUBVM.UCS.lNDIANA.EDU /00/tml/15th/tincon2.text [528] Tittel E: Der neue Gradus. Doblinger, Wien-Miinchen 1959 [529] Todd N P M: A Model of Expressive Timing in Tonal Music. Music Percep. 3, 33-58, 1985 [530] Todd N P M: Towards a Cognitive Theory of Expression: The Performance and Perception of Rubato. Contemporary Music Review 4, 1989 [531] Todd N P M: A Computational Model of Rubato. Contemporary Music Review 3, 1989 [532] Todd N P M: The Dynamics of Dynamics: A Model of Musical Expression. J. Acoustic Soc. Am. 91, 3540-3550, 1992 [533] Todoroff T: A Real-Time Analysis and Resynthesis Instrument for Transformation of Sounds in the Frequency Domain. In: ICMA (ed.): Proceedings of the ICMC 96, S. Francisco 1996 [534] Uhde J: Beethovens Klaviermusik III. Reclam, Stuttgart 1974 [535] Uhde J and Wieland R: Denken und Spielen. Biirenreiter, Kassel et al. 1988 [536] van Dalen D: Logic and Structure. Springer, Berlin et al. 1997 [537] Valery P: Le<;on Inaugurale du Cours de Poetique au College de France. Gallimard, Paris 1945 [538] Valery P: Cahiers I-IV (1894-1914). Celeyrette-Pietri N and Robinson-Valery J (eds.), Gallimard, Paris 1987
BIBLIOGRAPHY
1249
[539] Valtieri S: La Scuola d' Athene. Mitteilungen des Konsthistorischen Instituts in Florenz 16, 1972 [540] van der Waerden B L: Algebra I, II. Springer, Berlin et al. 1966 [541] van der Waerden B L: Die Pythagoreer. Artemis, Zurich 1979 [542] Varese E: Erinnerungen und Gedanken. In: Darmstadter Beitrage III. Schott, Mainz 1960 [543] Varese E: Ruckblick auf die Zukunft. edition text+kritik, Munchen 1978 [544] Vercoe B: The Synthetic Performer in the Context of Live Performance. In: ICMA (ed.): Proceedings of the ICMC 84, S. Francisco 1984 [545] Vieru A: The Book of Modes. Editura Muzicala, Bucarest 1993 [546] Vinci A C: Die Notenschrift. Barenreiter, Kassel 1988 [547] Vogel M: Die Lehre von den Tonbeziehungen. Verlag fiir systematische Musikwissenschaft, Bonn-Bad Godesberg 1975 [548] Vogel M: Arthur v. Oettingen und der harmonische Dualismus. In: Vogel M (ed.): Beitrage zu Musiktheorie des 19. Jahrhunderts. Bosse, Regensburg 1966 [549] Vogel M: Berechnung emmelischer und ekmelischer Mehrklange. In: Herf F R (ed.): Mikrotone II. Helbling, Innsbruck 1988 [550] Vogt H: Neue Musik seit 1945. Reclam, Stuttgart 1972 [551] Vossen G: Datenbankmodelle, Datenbanksprachen und Datenbank-Management-Systeme. Addison-Wesley, Bonn et al. 1994 [552] Vuza D T: Sur Ie rythme periodique. Revue Roumaine de Linguistique-Cahiers de Linguistique Theorique et Appliquee 22, no. 1 1985 [553] Vuza D T: Some Mathematical Aspects of David Lewin's Book Generalized Musical Intervals and Transformations. Perspectives of New Music, vol. 26, no. 1, 1988 [554] Vuza D T: Supplementary Sets and Regular Complementary Unending Canons (Part One). Perspectives of New Music, vol. 29, no. 2, 1991 [555] Vuza D T: Supplementary Sets and Regular Complementary Unending Canons (Part Two). Perspectives of New Music, vol. 30, no. 1, 1992 [556] Vuza D T: Supplementary Sets and Regular Complementary Unending Canons (Part Three). Perspectives of New Music, vol. 30, no. 2, 1992 [557] Vuza D T: Supplementary Sets and Regular Complementary Unending Canons (Part Four). Perspectives of New Music, vol. 31, no. 1, 1993 [558] Wanske H: Musiknotation. Schott, Mainz 1988
1250
BIBLIOGRAPHY
[559] Waldvogel M. et al.: presto source code. TRIMAX, Ziirich 1986-1993 [560] Walker J S: Fourier Analysis and Wavelet Analysis. Notices of the AMS, vol.44, No.6, July 1997 [561] Waugh W A O'N: Music, probability, and statistics. In: Encyclopedia of Statistical Sciences, 6, 134-137, 1985 [562] Webern A: Variationen op.27 (mit Interpretationsanweisungen). UE 16845, Wien 1980 [563] Wegner P: Interactive Foundations of Object-Oriented Programming. Computer, IEEE Computer Soc., October 1995 [564] Weyl H: Symmetrie. Birkha.user, Basel 1955 [565] Weinberger N M: Musica Research Notes. Music and Science Information Computer Archive, Center for Neurobiology of Learning and Memory, U Calfornia Irvine, Vol. III, Issue II, Fall 1996 [566] Wicinski A A: Psichologyiceskii analiz processa raboty pianista-ispolnitiela nad muzykalnym proizviedieniem. Izviestia Akademii Piedagogiceskich Nauc Vyp., 25 [Moscow], 171-215, 1950 [567] Widmer G: Learning Expression a Multiple Structural Levels. In: ICMA (ed.): Proceedings of the ICMC 94, S. Francisco 1994 [568] Widmer G: Modeling the Rational Basis of Musical Expression. Computer Music J. 18, 1994 [569] Wieser H-G: Electroclinical features of the psychomotor seizure. Fischer, Stuttgart and Butterworth, London 1983 [570] Wieser H-G and Mazzola G: Musical consonances and dissonances: are they distinguished independently by the right and left hippocampi? Neuropsychologia 24 (6):805-812, 1986 [571] Wieser H-G and Mazzola G: EEG responses to music in limbic and auditory cortices. In: Engel J Jr, Ojemann G A, Liiders H 0, Williamson P D (eds.): Fundamental mechanisms of human function. Raven, New York 1987 [572] Wieser H-G: Musik und Gehirn. Revue Suisse Med. 7, 153-162, 1987 [573] Wieser H-G and Moser S: Improved multipolar foramen ovale electrode monitoring. J Epilepsy 1: 13-22, 1986 [574] Wille R: Symmetrie - Versuch einer Begriffsbestimmung. In: Symmetrie, Katalogband VoLl zur Symmetrieausstellung, Mathildenh6he, Darmstadt, 1986 [575] Wille R: Personal Communication. Darmstadt 1982 [576] Wille R: Musiktheorie und Mathematik. In: G6tze H and Wille R (eds.): Musik und Mathematik. Springer, Berlin et al. 1985
BIBLIOGRAPHY
1251
[577] Wille R: Bedeutungen von Begriffsverbanden. Preprint Nr. 1058, TH Darmstadt 1987 [578] Wille R: Personal Communication. Darmstadt 1985 [579] Winson I: Brain and Psyche. The Biology of the Unconscious. Anchor Press/Doubleday, Garden City NY 1985 [580] Wittgenstein L: Tractatus Logico-Philosophicus (1918). Suhrkamp, Frankfurt/Main 1969 [581] Wolfram S: Software fur Mathematik und Naturwissenschaften. In: Chaos und Fraktale. Spektrum der Wissenschaft Verlagsgesellschaft, Heidelberg 1989 [582] Xenakis I: Formalized Music. Indiana Univ. Press, Bloomington 1972 [583] Yaglom A M and Yaglom I M: Wahrscheinlichkeit und Information. Deutscher Verlag der Wissenschaften, Berlin, 1967 [584] Yoneda N: On the homology theory of modules. J. Fac. Sci. Univ. Tokyo, Sct. I,7, 1954 [585] Ystad S et al.: Of Parameters corresponding to a Propagative Synthesis Model Through the Analysis of Real Sounds. In: ICMA (ed.): Proceedings of the ICMC 96, S. Francisco 1996 [586] Yusuke A et al.: Hyperscore: A Design of a Hypertext Model for Musical Expression and Structure. J. of New Music Research, Vol. 24, 1995 [587] Zahorka 0: Versuch einer Charakterisierung des altromischen Melodiestils. Seminar Lutolf, MWS Univ. Zurich 1993 [588] Zahorka 0: From Sign to Sound-Analysis and Performance of Musical Scores on RUBATO. In: Enders B (ed.): Symposionsband Klangart '95, Schott, Mainz 1997 [589] Zahorka 0: PrediBase-Controlling Semantics of Symbolic Structures in Music. In: ICMA (ed.): Proceedings of the ICMC 95, S. Francisco 1995 [590] Zahorka 0: RUBATO - Deep Blue in der Musik? Animato 97/3, 9-10, Zurich 1997 [591] Zarlino G: Istitutioni harmoniche. Venezia 1558 [592] Zekl G: Topos. Meiner, Hamburg 1990 [593] Zweifel P F: Generalized Diatonic and Pentatonic Scales: A Group-theoretic Approach. PNM, 34(1), 140-161, 1996 [594] Zwicker E and Fastl H: Psychoacoustics. Facts and Models. Springer, Berlin, et al. 1999 [595] Zwicker E and Zollner M: Elektroakustik (2nd edition). Springer, New York et al. 1987
Index Symbols 2Fu , 66 B,x, 280 Dia(Names/E) , 1138 Hi(t!../G 1 ), 433 Mono(E), 1137 Rn@B@M, 206 T 2 (R), 226 il,50 ([(3), 325 A 2 (U, V; W), 1093 IT,50 {},50 !, 1121, 1131 !/a, 402 (G), 1076 (G: H), 1067 (G1)V, 336 (h/Jd,631 (h/h),631 (KI/D 1 ),632 (K7!D 7 ),632 (X/C(X)), 631 (X [c]jY[c]) , 634 (XIC(X)), 631 (f /a), 405 (ri,j), 1085 (x, y), 1058 /N,360 OR, 1075 12-Temp, 512 1R, 1075 2, 1062 3CH,546 3Chains, 319 < N, 413
1253
INDEX
1254
Arg, 748 ArtiSlurs,u, 769 Aut, 1066 Aut(K),174 Aut(c), 1116 AutR(M), 1083 B(~), 1096 BU, (, 'T}, J((), U('T})) , 1156 BA t= a, 1133 BD'Vean(w),787 BDv(w),787 BIT, 51, 71 B@F, 205 B N ,353 Br(x),1153 BIM,1095 BarLine, 81 Basic, 417 BeetMotChordFiber, 423 BeetMotChordFiberObject, 424 BetterForm, 101 C, 80, 534, 1030 C(F),64 C(G,F,P,w),233 C(X),630 C(i), 577 C+,534 C/OnBeetSon, 422 CO, 534 CD, 415, 821 CINT(S),254 CINT(Ser),253 CINTm (S),254 CN(G 1 ),346 COM(Cont),251 CONTn,k(X), 251 CP,475 CT(D),67 C*(Kj M), 1150 CO [a, b], 1154 C5,509 C i (X),630 cn(Kj M), 1150 C*(G1 ), 374 Cal, 351
Cad,552 Cc,631 CcFourier, 86 CcM, 630, 631 CellHierarchYBP, 725 Cellu, 725 CellBP, 725 Ch/PianoChord, 413 Chrono - Fourier, 288 ChronoFourierSound, 288 Ci,630 CiM, 630 Cl H (U),715 Cl n , 215 ClassZChord, 509 Closed x , 1145 Cm,534 ColimCirc, 77 CommaZModule, 511 Consonant, 286 Count, 417 Covens, 1148 Crescendo, 79 D, 80, 323, 721, 1028 DEG,556 DFo , 1155 DNR,256 Dr/,1155 Dt(M),486 D K , 1112 D/,1108 D.,(To),802 D.(M),482 Dk,n, 150 Daughters, 725 Dc, 630 DcM, 630 DenOrb(R, n, B, M), 207 Der(L), 1105 D/,1155 D/." 1155 Dg,557 Dg*, 557 Di,630 DiM, 630
INDEX Dia,470 Dia(Names), 1138 Dian, 476 Diak,470 Dil,1084 Dir,797 Dir(Q), 1064 Dom(X),461 Dp,557 Dup(T),407 Duration, 79 DurationValue, 977 DynSymb,81 E, 80, 1028 E2M,73 EHLD, 768, 950 EHLDGC,799 EMB(A,B),254 ENH,511 EX, 1131 E-F,129 EdRg,472 Ed~Rg, 472 El(G),489 El(M),489 Elast,471 End(A),170 End(c), 1116 EndR(M), 1083 Envelope, 84 EulerChord, 109 EulerChordEvent, 110 Euler Module, 73, 106, 109 EulerModuleq[c]' 618 Euler Plane, 110, 113 EulerZChord,509 EulerlRM odule, 684 Evts(U), 766 Ex, 511 Ex(D, Lo;,(3), 373 ExToPA(F), 521 ExtA(M), 518 F -1 G, 1120 F(D),67 FCM(Alphabet), 1064
1255
FD,821 FG(Alphabet), 1067 F M(Alphabet), 1063 FM-Object,87 FO(PEX), 1135 FORMS, 1138 FSH,417 Fo,1155 Fx ,1107 F"" 672 F sym ,672 Fermata, 781 FermataE,s,769 Fib( e,), 914 Fieldu,726 Fieldu,R, 726 Fin, 1124 Fin(Onset),97 Fin(Pitch), 97 Fin(S), 1061 Flatten, 88 Flatten n ,89 FormList(S), 71 Fourier, 84, 86 FourierSound, 84 Frameu,726 Func(C, D), 1119 Fund(H), 715 Fushi,417 FushiOrnament, 417 FushiPic, 417 FushiST RG, 417 G, 80,1032 Gox,315 G/H,1067 G/n rh 0.*,770 GL(n,p), 1070 GLn(R), 1070 GI H, 1069 G I ,309 Gh ,351 GOpp, 1066 G m ,1066 G v ,k,I,226 Gesf(M), 474
1256
Gest (M), 486 Gest(M),474 ClC t ,676 Glissando, 79 GlobPerfScoreBP.,729 GlobPer f ScoreBP, 728 Grassr,n, 1113 ll, 80, 715, 1018, 1032 llA ~ a, 1133 II \ G, 1067 II
INDEX Int(E),518 Int(M),518 IntMod 12 ,q[c], 621 IntMod 12 ,621 IntThirds 3,4,q[c], 621 IntThirds 3,4, 621 Inte(M), 518 IntA,522 Int g ,1067 Intonation, 684 Item(F),976 J(Th E , w), 794 JCK,416 JCKFU,418 JK(X),1129 JKt,325 K(X),1129 KUs,t,144 Kl.A K 2 , 217 K 4 ,251 KE,697 Kb,857 K Tops ,767 K er(J), 1067 K ernelu, 726 Knot, 87 KnotBasic, 88, 330 KnotperOnS l2l 612 KnotperOns,612 Kq,74 Kt,74 Klk,1148 L, 80, 1030 LB,971 LCP,475 LPS::;t~ument' 728 LPSrtstrument, 728 LPSrtstrument' 728 LFf,1160 L..pf(a, b), 289, 1025 Label, 415 LegatoSluru, 768 LegatoSluru,s, 769 Lev, 329 LimCirc,77
1257
INDEX Limint(F) , 317 LinR,1083 LinR(G!, M), 351 List(F), 976 ListEntrYF' 417 ListF,417 LocCt , 670 LocPerjScoreBP, 724 LocF , 108 Loudness, 79 M/N, 351 MOT, 466 MOTF,466 MOTn , 466 MOTF,n, 466 M[c:) , 127 M@F, 1137 M@, 1091, 1119
•
Mlj/IdA,352 Mn, 209 M*,1063 M n ,253 M q ,1102 M s ,1102 M t ,546 M x (To),802 M 12 ,318 Mop, 466 Mo,466 M[cp],1084 Mju8t, 318 Md,a, 540 M i ,j,950 Maelzel,414 Maj,576 M akrOBasie, 88, 330 M akroPerOns12' 612 M akroPerOns, 612 M arcatou,s, 769 Match p , 1130 M athPitch, 72 Max(X),329 MaxMet(X),329 Media, 415
MelMin, 577 MelMin(3),577 Min(L, S), 705 Min(X, S), 705 ModelForm, 101 MonEnd(F),518 Mor(C),1116 Mos~, 378 Mos~.x, 379 MOS~k' 378 Moth~r, 725 Mt/Motij,413 N(D),67 N(F),63 N(U),1149 NComp, 415 NF,255 N)
1258
P(E),639 P(Q), 1079 P(Q)/ cv, 1117 PEX, 1135 PF,256 PO, 612 P 1\ Q, 421 P+, 1130 PTriiumerei, 873 Para, 117, 119 Part(I), 378 Partial, 86 Pause, 81 Per(R),380 PerOns, 612 PerOns12, 612 Percussion, 612 Periods w, 1018 PhysCrescendo, 81 PhysDuration, 81 PhysGlissando, 81 Physlnstri, 673 PhysLoudness, 81 PhysOnset, 81 PhysOrchestra, 673 PhysPitch, 81 PhysicalBruteForceOperator, 791 PiM Od 12 ,(7), 618 PiModn , 75 PiThirds 3 ,4,620 PianoSelector, 54 Piano-Note, 51 Pitch, 79 PitchChange, 417 PitchSymb, 80 PowerCirc, 77 Ptch, 140 Pythagoras Line, 560 QNormalize(c),974 QReduce(c, [n]), 974 Qw,793 Q(3u!-, , 146 QO(El' E 2), 640 Qo(K/D),640 R(Para),327
INDEX R.S, 126 R.J,l72 REd,472 RT,977 RTC, 449 RULES, 1133 R[c], 1077 R(M),1077 R(Q), 1079 R*,612 R Ons ' 612 R 2,882 R C ,1085 R X , 1075 RR,1084 RCI, 351 RVuza, 380 R max , 726 R min , 726 Rad(M), 1088 Rat, 748 Rate, 414 RelDyn, 81, 779 RelDynamicEvents, 779 RemainderSplit ex: /-l, 789 RemoveFrom(c1,c2),974 RepA,n*, 363 Review, 415 Rg,469 Rgo (M),469 Rgp(M),469 Rhythm(Para) , 327 S!,419 S,323 S(E),639 S(EX),1131 S(p, u.), 1070 S/ cv, 1060 SEGk[M1/n], 252 SERMn , 150 SERMk,n, 150 SER n , 150 SERk,n, 150 S02(71),949 SPE, 286
INDEX S[Alphabet], 1077 S(Alphabet) , 1077 S®R?, 1091 S-lA, 1101 S-lM, 1102 So, 121 Sn(K),1150 Sq,618 Sq[c], 618 SMt t , 546 SA,n*, 363 Sat, 449 Sat~, 526 Satellite, 811 ScalarOperatorw , 795 ScoreForm, 724 ScoreInstri, 673 ScoreOrchestra, 673 SemInTop(F), 520 Sema(£), 1140 SemiEnd(F), 518 Semitone, 1032 Sg, 557 Sh(C, J), 1130 Simplexu, 725 Simplexesu, 725 Sound, 146 Sp, 557 Sp(X),459 Sp(x),329 Spl-'(x),467 Spec, 1109 Spec(A) , 1108 Spec(R) , 179 Spec (f) , 1109 Special, 417 Split ex /L, 789 Splitu,II' 788 Staccatissimou,s, 769 Staccatou,s, 769 StepTune, 782 Sub(X) , 1062 Sub(BP), 715 Sub M* (N), 480 Sube , 1126
1259
Support, 1015 SupportForm(E), 411 Switch,71 Syllabic, 286 Sym(K),174 Symi(A, B), 254 SymbolicBrueForceOperator, 790 SynCirc, 76 T, 323, 721, 1017 T(E),697 T(F),64 T(G), 1070 T(p.,u:),1070 TF,544 TFj,t, 544 TG 1 , 677 TID, 721 TIn, 150 TK,669 TO, 1155 TRUTH(F) , 530 TRUTH(I),407 TRUTH(h),409 TTO, 253 T r I, 1155 TtK, 669 To, 801, 1147 Tl, 1147 T 2 ,1147 TkK, 669 T 2 ,R,475 Tx,x, 1112 TA,801 TTON,589 TVALmode' 589 TVAL type ' 589 T e ,128 T,,'f/,j, 1157 T re d,128 Tempo, 682 TempoOperator w , 793 Tension()..·,/L.,W,¢min),590 Tenutou,s, 769 Terminal, 976 TI, 669,1155
INDEX
1260
Tlk,669 Tg,557 TimeSig, 82 TimeSig(p/q)s, 769 Title, 415 Top(H) , 715 Tops, 767 TorSeq, 471 Toroidm,l 471 I ' Tp, 557 Tr(D, T), 636 Trans, 380 Trans(D, T), 636 Triv(K),174 Tune, 782 Types, 1137 UNICODE,406 U cgJ V, 1094 un, 727 UO, 1146 Ug , 141 Un, 253 Us, 141 Ux , 321 Un + l , 727 Ux / x + l , 321 Utai,418 V(E), 1108 VF(O),1160 VIIx , 321 VIx , 321 Vx,321 Vn,x, 213 Val(I),407 ValCh(p/q = r/s)s, 769 VerbAbsTpo, 414 Vowel, 286 WPn (U,1l'IR),727 W*,605 W1~\' 768 We'ight(U) , 727 WeightListBP, 726 Weightn(U), 727 WeightBP, 727 X(R,n,B,M),207
X(harmo), 879 X(melod), 879 X (metric) , 879 X@F, 1091 X -chromatic, 509 X-harm, 509 X-major, 509 X-mel,509 X(3), 321 xMaxMet, 329 xMetLg[LI, 329 xMetPer[PI, 329 X*, 1091 XCyc(g) , 1071 X 6 ,559 X~, 213 X add ,878 YAff , 1111 YCornRings, 1111 Year, 415 Z(R), 1075 Z(harmo), 879 Z (metric), 879 ZNF, 256 [C/C(X)], 631 [CIC(X)]' 631 [K),283 [K], 235 [M),489 [Ml/n], 252 [R], 407 [S], 1119 ['t], 378 [a, b], 556 [b, p, g], 328 [ll x 1k], 130 [8], 1060 [x),489 [x], 1159 [xy], 541 &, 1131 An, 1112 Ab,1107 Aff, 1109 111111,1155
INDEX 111112, 1155 1111100, 1155 Ilxlll' 1154 Ilxlb 1154 Ilxlloo, 1154 1111,1154 @M, 1091, 1119 @, 62,1090 @red R M , 1095 @R, 1090 @locX, 342 C,1076 CHR, 51 Colimit, 1137 ComGlobA, 347
ComLoClass!e;l~nd(B)' 207 ComLoClass';,e;l~t:p, 215 ComLoClass';,e+nlO ,210 , R ComLoClassn+l,oR' 215 ComLocA, 172 ComLocemb A , 172 ComLocf1;n, 172 ComLocT, 172 ComMod, 1108 ComRings, 1107 ComRings@, 1112 ARg, 469 Anf(G 1 ), 358 A, 686, 763, 1131 An, 211, 357 AGI, 358 Ai,j, 1098 Den, 402 Den(x, y), 402 DenColimit, 402 DenLimit, 402 Denpower, 402 DenSimple, 402 Dens yn , 402 Denoo , 406 Denoo / sig, 410 ForSem, 1141 fB f ,354 fAj,355
1261 f, 353, 1049, 1109 f(G 1 ), 351 f(U, F), 1109
•
f(J / IdA), 353 f(myFMObject) , 87 f W ,292 ft, _k468 f t ,468 f t (M,j),377 fDir, 803 fRedlndia,c, 470 ft,n, 468 Jl Glob,428 Gr, 1107, 1117 lHl, 1076 A!, 763 A 1, 763 A 1, 764 Aoo,764 LieR,1104 Limit, 1137 LinMod, 1117 LinMod R, 1117 Loc, 164 LOCEnd(A),170 LOC@A, 165 JlLoc, 428 LocRgSpaces, 1108 M R ,1117 Mm,n(R), 1084 Mod@, 1091 Mod, 1091, 1107, 1117 Mod R , 1091, 1117 Mon, 1107, 1117 N, 1058, 1116 Jl ObGlob, 428 ObLoc, 158 Jl ObLoc, 428 ObLoCEnd(A), 170 ObLoc@A, 165 fl ex /-t, 770 fl, 727, 763, 872, 1126 fl(p),588 flSh' 1130
1262 n~, 876 <1>, 1131
II, 743, 1135 II w ,232 Power, 1137 \II, 742 Q, 72, 73, 1058, 1076 ~, 51, 72, 1058, 1076 ~[1QI1' 72 =?, 1022, 1132 ®~, 100 Rings, 1107, 1117 BC(A, W), 1158 BP,715 B(A, W), 1157 C1 x C2 IK, 714 GlDiff, 676 Glob,335 LocDiff, 670 ObGlob,335 Tank,669 Tanbat' 670 Schemes, 1111 Setsu, 1116 I:'lMi ,1084 Simpl,1148 SinLoc, 169 SinLoCEnd(A) , 170 SinLoc@A, 170 Sob,1110 Simple, 1137 Syn, 1137 TON, 544, 545, 588, 590 1I'(h), 409 1I'1, 407 1I'1, 407 Tex, 406 Texig( Den) (h), 409 Texig( Den) 1, 406 Top, 1117, 1146 1, 1135 VAL, 544, 545, 588, 590 VAL mode , 589 VAL type , 589 3, 1131, 1135
INDEX 3D,508 3K,508 'IS,690 Z, 51, 1058, 1076 Zn' 1069, 1076 Z2,71 Z*1,947 'ISA,Dir, 797 'ISA,U, 801 'ISA, 797 'IS ex , 720 0:,638 0:+, 618, 689 eL,618 0:+, 128 0:_, 128 X,489 Z, 1076 (3,638 (3(x), 354 nV,1058 UV,1058 EBl M i , 1084 ..l, 408, 1132 ~, 1094 .,971 n,346 ?@A, 165 ti, 121 X, 506, 508 X(Y), 1062 X*,507 Xu, 1126 ill M i , 1062 8Y,324 8,638,743 8(XjY),633 8(YIX),633 8[XIYJ,633 8@A, 358 8ij , 1085 jjldA,359 8, 359 0, 176, 1057 0R , 1091
1263
INDEX ::ix, 1136 ::ixP(x, V), 421 \Ix, 1136 \lx P (x,y),421
(8)e, 1063
Xl, 1108 S), 722
B,1146 Cr ,1155
wtusQ5en, 159 wt, 1136 wt F a[x], 1136 wto, 559 wtI, 559 wt2 , 559
SM,1066 'r,484 'r/Ges, 484
(8), 513, 1063 (8), 1118 f---::-:- a, 1134 CL
£,1137 F(0),1160 M,21O MT) 213 R,1137 J-L,763
-', 1132 w,424 Y,1145
'rforoid' 489 'rforoid/Ges, 489 'rl-'/Ges, 484
1,1150
'rl-'/Ges, 486 'rt,P,d, 483 'rt,P,d/GeSt, 483
au, 718,
'rl-" 484, 486
!ID, 727
g[(L),1105 g[(n, R), 1105 ~,
716, 763
~IU, 718
.5[(L), 1105 .5[(n, R), 1105
,,(, 743 ----t
157
b,874
j,
157, 1025 ->,1131 E, 1057 F, 1122
Ie
Ix j, 1159 L,763 Lj, 1062, 1084
"', 552 ",(x),1108 "'J,553 "'RelDyn, 779 "'orb, 560
1146
aZ,689 a'IS'\,ex, 720 ali/axj,l155 a Y R,704 8yR,704
GL(n,p), 1070
7, 66,
aDynamics, 689 aIntonation, 689 8Tempo, 689
1084 ,I,477
TII M i,1062 '¢a,b, 289 PD,1016 a(X/Y),633 a(XIY),633 a[XIY]' 633
""',1060 s, 1046 Vx,1064 L, 483,1079 ,\i Li ,587 ,\i 10 Li' 79
""'P
0,1094 C,1057 ~, 1057
1264 r(X, D, S), 705 ISerll,280 ISer12, 280 B, 638 A,1108 X~, 213 x,216 T, 408, 1132 T 2(R),226 ---U,719 /1,764 'IS,719 Fa, 1132 'Px, 803 V, 1132 1\, 1132 tf.'...,1103 M,526 Pc, 712 jx[,325 +rx , 323 °rx , 323 RGlob,431 RGlobA,431 loc Loc@, 343 oC, 1116 lC,I116 ACOVn *, 357 AD.n , 357 AD.n* IN, 360 RR,1084 RSat~, 527 ~Glob, 431 ~GlobA' 431
a, alpha525 a Uc b, 1121 a Xc b, 1121 abcardt(M), 468 abcardt(m), 468 ad(x), 1105 ad,1062 add, 407 at, 545
INDEX
boJ,874 book, 78, 85 bottom (x) , 1081 c, 1029 c-spacen(X) , 251 c5 ,509 cm ,925 c(3), 561 C~3), 561
~), 561 Char, 577 Cmel, 577 Cnat, 576 card(A) , 1059 causal End, 926 causal Start , 926 char(F), 1076 codom(f) , 1115 colim(D.) , 1121 coord(F) , 1139 ctp".(M), 497 cyc(g), 1071 d, 472, 1028 dB, 1029 d*,1154 d~x, 874 d p,478 dn ,1150 dt ,472 d1,c, 473 d 2 ,c, 473 dEx, 874 d pn, 478 d;',c, 473 deJormLiGr, 925 den(E),412 dJ,1160 dim(K), 1148 dim(M), 1085 dom(f) , 1115 drap( k), 207 e,1028 eM, 1090 em, 1090
1265
INDEX e z ,913 enh,515 evp , 179 ex z , 523 exp( F, G), 405 exp(ad(x)) , 1105 ext(E),41O I la, 155 I : a ---- b, 1115 I@S, 121 log, 1115
• •
110.,335
Ila*M, 350 l-l(C), 1062 I*F,1107 IS,T,1102 linalEnd, 926 linalStart, 926 Irame(F), 1139 lun(F), 1139 g, 556, 1032 go 1,1059 glb, 422 grad(J), 1160 groundclass( C I ), 381 h, 1031, 1032 h x ,1108 i, 1130 i < x,y >,253 i{x,y},253 ic < k >,253 ic{k}, 253 id(F), 1139 intex z ,B,G(Ch),559 intexz,B, 524 ip < a,b >,252 ip{a,b},252 isx, 459 ison, 346 j,1110 jak, 261 jak", , 266 k-Contra, 470 ks, 143
key, 1032 l, 1063 l(Cont),251 l(M), 328, 1089 lambda w (x),317 lev, 1061 lev(x),329 levi,329 lim(6.), 1121 limeD), 1078 loc, 342 lub,422 m- l , 1066 m x ,1108 m12,318 mjust, 318 modo, 508 modp , 112 monex z , 523 myFMW, 292 myFMw, 292 n(U),1148 nW,498 nW,,(M),497 nr(a), 351 nk(U),1148 noo(KI), 317 newset, 176 0, 73 o-TempClass, 110 obintexz,B, 526 p,556 p(M),328 p-ClassChord, 110 p-EulerClass, 110 p-Scale, 113 pr:, 1013 pftstrument' 728 PB,689 Pj, 1062 POnset, 119 Pu,v, 715 p1.,637 PfjUI' , 145 Pc!, 618
INDEX
1266
Pint, 618 Pmeter, 327 pet, 780 prp,,.(M), 497 prof, 460 pv, 73 q,73 qx,l1l1 r('ThE' w), 794 rH,1016
rt,253 rel(x, ai), 786 resar/IdA,358 reS@A, 358 res, ,358 f/ldA
resclass(C I ),381 ret, 363 revk,150 round(x) , 1081 s' :=::; s, 1148 S('ThE' w, t), 794 s(x), 1108 sG,557 sG*, 557 sP,557 scalemodp, 113 sem, 1138 set, 176 sigDen , 406 sing, 1124 sp(x), 329 span, 467 sti (x),317 supp(X),41O sym(A),254 t, 73, 468 t(F),1139 tto,tk 786 Popt ' tn, 468 top(x), 1081 true, 1126 w,763 w-Pitch, 160 w-PitchClass, 111
w-TemperedScale, 113 w~armo' 786 w!etro, 785 w!otif' 785, 786 WEvt.,ReIEvt ... 780 WarpArti, 783 Wloe, 921 Worth,921 Xhmax, 873 Xhmean, 873 Xmelodie, 873 Xmetrie, 873 x/E, 410 X < y, 1060 X> y, 1110 x@G, 315 X~, 403 x)- z, 278 X -* y, 488 xh,320 x m ,320 xu,v, 542 Xalt, 128 Xj,s, 859 Xred, 128 y(t i ,j),877 Cr pP b,1121 C/b, 1121 C F= a, 1133 C@, 1119 COPP, 1117 cspaees, 1107 C~paees, 1107 c taees , 1107 aP,1131 1,1131 1?1,1149 1?ld,1149 IKI,1148 If I, 1149 Iml a , 525 Isl,1149 Colimit, 69 Limit, 69 Lac, 105
1267
INDEX
Mod@, 63 Power, 68, 107 PerCell, 713 Simple, 68 Syn,68 head, 1063 tail, 1063 Colimit,64 Limit, 64 Power, 64 Simple, 64 Syn,64
A A-addressed function, 351 abelian, 1066 abelian group finitely generated -, 373 absolute dynamical sign, 777 dynamics, 831 logic, 176 music, 934 pitch, 700 symbolic dynamics, 81 tempo, 682, 780 absorbing point, 525 absorption, 1015 coefficient, 1015 abstract cardinality, 468 complement, 254 gestalt, 474 specialization, 488 identity, 16 inclusion, 254 motif, 468 onset, 150 specialization, 488 abstraction, 492, 494, 515 concept framework, 468 textual -, 440
accelerando, 739, 768, 782 accelerated motion, 738 accentuation, 720 accessory parameter, 999 accumulation point, 1145 acoustics virtual -, 850 action complement -, 630 faithful -, 1066 free -, 1066 left -, 1066 motor -,739 right -, 1066 transitive -, 1067 activities fundamental-, 4, 7 activity combinatorial -, 242 construction -, 197 instinctive -, 757 interpretative -, 300, 307, 308 acuteness, 290 ad-hoc polymorphism, 968 adapted tempo curve, 699 Add-Element, 983 address, 61, 63, 1091, 1120 change, 63, 83 technique, 83 faithful, 523 fixed vs. variable, 106 for a chord, 111 full-, 523 fully faithful -, 523 functor, 170 killing, 204 navigation, 169 variable, 61 zero -, 61, 62 addressed adjointness, 166 comma category, 165 adic representation, 1080
1268 adjoint left -, 1120 right -, 1120 adjointness addressed -, 166 adjunction, 1062 admitted successor, 647 tonalities, 566 Adorno, Theodor Wiesengrund, 186, 300, 302, 665, 691, 696, 741, 757, 792 Adrien, Jean-Marie, 1027 affine counterpoint group, 475 dual,1091 Lie bracket, 541 tensor product, 1094 transformation modular -, 948 affine functions complex of -, 351 on functorial global compositions, 431 after qualifier, 983 Agawu, Kofi, 400 Age of Enlightenment, 41 aggregate, 253 Agmon, Eytan, 248, 250 agogical architecture, 963 operator, 872 agogics, 304, 780 global -, 764 primavista -, 764 AgoLogic, 699, 758, 953, 963 Agon, Carlos, viii, 257, 382 Alain, 175 aleatoric component, 242 aleatorics, 70 algebra, 1075 Boolean -, 123, 1132 general linear -, 1105
INDEX
Heyting -, 123, 1132 Lie -, 1104 logical -, 1132 monoid -, 1077 quiver -, 1079 Riemann -, 586 algebraic geometry, 668 topology, 200 algorithm, 1022 Euclidean -, 1076, 1080 in FM synthesis, 87 off-line -, 919 real-time -, 919 TX802, 289 algorithmic extraction of performance fields, 916 Alighieri, Dante, 138, 196 aliquid pro aliquo, 16 all-interval n-phonic series, 237 series, 244 allomorph, 539 allomorphic extension, 539 allowed successor pairing, 646 a-restriction, 525 alphabet of creativity, 242 of music, 106 alphabetic ordering, 40, 43, 58 alteration, 127, 129, 196, 198, 276, 567, 618 as tangent, 128 direction of -, 951 elementary -, 129 force field, 952 in pitch, 62 pitch -, 952 successively increased -, 952 two-dimensional -, 950 altered note, 127
1269
INDEX scale, 585 ambient space, 107 ambient space, 107 coproduct -, 124 dual-, 128 product -, 124 ambiguity, 300, 307 theory of -, 300 tonal -, 601 ambitus, 320 American (musical) set theory, 139, 219, 247-258 contour theory, 467 jazz, 538 notation, 533 theory, 534 amplitude, 291, 1020 modulation, 288, 1003 spectrum, 1020 Amuedo's decimal normal rotation, 256 Amuedo, John, 255, 258, 534 analysis, 1018 -by-synthesis, 741, 755 chord -,533 coherent -, 772 comparative -, 333 FM -,289 immanent -,458,465 metrical -, 835 motivic -, 262, 491 musical -, 744 neutral -, 272, 305 normative -,457 principal component -, 898 regression -, 860, 877, 880 situs, 199, 277 sonic -, 842 spectral -, 638 text -, 741 analytical discourse, 12 vector, 876
weight, 666, 671, 785 anchor note, 760 Andreatta, Moreno, viii, 257, 382 ANSI-C,945 anthropic principle, 565, 567, 658 anthropology computer-aided -, 925 anti-homomorphism ring -, 1076 antisymmetric, 1059 antiworld, 560, 605, 933 anvil, 1037 Appassionata, 667, 907 application framework, 808 apposition, 18 approach bigeneric -, 540 categorical -, 967 historical -, 565, 574 nonparametric -, 856 statistical -, 855 systematic -, 574 transformational -, 249 approximate, 829 arbitrary, 18 archicortex, 642 architectural principle, 869 architecture agogical -, 963 modulatory -, 603 Argerich, Martha, 894, 895, 927 argument, 748 Aristotle, v, 30, 31, 43, 50, 934 arpeggio, 88, 161, 697, 720, 760 field, 698 arrow, 618, 1063 self-addressed -, 626 articulated listening, 304 articulation, 304, 682, 769, 783, 832 double -, 19 field, 687-689 initial -, 702 operator, 720 artistic fantasy, 692
1270 artistry combinatorial -, 243 arts, 5 Ashkenazy, Vladimir, 884, 893, 895, 897 Assayag, Gerard, viii, 955 associated metric, 1154 metrical rhythm, 327 topology, 1154 AST, 247, 332, 470, 498, 534 global -, 382-385 software for -, 255 asymmetries of communication, 910 Atari®,758 Mega ST4, 955 atlas, 308, 676 A-addressed, 309 projective -, 360 standard -, 357 atlases equivalent, 314 atom semantic -, 538 atomic formula, 1135 atomism ontological-, 27 atonal music, 248 attack, 1018 auditory cortex, 639, 641 gestalt, 481 nerve, 1037 representation, 240 augmentation, 161 augmented, 540 Augustinus, 564, 611 Auroux, Sylvain, 41 auto, 1116 auto complementarity, 220, 517 function, 508, 632 autocomplementary marked dichotomy, 631 autocorrelation, 925 autocorrelogram, 864 automorphism, 1066, 1075, 1116 group, 174, 1083
INDEX of interpretable compositions, 372 autonomy, 7 Avison, Charles, 303 axiom, 1133 of choice, 1060 axis third -, 113 diachronic -, 399, 575 fifth -, 113 of combination, 138, 260 of selection, 138, 260 paradigmatic -, 194 synchronic -, 399, 575 syntagmatic -, 18, 194 B Batschmann's Bezugssystem, 12 Batschmann, Oskar, 12, 187 Bekesy, Georg von, 1041 Babbitt, Milton, 247-249 Bach, Johann Sebastian, 137, 141, 144, 196, 231, 243, 248, 303, 304, 394, 595, 693, 740, 835, 857, 860, 907 background, 503 Bacon, Francis, 5 ball open -, 1153 Banach space, 1154 Banach, Stefan, 1154 band frequency -,640 bandwidth, 856, 874 bankruptcy scientific -, 24 bar grouping, 864 bar-line, 720, 768 bar-lines, 81 barline meter, 115 Barlow, Klarenz, 1053 Barthes, Roland, 17, 195 barycentric coordinate, 1149 base, 1085 for a topology, 1146 sheaf on -, 1108
INDEX Basic, 88 basic extension, 518 intension, 518 series, 150 theme, 246 basilar membrane, 1038 basis, 1129 calculation, 918 coordinate, 1028 deformation, 797 of a tangent composition, 669 of disciplines, 6 parameter, 79, 795 space, 689, 715, 763 specialization, 797 basis-pianola operator, 795 Baudelaire, Charles, 268, 601, 963 Bauer, Moritz, 262 beat, 457, 1051 frequency, 1051 meter, 115 strong -, 457 weak -,457 beauty, 419 Beethoven, Ludwig van, 118, 145, 161, 245, 303, 327, 337, 394, 422, 492, 495, 559, 560, 563, 567, 594, 603, 693, 935, 941, 1035 before qualifier, 983 behave well, 483 Benjamin, Walter, 665, 691, 792 Beran operator, 876 Beran, Jan, viii, 245, 246, 594, 745, 855, 876 Berg, Alban, 150, 248, 301 Berger, Hans, 640 Bernard, Jonathan W., 249 Beschler, Edwin, ix Bessel function, 1024 Bezugssystem, Batschmann's, 12 Bezuoli, Giuseppe, 30 biaffine, 1093 bidirectional dialog, 34
1271 big bang, 399 big science, 239 bigeneric approach, 540 major tonality, 547 bigeneric morphemes construction of -, 541 bijective, 1059 bilinear form, 354 Binnenstruktur, 301 biological inheritance principle, 763 bipolar recording, 638 Bissonanz, 514 Blake, William, 505, 911 block, 716, 947 type, 379 Boccherini, Luigi, 295, 994 body performance -, 712 Bosendorfer, 700, 764, 765, 833, 849 boiling down method, 787 book concept, 56 BOOLE,50 Boole, George, 1132 Boolean algebra, 123, 1132 combination of (class) chords, 111 operation, 947 topos, 1134 bottle Mobius -, 677 bottom wall, 768 Boulez, Pierre, 33, 39, 40, 45, 105, 106, 137, 152, 299, 301, 309, 349, 369, 939, 941, 999, 1007, 1048 bound Chopin rubato, 760 variable -, 1136 boundary, 1146 bow angle, 288 application, 288
INDEX
1272 parameter, 1003 pressure, 288, 1002 velocity, 288, 1002 box, 969 factory -, 972 flow -, 1160 temporal -, 979 Box-Value, 983 bracket Lie -, 1104, 1161 brain emotional -, 641 breaking symmetry -,936 Brendel, Alfred, 883, 884 brilliance, 290 Brinkman, Alexander, 255 Bruijn, Nicolass Govert de, 232, 236, 379 bundle of ontologies, 171 tangent -, 1155 Bunin, Stanislav, 891 Buteau, Chantal, viii, 456, 490
C C-major, 556 inner symmetry of -, 147 c-motif, 119 C-scale frame, 577 C-scheme, 1112 C-space, 1107 c-space, 251 CAC, 967 cadence, 502, 551-562, 566, 986, 1008 parameter, 552 Rameau's -, 554 cadential, 552, 565 family minimal -, 554 formula, 551 set, 554 Cage, John, 70, 306, 694 calculation basis -, 918 field -, 918
precision, 775 calculus, 692 camera obscura, 734 canon, 140, 194, 328 canon cancricans, 857, 860 canonical curve, 872, 890, 899 operator, 253 Para-meter, 329 program, 394 canons classification of -, 380 cantus durus, 320 firmus, 619 mollis, 320 Capova, Sylvia, 894 cardinality, 817,1059 abstract, 468 of a gestalt, 475 of a local composition, 107 carrier, 87, 1022 Cartesian product, 1058 cartesian closed, 1127 product, 1121 case linear -, 250 cyclic -, 250 Casella, Alfredo, 223 Castine, Peter, 255 catastrophe, 605, 606, 608, 610 theory, 277, 604 categorical approach, 967 categories equivalent -, 1119 category cocomma -, 1121 cocomplete -, 1122 comma -, 1121 complete -, 1122 finitely co complete -, 1122 complete -, 1122 isomorphic -, 1118
1273
INDEX matrix -, 1117 of cellular hierarchies, 722 of commutative global composition, 347 of coverings of sets, 357 of denotators, 402-406 of elements, 158, 1122 of forms, 67 of functorial global compositions, 335 of local compositions, 105 of objective global compositions, 335 of performance cells, 713 of sheaves, 1130 of textual semioses, 409 opposite -,1117 path -,1079 product -, 1118 quotient -, 1117 skeleton -, 1117 Cauchy problem, 1162 sequence, 1154 Cauchy, Augustin, 1154, 1162 causal coherence, 929 depth, 821 relation, 985 causal-final variable, 927 causality, 925 CDC Cyber, 639 Cech cohomology, 431 Celibidache, Sergiu, 140 cell, 373 complex, 394 Deiters' -, 1039 hair -, 1038 outer hair -, 1039 pillar -, 1039 performance -, 711 cellular hierarchies category of, 722 classification of -, 718 hierarchy, 716, 725
product -, 718 restriction -, 718 type of a -, 716 organism, 394 Cent, 1031 center, 1075 central pitch detector, 1045 CERN, 239 chain, 1089 proof -, 1133 third -, 820 chamber pitch, 684, 1031 change of material, 982 of orientation, 619, 626, 646 of perspective, 393 program -,947 value -,769 CHANT, 291 chant Gregorian -, 620 character string, 71 characteristic, 1076 function, 407, 1062 map, 1126 characteristics method of -, 1161 charge semantic -, 490 chart, 307, 309 chart of level j, 329 Chim Chim Cheree, 219 choice axiom of -, 1060 Chomsky, Noam, 286 Chopin rubato, 667, 682, 698, 759, 924 bound -,760 free -, 760 Chopin, F'n§deric, 98, 760 chord, 109, 219-227, 304, 502 structural -, 503 a scale's -, 112 addresses, 111 analysis, 533
1274 circle -, 514 class -, 111 classification, 194, 502 closure, 523 complement, 111 core -,590 dictionary, 219 difference, 111 diminished seventh -,563,604,608,610 event, 110 foundation -, 535 fundamental-, 534 inspector, 820 intersection, 111 inversion, 509 isomorphism classes, 219 just class -, 111 n-,109 pivotal -, 563 progression, 502 prolongational -, 503 self-addressed -, 225 sequence, 591 coherent -, 591 standard -, 531 symbol, 533, 535 tempered class -, 111 tesselating -, 377 union, 111 CHORD-CLASSIFIER, 256, 535 Chowning, John, 1022 chromatic (tempered) class chord, 111 Michel-, 582 octave just -, 114 Roederer -, 582 scale, 506 Vogel-, 582 chronospectrum, 287 circle chord,514 of fifths, 513
INDEX of fourths, 513 circular colimit, 77 definition, 55, 176 denotator, 85-89 denotators folding -, 448 form, 76 limit, 77 set, 79 synonymy, 76 circularity conceptual -, 176 of forms, 56 CL,1134 class, 188, 968 contour -, 252 resultant -, 381 chord, 111 just -, 111 tempered -, 111 contiguity -, 1150 counterpoint dichotomy -,631 dichotomy -,630 equivalence -, 1060 ground -, 381 marked counterpoint dichotomy -,630 dichotomy -,630 meta-object -, 982 nerve, 346, 376, 390 number, 219 precedence list, 971 segment -,252,255 set -, 253, 254 third comma -,325 Vuza -,380 weight, 230, 346 classical logic, 1134 classification, 997 epistemological -, 6 semiotics of sound -, 294
1275
INDEX chord -,502 geometric -, 216 in musicology, 192 local musical interpretation of -, 211 local theory of -, 191 of canons, 380 of cellular hierarchies, 718 of chords, 194 of motives, 228-231 of music-related activities, 4 of rhythms, 380 of sounds, 11 recursive -, 216 sound -,284 technique, 205 theory, 999 classifier subobject -, 1126 CLOS, 256, 967, 968 closed cartesian -, 1127 locally -, 1147 path, 1063 point, 279 set, 1145 sieve, 1130 simplex, 1149 closure, 1145 hierarchy -, 715 objective -, 524 Clough, John, 248 cluster Cortot -, 894, 898 Horowitz -, 894, 898 Clynes, Manfred, 734, 738 CMAP, 255 coarser, 1145 coboundary map, 1150 cochain complex of a global composition, 374 singular -, 1150 cochlear Fourier analysis, 1051
cocomma category, 1121 co complete category, 1122 cocone, 1119 coda, 304, 603 code, 259 codification of a symmetry, 154 codomain, 1059, 1115 coefficient absorption -, 1015 largest -, 890 system, 1150 coefficients signs of -, 887 cognitive dimension, 219 effort, 218 independence, 219 musicology, 23 psychology, 218, 276 science, 743 coherence, 503, 667, 772, 834 causal -, 929 final-, 929 harmonic -,544 inter-period -, 929 coherent analysis, 772 chord sequence, 591 topology, 1146 Cohn, Richard, 384 cohomology, 374 Cech -,431 group, 1151 l-adic -, 460 module of a global composition, 374 resolution -, 432 coinduced topology, 1146 Coleman, Ornette, 959 Coleman, Steve, 458 colimit, 308, 1121 circular -, 77 form, 67 topology, 1146 colinear, 281
1276 collaborative environment, 240 collaboratory, 35, 809 collective responsibility, 770 color coordinate, 1021 encoding, 923 parameter, 1004 sound -, 194 space, 1000 coloring, 947 Coltrane, John, 694, 733 COM matrix, 470 combination axis of -, 138, 260 linear -, 1084 weight -, 827 combinatorial activity, 242 artistry, 243 topology, 310 combinatoriality, 257 combinatorics creative -, 301 comes, 194, 243, 835 comma category, 1121 addressed -, 165 fifth -, 74 Pythagorean -, 74 syntonic -, 74, 115 third -, 74, 325 common language, 25 taste, 907 common-note function, 249 communication, 4, 5, 10, 12, 27 asymmetries of -, 910 coordinates, 16 process, 15 communicative dimension, 25 commutative, 1063 local composition module of a -, 125 diagram, 1118
INDEX global compositions, 347 local composition, 125 polynomials, 1077 commutativity relation, 1117 compact, 1147 comparative analysis, 333 discourse, 601 comparative criticism, 912 comparison matrix, 251 competence, 401 historical, 424 stylistic, 424 complement, 1058 abstract -, 254, 257 action, 630 literal -, 257 theorem, 254 complete category, 1122 harmony, 995 quiver, 1063 uniform space, 1154 completeness, 41, 48, 57 finite -, 166 completion semantic -, 57 complex cell -, 394 module -,350 numbers, 1076 of affine functions, 351 quotient -, 351 set -, 382 simplicial -, 940, 1148 simplicial cochain -, 1150 complexity, 201 degree of -, 197 formal -, 465 measure, 311 of performance, 664 component aleatoric -, 242 alteration, 128
1277
INDEX idempotent -, 1064 irreducible -, 330 metrical -, 327 reduced -, 128 composed frame, 968 composer, 13 perspective of the -, 301 composition, 198, 1059 t-fold tangent -, 669 global standard -, 357 commutative local-, 125 computer assisted -, 967 computer-aided -, 935, 955 concept, 694 dodecaphonic -, 149 functorial local -, 121 generic, 212 global -, 47, 169, 999 N-formed -, 354 oriented -, 355 resolution of a -, 393 global functorial -, 314 global objective -, 309 interpretable -, 370 local -, 47, 89, 105, 107 embedded -, 126 dimension of a -, 217 generating -, 126 projecting -, 216 standard -, 357 local objective -, 107 locally free local -, 213 modular -, 307 musical -, 33 non-interpretable -, 371, 376 tangent -, 669 tools fractal -, 137 compositional design, 255 idea, 391 space, 249
compositions commutative global-, 347 local category of -, 105 computation symbolic -, 967 computational musicology, 23 computer assisted composition, 967 performance research, 764, 850 science, 7, 188 computer-aided anthropology, 925 composition, 935, 955 conativity, 259 concatenation, 159, 1059 principle, 160, 624 concept, 10, 39 architecture, 184 composition -, 694 construction history, 55 denotator -, 808 form -, 808 format, 48 framework, 3, 5, 9 abstraction -, 468 dynamic -, 399 fuzzy -, 200 grouping -, 305 human - construction, 55 leafing, 58, 60 of a book, 56 of instantaneous velocity, 30 of music, 23 paradigmatic -, 280 poietical -, 15 point -, 175 RUBATO@-, 807 score -, 307, 693, 909, 978 set -, 176 space, 23, 34, 36 stable -, 276 surgery, 99-102, 770 concepts
1278 standard of basic musicological -, 108 void pointer -, 35 conceptual circularity, 176 explicitness, 23 failure, 26 genealogy, 75 identification, 280 laboratory, 33 navigation, 39 precision, 35 profoundness, 109 universality, 109 zoom-in, 21 conceptualization dynamic -, 79 fuzzy -,455 human -,175 precise -, 258 process, 245 concert form, 957 master, 761 pitch, 699 concert for piano and orchestra, 307 condition initial -, 1156 instrumental -, 850 conductor, 668, 683, 761 cone, 1119 configuration counting series, 233 conjugation, 1067, 1076 conjugation class of endomorphisms, 220 of symmetry group, 220 conjunction, 421, 1131 connective predicate -, 1135 connotation, 19 connotator, 398, 1142 consonance, 564, 571, 619, 1049 deformed -, 646 imperfect -, 635, 646, 657 perfect -, 635, 646, 657 consonance-dissonance, 1035
INDEX
dichotomy, 508, 632, 657 consonant, 286 interval, 503, 640 mode, 547 constant functor, 1119 module complex, 350 part, 525 shift, 129 structural -, 1105 constraint gestural -, 751 programming, 935, 967 constraints semiotic -, 284 construction activity, 197 of bigeneric morphemes, 541 recursive -, 49 construction history of concept, 55 contact, 259 point, 288 container, 973, 978 content, 17, 410, 497, 786, 817 interval -, 252 mathematical -, 17 maximal structure -, 1047 musical -, 17 context, 259, 895 problem, 819 real-time -, 917 contiguity, 18 class, 1150 contiguous simplicial maps, 1150 continuous, 1146 gesture, 986 method, 776 stemma, 803 weight, 775 contour, 193, 470 class, 252 space, 251 theory, 332
1279
INDEX American -,467 contra, 646 contraction, 1157 contrapunctus III, 835 contrapuntal form, 304 group, 137 interval oriented -, 619 meaning of Z-addressed motives, 120 motion shape type, 470 sequence, 646 symmetry, 647 local character of a -, 647 technique, 194 tension, 646 tradition, 243, 1052 contravariant functor, 1118 contravariant-covariant rule, 972 control group, 936 interactive -, 982 of transformation, 244 conversation topos of -, 995 coordinate barycentric -, 1149 basis -, 1028 color -, 1021 fifth -, 1032 function, 212 geometric -, 1021 octave -, 1032 ontological -, 10 pianola -, 1028 third -, 1032 coordinates, 52 of existence, 701 coordinator, 50, 1139 form -,64 of a form, 65 coproduct, 1062 ambient space, 124 of local compositions, 124
type, 53 core chord, 590 correlate electrophysiological -, 637 cortex auditory -, 639, 641 Corti organ of -, 1038 Cortot cluster, 894, 898 Cortot, Alfred, 891, 894, 897 coset left -, 1067 right -, 1067 cosmology, 565 counterpoint, 161, 508, 618, 637, 995 dichotomy, 630 class, 631 double -, 624 theorem, 649, 653 theory, 936, 1008 countersubject, 836 counting series configuration -, 233 coupling monogamic -,769 polygamic -, 769 covariant functor, 1118 covering, 308 equivalent -, 309 family, 1129 motif, 467 sieve, 1129 cp,251 CPL,971 cpset, 251 creative combinatorics, 301 extension, 245 creativity, 242, 399 alphabet of -, 242 creator, 12, 13 crescendo, 79, 668, 722, 738, 1029 wedge, 778 critical
1280
INDEX
distance, 1015, 1016 fiber, 911, 915 criticism comparative -, 912 journalistic -, 885 music -,772 critique, 911 music -,905 cross-correlation stemmatic -, 771 cross-semantical relation, 745 cube topographic -, 19, 36 cul-de-sac, 657 interval, 653 culture of performance, 757 curve, 282, 1156 canonical -, 872, 890, 899 integral -, 1158 intonation -, 684 tempo -, 682, 738, 758, 877, 947 curvilinear reduction, 937 CX5M Yamaha -, 639 cycle, 1063, 1071, 1159 index, 233, 1071 of variations, 956 pitch -, 252 cyclic case, 250 extension, 253 group, 1069 interval succession, 253, 254 Czerny, Carl, 758, 924
D d'Alembert, Jean Le Rond, 5,40, 58 da capo, 140 dactylus, 260 grid, 265, 266 Dahlhaus, Carl, 26, 147,300,323,324,544, 574, 594, 819, 994, 1053 dance, 735 Dannenberg, Roger, 918
data ethnomusicological -, 99 dataglove, 738 daughter, 752 tempo, 682 daughters, 725 Davin, Patrick, 986 DBMS, 808 de la Motte, Helga, 25, 694 Debussy, Claude, 223, 600, 756 decay, 1018 decomposition hierarchical -, 858, 872 natural -, 855 orthonormal -, 11 spectral -, 856 Sylow -, 542, 620 Deep Purple, 231 default weight function, 587 definition circular, 55, 176 of music, 6 of musical concepts, 114 deformation, 276, 720 basis -, 797 degree of -, 951 hierarchy -, 799 non-linear, 827 non-linear -, 776, 889 of a tempo curve, 699 pianola -, 720, 797 deformed consonance, 646 dichotomy, 646 dissonance, 646 degree, 304, 321, 535, 537, 566 different -, 323 modulation -, 566 of complexity, 197 of deformation, 951 of freedom, 219 of organization, 869 of symmetry, 254 parallel -, 324 system
1281
INDEX irreducible -, 556 theory, 531 Deiters' cell, 1039 Delalande, Francois, 738, 740 delay, 1002 relative -, 288 Deligne, Pierre, 427 delta Kronecker -, 1085 Dennett, Daniel, 181 denotator, 47, 67-69 attributes, 48 circular -, 85-89 concept, 808 genealogy, 47 flow chart, 49 image, 69 language, 723 morphism, 108 name, 52 non-zero-addressed -, 82 ontology, 398 orchestra instrumentation -, 82 philosophy, 185 reference -, 403 regular -, 79-85 self-addressed -, 82 truth -,407 Z-addressed -, 62 denotators circular - folding, 448 linear ordering among -, 58 ordering on -, 89-99 ordering principle on -, 57 relations among -, 105 Denotex, 811, 1143 DenotexRUBETTE®, 811 dense, 280 densification, 986 depth, 23, 25, 240 causal -, 821 EEG,637 stereotactic -, 638 electrode, 639 final-, 821
in mathematics, 25 in musicology, 26 in the humanities, 591 semantic -, 465 derivation, 1105, 1160 inner -, 11 05 outer -, 1105 derivative, 1155 Lie -, 1160 derived serial motif, 237 Desain, Peter, 664 Descartes, Rene, 12, 178, 1049 description object -, 244 verbal -, 756 design compositional -, 255 matrix, 877 Desmond, Paul, 218 development, 304, 603 history, 745 software -, 723 syn- and diachronic of music, 242 Dezibel, 1029 di-alteration, 129 Diabelli Variations, 394 diachronic, 17 axis, 399, 575 index, 273 normalization, 909 diaffine homomorphism, 1090 diagonal embedding, 1098 field, 686 diagram, 1118 scheme, 1117 commutative -, 1118 filtered -, 1107 Hasse -, 267, 1061 of forms, 67 dialog, 996 bidirectional, 34 experimental navigation -, 35 dialogical principle, 997 diameter, 633
1282 diastematic, 816 index shape type, 470 shape type, 470 diatonic scale, 658 dichotomy class, 630 marked -, 630 consonance-dissonance -, 508, 632, 657 counterpoint -, 630 deformed -, 646 interval -, 630 major -, 631, 657 marked counterpoint -, 630 marked interval -, 630 Riemann -,636 Saussurean -, 17 dictionary of expressive rules, 747 Diderot, Denis, 5, 40, 58 difference, 1058 genealogical -, 912 phenomenological -, 912 different degree, 323 differentiable, 1155 differential, 1160 equation, 1156 semantic -, 198 differentiation rules, 742 digital age, 40 diinjective, 1101 dilatation, 160, 1097 time -, 83 dilinear homomorphism, 1084 part, 1090 dimension, 1085, 1148 cognitive -, 219 communicative -, 25 of a local composition, 217 of a simplex, 1148 ontological -, 19 diminished, 540 diminished seventh chord, 563, 604, 608, 610
INDEX Ding an sich, 23 direct image, 1107 sum module, 1084 directed graph, 1063 direction of alteration, 951 directional endomorphism, 797 Director Musices, 742 discantus, 619 disciplinarity dynamic -, 809 discipline basic -,6 discourse analytical -, 12 comparative -, 601 esthesic -, 15 discoursivity, 41, 48, 57 discrete, 774 interpretation, 311 field, 917 gesture, 986 nerve, 311 topology, 1145 disjoint, 1058 sum, 1121 disjunction, 421, 1131 dissonance, 564, 571, 619, 1049 deformed -, 646 emancipation of -, 33 dissonant interval, 640 mode, 547 distance, 276, 279 critical -, 1015, 1016 Euclidean - for diastematic types, 472 Euclidean - for rigid types, 472 for toroidal types, 473 function, 472 natural -, 441 on toroidal sequences, 473 relative Euclidean - for rigid types, 472 third -, 622 to an initial set, 704
1283
INDEX Distributed RUBATO®, 922 distributed laboratory, 35 distributive, 1132 distributor, 835 divertimenti, 994 division of pitch distances, 72 of time regular -, 456 divisor resulting -, 382 documentation, 4, 5, 7 dodecaphonic composition, 149 composition principle, 137 method, 936, 940 paradigm, 150 series, 149, 197, 236, 301, 309, 394 vocabulary, 243 dodecaphonism, 162, 251 communicative problem of -, 162 esthetic principles of -, 162 domain, 1059, 1115 fundamental scientific -, 6 modulation -, 580 dominance, 267, 329 topology, 283, 488 dominant, 323, 502, 541, 545 role of major scale, 657 seventh, 508 dominate, 278, 1110 double articulation, 19 counterpoint, 624 drama musical -, 908 Dreiding, Andre, 355 Dress, Andreas, 355 driving grid, 951 drum ear -,1037 dual affine -, 1091
ambient space, 128 linear -, 1091 numbers, 127, 618, 1077 dualism between major and minor, 147 Dufourt, Hugues, 967 duration, 51, 79 period,115 dux, 194, 243 DX7,1022 dynamic concept framework, 399 conceptualization, 79 disciplinarity, 809 navigation, 45 dynamical initialization, 701 knowledge management, 399 modularity -,809 sign absolute -, 777 relative local -, 778 relative punctual -, 777 dynamically loadable module, 808 dynamics, 303, 304, 682, 685 absolute -, 831 historical -, 271, 273 mechanical-, 739 of performance, 800 primavista -, 764 relative -, 831 symbolic absolute, 81 relative, 81 E E-ball,280 E-neighborhood,482 E-paradigm, 280 ear drum, 1037 inner -, 1037
1284 middle -, 1037 outer -, 1036 ecclesiastical mode, 319, 655, 657 editing geometric -, 946 editor, 968 EEG depth -,637 response, 637 semantic charge of -, 638, 640 test, 638 effect groove -, 952 effective, 1066 Eggebrecht, Hans Heinrich, 23, 24, 26 Ego poetic -, 262, 268 Ehrenfels transpositional invariance criterion, 108 Ehrenfels, Christian von, 108,203,276,301, 332, 334, 465 Eimert, Herbert, 152, 258 Eitz, Carl, 1032 elastic, 816 shape type, 471 electrode depth -,639 electrophysiological correlate, 637 element, 1057 neutral -, 1063 elementary alteration, 129 gesture, 986 neighborhood, 489 shift, 129 elements category of -, 158, 1122 emancipation of dissonance, 33 embedded local composition, 126 embedding, 1118 diagonal -, 1098 number, 254 Yoneda -, 1091, 1120 emotion, 642, 734-737 emotional
INDEX brain, 641 function of music, 642 landscape, 295 emotivity, 259 empty form name, 55 set, 176 string, 52 encapsulated history, 675 encapsulation, 26, 188, 973 speculative -, 30 encoding color -,923 formula rubato -, 751 Encore, 986 Encyclopedie, 5, 41, 43, 58 encyclopedia, 40, 440 encyclopedic ordering, 58 encyclopedism, 56 encyclospace, 41, 43, 58 endo, 1116 endolymph, 1038 endomorphism, 1116 directional -, 797 enharmonic -, 516 right-absorbing -, 524 ring, 1083 energy, 739 spectrum, 1020 enharmonic, 515 endomorphism, 516 group, 517 identification, 515 ensemble rules, 742 Ensemble Intercontemporain, 986 enumeration musical - theory, 232 of motives, 238 theory global -, 376 envelope, 84, 1018 environment collaborative -, 240
INDEX experimental -, 827 epi, 1116 epilepsy therapy surgical -, 638 epileptiform potential, 638 epimorphism, 1116 epistemology of musicology, 29 epsilon gestalt topology, 484 topology, 483 Epstein, David, 739 equation differential -, 1156 spring -, 1020 equivalence phonological -, 263 class, 1060 paradigmatic transformation -, 259 perceptual -, 280 relation, 305, 1060 syntagmatic -, 263 equivalent atlases, 314 categories, 1119 covering, 309 norms, 1155 equivariant, 1067 Erwartung, 223 Escher, Comelis Maurits, 196 EspressoRUBETTE®, 916, 922 essential parameter, 999 esthesic, 12, 1021 identification, 303 esthesis, 12, 15, 258 esthetic, 259 esthetics, 15, 259 of music, 393 ethnological form, 57 ethnology inverse -, 909 ethnomusicological data, 99 ethnomusicology, 909 Euclid, 178, 617 Euclidean
1285 algorithm, 1076, 1080 geometry, 353 metric, 279 Euler function, 1069 module, 73, 218 plane, 110 point, 73, 1031 space, 1031 Euler's identity, 1020 Euler, Leonhard, 73, 581, 619, 1032, 1049, 1165 European score notation, 79 Eustachian tube, 1037 evaluation, 359, 1132 event percussion -, 612 time -, 674 evolution, 763 exact sequence split -, 1069 exchange of pitch and onset, 152 parameter -, 160, 161 existence, 67, 397 mathematical -, 175, 398, 413 musical -, 413 experiment mental -, 666 musicological -, 33, 34 physical -, 32 experimental environment, 827 humanities, 29 material, 401 natural sciences, 29 strategy, 841, 851 experimentation, 982 experiments of the mind, 34 explanatory variable, 877 explicitness conceptual -, 23 exponentiable, 1127 exposition, 304, 603, 959 expression, 406, 733, 916
1286 human -,692 instrumental -, 994 rhetorical -, 692 expressive rules dictionary of -, 747 expressivity pure -,737 rhetorical -, 674 extension, 373, 401, 410, 670 allomorphic -, 539 basic -, 518 creative -, 245 cyclic -, 253 strict -, 539 topology, 521 exterior score, 694 extraterritorial part, 720 extroversive semiosis, 400
F J-morphism, 1107 F-to-enter level, 881 face, 1148 facticity, 397, 420, 565 finite - support, 411 full-,41O factor pressure decrease -, 1016 strength -, 742 factory box, 972 faithful action, 1066 address, 523 functor, 1118 point, 523 False, 1132 family, 293 covering -, 1129 minimal cadential-, 554 of violins, 997 violin -, 295, 1009 fantasy artistic -, 692 faster uphill, 742
INDEX father, 752 Feldman, Jacob, 739 Feldman, Morton, 306 Fermat, Pierre de, 26 fermata, 668, 766, 769, 781 Ferretti, Roberto, viii feuilleton, 772 feuilletonism, 905 FFT,638 fiber, 743, 1062 critical -, 911, 915 group, 936 product, 1078, 1121 of local compositions, 167 structure, 913 sum, 1121 of local compositions, 169 Fibonacci sequence, 413 Fibonacci, Leonardo, 70, 413 fibration linear -, 914 fiction, 397, 565 fictitious performance history, 763 field, 726 arpeggio -, 698 calculation, 918 diagonal -, 686 discrete -, 917 finite -, 949 fundamental-, 720 interpolation, 918, 922 intonation -, 684 of equivalence, 191 of fractions, 1101 operator, 792 paradigmatic -, 150 parallel articulation -, 689 parallel crescendo -, 689 parallel glissando -, 689 performance -, 685, 690, 712 prime -, 1076 selection, 969 skew -,1075
1287
INDEX tempo -,683 tempo-intonation -, 686 vector -, 1156 writing, 969 fifth, 73, 1031 axis, 113 coordinate, 1032 sequence, 321 Fifth symphony, 303 film music, 733 filtered diagram, 1107 filtering input -, 918 final coherence, 929 depth,821 retard,738 vertex, 802 finale, 956 finalis, 319 finality, 925 fine arts, 14, 186 finer, 1145 fingering, 303, 738, 757 finite, 1057 completeness, 166 cover topology, 430 field, 949 locally -, 1149 monoid, 1063 multigraph, 1062 finitely cocomplete category, 1122 complete category, 1122 generated, 1069, 1084 finitely generated abelian group, 373 Finscher, Ludwig, 993, 994 Finsler's principle, 175 Finsler, Paul, vi, 175 first representative, 220 FIS, 249 Fitting's lemma, 1089
Fitting, Hans, 1089 fixpoint, 1157 group, 1066 flasque module complex, 370 flat, 130 flatten, 88 flattening operation, 88, 331 Fleischer, Anja, viii, 590 FLOAT, 50 flow box, 1160 interpolation, 706 flying carpet, 927 FM, 289, 1022 -object generalized, 292 analysis, 289 synthesis, 86 folding, 442 circular denotators, 448 colimit denotators, 446 limit denotators, 446 foramen ovale recording, 638 force field alteration -, 952 modulation -, 567, 571 forces in physics, 649 foreground, 503 form, 50, 61-67 bilinear -, 354 circular -, 76 circularity, 56 coli mit -, 67 concept, 808 concert -, 957 contrapuntal -, 304 coordinator, 64, 65 ethnological -, 57 Forte's prime -, 256 functor, 64 identifier, 64, 65 limit -, 67 list -, 976 morphisms
1288 wrap -,402 musical-,6 name, 50, 51 empty -,55 names, see Symbols normal -, 255 of a symmetry, 135 pointer character, 55 powerset -, 66 prime -,257 Rahn's normal -, 255 regular -, 76 semiotic global -, 1141 simple -,66 simplify to a -, 75 sonata -, 304, 603, 956 space, 64 Straus' zero normal -, 256 synonym -,66 type, 64 typology, 65 form semiotics morphism of -, 1141 formal complexity, 465 structure, 967 formalism Lie -, 800 formant, 291 manifold, 291 open - set, 291 forms category of -, 67 diagram of -, 67 ordering on -, 89-99 formula, 1135 atomic -, 1135 cadential-, 551 propositional -, 1136 quantifier -, 1136 Forte's prime form, 256 Forte, Allen, 247-249, 255, 383 foundation chord, 535 four
INDEX part texture, 995 Fourier analysis cochlear -, 1051 decomposition, 84 ideology, 286 paradigm, 284 representation, 899, 1000 theorem, 10 transform, 1025 Fourier's theorem, 1019 Fourier, Jean-Baptiste, 512 fractal, 70, 196, 198, 943 composition tools, 137 principle, 964 fractions field of -, 1101 frame, 712, 726, 968, 1026, 1139 composed -, 968 simple -, 968 space, 64 structure, 718 wavelet -, 290 framework, 973 application, 808 concept -, 3, 5, 9 hermeneutical -, 12 free action, 1066 Chopin rubato, 760 commutative monoid, 1064 group, 1067 jazz, 665 locally -, 1110 module, 1085 monoid, 1064 variable -, 1136 free jazz, 14 freedom of choice, 658 frequency, 72, 84, 1018, 1021 band, 640 beat -, 1051
INDEX fundaDlental-, 1019 Dlodulation, 289, 1022 Dlodulation -, 288, 1003 of variable inclusion, 888 Freud, SigDlund, 643 Friberg, Anders, 742 Fripertinger, Harald, viii, 203, 231, 257, 376, 378,1071 Frost, Robert, 333 Fryden, Lars, 741, 755 fugue, 194, 243 full address, 523 functor, 1118 Dlodel, 880 point, 523 subcategory, 1118 subcoDlplex, 1148 fully faithful address, 523 functor, 1118 point, 523 function, 1059 A-addressed -, 351 autocoDlpleDlentarity -, 508, 632 Bessel -, 1024 characteristic -, 407, 1062 CODlDlon-note -, 249 Euler -, 1069 generic -, 971 gradus suavitatis -, 1049, 1165 horizontal poetical -, 942 index -, 1081 interval -, 249 inverse -, 1059 level -, 329, 1061 of a sYDlDletry, 136 poetical -, 18, 138, 259, 295, 303, 934, 942 theory, 324, 531 tonal-, 304, 323, 544 value tonal -, 544 vertical poetical -, 942 function harDlony, 35
1289 functional, 1059 prograDlDling, 967 seDlantics, 541 functor, 1117 address -, 170 constant -, 1119 contravariant -, 1118 covariant -, 1118 faithful -, 1118 forDl -, 64 full-, 1118 fully faithful -, 1118 global section -, 1109 Dlodule -, 172 nerve -, 1148 of orbits, 1114 open -,1113 open covering of -, 1113 representable -, 1120 resolution -, 358 support -, 314 functorial global cODlposition, 314 local cODlposition, 121 fundaDlental activities, 4, 7 chord,534 field, 720 note, 535 period, 1019 pitch, 532 scie;ntific dODlain, 6 series, 137 space, 715 fushi, 14, 416 Fux rule, 657 Fux, Johann Joseph, 636, 656, 1008, 1053 fuzziness, 531 fuzzy concept, 200 conceptualization, 455 logic, 409 set, 198 theory, 194
1290 G G-prime form, 254 Gotterdammerung, 814 Gabriel, Peter, 185 Gabrielsson, Alf, 734, 738 Galilei, Galileo, 29, 30, 32, 35, 664 Galois, Evariste, vi Garbers, J6rg, viii, 807, 1143 Garbusow, Nikolai, 221 gate function hippocampal-, 642 Gegenklang, 324, 556 Gell-Mann, Murray, 176 genealogical difference, 912 genealogy conceptual, 75 of denotator concept, 47 poietic, 154 general linear algebra, 1105 pause, 782 position, 391 General Midi, 287 general position, 126, 212 musical meaning of -, 127 generated finitely -, 1069, 1084 generating local composition, 126 Generative Theory of Tonal Music (=GTTM), 312, 457 generator sound -, 849 time -,936 generic composition, 212 function, 971 linear visualization, 440 point, 279, 330, 1110 score, 665 genotype, 943 geodesic, 295 geographic information system, 809
INDEX orientation, 43 geometric classification, 216 coordinate, 1021 editing, 946 parameter, 1000 realization, 1149 representation, 946 geometry analytical -, 178 algebraic -, 178, 201, 668 Euclidean -, 353 germ, 326, 1155 rhythmic -, 152, 326 germinal melody, 269, 270, 956 gestalt, 106, 203, 332, 465, 492 abstract -, 474 auditory -, 481 cardinality of a -, 475 global -, 307 musical -, 106, 152 paradigm, 816 psychology, 106 small-, 483 specialization, 488 category, 490 stability, 276 gestural constraint, 751 rationale, 908 semantics, 908 gesture, 735, 738-741 continuous -, 986 discrete -, 986 elementary -,986 instrumental -, 986 orchestral -, 986 Get-Editor, 983 Get-View, 983 Giannitrapani, Duilio, 640 Gianoli, Reine, 891 Gigue Nr. 32, 231 Gilels, Emil, 756 Gilson, Etienne, 13 GIS, 249, 384
INDEX
structure, 248 Glarean,320 glide reflection, 1097 glissando, 79, 668, 689, 722, 986, 1032 global, 299 affine functions module of -, 432 agogics, 764 AST, 382-385 composition, 169, 999 cochain complex of a -, 374 enumeration theory, 376 form semiotic, 1141 functorial composition, 314 morphism, 335 gestalt, 307 molecule, 355 molecules morphism of -,355 morphisms, 300 object, 299 objective composition, 309 objective composition morphism, 335 performance score, 728 predicate, 552 score, 307, 946 section, 350, 1121 functor, 1109 slope, 822 solution, 1159 standard composition, 357 tangent composition, 675 technical parameter, 1005 tension, 822 theory, 269 threshold, 819 globalization
1291 metrical-, 116 orchestral -, 673 Goller, Stefan, viii, 441, 1143 Goethe, Johann Wolfgang von, 147, 198, 394, 996 Goldbach conjecture, 32 Goldbach, Christian, 32 Goldberg Variations, 394 golden section, 70 Goldstein, Julius, 1045 Gottschewski, Hermann, 31 Gould, Glenn, 667, 740, 841, 851, 907 GPL, vii GPS,728 gradus suavitatis function, 1049, 1165 Graeser, Wolfgang, 135, 137,248,304 Gram identity, 355 grammar locally linear -, 802 performance -, 747 rule-based -, 748 grand unification, 564 granddaughter, 802 grandmother, 802 graph, 1058, 1062 directed -, 1063 of a FM-denotator, 87 Riemann -, 821 weighted -, 292 graphical interface design, 439 MOP, 982 Grassmann scheme, 1113 greeking, 441, 463 Gregorian chant, 620 Greimas, Algirdas Julien, 934, 936, 937 grid dactylus -, 265, 266 driving -, 951 vector
1292 horizontal -, 951 vertical -, 951 groove effect, 952 Grothendieck topology, 180, 430, 1129 topos, 1130 Grothendieck, Alexander, vi, ix, 175, 180, 185, 427, 430, 436, 1129 ground class, 381 group, 1066 affine counterpoint -, 475 automorphism -, 174, 1083 cohomology -, 1151 contrapuntal -, 137 control -, 936 cyclic -, 1069 enharmonic -, 517 fiber -, 936 fixpoint -, 1066 free -, 1067 homomorphism, 1066 isomorphism, 1066 isotropy -, 1066 Klein -, 251, 548 linear counterpoint -, 475 opposite -, 1066 p-Sylow -, 1069 paradigmatic of isometries, 478 paradigmatic -,474 product -, 1068 quotient -, 1067 rhythmical -, 977 simple -, 1067 Sylow -,218 symmetric -, 1066 symmetry -, 174, 220, 571, 816 theory, 191, 259 torsion -, 1070 group-theoretical method, 241, 250 grouping, 456, 503, 739, 764 bar -, 864 concept, 305 hierarchical -, 743
INDEX instrumental -, 728 metrical -, 303 of sounds, 88 rules, 742 stemmatic -, 770 structure (=G), 457 time -, 118 GTTM, 312, 457, 752 Guarneri del Gesu, 1000 gyri Heschl's -, 1044 H Holderlin, Friedrich, 9, 138 Hiillakkord, 523 Haegi, Hans, 355 hair cell, 1038 Haj6s group, 377, 382 Haj6s, Gyorgy, 377 Halle, Morris, 286 Halsey, George, 376, 381 Hamilton, William, 1076 Hamiltonian, 800 hammer, 1037 Hammerklavier-Sonate, 118, 245, 327, 337, 559, 560, 563, 567, 594, 603, 667, 693, 907, 941 hanging orientation, 128, 619 Hanslick, Eduard, 17, 303, 307, 935, 997 harmolodic, 959 harmonic coherence, 544 knowledge, 591 logic, 546 minor, 584 morpheme, 546 motion, 503 path,586 progression, 154 semantics, 531 strip, 310, 321, 538 tension, 586, 587 topology, 538 weight, 587, 786
1293
INDEX harmonic minor scale, 575, 577 tonality, 560 harmonical-rhythmical scale, 959 harmony, 106, 221,637 complete -, 995 jazz -, 337 Keplerian -, 33 Riemann -, 322 HarmoRUBETTE@, 546, 586, 787, 819, 866 Harnoncourt, Nicolas, 909 Harris, Craig, 255 Haschemann, 947 Hashimoto, Shuji, 738, 745 Hasse diagram, 267, 1061 specialization -, 269 hat Mexican -, 1025 Hauptmann, Moritz, 531 Hausdorff topology, 1147 Hausdorff, Felix, 1147 hayashi, 14 Haydn, Joseph, 295, 994, 995 Hazlitt, William, 831 heartbeat, 738 Hebb, Donald 0, 871 Hegel, Georg Wilhelm Friedrich, 39, 942 Heijink, Hank, 918 helicotrema, 1038 Helmholtz, Hermann von, 619, 1049, 1051 Hemmert, Werner, viii Hempfling, Thomas, ix Hentoff, Nat, 733 hermeneutics unicorn of -, 14 Hertz, 1018 Herve, Jean-Luc, 986 Heschl's gyri, 639, 1044 Hess system, 638 Hesse, Hermann, 200, 695 Hewitt, Edwin, 376, 381 hexameter, 261 Heyting algebra, 123, 407, 1132 logic, 530
Heyting, Arend, 407, 530, 1132 Hichert, Jens, 629, 649, 651, 653, 657 hidden symmetry, 136 hierarchical decomposition, 858, 872 grouping, 743 organism, 304 smoothing, 857 hierarchy, 306, 674, 767 cellular -, 716, 725 closure, 715 deformation, 799 metrical -, 455 of performance development, 757 parallel -, 718 performance -, 674 piano -, 722 space -, 715 standard -, 717 tempo -, 758 violin -, 722 Himmelfahrtsoratorium, 595 Hindemith, Paul, 147, 503, 512 Hintergrund, 503 hippocampal gate function, 642 memory function, 642 hippocampus, 639, 642 histogram, 865 historical dimension of music, 108 approach, 565, 574 dynamics, 271, 273 instrumentation, 393 localization, 273 musicology, 399 process, 763, 771 rationale, 994 reality, 594 historicity in music, 271 history, 5 development -, 745 encapsulated -, 675 of music, 6
1294 hit point, 706 problem, 704 Hjelmslev, Louis, 16, 19, 398 Hofmann, Ernst Theodor Amadeus, 303 homeomorphism, 1146 homomorphism diaffine -, 1090 dilinear -, 1084 group -, 1066 Lie algebra -, 1104 linear module -, 1083 monoid -, 1063 ring -, 1075 structural -, 1075 homotopy, 1150 relative -, 1150 Honing, Henkian, 664 Horace, 755 horizontal grid vector, 951 poetical function, 942 poeticity, 261 Horowitz cluster, 894, 898 Horowitz, Vladimir, 884, 891, 895, 897, 927 human conceptualization, 175 expression, 692 precision, 757 humanism, 997 humanities, 200, 275 experience in the -, 34 experimental-, 29 Husmann, Heinrich, 1052 hypermedia, 43 hyperouranios topos, 23, 42 I 1,24 Iberie, 223 ICMC, 240, 744 icon
INDEX instrumental -, 947 idea compositional -, 391 musical -, 934, 935 ideal, 1076 left -, 1076 right -, 1076 idempotent, 1064 component, 1064 identification conceptual -, 280 enharmonic -, 515 esthesic -, 303 identifier, 1139 form -, 64, 65 identity, 492, 496, 1115 of a point, 178 abstract, 16 Euler's -, 1020 Jacobi -, 1104 of a work, 16 slice, 336 ideology Fourier -, 286 IL,1134 image, 1062 denotator -, 69 direct -, 1107 inverse -, 1062 imagination, 5 imitation, 492, 494 immanent analysis, 465 imperfect consonance, 635, 646, 657 implementation, 763 implication, 421, 1131, 1132 importance relative -, 786 Impromptu, 760 improvisation, 45 jazz -, 218 in absentia, 18, 953 in praesentia, 18, 952 in-time music, 986 inbuilt performance
1295
INDEX grammar, 907 included literally -, 255 inclusion abstract -, 254, 257 literal -, 257 incomplete semiosis, 401 incorrect politically -, 907 indecomposable, 1089 space, 715 independence cognitive -, 219 index cycle -, 1071 diachronic -, 273 function, 1081 set, 1060 indiscrete interpretation, 312 topology, 1145 individual variable, 1135 ineffability, 25, 693 infinite, 1057 interpretation, 317 message, 905 performance, 666 infinitely small, 692 infinitesimal, 774 information, 39 paratextual -, 831 system geographic -, 809 InfoRUBETTE®, 810 inharmonic, 819 inharmonicity, 290 inheritance, 763, 968 principle biological -, 763 property, 479 initial, 1121 articulation, 702 condition, 1156 design matrix, 878
moment, 697 performance, 712, 726 set, 696, 712, 725 polyhedral -, 704 value, 683 initial set distance to an -, 704 initialization dynamical-,701 injective, 1059 module, 1103 inlet, 972 inner derivation, 1105 ear, 1037 logic, 757 score, 665, 694 input filtering, 918 real-time -, 946 inspector chord -,820 instance, 968 instantiation, 968 instinctive activity, 757 instrument name, 82 space, 726 instrumental condition, 850 expression, 994 gesture, 986 grouping, 728 icon, 947 parameter, 1002, 1004 technique, 1002 variety, 673 vector, 1005 voice, 269 instrumentation historical -, 393 orchestra - denotator, 82 instrumentum,42 InUgmles, 394 INTEGER, 50
INDEX
1296 integer, 1076 integral curve, 1158 surface, 1161 integrated serial motif, 237 integration method, 829 intensification, 964 intension, 401, 519 basic -, 518 topology, 520 intensity, 739 inter-period coherence, 929 interaction interpretative -, 873 matrix, 857 interactive control, 982 interface design graphical -, 439 interictal period, 638 interior, 1146 interlude, 836 intermediate performance, 756 internal structure, 311 interpolation, 692, 917 field -, 918, 922 flow -, 706 interpretable composition, 370 automorphism group of -, 372 molecule, 356, 385 interpretation, 269, 272, 665 discrete -, 311 indiscrete -, 312 infinite -, 317 iterated -, 317 just triadic degree -, 325 metrical, 328 motivic -, 332, 467 of a local composition, 316 of weights, 800 rhythmical -, 328 semantic -, 598 silly -, 312
singleton -, 336 sketchy -, 757 tangent -, 677 tetradic -, 337 third chain -, 319 triadic -, 337, 548, 553, 566 triadic degree -, 320 interpretative activity, 300, 307, 308 interaction, 873 interspace, 692 sequence, 234 structure, 234 interval unordered p-space -, 252 unordered pc -, 253 class content vector, 253 consonant -, 503, 640 content, 252 contrapuntal oriented -, 619 cul-de-sac -, 653 cyclic - succession, 253 dichotomy, 630 dissonant -, 640 function, 249 multiplication, 623 ordered p-space -, 252 ordered pc -, 253 succession, 252 cyclic -, 254 mth -, 254 successive -, 640 time -, 83 vector, 253, 257 interval-class vector, 249 intonation, 682, 683 curve, 684 field, 684 intratextual, 400 introversive semiosis, 400 intuition musical -, 246 intuitionistic logic, 539, 1134
1297
INDEX invariance transformational -, 276, 332 vector, 254 invariant pcset, 254 inversa, 839 inverse, 1066 ethnology, 909 function, 1059 image, 1062 left -, 1066 performance theory, 743, 790, 913 right -, 1066 inversion, 140, 302, 321 chord -, 509 real, 148 retrograde -, 73, 144 tonal -, 148, 952 inverted weight, 827 invertible, 1075 !RCAM, 291, 967 irreducible, 716 component, 330 degree system, 556 topological space, 1110 iso, 1116 isometry, 305, 1153 isomorphic, 1066 category, 1118 isomorphism, 1116 group -, 1066 monoid -, 1063 ring -, 1075 isomorphism classes of local rhythms, 221 of chords, 219 isotropy group, 1066 isotypic tesselation, 376 ISPW, 918 istesso tempo, 673 iterated interpretation, 317
Jacobi identity, 1104 Jacobi, Carl, 1104 Jacobian, 1155 Jacobson, Nathan, 1088 Jakobson, Roman, 18, 138, 191, 259, 272, 286, 295, 305, 400, 934, 942 Jandl, Ernst, 963 Jauss, Hans Robert, 187 Java, 808 Java2D, 922 jazz, 13, 14, 45, 218, 694 American -, 538 CD review, 415 free -, 14, 665 harmony, 337 improvisation, 218 lead-sheet notation, 533 JCK, 416 jnd,280 Johnson, Tom, 594, 953 join, 1132 journalistic criticism, 885 J6zef Marja Hoene-Wronski, 387 Julia set, 198 Julia, Gaston, 198 Jupiter Symphony, 458 just, 111 chromatic octave, 114 class chord, 111 modulation, 577 scale, 113 triadic degree interpretation, 325 tuning, 1032 just-tempered tuning, 1033 justest scale, 325 tuning, 560 juxtaposition, 72
K J Jackendoff,Ra~305,311,457,461,
752,873
k-partition, 378 Kohler, Egmont, 230
1298 Kagel, Maurizio, 152, 394 kairos, 994 Kaiser, Joachim, 303, 603, 907 kansei, 738, 745 Kant, Immanuel, v, 10, 23, 32, 43, 175 Karajan, Herbert von, vi, 700, 740, 945 Karg-Elert, Sigfrid, 137, 504 Kepler, Johannes, 136 Keplerian harmony, 33 kernel, 726, 1067 Naradaya-Watson -, 857 smoothing, 856, 874 smoothing -, 857 symbolic -, 712 view, 826 key, 948 function of music, 643 musicogenic, 644 signature, 768 killing address -, 204 Kinderszenen, 495 kindred, 293 Kircher, Athanasius, 242 Klangrede, 19,996 Klavierstiick III, 385 Klein group, 251, 475, 548 knot in FM synthesis, 88 knowledge, 39, 440 crash,412 harmonic -, 591 hiding, 240, 440 human -,5 management dynamical -, 399 ontology, 420 private -, 29 space, 10, 29 Koenig, Thomas, 833 Kollmann, August, 995 Kontra-Punkte, 152 Kopiez, Reinhard, 291, 734, 736 KORG, 1027 Kronecker delta, 1085
INDEX Kronecker, Leopold, 1085 Kronmann, Ulf, 738, 739 Krull, Wolfgang, 1090 KTH school, 750, 755 Kubalek, Antonin, 894 Kunst der Fuge, 137,248,304,740,835,849 Kuriose Geschichte, 764, 765, 771, 849, 860 Kurzweil, 850 KV 449,231 L 1, 1029 L 'essence du bleu, 941 L 'isle joyeuse, 223 A-abstraction, 970 l-adic cohomology, 460 A&-calculus, 968 A-function, 969 L-system, 943 Liidi, Werner, 664, 955 Levi-Strauss, Claude, 593 La mort des artistes, 268, 963 laboratory conceptual -, 33 distributed -, 35 Lagrangian, 800 landscape emotional -, 295 Langer, Susan, 734 Langner, Jorg, 736 language, 19 common -,25 denotator -, 723 langue, 19 large orchestra performance of a -, 761 largest coefficient, 890 lattice, 1132 law Weber-Fechner -, 1029 Lawrence, David Herbert, 905 Lawvere, William, 180, 435 layer RUBATO@-, 810 layers of reality, 10
1299
INDEX lazy path, 1063 LCA,639 Le sacre du printemps, 223 lead-sheet notation, 535, 694 learning process, 674 learning by doing, 36 leaves of a stemma, 764 left action, 1066 adjoint, 1120 coset, 1067 ideal, 1076 inverse, 1066 legato, 783 LEGO,943 Leibnitz, Gottfried Wilhelm, 565 )..£/),701/, 17 lemma Fitting's -, 1089 length, 328, 1089 minimal -, 835 of a local meter, 115 path -, 1063 LEP,239 Lerdahl, Fred, 290, 291, 294, 305, 311, 457, 461, 752, 873 Les fieurs du mal, 268, 963 level, 329 connotative -, 19 denotative -, 19 F-to-enter -, 881 function, 329, 1061 meta -,19 metrical -, 457 neutral -, 258 object-, 19 sound pressure -, 1029 levels of reality, 10 Levelt, Wilhelm, 1049, 1052 Lewin, David, 83, 247-250, 376, 384, 498 Lewis, Clarence Irving, 693 lexical, 418 lexicographic ordering, 58, 90, 1060 Leyton, Michael, viii, 933, 935
LH,764 library, 60, 446 Lie algebra, 1104 homomorphism, 1104 linear -, 1105 bracket, 1104, 1161 affine -, 541 derivative, 1160 formalism, 800 operator, 774 Lie, Sophus, 1104, 1160 Lied auf dem Wasser zu sing en ... , 262 Ligeti, Gyorgy, 33 limbic structure, 638 system, 642, 737, 1045 limit, 1121 circular -, 77 form, 67 ring, 1078 topology, 1146 limited modulations, 585 transposition, 151 mode with -, 151 line, 1062 linear (in)dependence, 1085 algebra special -, 1105 case, 250 combination, 1084 counterpoint group, 475 dual, 1091 fibration, 914 Lie algebra, 1105 module homomorphism, 1083 ordering, 1060 on a colimit, 92 on a limit, 92 on finite subsets, 92
1300 representation, 1086 visualization generic -, 440 metrical -, 441 linear ordering among denotators, 58 linguistics, 194 structuralist -, 305 Lipschitz locally -, 1156 Lipschitz, Rudolf, 1156 LISP, 534 list form, 976 listener, 12, 14 listening articulated -, 304 music -, 1035 procedure, 743 Liszt, Franz, 18, 20, 603 literally included, 255 Lluis Puebla, Emilio, viii local, 299 character of a contrapuntal symmetry, 647 composition, 89, 105, 107 commutative -, 125 embedded -, 126 functorial -, 121 generating -, 126 morphism, 124 objective -, 107 sequence of a -, 234 wrapped as -, 108 compositions coproduct of -, 124 fiber sum of -, 169 product of -, 124 meter, 115 length of a -, 115 period of a -, 115 meters simultaneous -, 609 morphism, 1108 optimization, 821 orientation, 324 Para-meter, 327
INDEX performance score, 724 rhythm, 116, 127 ring, 1089 score, 307, 946 solution, 1156 standard composition, 357 symmetry, 648, 649 technical parameter, 1005 threshold, 819 local topography, 19 local-global patchwork, 307 locality principle, 920 localization, 1101 historical -, 273 of epilepsy focus, 638 of musical existence, 24 locally closed, 1147 finite, 1149 free, 1110 linear grammar, 802 Lipschitz, 1156 ringed space, 1108 trivial structure, 307 locus Riemann -,820 logarithmic perception, 668 LoGeoRUBETTE®,811 logic, 419 absolute, 176 classical -, 1134 fuzzy, 409 harmonic -, 546 Heyting -, 530 inner -, 757 intuitionistic -, 539, 1134 musical -, 323 of orbits, 243 of toposes, 277
1301
INDEX
performance -, 674 performing -,934 predicate -, 530 logical, 1128 algebra, 1132 connective symbol, 1131 motivation, 776 switch operator, 71 time, 611 loop, 1063 Lord, John, 231 loudness, 51, 79, 739, 1029 LPS, 724, 755 Luening, Otto, 306 Lussy, Mathis, 747 M m th interval succession, 254 M.M., 670, 682 Malzel's metronome, 31, 1028 Malzel, Johannn Nepomuk, 414, 670, 682, 693 M a mere l'oye, 223 Mac OS X, 807, 813 machine performance -, 852 precision, 757 Turing -, 670 MacLean, Paul, 642 macro, 331 -event, 89 germ, 331 Maiguashca, Mesias, 70, 137 major, 146, 657 dichotomy, 631, 657 mode, 545 scale, 575, 576 dominant role of -, 657 third, 1031 tonality, 560, 582 bigeneric -, 547 major-minor problem, 146 making music, 24
Malt, Mikhail, 979 manifold, 307 formant -, 291 musical -, 295 of opinions, 997 semantic -, 295 map, 1059 characteristic -, 1126 coboundary -, 1150 performance -, 712 refinement -, 336 simplicial -, 1148 maquette, 978 Marek, Ceslav, 303 marked counterpoint dichotomy, 630 class, 630 dichotomy auto complementary -, 631 class, 630 rigid -, 631 strong -, 631 interval dichotomy, 630 Martinet, Andre, 17 Marx, Adolf Bernhard, 603 Maschke, Heinrich, 1088 Mason's theorem, 130 Mason, Robert, 129, 567 Mason-Mazzola theorem, 130 mass-spring, 1027 Massinger, Philip, 824 master concert -, 761 matching, 918 of structures, 869 score-performance -, 918 material change of -, 982 experimental -, 401 musical -, 978 of music, 106 time, 611 Math-motif, 494 Mathematica®, 929
1302 mathematical existence, 175, 398 model, 565 morphism, 344 overhead, 623 mathematically equivalent morphisms, 344 mathematics, 6, 195 matrilineal, 764 scheme, 762 matrix, 1085 category, 1117 comparison -, 251 design -, 877 initial design -, 878 interaction -, 857 product, 1085 Riemann -, 544, 586, 819 value -, 925 verse -, 261 Matterhorn, 183 Mattheson, Johann, 303, 996, 1050 MAX, 137, 256, 535, 953 maximal, 381 meter nerve topology, 460 topology, 329, 459 structure content, 1047 mayamalavagaula, 658 Mayer, Gunther, 271 Mazzola, Christina, ix Mazzola, Guerino, 268, 611, 613, 745, 873, 945 Mazzola, Silvio, ix MDZ71,293 mean performance, 881 tempo, 881 meaning of sound, 295 paratextual -, 400 textual -, 400
INDEX topological -, 192 transformational -, 193 measure for complexity, 311 measurement, 30 mechanical dynamics, 739 mechanism modulation -, 566 mediante tuning, 1033 mediation, 935 meet, 1132 mela, 658 melakarta, 658 melodic charge, 742 minor, 584,657 variation, 959 melodic minor scale, 575, 578 tonality, 560 melody, 276, 331 germinal -, 269, 270, 956 retrograde of a -, 137 MeloRUBETTE®, 467, 497, 785, 816 membrane basilar -, 1038 Reissner's -, 1038 tectorial -, 1039 memory, 641, 642 function hippocampal -, 642 mental experiment, 666 organization, 39 time, 664 tone parameters, 79 Mersenne, Marin, 1049 message, 13, 27 infinite -, 905 passing, 968 messaging, 188 Messiaen mode, 151 scale, 959 Messiaen, Olivier, 150, 152, 161, 959
1303
INDEX meta-object, 968, 978, 982 class, 982 protocol, 982 meta-programming, 967, 982 meta-vocabulary, 243 metalanguage, 259 metalevel, 19 metamere, 1046 metaphor, 26 metasystem, 19 meter, 114,455-463, 1029 beat -,115 barline -, 115 local-, 115 method, 188, 968 boiling down -,787 continuous -, 776 dodecaphonic -, 936, 940 group-theoretical -, 241, 250 integration -, 829 of characteristics, 1161 operational -, 31 selection, 971 statistical -, 745, 818 metric, 1153 associated -, 1154 Euclidean -, 279 metrical analysis, 835 component, 327 globalization, 116 grouping, 303 hierarchy, 455 level, 457 linear visualization, 441 profile, 835 quality, 456 rhythm associated -, 327 similarity, 199, 472 structure (=M), 457 weight, 455, 456, 785 metronome, 118 Miilzel's -, 31, 1028
MetroRUBETTE®, 457, 814 Mexican hat, 289, 1025 Meyer wavelet, 1026 Meyer-Eppler, Werner, 1035, 1046 mezzoforte, 1030 Michel chromatic, 582 micro -motif,784 timing, 270 micrologic, 692 microstructure timing -, 871 middle ear, 1037 middleground, 503 MIDI, 287, 946 Mikaleszewski, Kacper, 756 minimal cadential set, 554 length,835 Minkowski, Hermann, 377,1070 minor, 146 harmonic -, 584 melodic -, 584, 657 mode, 545 natural -, 582 tonality, 582 Mittelgrund, 503 Mitzler, Laurentz, 1050 mixed weight, 815 Mobius bottle, 677 strip, 549, 579, 941 Mobius strip, 307, 322, 538 modal structure, 383 synthesis, 1027 mode, 152 aeolian -, 320 authentic -, 320 consonant -,547 dissonant -, 547 dorian -, 320 ecclesiastical -, 319, 655, 657 hypoaeolian -, 320
1304 hypodorian -, 320 hypoionian -, 320 hypolocrian -, 320 hypolydian -, 320 hypomixolydian -, 320 hypophrygian -, 320 ionian -, 320 locrian -, 320 lydian -, 320 Messiaen -, 151 mixolydian -, 320 phrygian -, 320 plagal -, 320 rhythmic -, 611 with limited transpositions, 151 model,1136 mathematical, 565 physical -, 29 template fitting -, 1045 modeling physical -, 289, 850 modification of functional relations, 982 syntax -, 982 modular affine transformation, 948 composition, 307 modularity dynamical -, 809 modulatio, 564 modulation, 559, 563-592, 1008 amplitude -, 288, 1003 degree, 566 domain, 580 force, 567, 571 frequenc~ 288, 1003 frequency -, 1022 just -, 577 mechanism, 566 path, 600 pedal -, 608 pitch -, 288, 1003 plan, 607
INDEX quantized -, 572 quantum, 567, 568, 572, 573 rhythmical -, 576, 610, 613, 959 theorem, 572 topos-theoretic background of -, 568 well-tempered -, 571 modulations limited -, 585 modulator, 87, 289, 572,596,600,602,1022 modulatory architecture, 603 region, 592 module, 1083 as basic space type, 69 complex, 350 constant -, 350 flasque -, 370 of A-addressed forms, 351 representative -, 363 retracted -, 352 direct sum -, 1084 dynamically loadable -, 808 free -, 1085 functor, 172 injective -, 1103 of a commutative local composition, 125 of global affine functions, 432 product -, 1084 projective -, 1102 semi-simple -, 1087 shaping -, 807 simple -, 1087 structuring -, 807 modules in music, 70 modus ponens, 1134 molecule, 355 global -, 355 interpretable -, 356, 385 Molino, Jean, 12, 14, 696 moment initial -, 697 mono, 1116 monochord, 24 monogamic coupling, 769 monoid, 1063
INDEX algebra, 71, 1077 finite -, 1063 free -, 1064 free commutative -, 1064 homomorphism, 1063 isomorphism, 1063 morpheme -, 540 multigeneric -, 543 trigeneric -, 540 word -, 1064 monomorphism, 1116 Monteverdi, Claudio, 909 Montiel Hernandez, Mariana, viii, 334 mood,736 MOP, 982 graphical -, 982 Morlet wavelet, 1025 morpheme harmonic -, 546 monoid,540 Morphemfeld, 506 morphic, 749, 1140 morphing, 952 morphism, 196, 1115 t-fold differentiable tangent -, 669 t-fold tangent -, 669 global -, 300 local -, 1108 mathematical-, 344 mathematically equivalent -, 344 of denotators, 108 of form semiotics, 1141 of formed compositions, 355 of functorial global compositions, 335 of functorial local compositions, 156 of global molecules, 355 of local compositions, 124, 154-158 of objective global compositions, 335 of objective local compositions, 154 of performance cells, 713 tangent -, 669, 676 Morris, Robert, 247, 249, 250, 258, 383, 385, 498
1305 Morrison, Joseph, 1027 MOSAIC, 1027 mosaic, 378 mother, 724, 752 primary -, 764, 765 prime -, 765 tempo, 682 motif, 118, 193, 279, 331 abstract -, 468 classification, 228-231 covering, 467 Reti's definition of a -, 491 rhythmic -, 613 serial, 149 space, 467 Z-addressed -, 120 motion, 734, 738 accelerated -, 738 harmonic -, 503 sense of -, 739 trigger, 738 motivated, 18 motivation, 419 geometric -, 420, 422 logical -, 420, 776 motives enumeration of -, 238 motivic analysis, 262, 491 interpretation, 332, 467 nerve, 467 simplex, 467 weight, 496, 785 work, 338 zig-zag, 339, 941 motor action, 739 movement tensed -, 646 Mozart, Wolfgang Amadeus, 231, 458, 598 Miiller, Stefan, viii, 912, 1143 multigeneric monoid, 543 multigraph, 1062 finite -, 1062 multimedia object, 441, 449 multiple-dispatching, 968
INDEX
1306 multiplication interval -, 623 scalar -, 1083 multiplicity, 254 Mumford, David, 366 Murenz wavelet, 1025 music, 3, 8, 9, 14, 25 absolute -, 934 alphabet of -, 106 atonal -, 248 composition technology, 564 concept of -, 23 critic, 304 role of -, 906 criticism, 303, 772 critique, 905 definition of -, 6 deixis, 18 emotional function of -, 642 esthetics of -, 393 fact of -, 10 film -, 733 historical dimension of -, 108 history, 6 in-time -, 986 key function of -, 643 listening, 1035 material of -, 106 psychology, 291, 305 research, 8 semiotic perspective of -, 16 software, 307 syn- and diachronic development of -, 242 tape -, 306 theory, 813 thinking -, 25 music theory professional -, 247 musical concepts definition of -, 114 analysis, 744 composition, 33
drama, 908 gestalt, 106, 152 idea, 934, 935 intuition, 246 logic, 323 manifold, 295 material, 978 onset, 1028 ontology, 23 process, 978 prosody, 270 reality, 171 semantics, 162 taste, 643 tempo, 31 topography, 19 unit, 106 musicological experiment, 33, 34 ontology, 398 musicology, 3, 14, 813, 871 cognitive -, 23 computational -, 23 historical -, 399 systematic -, 399 traditional -, 24, 31 Musikalisches Opfer, 144 musique concrete, 306 Muzzulini, Daniel, 537, 554, 571, 585 Mystery Child, 952 N
n-circle, 513 n-cube, 671 N-formed global composition, 354 n-modular pitch, 250 n-phonic series all-interval -, 237 N -quotient, 360 name, 76 instrument, 82 of a denotator, 52 of a form, 51 names
INDEX ordering on -, 90 naming policy, 51, 52, 68 Naradaya-Watson kernel, 857 narration, 933 narrativity theory of -, 934 Nattiez, Jean-Jacques, 272, 304, 473, 940 natural, 1119 decomposition, 855 distance, 441 minor, 582 transformation, 1118 natural minor tonality, 560 natural sciences experience in the -, 34 nature exterior -, 32 interior -, 32 nature's performance, 925 navigation, 34, 43 address -, 169 conceptual -, 39 dynamic -,45 productive -, 44, 45 receptive -, 44, 89 topographical -, 21 trajectory, 35 visual -, 439 negation, 421, 1131, 1132 neighborhood, 199, 276, 817, 1145 elementary -,489 nerve, 937, 1148 auditory -, 1037 class -, 346, 376, 390 discrete -, 311 functor, 1148 motivic -, 467 of a global functorial composition, 344 of a global objective composition, 310 weight, 460 induced -, 460 Neuhaus, Harry, 756 VeVILa, 193
neumes, 16, 193, 693
1307 neural pitch processing, 1045 neuronal oscillator, 737 neutral, 12, 1021 analysis, 272, 305 element, 1063 level, 258 neutral level, 12, 14 neutralization, 565 Newton, Isaac, 399 NeXT, 808, 833 NEXTSTEP, 807, 813 nexus, 382 nihil ex nihilo, 28 nilpotent, 1089 Ninth Symphony, 495 Noether, Emmy, 138 Noh, 14, 416, 768 Noll, Thomas, viii, 82, 221, 506, 510, 512, 515, 519, 524, 529, 538, 540, 546, 564, 571, 633, 636, 744, 820, 1064, 1143 non-commutative polynomials, 1077 non-interpretable composition, 371, 376 non-invertible symmetry, 153 non-lexical, 418 non-linear deformation, 776, 827, 889 non-linearity, 1043 non-parametric approach, 856 norm, 19, 1154 normal form, 255 Rahn's, 255 order, 257 subgroup, 1067 normalization diachronic -, 909 synchronic -, 909 normative analysis, 457 norms equivalent -, 1155 not parallel, 795 notation American jazz -, 533 European score -,79
1308
INDEX
lead-sheet -, 533, 535, 694 notched tone space, 1048 note alterated, 127 anchor -, 760 satellite -, 760 note against note, 619, 646 number embedding -, 254 prime -,278 numbers complex -, 1076 dual -, 618, 1077 rational -, 1076 real-, 1076
o
object, 188, 968, 1116 description, 244 global -, 299 multimedia -, 441, 449 prototypical -, 280 visualization principle, 441 object-oriented programming, 55, 723, 763, 766, 770, 967, 968 objective closure, 524 global - composition, 309 local - composition, 107 trace, 121 Objective C, 808, 825 objectlevel, 19 objectystem, 19 observation, 30 OCR,767 octave, 73, 1031 coordinate, 1032 period,110 octave class, 139 ODE, 792, 829, 1156 Ode an die Freude, 495 Oettingen, Arthur von, 137, 147, 504, 512, 514,1032 off-line algorithm, 919
ON-OFF, 71 ondeggiando, 720 onomatopoiesis, 18, 938 onset, 51, 79 abstract -, 150 musical -, 1028 origin, 115 physical -, 1028 self-addressed -, 83 time, 1013 weight, 116 ontological atomism, 27 coordinate, 10 dimension, 19 perspective, 6 shift, 171 ontology, 9, 171, 180, 184, 398 musicological -, 398 denotator -, 398 knowledge -, 420 musical -, 23 time -,936 open ball, 1153 covering of a functor, 1113 formant set, 291 functor, 1113 semiosis, 401 set, 278, 1145 source, 808 Open-Editor, 983 OpenMusic, 256, 384, 935, 943, openness, 290 operation Boolean -,947 flattening -, 88, 331 operationalization, 245 operationalized thinking, 196 operator, 727, 749, 752 agogical -, 872 articulation -, 720 basis-pianola -, 795 Beran -,876 canonical -, 253
967~990
INDEX field -, 792 Lie -, 774 performance -, 727, 744, 773-803 physical -, 749, 791 pianola -, 801 prima vista -, 749 smoothing -, 874 splitting -, 788 sub-path -, 588, 1079 support -, 1015 symbolic -, 749, 789 tempo -, 793 test -, 791 Todd -,752 TTO -,253 validation -, 424 opinions manifold of -, 997 opposite category, 1117 group -, 1066 opposition, 18 optimal path, 820 optimization local -, 821 orbit, 1066 set-theoretic -, 1114 space, 1066 orbits functor of -, 1114 Orchestervariationen, 137 orchestra instrumentation denotator, 82 orchestral gesture, 986 globalization, 673 orchestration, 948 order, 1067, 1069 normal -, 257 of a PDE, 1161 ordered p-space interval, 252 pair, 1058 pc interval, 253 ordering, 440 alphabetic -, 40, 43, 58
1309 encyclopedic -, 58 lexicographic -, 58, 90, 1060 linear -, 1060 on a colimit, 92 on a limit, 92 on finite subsets, 92 on coefficient rings, 94 compound (naive) denotators, 59 compound (naive) forms, 59 coordinators, 91 denotators, 89-99 diagrams, 91 direct sums, 94, 95 forms, 89-99 identifiers, 91 matrix modules, 95 Mod, 93-96 morphisms, 95 names, 90 simple forms, 90 types, 90 universal construction functors, 92 '1L(ASCII;, 95 partial -, 1060 powerset -, 60 principle, 440 on denotators, 57 universal -, 44 ordinal, 1116 Oresme, Nicholas, 30, 664 organ of Corti, 1038 organic composition principle, 868 principle, 198 organism cellular, 394 hierarchical -, 304 organization degree of -, 869 mental-,39 orientation, 8, 322, 619 hanging -, 128 change of -, 619, 626, 646
1310
INDEX
geographic -, 43 hanging -, 619 local-, 324 ontological -, 9 recursive -, 21 sweeping -, 128, 619 oriented contrapuntal interval, 619 global composition, 355 origin, 328 of onset, 115 Orna11agic, 941, 950, 952, 953 ornament, 720, 949 pattern, 246 Ornament Operator , 784 orthogonality principle, 920 orthonormal decomposition, 11 Orthonormalization, 879 oscillator, 736 neuronal -, 737 oscillogram, 736, 737 Osgood, Charles, 198 ostinato, 979 ottava battuta, 657 outer derivation, 1105 ear, 1036 hair cell, 1039 pillar cell, 1039 outlet, 972 output presto®-, 946 oval window, 1037, 1041 overhead mathematical-,623 overloading, 968 p
p-group, 1069 p-pitch, 252 p-scale, 112 p-space, 252
p-Sylow group, 1069 Paganini, Niccolo, 996 painting, 183, 946 pair ordered -, 1058 polarized -, 646 simplicial -, 1150 Yoneda -, 1137 Palestrina-Fux theory, 655 paper science, 176 Papez, James, 642 Pare, Ambroise, 32 Para-rhythm, 327 paradigm, 18 dodecaphonic -, 150 Fourier -, 284 general affine -, 161 gestalt -, 816 phonological -, 269 7rapaowyp,a, 192 paradigmatic concept, 280 field, 150 group, 474 strategy, 940 theme, 272, 473 tool, 953 transformation equivalence, 259 paradigmatics uncontrolled -, 201 parallel, 795 articulation field, 689 crescendo field, 689 degree, 324 glissando field, 689 hierarchy, 718 not -,795 performance field, 689 map, 689 space, 718 Parallelklang, 556
1311
INDEX parameter accessory -, 999 basis -, 79, 795 bow -,1003 cadence -, 552 color -, 1004 essential -, 999 exchange, 160, 161 geometric -, 1000 global technical -, 1005 instrumental -, 1002, 1004 local technical -, 1005 pianola -, 79, 795 primavista -, 722 space, 434 system -, 575 technical -, 289 vibrato -, 1003 parametric polymorphism, 968 paratextual, 769 information, 831 meaning, 400 paratextuality, 424 Parncutt, Richard, 738 parole, 19 part, 301, 307, 334 dilinear -, 1090 extraterritorial -, 720 translation -, 1090 partial, 86, 513 ordering, 1060 partials, 10 participation value, 639 particle physics, 567 partition, 1058 interieure, 14 partitioning, 257 passing message -, 968 patch, 969, 978 patchwork local-global -, 307 path, 1063
category, 1079 closed -, 1063 harmonic -, 586 lazy -, 1063 length, 1063 modulation -, 600 optimal -, 820 patrilineal, 764 pattern, 246 pause, 81, 768 general -, 782 pc, 253 pc-space, 253 pcseg,253 pcset,253 invariant -, 254 PDE, 798, 1161 quasi-linear -, 1161 Peano axioms, 32 pedal modulation, 608 voice, 608 peer, 810 perception logarithmic -, 668 perceptional pitch concept, 1047 perceptual equivalence, 280 percussion, 269 event, 612 perfect consonance, 635, 646, 657 performance body, 712 cell, 711 cells category of -, 713 morphism of -, 713 complexity of -, 664 culture of -, 757 development hierarchy of -, 757 dynamics of -,800 field, 685, 690, 712 parallel -, 689 prime mother -, 768 fields
1312 algorithmic extraction of -, 916 grammar, 747 inbuilt -, 907 hierarchy, 674 history fictitious -, 763 real-, 763 infinite -, 666 initial -, 712, 726 intermediate -, 756 logic, 674 machine, 852 map, 712 parallel -, 689 mean -,881 nature's -, 925 of a large orchestra, 761 operator, 727, 744, 773-803 plan, 757 primavista -, 766 procedure, 743 real-time -, 738 research computer-assisted -, 764, 850 score global -, 728 local-, 724 structural rationale of -, 395 synthetic -, 741 theory, 387, 393 inverse -, 743, 790, 913 tradition, 907 PerformanceRUBETTE®, 708, 792, 794, 824, 889 performer, 27 performing logic, 934 perilymph, 1038 period,503 fundamental -, 1019 in the Euler module, 110, 112 interictal -, 638 octave -, 110 -of a local meter, 115 of a Vuza canon, 381
INDEX of a Vuza rhythm, 380 of duration, 115 temporal -,456 periodicity, 856 higher level -, 117 Perle, George, 248 permutation, 1059 perspective, 27, 181, 184, 393, 566 change of -, 393 /-,336 of the composer, 301 ontological -, 6 variation of -, 182 perspectives sum of -,394 Petsche, Hellmuth, 638, 640 phase, 1020 portrait, 1159 spectrum, 291, 1020 phaticity, 259 phenomenological difference, 912 phenotype, 943 philosophy, 5 denotator -, 185 Yoneda -, 997 phoneme, 287 phonological equivalence, 263 paradigm, 269 poeticity, 263 photography, 183 phrasing, 303 physical model, 29 modeling, 289, 850 onset, 1028 operator, 749, 791 pitch, 1031 sound,84 time, 664 tone parameters, 81 PhysicalOperator, 829 physics, 6 particle -, 567 Pi-rank, 1070
INDEX pianissimo, 738 piano hierarchy, 722 Piano concert No.1, 245, 246 Pianoforte Schule, 758 pianola coordinate, 1028 deformation, 720, 797 operator, 801 parameter, 79, 795 space, 689, 715, 763 specialization, 801 piecewise smooth, 1018 Pinocchio, 450 pitch, 51, 79 -class self-addressed -, 83 -class set, 248, 253 absolute -, 700 alteration, 62, 952 chamber -, 684, 1031 class, 111, 139 segment, 253 set, 253 concept perceptional -, 1047 concert -, 699 cycle, 252 detector central -, 1045 difference, 73 distance, 72 fundamental-, 532 mathematical-,72 modulation, 288, 1003 physical -, 1031 processing neural -, 1045 segment, 252 spaces, 250 symbolic -, 80 pivot, 572 pivotal chord, 563 pixel, 417 plan performance -, 757
1313 plane transformation, 949 Plato, 29, 43, 202 platonic ideas, 23 playing, 24 Plomp, Reiner, 1049, 1052 Podrazik, Janusz, 257 Poe, Edgar Allan, 933 Poem of Wind, 268 poetic Ego, 268 poetical function, 18, 138, 259, 295, 303, 934, 942 functions spectrum of -,266 poeticity, 138, 259 vertical -, 261 horizontal -, 261 phonological, 263 poetics timbral -, 295 verse -, 303 poetology, 258 poiesis, 12, 13, 258 retrograde -, 15 poietic, 12, 1021 genealogy, 154 point, 177-181, 1057 generic -, 279 absorbing -, 525 accumulation -, 1145 closed -, 279 concept, 175 etymology, 178 Euler -, 1031 faithful -, 523 full, 523 fully faithful -, 523 generic -, 330, 1110 identity, 178 turning -, 565 pointer, 25, 26, 43, 177 scheme, 55 polarity, 631, 640 at x, 637 in musical cultures, 658
1314 profile, 279 polarized pair, 646 politically incorrect, 907 P6lya enumeration theory main theorems of -, 233 theory, 232 weight function, 232 P6lya, George, 232, 378 polygamic coupling, 769 polygon, 948 polyhedral initial set, 704 polymorphism ad-hoc -, 968 parametric -, 968 polynomials commutative -, 1077 non-commutative -, 1077 polyphony, 995 polyrhythm, 965 polysemy, 129, 200 Popper, Karl, 997 Porphyrean tree, 191 portrait phase -, 1159 position general -, 391 privileged -, 633 Posner, Roland, 261, 942 post-serialism, 245 potential epileptiform -, 638 sink -, 739 power spectral -, 640 window, 638 powerset, 1062 form, 66 ordering, 60 type, 54 PR,457 Preludes, 600 practising, 769, 771 pre-Hilbert space, 1020 pre-morphism, 410
INDEX pre-object, 410 precise conceptualization, 258 precision calculation -, 775 conceptual -, 35 human -,757 machine -, 757 PrediBase, 808 predicate, 410 atomic -, 412 connective, 1135 deictic, 420 global -, 552 logic, 530 mathematical -, 412, 420 morphic -, 410 objective, 410 primavista -, 414, 769 European -, 414 non-European -,414 punctual -, 410 PV -,414 relational -, 410 shifter -, 418 textual-, 544, 552 variable, 1135 preferences, 820 prehistory of the string quartet, 994 presence, 497, 785, 817 presheaf, 1119 pressure bow -, 1002 decrease factor, 10 16 variation, 1013 presto®, 47, 137, 245, 246, 268, 269, 293, 532, 639, 699, 758, 941, 943, 945953,955 output, 946 presto®, primary mother, 764, 765 primavista, 674, 766 agogics, 764 dynamics, 764
INDEX operator, 749 parameter, 722 performance, 751, 766 predicate, 769 PrimavistaOperator, 766 PrimavistaRUBETTE®, 831 prime, 1080 field, 1076 form, 257 mother, 765 performance field, 768 number, 278 spectrum, 278, 293, 1108 stemma, 764 vector, 73, 1033 principal component analysis, 898 principal homogeneous G-set, 249 principle anthropic -, 565, 567, 658 architectural -, 869 concatenation -, 160, 624 dialogical -, 997 fractal -, 964 locality -, 920 normative -, 458 object visualization -, 441 of relevance, 17 ordering -, 440 organic composition -, 868 organic -, 198 orthogonality -, 920 packing -, 441 sonata -, 163 variation -, 394 priority, 879, 891 privileged position, 633 problem Cauchy -, 1162 context -, 819 hit point -, 704 wild -, 913 procedure listening -, 743 performance -, 743
1315 rule based -, 747 rule learning -, 747 statistical -, 242 process, 19, 401 historical -, 763, 771 learning -, 674 musical -, 978 of conceptualization, 245 product, 1062 ambient space, 124 Cartesian -, 1058 cartesian -, 1121 category, 1118 cellular hierarchy, 718 fiber -, 1078, 1121 group, 1068 matrix -, 1085 module, 1084 of local compositions, 124 of the cells, 714 ring, 1077 semidirect -, 1068 tensor -, 1078 topology, 1146 type, 52 weight function, 232 wreath -, 1069 production, 4 of a musical work, 13 profile metrical -, 835 program canonical -, 394 change, 947 programme narratif, 935 programming constraint -, 935, 967 functional -, 967 language visual -, 967 object-oriented -, 55, 188, 723, 763, 766, 770, 967, 968 progression harmonic, 304 chord -,502
1316 contrapuntal, 304 harmonic -, 154 Project of Music for Magnetic Tape, 306 projecting local composition, 216 projection, 153, 1062 projective atlas, 360 functions, 362 module, 1102 Prokofiev, Serge, 223 prolongational reduction (=PR), 457 proof chain, 1133 propagation sexual -, 763, 773 property inheritance -, 479 propositional formula, 1136 variable, 1131 prosody musical -, 270 protocol meta-object -, 982 prototype, 241 prototypical object, 280 pseg, 252 pseudo-metric, 1153 on abstract gestalt space, 478 psychological reality, 665 psychology, 6 cognitive -, 218, 276 gestalt -, 106 music -, 291, 305 psychometrics, 199, 279 Puckette, Miller, 918 pullback, 1121 pure expressivity, 737 pushout, 1121 PVBrowserRUBETTE@, 811 Pythagoras, 530 Pythagorean, 33 school, 114, 413 tonality, 561 tradition, 24, 186, 1049 tuning, 325, 581, 1032
INDEX Pythagoreans, 12 Q quale, 693 qualifier after -, 983 before -, 983 quality metrical -, 456 quantifier existence -, 421 formula, 1136 universal -, 421 quantization, 835 quantized modulation, 572 quantum modulation -, 567, 568, 572, 573 quantum mechanics, 505, 516 quartet string -, 934, 993 quasi-coherent, 1109 quasi-compact, 1147 quasi-homeomorphism, 1111 quasi-linear PDE, 1161 quaternions, 1076 quatuor concertant, 994 quatuor dialogue, 996 quiver, 1063 algebra, 1079 complete -, 1063 Riemann -, 586 Riemann index -, 589 stemma -, 801 quotient category, 1117 complex, 351 dominance topology, 283 group, 1067 ring, 1076 topology, 1146
R radical, 1064 Radl, Hildegard, 560, 576, 581, 585 Raffael, 186 Raffman, Diana, 25, 693
1317
INDEX raga, 658 Rahn, John, 247-250, 498 Rameau's cadence, 554 Rameau, Jean-Philippe, 502, 512, 531, 554, 1050 ramification mode, 48 random, 15 rank, 1085 torsion-free -, 1070 Raphael, 201 rational numbers, 1076 rationale, 748 gestural-, 908 historical -, 994 Ratner, Leonard, 400 Ratz, Erwin, 567, 604 Ravel, Maurice, 223 RCA, 639 real,829 inversion, 148 numbers, 1076 performance history, 763 real-time algorithm, 919 context, 917 input, 946 performance, 738 reality, 10 historical -, 594 levels of -, 10 mental-, 12 musical -, 171 physical -, 11 psychological -, 12, 665 realization geometric -, 1149 reason, 5 recapitulation, 304, 603 receiver, 259 reception, 4 receptive navigation, 89 recitation tone, 319 recombination, 982 weight -, 776
reconstruction, 493 recording bipolar -, 638 foramen ovale -,638 recta, 839 recursive classification, 216 construction, 49 orientation, 21 typology, 48, 56 reduced diastematic shape type, 470 strict style, 657 reduction, 1095 curvilinear -, 937 reductionism, 7 Reeves, Hubert, 203 reference denotator, 403 tonality, 546 referentiality, 259 refinement map, 336 reflection, 982 glide -, 1097 reflexive, 1059 reflexivity, 967 Regener, Eric, 249 region modulatory -, 592 register, 947, 969 regression analysis, 860, 877, 880 regular denotator, 79-85 division of time, 456 form, 76 representation, 1086 structure, 856 regularity time -, 116 rehearsal, 674, 741, 745, 769, 771 Reichhardt, Johann Friedrich, 996 reification, 982 Reissner's membrane, 1038
1318 relation causal -, 985 commutativity -, 1117 cross-semantical -, 745 equivalence -, 305, 1060 K -,382 Kh -,382 KI -,383 temporal -, 985 relative delay, 288 dynamics, 831 homotopy, 1150 importance, 786 motivic topology, 486 symbolic dynamics, 81 tempo, 682, 832 topology, 1146 relative local dynamical sign, 778 tempo, 780 relative punctual dynamical sign, 777 tempo, 780 relevance, principle of -, 17 Rellstab, Ludwig, 20 Remak, Robert, 1090 Remove-Element, 983 renaming, 55 repetition, 140 replay, 140 Repp, Bruno, 871, 872, 876, 898, 927 representable functor, 1120 representation adic -, 1080 auditory -, 240 Fourier -, 899, 1000 geometric -, 946 linear -, 1086 regular -, 1086 score -, 742 textual -, 937
INDEX representative first -, 220 module complex, 363 reprise, 964 reset, 71 resolution, 434, 999, 1009 cohomology, 432 functor, 358 of a global composition, 358, 393 response EEG -,637 responsibility collective -, 770 restriction cellular hierarchy -, 718 of modulators, 606 scalar -, 127, 1084 resultant class, 381 resulting divisor, 382 retard final-, 738 Reti, Rudolph, 201, 275, 456, 465, 490, 816, 873 Reti-motif, 493 retracted module complex, 352 retraction, 1116 retrograde, 15, 25, 142, 152, 160, 254, 302 address involution, 150 inversion, 73, 144, 161 of a melody, 137 retrogression, 253 reverberation time, 1016 reversed order score played in -, 143 tape played in -, 145 sound, 145 revolution experimental -, 32 RH, 639, 764 rhetorical expression, 692 expressivity, 674 shaping, 674 rhetorics, 996
INDEX rhythm, 140, 152, 455-463, 974 local -, 116, 127 Vuza -,380 rhythmic germ, 152, 326 mode, 611 motif, 613 scale, 613 rhythmical group, 977 modulation, 576, 610, 613, 959 theory, 612 structure, 958 rhythms, 114 classification of -, 380 local isomorphism classes of -, 221 Richards, Whitman, 739 richness semantic -, 692 Richter, Sviatoslav, 756 Riemann algebra, 586 dichotomy, 636 graph, 821 harmony, 322 index quiver, 589 locus, 820 matrix, 544, 586, 819 quiver, 586 transformation, 384 Riemann, Bernhard, 307 Riemann, Hugo, 116, 147, 194, 245, 250, 307, 455, 502, 506, 531, 543, 546, 564, 571, 586, 590, 619, 636, 814, 819, 841, 866, 873 Ries, Ferdinand, 993, 995 right action, 1066 adjoint, 1120 coset, 1067 ideal, 1076 inverse, 1066 right-absorbing endomorphism, 524
1319 rigid, 321, 340, 567, 571, 576, 816 difference shape type, 469 marked dichotomy, 631 shape type, 469 ring, 1075 anti-homomorphism, 1076 endomorphism -, 1083 homomorphism, 1075 isomorphism, 1075 limit -, 1078 local -, 1089 product -, 1077 quotient -, 1076 self-injective -, 1103 simple -, 1077 ringed space, 1107 ritardando, 739, 782 RMI, 810 Roederer chromatic, 582 Roland R-8M, 269, 955 role exchange, 72 of a music critic, 906 rotation, 246, 253, 254, 949 Amuedo's decimal normal -, 256 roughness, 1051 round window, 1041 Rousseau, Jean-Jacques, 303 row-class, 255 RUBATO@, 47, 457, 764, 788, 789, 807-811, 871, 895, 916 concept, 807 Distributed -, 922 layer, 810 rubato Chopin -, 667, 682, 698, 759, 924 encoding formula, 751 RUBETTE@, 808, 813-832 Rufer, Joseph, 150, 162 rule -based procedure, 747
1320 grammar, 748 contravariant-covariant -, 972 Fux -,657 learning procedure, 747 preference - (=PR), 457 well-formedness - (=WFR), 457 rules differentiation -, 742 ensemble -, 742 grouping -, 742 Rulle, 742 Runge-Kutta-Fehlberg, 792, 829 Russian Quartets, 995 Ruwet, Nicolas, 272, 304, 940
S Sabine's formula, 1017 Sachs, Klaus-Jiirgen, 645, 646 Salzer, Friedrich, 503 Sands' algorithm, 377 Sands, Arthur, 377 Sarabande Nr. 52, 231 Sarcasmes, 223 satellite, 449 note, 760 saturation, 526, 1102 sheaf, 526, 540 Saussure, Ferdinand de, 17, 194, 242, 272, 305,574 Sawada, Hideyuki, 738 SC,253 scala media, 1038 tympani, 1037 vestibuli, 1038 scalar, 1083 multiplication, 72, 1083 restriction, 127, 1084 ScalarOperator, 829 scale, 112, 538 12-tempered -, 318 major -,575 melodic minor -,575,578 altered -, 585
INDEX chromatic -, 506 diatonic -, 658 harmonic minor -, 575, 577 harmonical-rhythmical -, 959 just -, 113, 318 justest -, 325 major -, 321, 576 Messiaen -, 959 minor harmonic -s, 321 melodic -, 321 rhythmic -, 613 whole-tone -, 657 SCALE-FINDER, 256, 535 SCALE-MONITOR, 256, 535 scales common 12-tempered -, 113 scatterplot, 862 Schonberg, Arnold, 33, 106, 137, 150, 162, 223, 243, 245, 248, 249, 301, 310, 321, 394, 501, 512, 563, 565, 567, 611, 936, 940 Schafer, Sabine, 833 Schaeffer, Pierre, 306 scheme, 1111 diagram -, 1117 Grassmann -, 1113 matrilineal -, 762 mental-, 14 Molino's -, 12 sonata -, 613 Schenker, Heinrich, 331, 400, 503 scherzo, 956 Schmidt, Erhard, 1090 school KTH -, 750, 755 Pythagorean -, 114, 413 Zurich -, 744 School of Athens, 186, 201 Schubert, Franz, 262, 283, 956 Schumann, Robert, 495, 764, 765, 818, 860, 947, 996 Schweizer, Albert, 834 science cognitive -, 743
152, 259, 538,
849,
1321
INDEX computer -, 188 contemplative -, 29 doing -,30 experimental -, 32 paper -, 176 scientific bankruptcy, 24 score, 12, 14, 71, 414, 946 concept, 307, 693, 909, 978 European - notation, 79 exterior -, 694 generic -, 665 global -, 307, 946 inner -, 665, 694 interior -, 14 local -, 307, 946 played in reversed order, 143 representation, 742 semantics, 696 transformation -, 948 score-following, 918 score-performance matching, 918 Scriabin, Alexander, 222, 587, 964 SEA,467 Second Book of Preludes, 756 section, 1109, 1116 global -, 350, 1121 segment class, 252, 255 pitch -,252 selection axis of -, 138, 260 field -,969 method -, 971 stepwise forward -, 881 self-addressed arrow, 626 chord,225 contrapuntal intervals, 626 denotator, 82 onset, 83 pitch-class, 83 self-injective ring, 1103 self-modulating, 1022 self-referential, 22, 176
self-similar time structure, 964 Selibidache, Sergiu, 696 semantic atom, 538 charge, 490 of EEG, 638, 640 completion, 57 depth, 465 differential, 198 interpretation, 598 loading, 48 manifold, 295 richness, 692 semantics functional -, 541 gestural -, 908 harmonic -, 531 incomplete, 99 musical -, 162 of weights, 497 score -, 696 semi-simple module, 1087 semidirect product, 1068 semigroup, 1063 semiosis, 10 extroversive -, 400 incomplete -, 401 introversive -, 400 open -,401 paratextual-, 424 textual -, 406, 424 semiotic constraints, 284 marker visual -, 981 of [-forms, 1138 semiotical symmetry, 161 semiotics, 6, 16 of sound classification, 294 semitone, 74 sender, 13, 259 sense of motion, 739 sentence, 1131, 1136 valid -, 1132 sentic state, 734
INDEX
1322 separating module complex, 360 sequence Cauchy -, 1154 chord -, 591 contrapuntal -, 646 Fibonacci -, 413 interspace -, 234 of a local composition, 234 sequencer, 953 sequentialization, 937 serial motif, 149 integrated -, 237 derived -, 237 technique, 152-154 serialism, 245 series all-interval -, 237, 244 basic -, 150 dodecaphonic-, 149, 197,236,301,309, 394 fundamental -, 137 (k,n)-, 149,236 n-phonic -, 149, 236 time -,856 set, 1057 cadential -, 554 circular -, 79 class, 253, 254 closed -, 1145 complex, 249, 382 theory, 248 concept, 176 empty -,176 fuzzy -, 198 in AST, 248 index -, 1060 initial -, 696, 712, 725 minimal cadential -, 554 of operations, 255 open -, 278, 1145 pitch-class -, 248 small -, 1116 source -, 248
support -, 309 theory, 305 SET-SLAVE, 255, 535 set-theoretic orbit, 1114 seventh dominant -, 508 natural -, 513 subdominant -, 508 tonic -, 508 sexual propagation, 763, 773 SGC, 255 Shakespeare, William, 773 shape, 492 shape type, 468 contrapuntal motion -, 470 diastematic index -,470 diastematic -, 470 elastic -, 471 reduced diastematic -, 470 rigid difference -, 469 rigid -,469 toroidal sequence -, 471 toroidal -, 471 shaping module, 807 rhetorical -, 674 vector, 876 sharp, 130 sheaf, 1111, 1130 on a base, 1108 saturation -, 526 sheafification, 1130 shearing, 144, 147, 160, 246, 1096 sheaves category of -, 1130 Sherman, Robert, 218 shift, 129 constant -, 129 elementary -, 129 ontological -, 171 shifter, 696, 701
INDEX esthesic -, 419 poietic -, 418 sieve, 1126 closed -, 1130 covering -, 1129 sight-reading, 674, 875 sign, 16 deictic -, 18 lexical -, 18 shifter -, 18 signature, 720 key -,768 time -,768 significant, 16 significate, 16 signification, 16, 17, 410 process, 16 signs of coefficients, 887 system of -, 6 similarity, 194, 198, 276, 279 metrical -, 199, 472 simple form, 66 simplify to a -, 75 frame, 968 group, 1067 module, 1087 ring, 1077 simple forms ordering on -, 90 simplex, 725, 1148 closed -, 1149 dimension of -, 1148 motivic -, 467 singular -, 1150 standard -, 1150 simplicial cochain complex, 1150 complex, 940, 1148 map, 1148 metrical weight, 329 pair, 1150
1323 weight, 346 simplify to a simple form, 75 Simula, 968 simultaneous local meters, 609 singleton interpretation, 336 singular cochain, 1150 simplex, 1150 sink potential, 739 Siron, Jacques, 694 sister, 802 site, 1129 Zariski -, 1112 skeleton, 1148 category, 1117 sketchy interpretation, 757 skew field, 1075 slave tempo -,759 Slawson, Wayne, 290 slice, 121 identity -, 336 f -slice, 336 Sloboda, John, 807 slope global -, 822 slot, 968 slur, 768 SMAC, 744 small gestalt, 483 infinitely -, 692 set, 1116 smallness, 290 Smith III, Julius 0, 1027 smooth piecewise -, 1018 smoothing hierarchical -, 857 kernel, 857 kernel -, 856, 874 operator, 874 SMPTE,946
1324 SNSF, 744, 807 sober, 1110 weight, 460 socle, 1089 software development, 723 engineering, 184 for AST, 255 music -,307 solution global -, 1159 local -, 1156 sonata form, 304, 603, 956 principle, 163 scheme, 613 theory, 603 Sonatine, 223 sound, 1013 classification, 284 color, 194 colors space of -, 290 conceptualization of -, 15 generator, 849 grouping, 88 meaning of -, 295 natural -, II physical -, 84 pressure level, 1029 reversed -, 145 speech,996 transformation, 145 Sound Pattern of English (=SPE), 286 sounding analysis, 842 source open -,808 set, 248 space, 1139 ambient -, 107 Banach -, 1154 basis -, 689, 715, 763 color -, 1000 compositional -, 249
INDEX contour -, 251 Euler -, 1031 form -, 64 fundamental -, 715 hierarchy, 715 indecomposable -, 715 instrument -, 726 locally ringed -, 1108 motif -, 467 of sound colors, 290 orbit -, 1066 parallel -, 718 parameter -, 434 pianola -, 689, 715, 763 pre-Hilbert -, 1020 ringed -, 1107 tangent -, 669 top -, 715 topological -, 1145 vector -, 1085 span, 633, 817 time -, 83 SPE,286 special linear algebra, 1105 specialization, 196, 267, 282, 488, 719 abstract -, 488 abstract gestalt -, 488 basis -, 797 co-inherited -, 489 gestalt -, 488 Hasse diagram, 269 inherited -, 489 pianola -, 801 topology, 489 specialize, 278 species, 293 spectral analysis, 638 decomposition, 856 participation vector, 638, 639 power, 640 vector, 1001 spectrum, 874
INDEX amplitude -, 1020 energy -, 1020 of poetical functions, 266 phase -, 291, 1020 prime -, 278, 293, 1108 speculum mundi, 41 speech,19 sound -,996 SPL, 1029 split exact sequence, 1069 local commutative composition, 215 Split Operator , 827 splitting, 764 operator, 788 spring equation, 1020 SQL, 811 Staatliche Hochschule fur Musik, 764, 765, 833 stability gestalt -, 276 stabilizer, 1066 stable concept, 276 staccato, 783 stalk, 1107 standard global - composition, 357 atlas, 357 chord, 531 hierarchy, 71 7 local - composition, 357 of basic musicological concepts, 108 simplex, 1150 composition, 211 standardized tempo, 877 Stange-Elbe, Joachim, viii, 764, 833 state sentic -, 734 stationary voice, 608 statistical approach, 855 method, 745, 818 procedure, 242 Steibelt, Daniel, 145, 161 Steinway, 955
1325 stemma, 674, 745, 752, 755-772 continuous -, 803 leaves of a -, 764 prime -, 764 quiver, 801 tempo -,758 theory, 895, 911 tree, 802 stemmatic cross-correlation, 771 grouping, 770 Stengers, Isabelle, 30, 664 stepwise forward selection, 881 stereocilia, 1038 stereotactic depth EEG, 638 stirrup, 1037 Stockhausen, Karlheinz, 70, 152, 286, 385 Stolberg, Leopold, 262, 283, 956 Stone, Peter, 137 Stopper, Bernhard, 110 strategy experimental -, 841, 851 paradigmatic -, 940 target-driven -, 841, 851 Straub, Hans, viii, 230, 347, 537, 553, 555, 1188 Straus' zero normal form, 256 strength factor, 742 stretching, 246 time -,789 strict extension, 539 style, 656 reduced -, 657 STRING,50 string empty, 52 of operations, 255 quartet, 82, 295, 934, 993 prehistory of the -, 994 theory, 993 strip harmonic -, 310, 321, 538 Mobius -, 549, 579, 941 strong marked dichotomy, 631
1326 structural constant, 1105 homomorphism, 1075 rationale of performance, 395 structuralist linguistics, 305 structure formal -, 967 frame -,718 internal -, 311 interspace -, 234 limbic -, 638 local vs. global -, 106 locally trivial -, 307 modal-, 383 of fibers, 913 regular -, 856 rhythmical -, 958 transitional -, 564 structures matching of -, 869 Structures pour piano, 152 structuring module, 807 Stucki, Peter, ix style, 869 strict -, 656 sub-complex Kh, 257 sub-path operator, 588, 1079 subbase for a topology, 1146 subcategory, 1118 full -, 1118 Yoneda -, 1137 subclass, 968 subcomplex, 1148 full-, 1148 subconscious, 643 subdivision, 757 subdominant, 323, 502, 541, 545 seventh, 508 subgroup normal -, 1067 subject, 24 subjectivity, 32 subobject, 1126 classifier, 1126
INDEX relation, 91 substance, 50 substitution theory, 1048 subtyping, 971 succession, 935 interval -, 252 successive interval, 640 successively increased alteration, 952 successor admitted -, 647 pairing allowed -, 646 sum direct -, 74 disjoint -, 1121 fiber -, 1121 of perspectives, 394 SUN,810 Sundberg, Johan, 671, 738, 739, 741, 747 super-summativity, 203, 276, 332 superclass, 968 supersensitivity, 834 support, 410 functor, 314 of a local composition, 107 operator, 1015 set, 309 supporting valence, 1047 surface integral -, 1161 surgery concept -, 770 surgical epilepsy therapy, 638 surjective, 1059 suspension, 875 sweeping orientation, 128, 619 Swing, 922 switch vocabulary -, 293 Sylow decomposition, 95, 542, 620 group, 218 Sylow, Ludwig, 1069 symbol logical connective -, 1131
1327
INDEX symbolic absolute dynamics, 81 computation, 967 kernel, 712 operator, 749, 789 pitch, 80 relative dynamics, 81 Symbolic Composer, 137 SymbolicOperator, 829 symmetric, 1059 group -, 1066 SYMMETRICA, 379 symmetries in music, 15, 137-154 musical meaning of -, 159 semantical paradigm for -, 159 symmetry, 108, 116, 135, 196, 1096 of parameter roles, 152 breaking, 936 codification of a -, 154 contrapuntal -, 647 degree of -, 254 form of a -, 135 function of a -, 136 group, 174,220,571,816 conjugation class of the -, 220 hidden -, 136 inner of C-major, 147 local -, 648, 649 non-invertible, 153 semantical function of -, 135 semiotical -, 161 transformation, 305, 306 underlying -, 155 synchronic, 17 axis, 399, 575 normalization, 909 synonym form, 66 synonymy circular -, 76 type, 54 syntagm, 18
syntagmatic equivalence, 263 syntax modification, 982 Synthesis, 268, 576, 610, 613, 940, 941, 950, 955-964 synthesis, 1018 modal -, 1027 synthetic performance, 741 syntonic comma, 115 system auditory -, 11 coefficient -, 1150 Hess -,638 limbic -, 642, 737, 1045 meta -,19 object-, 19 of signs, 6 non-linguistic -, 16 parameter, 575 vestibular -, 739 weight -, 768 systematic approach, 574 musicology, 399 understanding, 994 T
t- fold tangent composition, 669 morphism, 669 t-fold differentiable tangent morphism, 669 t-gestalt, 474 tonend bewegte Formen, 307, 935 tactus, 457 Take Five, 218 tangent, 128, 621 bundle, 1155 composition, 669 basis of a -, 669 global -, 675 interpretation, 677 morphism, 669, 676 space, 669 Zariski -, 1112 torus, 621 tape music, 306
INDEX
1328 target-driven strategy, 841, 851 taste common -,907 musical -, 643 tautology, 1132 Taylor, Cecil, 664, 963 technical parameter, 289 technique instrumental -, 1002 tectorial membrane, 1039 telling time, 934 tempered, 111 class chord, 111 scale space, 113 tuning, 1032 template fitting model, 1045 tempo, 664, 668, 670, 682 absolute -, 414, 682, 780 curve, 247, 270, 682, 738, 758, 877, 947 adapted -, 699 deformation of -, 699 daughter -, 682 discrete -, 31 field,683 hierarchy, 758 istesso -, 673 mean -,881 mother -, 682 musical -, 30, 31 operator, 793 relative -, 682, 832 relative local -, 780 relative punctual -, 780 slave, 759 standardized -, 877 stemma, 758 weight -, 794 tempo-intonation field, 686 TempoOperator, 829 temporal box, 979 period, 456 relation, 985 tenor tone, 319
tensed movement, 646 tension, 503, 786 contrapuntal -, 646 global -, 822 harmonic -, 586, 587 tensor product, 1078 affine -, 1094 Terhardt, Ernst, 1053 terminal, 1121 terminology, 248 territory, 719 tesselating chord, 377 tesselation isotypic -, 376 test EEG -,638 operator, 791 'lUring -, 955 Wilcoxon -, 640 tetractys, 33, 1049 tetradic interpretation, 337 tetrahedron, 4, 7 text analysis, 741 textual abstraction, 440 meaning, 400 predicate, 544, 552 representation, 937 semioses category of -, 409 textuality, 406-424 texture four part -, 995 The Sonic Language of Myth, 458 theme, 331, 503 basic -, 246 paradigmatic -, 272, 473 Reti's definition of a -, 491 theorem, 1133 complement -, 254 counterpoint -, 649, 653 Fourier's -, 1019 Mason's -, 130 Mason-Mazzola -, 130
INDEX modulation -, 572 theory American jazz -, 534 catastrophe -, 277, 604 classification -, 999 contour -, 332 counterpoint -, 936, 1008 degree -, 531 function -, 531 global -, 269 group -,259 music -,813 of ambiguity, 300 of narrativity, 934 Palestrina-Fux -, 655 performance -,387,393 rhythmical modulation -, 612 set -, 305 sonata -, 603 stemma -, 895, 911 string quartet -, 993 substitution -, 1048 valence -, 1035 wavelet -, 1025 thesis world-antiworld -, 604 Thiele, Bob, 733 thinking, 24 by doing, 31, 33 music, 24 operationalized -, 196 thinking music, 25 third,502 axis, 113 chain, 318, 532, 820 closure, 319 interpretation, 319 minimal -, 319 weak -,319 comma, 325 class, 325 coordinate, 1032
1329 degree tonality, 548 distance, 622 major -, 73, 1031 weight, 820 Thorn, Rene, 196, 277 3D vision, 439 threshold global -, 819 local-, 819 tie, 720 Tierny, Miles, 180, 435 tiling lattice, 517 timbral poetics, 295 time, 5, 411, 1029 -slice, 307 -span reduction (=TSR), 457 dilatation, 83 event, 674 generator, 936 grouping, 118 interval, 83 logical -, 611 material -, 611 mental-, 664 onset -, 1013 ontology, 936 physical -, 664 regularity, 116 reverberation -, 1016 series, 856 signature, 82, 768 span, 83 reduction, 752 stretching, 789 structure self-similar -, 964 telling -, 934 told -, 934 timing micro -,270 microstructure, 871 Tinctoris, Johannes, 629 Todd operator, 752 Todd, Neil McAgnus, 674, 739, 742, 744, 755
INDEX
1330 told time, 934 tolerance, 779, 827 Ton, 618 tonal ambiguity, 601 function, 304, 323, 544 value, 544 inversion, 148, 952 tonalities admitted -, 566 tonality, 304, 323, 502, 531, 544, 551 harmonic minor -,560 major -, 560, 582 melodic minor -, 560 minor -, 582 natural minor -, 560 Pythagorean -, 561 reference -, 546 third degree -, 548 tone recitation -, 319 space notched -, 1048 tenor -,319 tone parameters mental -, 79 physical -, 81 tonic, 319, 502, 541, 545 seventh, 508 toni cal, 323 Tonort, 618 tonotopy, 1045 tool paradigmatic -, 953 top space, 715 top-down, 757 topic, 43, 400 topographic cube, 19,36 topographical navigation, 21 topograph~ 9, 137 local-, 19 local character of -, 27 musical -, 19 topological
meaning, 192 space, 1145 irreducible -, 1110 topology, 43, 191, 199, 275 algebraic -, 200 associated -, 1154 base for a -, 1146 coherent -, 1146 coinduced -, 1146 colimit -, 1146 combinatorial -, 310 discrete -, 1145 dominance -, 283, 488 epsilon -, 483 epsilon gestalt -, 484 extension -, 521 finite cover -, 430 Grothendieck -, 180, 430, 1129 harmonic -,538 Hausdorff -, 1147 indiscrete -, 1145 Lawvere-Tierny -,435 limit -, 1146 maximal meter nerve -,460 maximal meter -, 329, 459 on gestalt spaces, 479 on motif spaces, 479 product -, 1146 quotient -, 1146 quotient dominance -, 283 relative -, 1146 relative motivic -, 486 specialization -, 489 subbase for a -, 1146 uniform -, 1147 weak -, 1146 Zariski -, 199, 293 topor, 1139 topos, 3, 10, 23, 1128 Boolean -, 1134 Grothendieck -, 1130 hyperouranios -, 23 logic, 530 of conversation, 995
1331
INDEX
Platonic -, 178 topos-theoretic background of modulation, 568 toroidal sequence shape type, 471 shape type, 471 torsion group, 1070 torsion-free rank, 1070 torus tangent -, 621 TOS, 736 total, 1059 Triiumerei, 818, 857, 860, 899, 927 trace objective -, 121 track, 307 tradition, 401 contrapuntal -, 243, 1052 performance -,907 Pythagorean -, 24, 1049 traditional musicology, 24 transcendence, 23 transform Fourier -, 1025 TransforMaster, 953 transformation, 492, 495, 935 control of -, 244 natural -, 1118 of sound, 145 plane -,949 Riemann -,384 score, 948 symmetry -, 305, 306 transformational approach, 249 invariance, 276, 332 meaning, 193 Transicion II, 152, 394 transitional structure, 564 transitive, 1059 action, 1067 transitivity, 280 translation, 159, 1090
part, 1090 transposability, 203 transposition, 139, 160, 276, 624 limited -, 151 transvection, 144, 160 tree, 407 stemma -, 802 triad, 106, 502 augmented -, 321 diminished -, 321 major -,321 minor, 321 triadic degree interpretation, 320 interpretation, 337, 548, 553, 566 trigeneric monoid, 540 trigger motion -, 738 trill, 88, 760 True, 1132 truth denotator, 407
TTO
operator, 253 tube Eustachian -, 1037 Tudor, David, 306 tuning, 304 just -, 1032 just-tempered -, 1033 justest -, 560 mediante -, 1033 Pythagorean -,325,581, 1032 tempered -, 1032 12-tempered -, 106 well-tempered -, 1032 turbidity, 147 Turing machine, 670 test, 955 turning point, 565 12-tempered scales common -, 113 tuning, 106
1332 two-dimensional alteration, 950
TX7
Yamaha -, 639 type, 50, 1139 casting, 405 change, 402 coproduct -, 53 form -,64 of a cellular hierarchy, 716 powerset -, 54 product -, 52 shape -,468 synonymy -, 54 types ordering on -, 90 typology of forms, 65 recursive, 56 recursive -, 48 U Uhde, Jiirgen, 302, 567, 604 Unbewusstes, 643 uncertainty relation, 299, 516 uncontrolled paradigmatics, 201 underlying symmetry, 155 understanding, 395, 997 musical works, 393 systematic -, 994 unfolding, 937 unicorned view, 906 uniform topology, 1147 uniformity, 1147 union, 1058 unit musical -, 106 unity, 41, 48, 56 universal ordering, 44 universe, 1116 of structure, 400 of topics, 400 unordered p-space interval, 252 pc interval, 253
INDEX Ursatz,400 Ussachevsky, Vladimir, 306 Utai,416 utai, 14
V Valery, Paul, 15,47, 187, 663, 670, 681, 696, 711,774 valence, 1046 supporting -, 1047 theory, 1035 valid sentence, 1132 validation operator, 424 valuation interpretative -, 15 value change, 769 initial -, 683 matrix, 925 participation -, 639 Varese, Edgar, 392, 394 variable bound, 1136 causal-final -, 927 explanatory -, 877 free, 1136 inclusion frequency of -, 888 individual -, 1135 predicate -, 1135 propositional -, 1131 variable address, 61 variation, 492, 495, 618, 950 melodic -, 959 of the perspective, 182 pressure -, 1013 principle, 394 Variationen fur Klavier, 394, 860 variations cycle of -, 956 varieties of sounds, 284 variety instrumental -, 673 vector, 1083 analytical -, 876
1333
INDEX field, 1156 instrumental -, 1005 interval -, 253, 257 interval-class -, 249 invariance -, 254 prime -, 1033 shaping -, 876 space, 1085 spectral participation -, 639 spectral -, 1001 spectral participation -, 638 velocity, 739, 948, 1030 instantaneous -, 30 bow -, 1002 concept of instantaneous -, 30 physical -, 30 verbal description, 756 Vercoe, Barray, 918 Verdier, Jean-Louis, 431 Verillo, Ronald, 738 Vers la !lamme, 587, 964 verse matrix, 261 poetics, 303 vertex, 1062, 1148 final-, 802 vertical grid vector, 951 poetical function, 942 poeticity, 261 vestibular system, 739 vibrato, 288, 290, 738, 1002 parameter, 1003 Vieru, Anatol, 257, 383 view, 968 kernel -, 826 unicorned -, 906 Villon, Franc,;ois, 138, 242, 243 viola, 993 violin, 993 family, 295, 997, 1009 hierarchy, 722
violoncello, 993 virtual acoustics, 850 visual navigation, 439 programming language, 967 semiotic marker, 981 visualization, 917, 918 vocabulary dodecaphonic -, 243 extension, 45 switch, 250, 293 Vogel chromatic, 582 Vogel, Martin, 115, 506, 512, 517, 576, 582, 1032, 1167 voice, 619 crossing, 619 instrumental -, 269 leading, 304 pedal-, 608 stationary -, 608 Voisin, Frederic, 986 Volkswagen Foundation, 807 volume, 230 Vordergrund, 503 vowel,286 Vuza class, 380 rhythm, 380 Vuza, Dan Tudor, 83, 257, 328, 376, 380
w W, 605 Wagner, Richard, 259, 814 walking, 738 wall bottom -, 768 Ward, Artemus, 747 wave, 1018 waveguide, 1027 wavelet, 289, 1025 frame, 290 Meyer -, 1026
INDEX
1334 Morlet -, 1025 Murenzi -, 1025 theory, 1025 wavelet-transformed, 1025 weak topology, 1146 Weber-Fechner law, 1029 Webern, Anton von, 149, 150, 152, 198, 248, 301, 394, 860, 907 Wedderburn, Joesph, 1088 wedge crescendo -, 778 Wegner, Peter, 29 weight, 726, 742, 744, 752, 775, 818 analytical -, 666, 671, 785 class -, 346 combination, 827 continuous -, 775 function default -, 587 P6lya -,232 product -, 232 harmonic -, 587, 786 induced nerve -, 460 inverted -, 827 metrical -, 455, 456, 785 mixed -,815 motivic -, 496, 785 nerve -,460 onset -, 116 profile, 267 recombination, 776 simplicial -, 346 simplicial metrical -, 329 sober -,460 system, 768 tempo, 794 third -, 820 watcher, 827 weighted graph, 292 well-ordered, 1060 well-tempered modulation, 571 tuning, 1032 Well-Tempered Piano, 303 Weyl, Hermann, 196
WFR,457 whatness, 23 whereness, 23 White, Andrew, 694 whole, 301, 334 whole-tone scale, 657 Whymper, Edward, 183 Wicinski, A.A., 756 Widmer, Gerhard, 744 Wieland, Renate, 302 Wieser, Heinz-Gregor, 637 Wilcoxon test, 640 wild problem, 913 Wille, Rudolf, 3, 135, 551 window oval -, 1037, 1041 power -,638 round -, 1041 Winson, Jonathan, 642 Wittgenstein, Ludwig, 43, 397 WLOG,131 Wohltemperiertes Klavier, 141 Wolff, Christian, 306 word, 71, 1064 monoid, 1064 work, 12, 14 identity of a -, 16 motivic -, 338 production of a -, 13 world, 560, 605 world-antiworld thesis, 604 wrap form morphisms, 402 wrapped as local composition, 108 wreath product, 1069 writing field -, 969 Wulf, Bill, 35, 809 Wyschnegradsk~ Ivan, 110
X Xenakis, Iannis, 33, 258
y YAMAHA, 1027
INDEX Yamaha, 834, 849, 1022 CX5M, 639 RX5, 269, 955 TX7,639 TX802, 269, 289, 293, 955 Yoneda embedding, 1091, 1120 lemma, 171, 341, 393 pair, 1137 philosophy, 109, 175, 184, 566, 997 subcategory, 1137 Yoneda, Nobuo, 175,299,392,997 Z Z-addressed motives contrapuntal meaning of -, 120 Z-relation, 257 Zahorka, Oliver, 764, 807, 833 Zariski site, 1112 tangent, 128 space, 1112 topology, 199, 293 Zariski, Oskar, 199 Zarlino, Gioseffo, 147 Zauberjiote, 598 Zermelo, Ernst, 1061 zero address, 61, 62 zig-zag motivic -, 339, 941 Zurechth6ren, 1035 Zurich school, 744
1335