11.07.2015 Views

Lectures on Modular Forms and Hecke Operators - William Stein

Lectures on Modular Forms and Hecke Operators - William Stein

Lectures on Modular Forms and Hecke Operators - William Stein

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<str<strong>on</strong>g>Lectures</str<strong>on</strong>g> <strong>on</strong> <strong>Modular</strong> <strong>Forms</strong> <strong>and</strong> <strong>Hecke</strong> <strong>Operators</strong>Kenneth A. Ribet<strong>William</strong> A. <strong>Stein</strong>October 5, 2011


C<strong>on</strong>tentsPreface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 The Main Objects 31.1 Torsi<strong>on</strong> points <strong>on</strong> elliptic curves . . . . . . . . . . . . . . . . . . . . 31.1.1 The Tate module . . . . . . . . . . . . . . . . . . . . . . . . 31.2 Galois representati<strong>on</strong>s . . . . . . . . . . . . . . . . . . . . . . . . . 41.3 <strong>Modular</strong> forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51.4 <strong>Hecke</strong> operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 <strong>Modular</strong> Representati<strong>on</strong>s <strong>and</strong> Algebraic Curves 72.1 <strong>Modular</strong> forms <strong>and</strong> Arithmetic . . . . . . . . . . . . . . . . . . . . 72.2 Characters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82.3 Parity c<strong>on</strong>diti<strong>on</strong>s . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92.4 C<strong>on</strong>jectures of Serre (mod l versi<strong>on</strong>) . . . . . . . . . . . . . . . . . 92.5 General remarks <strong>on</strong> mod p Galois representati<strong>on</strong>s . . . . . . . . . . 102.6 Serre’s c<strong>on</strong>jecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102.7 Wiles’s perspective . . . . . . . . . . . . . . . . . . . . . . . . . . . 113 <strong>Modular</strong> <strong>Forms</strong> of Level 1 133.1 The Definiti<strong>on</strong> . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133.2 Some examples <strong>and</strong> c<strong>on</strong>jectures . . . . . . . . . . . . . . . . . . . . 143.3 <strong>Modular</strong> forms as functi<strong>on</strong>s <strong>on</strong> lattices . . . . . . . . . . . . . . . . 153.4 <strong>Hecke</strong> operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173.5 <strong>Hecke</strong> operators directly <strong>on</strong> q-expansi<strong>on</strong>s . . . . . . . . . . . . . . . 183.5.1 Explicit descripti<strong>on</strong> of sublattices . . . . . . . . . . . . . . . 183.5.2 <strong>Hecke</strong> operators <strong>on</strong> q-expansi<strong>on</strong>s . . . . . . . . . . . . . . . 193.5.3 The <strong>Hecke</strong> algebra <strong>and</strong> eigenforms . . . . . . . . . . . . . . 213.5.4 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213.6 Two C<strong>on</strong>jectures about <strong>Hecke</strong> operators <strong>on</strong> level 1 modular forms . 22


ivC<strong>on</strong>tents3.6.1 Maeda’s c<strong>on</strong>jecture . . . . . . . . . . . . . . . . . . . . . . . 223.6.2 The Gouvea-Mazur c<strong>on</strong>jecture . . . . . . . . . . . . . . . . 223.7 An Algorithm for computing characteristic polynomials of <strong>Hecke</strong>operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243.7.1 Review of basic facts about modular forms . . . . . . . . . 243.7.2 The Naive approach . . . . . . . . . . . . . . . . . . . . . . 253.7.3 The Eigenform method . . . . . . . . . . . . . . . . . . . . 253.7.4 How to write down an eigenvector over an extensi<strong>on</strong> field . 273.7.5 Simple example: weight 36, p = 3 . . . . . . . . . . . . . . . 284 Duality, Rati<strong>on</strong>ality, <strong>and</strong> Integrality 294.1 <strong>Modular</strong> forms for SL 2 (Z) <strong>and</strong> Eisenstein series . . . . . . . . . . . 294.2 Inner product . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 304.3 Eigenforms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 314.4 Integrality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 314.5 A Result from Victor Miller’s thesis . . . . . . . . . . . . . . . . . 324.6 The Peterss<strong>on</strong> inner product . . . . . . . . . . . . . . . . . . . . . 335 Analytic Theory of <strong>Modular</strong> Curves 375.1 The <strong>Modular</strong> group . . . . . . . . . . . . . . . . . . . . . . . . . . . 375.1.1 The Upper half plane . . . . . . . . . . . . . . . . . . . . . 375.2 Points <strong>on</strong> modular curves parameterize elliptic curves with extrastructure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 385.3 The Genus of X(N) . . . . . . . . . . . . . . . . . . . . . . . . . . 416 <strong>Modular</strong> Curves 456.1 Cusp <strong>Forms</strong> . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 456.2 <strong>Modular</strong> curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 466.3 Classifying Γ(N)-structures . . . . . . . . . . . . . . . . . . . . . . 466.4 More <strong>on</strong> integral <strong>Hecke</strong> operators . . . . . . . . . . . . . . . . . . . 476.5 Complex c<strong>on</strong>jugati<strong>on</strong> . . . . . . . . . . . . . . . . . . . . . . . . . . 476.6 Isomorphism in the real case . . . . . . . . . . . . . . . . . . . . . 476.7 The Eichler-Shimura isomorphism . . . . . . . . . . . . . . . . . . 486.8 The Petters<strong>on</strong> inner product is <strong>Hecke</strong> compatible . . . . . . . . . . 497 <strong>Modular</strong> Symbols 517.1 <strong>Modular</strong> symbols . . . . . . . . . . . . . . . . . . . . . . . . . . . . 517.2 Manin symbols . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 527.2.1 Using c<strong>on</strong>tinued fracti<strong>on</strong>s to obtain surjectivity . . . . . . . 537.2.2 Triangulating X(G) to obtain injectivity . . . . . . . . . . . 547.3 <strong>Hecke</strong> operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 577.4 <strong>Modular</strong> symbols <strong>and</strong> rati<strong>on</strong>al homology . . . . . . . . . . . . . . . 587.5 Special values of L-functi<strong>on</strong>s . . . . . . . . . . . . . . . . . . . . . . 598 <strong>Modular</strong> <strong>Forms</strong> of Higher Level 638.1 <strong>Modular</strong> <strong>Forms</strong> <strong>on</strong> Γ 1 (N) . . . . . . . . . . . . . . . . . . . . . . . 638.2 The Diam<strong>on</strong>d bracket <strong>and</strong> <strong>Hecke</strong> operators . . . . . . . . . . . . . 648.2.1 Diam<strong>on</strong>d bracket operators . . . . . . . . . . . . . . . . . . 648.2.2 <strong>Hecke</strong> <strong>Operators</strong> <strong>on</strong> q-expansi<strong>on</strong>s . . . . . . . . . . . . . . . 668.3 Old <strong>and</strong> new subspaces . . . . . . . . . . . . . . . . . . . . . . . . . 66


C<strong>on</strong>tentsv9 Newforms <strong>and</strong> Euler Products 699.1 Atkin-Lehner-Li theory . . . . . . . . . . . . . . . . . . . . . . . . 699.2 The U p operator . . . . . . . . . . . . . . . . . . . . . . . . . . . . 739.2.1 A C<strong>on</strong>necti<strong>on</strong> with Galois representati<strong>on</strong>s . . . . . . . . . . 749.2.2 When is U p semisimple? . . . . . . . . . . . . . . . . . . . . 759.2.3 An Example of n<strong>on</strong>-semisimple U p . . . . . . . . . . . . . . 759.3 The Cusp forms are free of rank <strong>on</strong>e over T C . . . . . . . . . . . . 759.3.1 Level 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 759.3.2 General level . . . . . . . . . . . . . . . . . . . . . . . . . . 779.4 Decomposing the anemic <strong>Hecke</strong> algebra . . . . . . . . . . . . . . . 7810 Some Explicit Genus Computati<strong>on</strong>s 8110.1 Computing the dimensi<strong>on</strong> of S k (Γ) . . . . . . . . . . . . . . . . . . 8110.2 Applicati<strong>on</strong> of Riemann-Hurwitz . . . . . . . . . . . . . . . . . . . 8210.3 Explicit genus computati<strong>on</strong>s . . . . . . . . . . . . . . . . . . . . . . 8310.4 The Genus of X(N) . . . . . . . . . . . . . . . . . . . . . . . . . . 8310.5 The Genus of X 0 (N) . . . . . . . . . . . . . . . . . . . . . . . . . . 8310.6 <strong>Modular</strong> forms mod p . . . . . . . . . . . . . . . . . . . . . . . . . 8411 The Field of Moduli 8711.1 Digressi<strong>on</strong> <strong>on</strong> moduli . . . . . . . . . . . . . . . . . . . . . . . . . . 8811.2 When is ρ E surjective? . . . . . . . . . . . . . . . . . . . . . . . . . 8811.3 Observati<strong>on</strong>s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8911.4 A descent problem . . . . . . . . . . . . . . . . . . . . . . . . . . . 9011.5 Sec<strong>on</strong>d look at the descent exercise . . . . . . . . . . . . . . . . . . 9111.6 Acti<strong>on</strong> of GL 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9212 <strong>Hecke</strong> <strong>Operators</strong> as Corresp<strong>on</strong>dences 9512.1 The Definiti<strong>on</strong> . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9512.2 Maps induced by corresp<strong>on</strong>dences . . . . . . . . . . . . . . . . . . . 9712.3 Induced maps <strong>on</strong> Jacobians of curves . . . . . . . . . . . . . . . . . 9812.4 More <strong>on</strong> <strong>Hecke</strong> operators . . . . . . . . . . . . . . . . . . . . . . . . 9912.5 <strong>Hecke</strong> operators acting <strong>on</strong> Jacobians . . . . . . . . . . . . . . . . . 9912.5.1 The Albanese Map . . . . . . . . . . . . . . . . . . . . . . . 10012.5.2 The <strong>Hecke</strong> algebra . . . . . . . . . . . . . . . . . . . . . . . 10112.6 The Eichler-Shimura relati<strong>on</strong> . . . . . . . . . . . . . . . . . . . . . 10112.7 Applicati<strong>on</strong>s of the Eichler-Shimura relati<strong>on</strong> . . . . . . . . . . . . . 10512.7.1 The Characteristic polynomial of Frobenius . . . . . . . . . 10512.7.2 The Cardinality of J 0 (N)(F p ) . . . . . . . . . . . . . . . . . 10713 Abelian Varieties 10913.1 Abelian varieties . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10913.2 Complex tori . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11013.2.1 Homomorphisms . . . . . . . . . . . . . . . . . . . . . . . . 11013.2.2 Isogenies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11213.2.3 Endomorphisms . . . . . . . . . . . . . . . . . . . . . . . . 11313.3 Abelian varieties as complex tori . . . . . . . . . . . . . . . . . . . 11313.3.1 Hermitian <strong>and</strong> Riemann forms . . . . . . . . . . . . . . . . 11413.3.2 Complements, quotients, <strong>and</strong> semisimplicity of the endomorphismalgebra . . . . . . . . . . . . . . . . . . . . . . . . . . 115


viC<strong>on</strong>tents13.3.3 Theta functi<strong>on</strong>s . . . . . . . . . . . . . . . . . . . . . . . . . 11713.4 A Summary of duality <strong>and</strong> polarizati<strong>on</strong>s . . . . . . . . . . . . . . . 11713.4.1 Sheaves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11813.4.2 The Picard group . . . . . . . . . . . . . . . . . . . . . . . . 11813.4.3 The Dual as a complex torus . . . . . . . . . . . . . . . . . 11813.4.4 The Nér<strong>on</strong>-Severi group <strong>and</strong> polarizati<strong>on</strong>s . . . . . . . . . . 11913.4.5 The Dual is functorial . . . . . . . . . . . . . . . . . . . . . 11913.5 Jacobians of curves . . . . . . . . . . . . . . . . . . . . . . . . . . . 11913.5.1 Divisors <strong>on</strong> curves <strong>and</strong> linear equivalence . . . . . . . . . . 12013.5.2 Algebraic definiti<strong>on</strong> of the Jacobian . . . . . . . . . . . . . 12113.5.3 The Abel-Jacobi theorem . . . . . . . . . . . . . . . . . . . 12213.5.4 Every abelian variety is a quotient of a Jacobian . . . . . . 12313.6 Nér<strong>on</strong> models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12513.6.1 What are Nér<strong>on</strong> models? . . . . . . . . . . . . . . . . . . . 12513.6.2 The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer c<strong>on</strong>jecture <strong>and</strong> Nér<strong>on</strong> models12713.6.3 Functorial properties of Ner<strong>on</strong> models . . . . . . . . . . . . 12914 Abelian Varieties Attached to <strong>Modular</strong> <strong>Forms</strong> 13114.1 Decompositi<strong>on</strong> of the <strong>Hecke</strong> algebra . . . . . . . . . . . . . . . . . 13114.1.1 The Dimensi<strong>on</strong> of the algebras L f . . . . . . . . . . . . . . 13214.2 Decompositi<strong>on</strong> of J 1 (N) . . . . . . . . . . . . . . . . . . . . . . . . 13314.2.1 Aside: intersecti<strong>on</strong>s <strong>and</strong> c<strong>on</strong>gruences . . . . . . . . . . . . . 13414.3 Galois representati<strong>on</strong>s attached to A f . . . . . . . . . . . . . . . . 13514.3.1 The Weil pairing . . . . . . . . . . . . . . . . . . . . . . . . 13614.3.2 The Determinant . . . . . . . . . . . . . . . . . . . . . . . . 13814.4 Remarks about the modular polarizati<strong>on</strong> . . . . . . . . . . . . . . . 13915 <strong>Modular</strong>ity of Abelian Varieties 14115.1 <strong>Modular</strong>ity over Q . . . . . . . . . . . . . . . . . . . . . . . . . . . 14115.2 <strong>Modular</strong>ity of elliptic curves over Q . . . . . . . . . . . . . . . . . 14415.3 <strong>Modular</strong>ity of abelian varieties over Q . . . . . . . . . . . . . . . . 14416 L-functi<strong>on</strong>s 14716.1 L-functi<strong>on</strong>s attached to modular forms . . . . . . . . . . . . . . . . 14716.1.1 Analytic c<strong>on</strong>tinuati<strong>on</strong> <strong>and</strong> functi<strong>on</strong>al equati<strong>on</strong>s . . . . . . . 14816.1.2 A C<strong>on</strong>jecture about n<strong>on</strong>vanishing of L(f, k/2) . . . . . . . . 15016.1.3 Euler products . . . . . . . . . . . . . . . . . . . . . . . . . 15016.1.4 Visualizing L-functi<strong>on</strong> . . . . . . . . . . . . . . . . . . . . . 15117 The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jecture 15317.1 The Rank c<strong>on</strong>jecture . . . . . . . . . . . . . . . . . . . . . . . . . . 15317.2 Refined rank zero c<strong>on</strong>jecture . . . . . . . . . . . . . . . . . . . . . . 15517.2.1 The Number of real comp<strong>on</strong>ents . . . . . . . . . . . . . . . 15617.2.2 The Manin index . . . . . . . . . . . . . . . . . . . . . . . . 15617.2.3 The Real volume Ω A . . . . . . . . . . . . . . . . . . . . . . 15717.2.4 The Period mapping . . . . . . . . . . . . . . . . . . . . . . 15817.2.5 The Manin-Drinfeld theorem . . . . . . . . . . . . . . . . . 15817.2.6 The Period lattice . . . . . . . . . . . . . . . . . . . . . . . 15817.2.7 The Special value L(A, 1) . . . . . . . . . . . . . . . . . . . 15917.2.8 Rati<strong>on</strong>ality of L(A, 1)/Ω A . . . . . . . . . . . . . . . . . . . 159


C<strong>on</strong>tentsvii17.3 General refined c<strong>on</strong>jecture . . . . . . . . . . . . . . . . . . . . . . . 16117.4 The C<strong>on</strong>jecture for n<strong>on</strong>-modular abelian varieties . . . . . . . . . . 16117.5 Visibility of Shafarevich-Tate groups . . . . . . . . . . . . . . . . . 16217.5.1 Definiti<strong>on</strong>s . . . . . . . . . . . . . . . . . . . . . . . . . . . 16317.5.2 Every element of H 1 (K, A) is visible somewhere . . . . . . . 16417.5.3 Visibility in the c<strong>on</strong>text of modularity . . . . . . . . . . . . 16417.5.4 Future directi<strong>on</strong>s . . . . . . . . . . . . . . . . . . . . . . . . 16617.6 Kolyvagin’s Euler system of Heegner points . . . . . . . . . . . . . 16717.6.1 A Heegner point when N = 11 . . . . . . . . . . . . . . . . 17617.6.2 Kolyvagin’s Euler system for curves of rank at least 2 . . . 17718 The Gorenstein Property for <strong>Hecke</strong> Algebras 17918.1 Mod l representati<strong>on</strong>s associated to modular forms . . . . . . . . . 17918.2 The Gorenstein property . . . . . . . . . . . . . . . . . . . . . . . . 18218.3 Proof of the Gorenstein property . . . . . . . . . . . . . . . . . . . 18418.3.1 Vague comments . . . . . . . . . . . . . . . . . . . . . . . . 18718.4 Finite flat group schemes . . . . . . . . . . . . . . . . . . . . . . . 18718.5 Reformulati<strong>on</strong> of V = W problem . . . . . . . . . . . . . . . . . . . 18818.6 Dieud<strong>on</strong>né theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18918.7 The proof: part II . . . . . . . . . . . . . . . . . . . . . . . . . . . 18918.8 Key result of Bost<strong>on</strong>-Lenstra-Ribet . . . . . . . . . . . . . . . . . . 19219 Local Properties of ρ λ 19319.1 Definiti<strong>on</strong>s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19319.2 Local properties at primes p ∤ N . . . . . . . . . . . . . . . . . . . 19419.3 Weil-Deligne Groups . . . . . . . . . . . . . . . . . . . . . . . . . . 19419.4 Local properties at primes p | N . . . . . . . . . . . . . . . . . . . 19419.5 Definiti<strong>on</strong> of the reduced c<strong>on</strong>ductor . . . . . . . . . . . . . . . . . . 19520 Adelic Representati<strong>on</strong>s 19720.1 Adelic representati<strong>on</strong>s associated to modular forms . . . . . . . . . 19720.2 More local properties of the ρ λ . . . . . . . . . . . . . . . . . . . . . 20020.2.1 Possibilities for π p . . . . . . . . . . . . . . . . . . . . . . . 20120.2.2 The case l = p . . . . . . . . . . . . . . . . . . . . . . . . . 20220.2.3 Tate curves . . . . . . . . . . . . . . . . . . . . . . . . . . . 20321 Serre’s C<strong>on</strong>jecture 20521.1 The Family of λ-adic representati<strong>on</strong>s attached to a newform . . . . 20621.2 Serre’s C<strong>on</strong>jecture A . . . . . . . . . . . . . . . . . . . . . . . . . . 20621.2.1 The Field of definiti<strong>on</strong> of ρ . . . . . . . . . . . . . . . . . . 20721.3 Serre’s C<strong>on</strong>jecture B . . . . . . . . . . . . . . . . . . . . . . . . . . 20821.4 The Level . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20821.4.1 Remark <strong>on</strong> the case N(ρ) = 1 . . . . . . . . . . . . . . . . . 20921.4.2 Remark <strong>on</strong> the proof of C<strong>on</strong>jecture B . . . . . . . . . . . . 21021.5 The Weight . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21121.5.1 The Weight modulo l − 1 . . . . . . . . . . . . . . . . . . . 21121.5.2 Tameness at l . . . . . . . . . . . . . . . . . . . . . . . . . . 21121.5.3 Fundamental characters of the tame extensi<strong>on</strong> . . . . . . . 21221.5.4 The Pair of characters associated to ρ . . . . . . . . . . . . 21321.5.5 Recipe for the weight . . . . . . . . . . . . . . . . . . . . . 214


viiiC<strong>on</strong>tents21.5.6 The World’s first view of fundamental characters . . . . . . 21521.5.7 F<strong>on</strong>taine’s theorem . . . . . . . . . . . . . . . . . . . . . . . 21521.5.8 Guessing the weight (level 2 case) . . . . . . . . . . . . . . 21521.5.9 θ-cycles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21621.5.10 Edixhoven’s paper . . . . . . . . . . . . . . . . . . . . . . . 21821.6 The Character . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21821.6.1 A Counterexample . . . . . . . . . . . . . . . . . . . . . . . 22021.7 The Weight revisited: level 1 case . . . . . . . . . . . . . . . . . . . 22121.7.1 Compani<strong>on</strong> forms . . . . . . . . . . . . . . . . . . . . . . . . 22121.7.2 The Weight: the remaining level 1 case . . . . . . . . . . . . 22221.7.3 Finiteness . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22322 Fermat’s Last Theorem 22522.1 The applicati<strong>on</strong> to Fermat . . . . . . . . . . . . . . . . . . . . . . . 22522.2 <strong>Modular</strong> elliptic curves . . . . . . . . . . . . . . . . . . . . . . . . . 22723 Deformati<strong>on</strong>s 22923.1 Introducti<strong>on</strong> . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22923.2 C<strong>on</strong>diti<strong>on</strong> (∗) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23023.2.1 Finite flat representati<strong>on</strong>s . . . . . . . . . . . . . . . . . . . 23123.3 Classes of liftings . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23123.3.1 The case p ≠ l . . . . . . . . . . . . . . . . . . . . . . . . . 23123.3.2 The case p = l . . . . . . . . . . . . . . . . . . . . . . . . . 23223.4 Wiles’s <strong>Hecke</strong> algebra . . . . . . . . . . . . . . . . . . . . . . . . . 23324 The <strong>Hecke</strong> Algebra T Σ 23524.1 The <strong>Hecke</strong> algebra . . . . . . . . . . . . . . . . . . . . . . . . . . . 23524.2 The Maximal ideal in R . . . . . . . . . . . . . . . . . . . . . . . . 23724.2.1 Strip away certain Euler factors . . . . . . . . . . . . . . . . 23724.2.2 Make into an eigenform for U l . . . . . . . . . . . . . . . . 23824.3 The Galois representati<strong>on</strong> . . . . . . . . . . . . . . . . . . . . . . . 23924.3.1 The Structure of T m . . . . . . . . . . . . . . . . . . . . . . 24024.3.2 The Philosophy in this picture . . . . . . . . . . . . . . . . 24024.3.3 Massage ρ . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24024.3.4 Massage ρ ′ . . . . . . . . . . . . . . . . . . . . . . . . . . . 24124.3.5 Representati<strong>on</strong>s from modular forms mod l . . . . . . . . . 24224.3.6 Representati<strong>on</strong>s from modular forms mod l n . . . . . . . . 24224.4 ρ ′ is of type Σ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24324.5 Isomorphism between T m <strong>and</strong> R mR . . . . . . . . . . . . . . . . . . 24424.6 Deformati<strong>on</strong>s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24524.7 Wiles’s main c<strong>on</strong>jecture . . . . . . . . . . . . . . . . . . . . . . . . 24624.8 T Σ is a complete intersecti<strong>on</strong> . . . . . . . . . . . . . . . . . . . . . 24824.9 The Inequality #O/η ≤ #℘ T /℘ 2 T ≤ #℘ R/℘ 2 R . . . . . . . . . . . . 24824.9.1 The Definiti<strong>on</strong>s of the ideals . . . . . . . . . . . . . . . . . . 24924.9.2 Aside: Selmer groups . . . . . . . . . . . . . . . . . . . . . . 25024.9.3 Outline of some proofs . . . . . . . . . . . . . . . . . . . . . 25025 Computing with <strong>Modular</strong> <strong>Forms</strong> <strong>and</strong> Abelian Varieties 25326 The <strong>Modular</strong> Curve X 0 (389) 255


C<strong>on</strong>tentsix26.1 Factors of J 0 (389) . . . . . . . . . . . . . . . . . . . . . . . . . . . 25626.1.1 Newforms of level 389 . . . . . . . . . . . . . . . . . . . . . 25626.1.2 Isogeny structure . . . . . . . . . . . . . . . . . . . . . . . . 25726.1.3 Mordell-Weil ranks . . . . . . . . . . . . . . . . . . . . . . . 25726.2 The <strong>Hecke</strong> algebra . . . . . . . . . . . . . . . . . . . . . . . . . . . 25826.2.1 The Discriminant is divisible by p . . . . . . . . . . . . . . 25826.2.2 C<strong>on</strong>gruences primes in S p+1 (Γ 0 (1)) . . . . . . . . . . . . . . 25926.3 Supersingular points in characteristic 389 . . . . . . . . . . . . . . 26026.3.1 The Supersingular j-invariants in characteristic 389 . . . . 26026.4 Miscellaneous . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26026.4.1 The Shafarevich-Tate group . . . . . . . . . . . . . . . . . . 26026.4.2 Weierstrass points <strong>on</strong> X 0 + (p) . . . . . . . . . . . . . . . . . 26126.4.3 A Property of the plus part of the integral homology . . . . 26126.4.4 The Field generated by points of small prime order <strong>on</strong> anelliptic curve . . . . . . . . . . . . . . . . . . . . . . . . . . 261References 263


PrefaceThis book began when the sec<strong>on</strong>d author typed notes for the first author’s 1996Berkeley course <strong>on</strong> modular forms with a view toward explaining some of the keyideas in Wiles’s celebrated proof of Fermat’s Last Theorem. The sec<strong>on</strong>d authorthen exp<strong>and</strong>ed <strong>and</strong> rewrote the notes while teaching a course at Harvard in 2003<strong>on</strong> modular abelian varieties.The intended audience of this book is advanced graduate students <strong>and</strong> mathematicalresearchers. This book is more advanced than [LR11, Ste07, DS05], <strong>and</strong>at a relatively similar level to [DI95], though with more details. It should be substantiallymore accessible than a typical research paper in the area.Some things about how this book is (or will be!) fully “modern” in that it takesinto account:• The full modularity theorem.• The proof of the full Serre c<strong>on</strong>jecture• Computati<strong>on</strong>al techniques (algorithms, Sage)Notati<strong>on</strong>∼= an isomorphism≈ a n<strong>on</strong>can<strong>on</strong>ical isomorphismAcknowledgementJoe Wetherell wrote the first versi<strong>on</strong> of Secti<strong>on</strong> 2.5.


2 C<strong>on</strong>tentsC<strong>on</strong>tactKenneth A. Ribet (ribet@math.berkeley.edu)<strong>William</strong> A. <strong>Stein</strong> (wstein@gmail.com)


1The Main Objects1.1 Torsi<strong>on</strong> points <strong>on</strong> elliptic curvesThe main geometric objects that we will study in this book are elliptic curves,which are curves of genus <strong>on</strong>e equipped with a distinguished point. More generally,we c<strong>on</strong>sider certain algebraic curves of larger genus called modular curves, which inturn give rise via the Jacobian c<strong>on</strong>structi<strong>on</strong> to higher-dimensi<strong>on</strong>al abelian varietiesfrom which we will obtain representati<strong>on</strong>s of the Galois group Gal(Q/Q) of therati<strong>on</strong>al numbers.It is c<strong>on</strong>venient to view the group of complex points E(C) <strong>on</strong> an elliptic curve Eover the complex numbers C as a quotient C/L. Here{∫}L = ω : γ ∈ H 1 (E(C), Z)γis a lattice attached to a n<strong>on</strong>zero holomorphic differential ω <strong>on</strong> E, <strong>and</strong> the homologyH 1 (E(C), Z) ≈ Z×Z is the abelian group of smooth closed paths <strong>on</strong> E(C) modulothe homology relati<strong>on</strong>s.Viewing E as C/L immediately gives us informati<strong>on</strong> about the structure of thegroup of torsi<strong>on</strong> points <strong>on</strong> E, which we exploit in the next secti<strong>on</strong> to c<strong>on</strong>structtwo-dimensi<strong>on</strong>al representati<strong>on</strong>s of Gal(Q/Q).1.1.1 The Tate moduleIn the 1940s, Andre Weil studied the analogous situati<strong>on</strong> for elliptic curves definedover a finite field k. He desperately wanted to find an algebraic way to describethe above relati<strong>on</strong>ship between elliptic curves <strong>and</strong> lattices. He found an algebraicdefiniti<strong>on</strong> of L/nL, when n is prime to the characteristic of k.LetE[n] := {P ∈ E(k) : nP = 0}.


4 1. The Main ObjectsWhen E is defined over C,( ) 1E[n] =n L /L ∼ = L/nL ≈ (Z/nZ) × (Z/nZ),so E[n] is a purely algebraic object can<strong>on</strong>ically isomorphic to L/nL.Now suppose E is defined over an arbitrary field k. For any prime l, letE[l ∞ ] := {P ∈ E(k) : l ν P = 0, some ν ≥ 1}∞⋃= E[l ν ] = lim E[l ν ]. −→ν=1In an analogous way, Tate c<strong>on</strong>structed a rank 2 free Z l -moduleT l (E) := lim ←−E[l ν ],where the map E[l ν ] → E[l ν−1 ] is multiplicati<strong>on</strong> by l. The Z/l ν Z-module structureof E[l ν ] is compatible with the maps E[l ν ] l −→ E[l ν−1 ] (see, e.g., [Sil92, III.7]).If l is coprime to the characteristic of the base field k, then T l (E) is free of rank 2over Z l , <strong>and</strong>V l (E) := T l (E) ⊗ Q lis a 2-dimensi<strong>on</strong>al vector space over Q l .1.2 Galois representati<strong>on</strong>sNumber theory is largely c<strong>on</strong>cerned with the Galois group Gal(Q/Q), which isoften studied by c<strong>on</strong>sidering c<strong>on</strong>tinuous linear representati<strong>on</strong>sρ : Gal(Q/Q) → GL n (K)where K is a field <strong>and</strong> n is a positive integer, usually 2 in this book. Artin, Shimura,Taniyama, <strong>and</strong> Tate pi<strong>on</strong>eered the study of such representati<strong>on</strong>s.Let E be an elliptic curve defined over the rati<strong>on</strong>al numbers Q. Then Gal(Q/Q)acts <strong>on</strong> the set E[n], <strong>and</strong> this acti<strong>on</strong> respects the group operati<strong>on</strong>s, so we obtain arepresentati<strong>on</strong>ρ : Gal(Q/Q) → Aut(E[n]) ≈ GL 2 (Z/nZ).Let K be the field cut out by the ker(ρ), i.e., the fixed field of ker(ρ). Then K isa finite Galois extensi<strong>on</strong> of Q. SinceGal(K/Q) ∼ = Gal(Q/Q)/ ker ρ ∼ = Im ρ ↩→ GL 2 (Z/nZ)we obtain, in this way, subgroups of GL 2 (Z/nZ) as Galois groups.Shimura showed that if we start with the elliptic curve E defined by the equati<strong>on</strong>y 2 + y = x 3 − x 2 then for “most” n the image of ρ is all of GL 2 (Z/nZ). Moregenerally, the image is “most” of GL 2 (Z/nZ) when E does not have complexmultiplicati<strong>on</strong>. (We say E has complex multiplicati<strong>on</strong> if its endomorphism ringover C is strictly larger than Z.)


1.3 <strong>Modular</strong> forms 51.3 <strong>Modular</strong> formsMany spectacular theorems <strong>and</strong> deep c<strong>on</strong>jectures link Galois representati<strong>on</strong>s withmodular forms. <strong>Modular</strong> forms are extremely symmetric analytic objects, whichwe will first view as holomorphic functi<strong>on</strong>s <strong>on</strong> the complex upper half plane thatbehave well with respect to certain groups of transformati<strong>on</strong>s.Let SL 2 (Z) be the group of 2 × 2 integer matrices with determinant 1. For anypositive integer N, c<strong>on</strong>sider the subgroup{( )}a bΓ 1 (N) := ∈ SLc d 2 (Z) : a ≡ d ≡ 1, c ≡ 0 (mod N)( )1 ∗of matrices in SL 2 (Z) that are of the form when reduced modulo N.0 1The space S k (N) of cusp forms of weight k <strong>and</strong> level N for Γ 1 (N) c<strong>on</strong>sists ofall holomorphic functi<strong>on</strong>s f(z) <strong>on</strong> the complex upper half planeh = {z ∈ C : Im(z) > 0}that vanish at the cusps (see below) <strong>and</strong> satisfy the equati<strong>on</strong>( )( )az + bf = (cz + d) k a bf(z) for all ∈ Γcz + dc d 1 (N) <strong>and</strong> z ∈ h.Thus f(z + 1) = f(z), so f determines a functi<strong>on</strong> F of q(z) = e 2πiz such thatF (q) = f(z). Viewing F as a functi<strong>on</strong> <strong>on</strong> {z : 0 < |z| < 1}, the c<strong>on</strong>diti<strong>on</strong> thatf(z) is holomorphic <strong>and</strong> vanishes at infinity is that F (z) extends to a holomorphicfuncti<strong>on</strong> <strong>on</strong> {z : |z| < 1} <strong>and</strong> F (0) = 0. In this case, f is determined by its Fourierexpansi<strong>on</strong>∞∑f(q) = a n q n .n=1It is also useful to c<strong>on</strong>sider the space M k (N) of modular forms of level N, whichis defined in the same way as S k (N), except that the c<strong>on</strong>diti<strong>on</strong> that F (0) = 0 isrelaxed, <strong>and</strong> we require <strong>on</strong>ly that F extends to a holomorphic functi<strong>on</strong> at 0 (<strong>and</strong>there is a similar c<strong>on</strong>diti<strong>on</strong> at the cusps other than ∞).The spaces M k (N) <strong>and</strong> S k (N) are finite dimensi<strong>on</strong>al.Example 1.3.1. We compute dim(M 5 (30)) <strong>and</strong> dim(S 5 (30)) in Sage:sage : <strong>Modular</strong><strong>Forms</strong> ( Gamma1 (30) ,5). dimensi<strong>on</strong> ()112sage : Cusp<strong>Forms</strong> ( Gamma1 (30) ,5). dimensi<strong>on</strong> ()80For example, the space S 12 (1) has dimensi<strong>on</strong> <strong>on</strong>e <strong>and</strong> is spanned by the famouscusp form∞∏∞∑∆ = q (1 − q n ) 24 = τ(n)q n .n=1The coefficients τ(n) define the Ramanujan τ-functi<strong>on</strong>. A n<strong>on</strong>-obvious fact is that τis multiplicative <strong>and</strong> for every prime p <strong>and</strong> positive integer ν, we haven=1τ(p ν+1 ) = τ(p)τ(p ν ) − p 11 τ(p ν−1 ).


6 1. The Main ObjectsExample 1.3.2. We draw a plot of the ∆ functi<strong>on</strong> (using 20 terms of the q-expansi<strong>on</strong>) <strong>on</strong> the upper half plane. Notice the symmetry ∆(z) = ∆(z + 1):0.70.60.50.40.30.20.1-2 -1.5 -1 -0.5 0.5 1 1.5 2sage : z = var (’z’); q = exp (2* pi*i*z)sage : D = delta_qexp (20)( q)sage : complex_plot (D, ( -2 ,2) , (0 ,.75) , plot_points =200)1.4 <strong>Hecke</strong> operatorsMordell defined operators T n , n ≥ 1, <strong>on</strong> S k (N) which are called <strong>Hecke</strong> operators.These proved very fruitful. The set of such operators forms a commuting family ofendomorphisms <strong>and</strong> is hence “almost” simultaneously diag<strong>on</strong>alizable.Often there is a basis f 1 , . . . , f r of S k (N) such that each f = f i = ∑ ∞n=1 a nq nis a simultaneous eigenvector for all the <strong>Hecke</strong> operators T n normalized so thatT n f = a n f, i.e., so the coefficient of q is 1. In this situati<strong>on</strong>, the eigenvalues a n arenecessarily algebraic integers <strong>and</strong> the fieldK = K f = Q(. . . , a n , . . .)generated by all a n is finite over Q.The a n exhibit remarkable properties. For example,τ(n) ≡ ∑ d|nd 11 (mod 691).We can check this c<strong>on</strong>gruence for n = 30 in Sage as follows:sage : n =30; t = delta_qexp (n +1)[ n]; t-29211840sage : sigma (n ,11)17723450167663752sage : ( sigma (n ,11) - t )%6910The key to studying <strong>and</strong> interpreting the a n is to underst<strong>and</strong> the deep c<strong>on</strong>necti<strong>on</strong>sbetween Galois representati<strong>on</strong>s <strong>and</strong> modular forms that were discovered bySerre, Shimura, Eichler <strong>and</strong> Deligne.


2<strong>Modular</strong> Representati<strong>on</strong>s <strong>and</strong> AlgebraicCurves2.1 <strong>Modular</strong> forms <strong>and</strong> ArithmeticC<strong>on</strong>sider a cusp formf =∞∑a n q n ∈ S k (N)n=1which is an eigenform for all of the <strong>Hecke</strong> operators T p , <strong>and</strong> assume f is normalizedso a 1 = 1. Then the Mellin transform of f is the L-functi<strong>on</strong>L(f, s) =∞∑n=1a nn s .<strong>Hecke</strong> proved that L(f, s) extends uniquely to a holomorphic functi<strong>on</strong> <strong>on</strong> C thatsatisfies a functi<strong>on</strong>al equati<strong>on</strong>.Let K = Q(a 1 , a 2 , . . .) be the number field generated by the Fourier coefficientsof f. One can show that the a n are algebraic integers <strong>and</strong> that K is a number field.When k = 2, Shimura associated to f an abelian variety A f over Q of dimensi<strong>on</strong>[K : Q] <strong>on</strong> which Z[a 1 , a 2 , . . .] acts [Shi94, Theorem 7.14].Example 2.1.1 (<strong>Modular</strong> Elliptic Curves). Suppose now that all coefficients a n of flie in Q so that [K : Q] = 1 <strong>and</strong> hence A f is a <strong>on</strong>e dimensi<strong>on</strong>al abelian variety, i.e.,an elliptic curve. An elliptic curve isogenous to <strong>on</strong>e arising via this c<strong>on</strong>structi<strong>on</strong> iscalled modular.Elliptic curves E 1 <strong>and</strong> E 2 are isogenous if there is a morphism E 1 → E 2 ofalgebraic groups, having finite kernel.The following theorem motivates much of the theory discussed in this course. Itis a theorem of Breuil, C<strong>on</strong>rad, Diam<strong>on</strong>d, Taylor, <strong>and</strong> Wiles (see [BCDT01]).Theorem 2.1.2 (<strong>Modular</strong>ity Theorem). Every elliptic curve over Q is modular,that is, isogenous to a curve c<strong>on</strong>structed in the above way.


8 2. <strong>Modular</strong> Representati<strong>on</strong>s <strong>and</strong> Algebraic CurvesFor k ≥ 2 Serre <strong>and</strong> Deligne discovered a way to associate to f a family of l-adicrepresentati<strong>on</strong>s. Let l be a prime number <strong>and</strong> K = Q(a 1 , a 2 , . . .) be as above.Then it is well known that∏K ⊗ Q Q l∼ = K λ .One can associate to f a representati<strong>on</strong>λ|lρ l,f : G = Gal(Q/Q) → GL 2 (K ⊗ Q Q l )unramified at all primes p ∤ lN. Let Z be the ring of all algebraic integers. For ρ l,fto be unramified at p we mean that for all primes P of Z lying over p, the inertiasubgroup of the decompositi<strong>on</strong> group at P is c<strong>on</strong>tained in the kernel of ρ l,f . Thedecompositi<strong>on</strong> group D P at P is the set of those g ∈ G which fix P . Let k bethe residue field Z/P <strong>and</strong> k = F p . Then the inertia group I P is the kernel of thesurjective map D P → Gal(k/k).Now I P ⊂ D P ⊂ Gal(Q/Q) <strong>and</strong> D P /I P is pro-cyclic (being isomorphic to theGalois group Gal(k/k)), so it is generated by a Frobenious automorphism Frob plying over p. One hastr(ρ l,f (Frob p )) = a p ∈ K ⊂ K ⊗ Q l<strong>and</strong>det(ρ l,f ) = χ k−1lεwhere, as explained below, χ l is the lth cyclotomic character <strong>and</strong> ε is the Dirichletcharacter associated to f.There is an incredible amount of “abuse of notati<strong>on</strong>” packed into the abovestatement. Let M = Q ker(ρ l,f )be the field fixed by the kernel of ρl,f . Then theFrobenius element Frob P (note P not p) is well defined as an element of Gal(M/Q),<strong>and</strong> the element Frob p is then <strong>on</strong>ly well defined up to c<strong>on</strong>jugacy. But this worksout since ρ l,f is well-defined <strong>on</strong> Gal(M/Q) (it kills Gal(Q/M)) <strong>and</strong> trace is welldefined<strong>on</strong> c<strong>on</strong>jugacy classes (tr(AB) = tr(BA) so tr(ABA −1 ) = T r(B)).2.2 CharactersLet f ∈ S k (N) be an eigenform for all <strong>Hecke</strong> operators. Then for all ( )a bc d ∈ SL2 (Z)with c ≡ 0 mod N we have( ) az + bf = (cz + d) k ε(d)f(z),cz + dwhere ε : (Z/NZ) ∗ → C ∗ is a Dirichlet character mod N. If f is also normalizedso that a 1 = 1, as in Secti<strong>on</strong> 1.4, then ε actually takes values in K ∗ .Let G = Gal(Q/Q), <strong>and</strong> let ϕ N be the mod N cyclotomic character so thatϕ N : G → (Z/NZ) ∗ takes g ∈ G to the automorphism induced by g <strong>on</strong> theNth cyclotomic extensi<strong>on</strong> Q(µ N ) of Q (where we identify Gal(Q(µ N )/Q) with(Z/NZ) ∗ ). Then what we called ε above in the formula det(ρ l ) = χ k−1lε is reallythe compositi<strong>on</strong>G ϕ N−−→ (Z/NZ) ∗ ε−→ C ∗ .


2.3 Parity c<strong>on</strong>diti<strong>on</strong>s 9For each positive integer ν we c<strong>on</strong>sider the l ν th cyclotomic character <strong>on</strong> G,ϕ l ν : G → (Z/l ν Z) ∗ .Putting these together gives the l-adic cyclotomic characterχ l : G → Z ∗ l .2.3 Parity c<strong>on</strong>diti<strong>on</strong>sLet c ∈ Gal(Q/Q) be complex c<strong>on</strong>jugati<strong>on</strong>. Then ϕ N (c) = −1 so ε(c) = ε(−1)<strong>and</strong> χ k−1l(c) = (−1) k−1 . Letting ( ) (a b = −1 0)0 −1 , for f ∈ Sk (N), we haveso (−1) k ε(−1) = 1. Thusc ′ df(z) = (−1) k ε(−1)f(z),det(ρ l,f (c)) = ε(−1)(−1) k−1 = −1.We say a representati<strong>on</strong> is odd if the determinant of complex c<strong>on</strong>jugati<strong>on</strong> is −1.Thus the representati<strong>on</strong> ρ l,f is odd.Remark 2.3.1 (Vague Questi<strong>on</strong>). How can <strong>on</strong>e recognize representati<strong>on</strong>s like ρ l,f“in nature”? F<strong>on</strong>taine <strong>and</strong> Mazur have made relevant c<strong>on</strong>jectures. The modularitytheorem can be reformulated by saying that for any representati<strong>on</strong> ρ l,E comingfrom an elliptic curve E there is an f so that ρ l,E∼ = ρl,f .2.4 C<strong>on</strong>jectures of Serre (mod l versi<strong>on</strong>)Suppose f is a modular form, l ∈ Z prime, λ a prime lying over l, <strong>and</strong> therepresentati<strong>on</strong>ρ λ,f : G → GL 2 (K λ )(c<strong>on</strong>structed by Serre-Deligne) is irreducible. Then ρ λ,f is c<strong>on</strong>jugate to a representati<strong>on</strong>with image in GL 2 (O λ ), where O λ is the ring of integers of K λ (seeSecti<strong>on</strong> 2.5 below). Reducing mod λ gives a representati<strong>on</strong>ρ λ,f : G → GL 2 (F λ )which has a well-defined trace <strong>and</strong> det, i.e., the det <strong>and</strong> trace do not depend <strong>on</strong>the choice of c<strong>on</strong>jugate representati<strong>on</strong> used to obtain the reduced representati<strong>on</strong>.One knows from representati<strong>on</strong> theory (the Brauer-Nesbitt theorem – see [CR62])that if such a representati<strong>on</strong> is semisimple then it is completely determined by itstrace <strong>and</strong> det (more precisely, it is determined by the characteristic polynomialsof all of its elements). Thus if ρ λ,f is irreducible (<strong>and</strong> hence semisimple) then it isunique in the sense that it does not depend <strong>on</strong> the choice of c<strong>on</strong>jugate.


10 2. <strong>Modular</strong> Representati<strong>on</strong>s <strong>and</strong> Algebraic Curves2.5 General remarks <strong>on</strong> mod p Galois representati<strong>on</strong>sFirst, what are semisimple <strong>and</strong> irreducible representati<strong>on</strong>s? Remember that a representati<strong>on</strong>ρ is a map from a group G to the endomorphisms of some vector spaceW (or a free module M if we are working over a ring instead of a field, but let’snot worry about that for now). A subspace W ′ of W is said to be invariant underρ if ρ takes W ′ back into itself. (The point is that if W ′ is invariant, then ρinduces representati<strong>on</strong>s <strong>on</strong> both W ′ <strong>and</strong> W/W ′ .) An irreducible representati<strong>on</strong> is<strong>on</strong>e whose <strong>on</strong>ly invariant subspaces are {0} <strong>and</strong> W . A semisimple representati<strong>on</strong>is <strong>on</strong>e where for every invariant subspace W ′ there is a complementary invariantsubspace W ′′ – that is, you can write ρ as the direct sum of ρ| W ′ <strong>and</strong> ρ| W ′′.Another way to say this is that if W ′ is an invariant subspace then we get ashort exact sequence0 → ρ| W/W ′ → ρ → ρ| W ′ → 0.Furthermore ρ is semisimple if <strong>and</strong> <strong>on</strong>ly if every such sequence splits.Note that irreducible representati<strong>on</strong>s are semisimple. As menti<strong>on</strong>ed above, twodimensi<strong>on</strong>alsemisimple Galois representati<strong>on</strong>s are uniquely determined (up to isomorphismclass) by their trace <strong>and</strong> determinant. In the case we are c<strong>on</strong>sidering,G = Gal(Q/Q) is compact, so the image of any Galois representati<strong>on</strong> ρ intoGL 2 (K λ ) is compact. Thus we can c<strong>on</strong>jugate it into GL 2 (O λ ). Irreducibility is notneeded for this.Now that we have a representati<strong>on</strong> into GL 2 (O λ ), we can reduce to get a representati<strong>on</strong>ρ to GL 2 (F λ ). This reduced representati<strong>on</strong> is not uniquely determinedby ρ, since we made a choice of basis (via c<strong>on</strong>jugati<strong>on</strong>) so that ρ would have imagein GL 2 (O λ ), <strong>and</strong> a different choice may lead to a n<strong>on</strong>-isomorphic representati<strong>on</strong>mod λ. However, the trace <strong>and</strong> determinant of a matrix are invariant under c<strong>on</strong>jugati<strong>on</strong>,so the trace <strong>and</strong> determinant of the reduced representati<strong>on</strong> ρ are uniquelydetermined by ρ.So we know the trace <strong>and</strong> determinant of the reduced representati<strong>on</strong>. If we alsoknew that it was semisimple, then we would know its isomorphism class, <strong>and</strong> wewould be d<strong>on</strong>e. So we would be happy if the reduced representati<strong>on</strong> is irreducible.And in fact, it is easy to see that if the reduced representati<strong>on</strong> is irreducible, thenρ must also be irreducible. Most ρ of interest to us in this book will be irreducible.Unfortunately, the opposite implicati<strong>on</strong> does not hold: ρ irreducible need not implythat ρ is irreducible.2.6 Serre’s c<strong>on</strong>jectureSerre has made the following c<strong>on</strong>jecture which is now a theorem (see [KW08]).C<strong>on</strong>jecture 2.6.1 (Serre). All 2-dimensi<strong>on</strong>al irreducible representati<strong>on</strong> of G overa finite field which are odd, i.e., det(σ(c)) = −1, c complex c<strong>on</strong>jugati<strong>on</strong>, are of theform ρ λ,f for some representati<strong>on</strong> ρ λ,f c<strong>on</strong>structed as above.Example 2.6.2. Let E/Q be an elliptic curve <strong>and</strong> let σ l : G → GL 2 (F l ) be therepresentati<strong>on</strong> induced by the acti<strong>on</strong> of G <strong>on</strong> the l-torsi<strong>on</strong> of E. Then det σ l = ϕ l isodd <strong>and</strong> σ l is usually irreducible, so Serre’s c<strong>on</strong>jecture implies that σ l is modular.From this <strong>on</strong>e can, assuming Serre’s c<strong>on</strong>jecture, prove that E is itself modular (see[Rib92]).


2.7 Wiles’s perspective 11Definiti<strong>on</strong> 2.6.3 (<strong>Modular</strong> representati<strong>on</strong>). Let σ : G → GL 2 (F) (F is a finitefield) be an irreducible representati<strong>on</strong> of the Galois group G. Then we say thatthe representati<strong>on</strong> σ is modular if there is a modular form f, a prime λ, <strong>and</strong> anembedding F ↩→ F λ such that σ ∼ = ρ λ,f over F λ .For more details, see Chapter 21 <strong>and</strong> [RS01].2.7 Wiles’s perspectiveSuppose E/Q is an elliptic curve <strong>and</strong> ρ l,E : G → GL 2 (Z l ) the associated l-adic representati<strong>on</strong> <strong>on</strong> the Tate module T l . Then by reducing we obtain a mod lrepresentati<strong>on</strong>ρ l,E = σ l,E : G → GL 2 (F l ).If we can show this representati<strong>on</strong> is modular for infinitely many l then we willknow that E is modular.Theorem 2.7.1 (Langl<strong>and</strong>s <strong>and</strong> Tunnel). If σ 2,E <strong>and</strong> σ 3,E are irreducible, thenthey are modular.This is proved by using that GL 2 (F 2 ) <strong>and</strong> GL 2 (F 3 ) are solvable so we may applysomething called “base change for GL 2 .”Theorem 2.7.2 (Wiles). If ρ is an l-adic representati<strong>on</strong> which is irreducible <strong>and</strong>modular mod l with l > 2 <strong>and</strong> certain other reas<strong>on</strong>able hypothesis are satisfied,then ρ itself is modular.


12 2. <strong>Modular</strong> Representati<strong>on</strong>s <strong>and</strong> Algebraic Curves


3<strong>Modular</strong> <strong>Forms</strong> of Level 1In this chapter, we view modular forms of level 1 both as holomorphic functi<strong>on</strong>s<strong>on</strong> the upper half plane <strong>and</strong> functi<strong>on</strong>s <strong>on</strong> lattices. We then define <strong>Hecke</strong> operators<strong>on</strong> modular forms, <strong>and</strong> derive explicit formulas for the acti<strong>on</strong> of <strong>Hecke</strong> operators<strong>on</strong> q-expansi<strong>on</strong>s. An excellent reference for the theory of modular forms of level 1is Serre [Ser73, Ch. 7].3.1 The Definiti<strong>on</strong>Let k be an integer. The space S k = S k (1) of cusp forms of level 1 <strong>and</strong> weight kc<strong>on</strong>sists of all functi<strong>on</strong>s f that are holomorphic <strong>on</strong> the upper half plane h <strong>and</strong> suchthat for all ( )a bc d ∈ SL2 (Z) <strong>on</strong>e has( ) aτ + bf = (cτ + d) k f(τ), (3.1.1)cτ + d<strong>and</strong> f vanishes at infinity, in a sense which we will now make precise. The matrix( 1 0 11 ) is in SL 2(Z), so f(τ + 1) = f(τ). Thus f passes to a well-defined functi<strong>on</strong>of q(τ) = e 2πiτ . Since for τ ∈ h we have |q(τ)| < 1, we may view f(z) = F (q) asa functi<strong>on</strong> of q <strong>on</strong> the punctured open unit disc {q : 0 < |q| < 1}. The c<strong>on</strong>diti<strong>on</strong>that f(τ) vanishes at infinity means that F (q) extends to a holomorphic functi<strong>on</strong><strong>on</strong> the open disc {q : |q| < 1} so that F (0) = 0. Because holomorphic functi<strong>on</strong>sare represented by power series, there is a neighborhood of 0 such thatf(q) =∞∑a n q n ,n=1so for all τ ∈ h with sufficiently large imaginary part (but see Remark 3.1.1 below),f(τ) = ∑ ∞n=1 a ne 2πinτ .


14 3. <strong>Modular</strong> <strong>Forms</strong> of Level 1It will also be useful to c<strong>on</strong>sider the slightly larger space M k (1) of holomorphicfuncti<strong>on</strong>s <strong>on</strong> h that transform as above <strong>and</strong> are merely required to be holomorphicat infinity.Remark 3.1.1. In fact, the series ∑ ∞n=1 a ne 2πinτ c<strong>on</strong>verges for all τ ∈ h. This isbecause the Fourier coefficients a n are O(n k/2 ) (see [Miy89, Cor. 2.1.6, pg. 43]).Remark 3.1.2. In [Ser73, Ch. 7], the weight is defined in the same way, but in thenotati<strong>on</strong> our k is twice his k.3.2 Some examples <strong>and</strong> c<strong>on</strong>jecturesThe space S k (1) of cusp forms is a finite-dimensi<strong>on</strong>al complex vector space. For keven we have dim S k (1) = ⌊k/12⌋ if k ≢ 2 (mod 12) <strong>and</strong> ⌊k/12⌋ − 1 if k ≡ 2(mod 12), except when k = 2 in which case the dimensi<strong>on</strong> is 0. For even k, thespace M k (1) has dimensi<strong>on</strong> 1 more than the dimensi<strong>on</strong> of S k (1), except when k = 2when both have dimensi<strong>on</strong> 0. (For proofs, see, e.g., [Ser73, Ch. 7, §3].)By the dimensi<strong>on</strong> formula menti<strong>on</strong>ed above, the first interesting example is thespace S 12 (1), which is a 1-dimensi<strong>on</strong>al space spanned by∆(q) = q∞∏(1 − q n ) 24n=1= q − 24q 2 + 252q 3 − 1472q 4 + 4830q 5 − 6048q 6 − 16744q 7 + 84480q 8− 113643q 9 − 115920q 10 + 534612q 11 − 370944q 12 − 577738q 13 + · · ·That ∆ lies in S 12 (1) is proved in [Ser73, Ch. 7, §4.4] by expressing ∆ in termsof elements of M 4 (1) <strong>and</strong> M 6 (1), <strong>and</strong> computing the q-expansi<strong>on</strong> of the resultingexpressi<strong>on</strong>.Example 3.2.1. We compute the q-expansi<strong>on</strong> of ∆ in Sage:sage : delta_qexp (7)q - 24* q^2 + 252* q^3 - 1472* q^4 + 4830* q^5 - 6048* q^6 + O(q ^7)In Sage, computing delta_qexp(10^6) <strong>on</strong>ly takes a few sec<strong>on</strong>ds, <strong>and</strong> computing upto precisi<strong>on</strong> 10^8 is even reas<strong>on</strong>able. Sage does not use the formula q ∏ ∞n=1 (1−qn ) 24given above, which would take a very l<strong>on</strong>g time to directly evaluate, but insteaduses the identity⎛⎞∆(q) = ⎝ ∑ 8(−1) n (2n + 1)q n(n+1)/2 ⎠ ,n≥0<strong>and</strong> computes the 8th power using asymptotically fast polynomial arithmetic inZ[q], which involves a discrete fast Fourier transform (implemented in [HH]).The Ramanujan τ functi<strong>on</strong> τ(n) assigns to n the nth coefficient of ∆(q).C<strong>on</strong>jecture 3.2.2 (Lehmer). τ(n) ≠ 0 for all n ≥ 1.This c<strong>on</strong>jecture has been verified for n ≤ 22798241520242687999 (Bosman, 2007– see http://en.wikipedia.org/wiki/Tau-functi<strong>on</strong>).


3.3 <strong>Modular</strong> forms as functi<strong>on</strong>s <strong>on</strong> lattices 15Theorem 3.2.3 (Edixhoven et al.). Let p be a prime. There is a polynomial timealgorithm to compute τ(p), polynomial in the number of digits of p.Edixhoven’s idea is to use l-adic cohomology <strong>and</strong> Arakelov theory to find ananalogue of the Schoof-Elkies-Atkin algorithm (which counts the number N q ofpoints <strong>on</strong> an elliptic curves over a finite field F q by computing N q mod l formany primes l). Here’s some of what Edixhoven has to say about his result:“You need to compute <strong>on</strong> varying curves such as X 1 (l) for l up tolog(p) say. An important by-product of my method is the computati<strong>on</strong>of the mod l Galois representati<strong>on</strong>s associated to ∆ in time polynomialin l. So, it should be seen as an attempt to make the Langl<strong>and</strong>scorresp<strong>on</strong>dence for GL 2 over Q available computati<strong>on</strong>ally.”If f ∈ M k (1) <strong>and</strong> g ∈ M k ′(1), then it is easy to see from the definiti<strong>on</strong>s thatfg ∈ M k+k ′(1). Moreover, ⊕ k≥0 M k(1) is a commutative graded ring generatedfreely by E 4 = 1+240 ∑ ∞n=1 σ 3(n)q n <strong>and</strong> E 6 = 1−504 ∑ ∞n=1 σ 5(n)q n , where σ d (n)is the sum of the dth powers of the positive divisors of n (see [Ser73, Ch.7, §3.2]).Example 3.2.4. Because E 4 <strong>and</strong> E 6 generate, it is straightforward to write downa basis for any space M k (1). For example, the space M 36 (1) has basisf 1 = 1 + 6218175600q 4 + 15281788354560q 5 + · · ·f 2 = q + 57093088q 4 + 37927345230q 5 + · · ·f 3 = q 2 + 194184q 4 + 7442432q 5 + · · ·f 4 = q 3 − 72q 4 + 2484q 5 + · · ·3.3 <strong>Modular</strong> forms as functi<strong>on</strong>s <strong>on</strong> latticesIn order to define <strong>Hecke</strong> operators, it will be useful to view modular forms asfuncti<strong>on</strong>s <strong>on</strong> lattices in C.Definiti<strong>on</strong> 3.3.1 (Lattice). A lattice L ⊂ C is a subgroup L = Zω 1 + Zω 2 forwhich ω 1 , ω 2 ∈ C are linearly independent over R.We may assume that ω 1 /ω 2 ∈ h = {z ∈ C : Im(z) > 0}. Let R be the set of alllattices in C. Let E be the set of isomorphism classes of pairs (E, ω), where E isan elliptic curve over C <strong>and</strong> ω ∈ Ω 1 E is a n<strong>on</strong>zero holomorphic differential 1-form<strong>on</strong> E. Two pairs (E, ω) <strong>and</strong> (E ′ , ω ′ ) are isomorphic if there is an isomorphismϕ : E → E ′ such that ϕ ∗ (ω ′ ) = ω.Propositi<strong>on</strong> 3.3.2. There is a bijecti<strong>on</strong> between R <strong>and</strong> E under which L ∈ Rcorresp<strong>on</strong>ds to (C/L, dz) ∈ E.Proof. We describe the maps in each directi<strong>on</strong>, but leave the proof that they inducea well-defined bijecti<strong>on</strong> as an exercise for the reader [[add ref to actual exercise]].Given L ∈ R, by Weierstrass theory the quotient C/L is an elliptic curve, which isequipped with the distinguished differential ω induced by the differential dz <strong>on</strong> C.C<strong>on</strong>versely, if E is an elliptic curve over C <strong>and</strong> ω ∈ Ω 1 E is a n<strong>on</strong>zero differential,we obtain a lattice L in C by integrating homology classes:{∫}L = L ω = ω : γ ∈ H 1 (E(C), Z) .γ


16 3. <strong>Modular</strong> <strong>Forms</strong> of Level 1LetB = {(ω 1 , ω 2 ) : ω 1 , ω 2 ∈ C, ω 1 /ω 2 ∈ h} ,be the set of ordered basis of lattices in C, ordered so that ω 1 /ω 2 ∈ h. There is aleft acti<strong>on</strong> of SL 2 (Z) <strong>on</strong> B given by( a bc d)(ω1 , ω 2 ) ↦→ (aω 1 + bω 2 , cω 1 + dω 2 )<strong>and</strong> SL 2 (Z)\B ∼ = R. (The acti<strong>on</strong> is just the left acti<strong>on</strong> of matrices <strong>on</strong> columnvectors, except we write (ω 1 , ω 2 ) as a row vector since it takes less space.)Give a modular form f ∈ M k (1), associate to f a functi<strong>on</strong> F : R → C as follows.First, <strong>on</strong> lattices of the special form Zτ + Z, for τ ∈ h, let F (Zτ + Z) = f(τ).In order to extend F to a functi<strong>on</strong> <strong>on</strong> all lattices, note that F satisfies thehomogeneity c<strong>on</strong>diti<strong>on</strong> F (λL) = λ −k F (L), for any λ ∈ C <strong>and</strong> L ∈ R. ThenF (Zω 1 + Zω 2 ) = ω −k2 F (Zω 1/ω 2 + Z) := ω −k2 f(ω 1/ω 2 ).That F is well-defined exactly amounts to the transformati<strong>on</strong> c<strong>on</strong>diti<strong>on</strong> (3.1.1)that f satisfies.Lemma 3.3.3. The lattice functi<strong>on</strong> F : R → C associated to f ∈ M k (1) is welldefined.Proof. Suppose Zω 1 + Zω 2 = Zω 1 ′ + Zω 2 ′ with ω 1 /ω 2 <strong>and</strong> ω 1/ω ′ 2 ′ both in h. Wemust verify that ω2 −k f(ω 1/ω 2 ) = (ω 2) ′ −k f(ω 1/ω ′ 2). ′ There exists ( )a bc d ∈ SL2 (Z)such ( that ω 1 ′ = aω 1 + bω 2 <strong>and</strong> ω 2 ′ = cω 1 + dω 2 . Dividing, we see that ω 1/ω ′ 2 ′ =a b)c d (ω1 /ω 2 ). Because f is a weight k modular form, we have( ) (( ) ( )) (ω′f 1 a b ω1= f= c ω ) k ( )1 ω1+ d f .c d ω 2 ω 2 ω 2ω ′ 2Multiplying both sides by ω2 k yields( ) ωω2 k ′f 1= (cω 1 + dω 2 ) k fω ′ 2(ω1ω 2).Observing that ω ′ 2 = cω 1 + dω 2 <strong>and</strong> dividing again completes the proof.Since f(τ) = F (Zτ +Z), we can recover f from F , so the map f ↦→ F is injective.Moreover, it is surjective in the sense that if F is homogeneous of degree −k, thenF arises from a functi<strong>on</strong> f : h → C that transforms like a modular form. Moreprecisely, if F : R → C satisfies the homogeneity c<strong>on</strong>diti<strong>on</strong> F (λL) = λ −k F (L),then the functi<strong>on</strong> f : h → C defined by f(τ) = F (Zτ +Z) transforms like a modularform of weight k, as the following computati<strong>on</strong> shows: For any ( )a bc d ∈ SL2 (Z) <strong>and</strong>τ ∈ h, we havef( ) (aτ + b= F Z aτ + b )cτ + d cτ + d + Z= F ((cτ + d) −1 (Z(aτ + b) + Z(cτ + d)))= (cτ + d) k F (Z(aτ + b) + Z(cτ + d))= (cτ + d) k F (Zτ + Z)= (cτ + d) k f(τ).


3.4 <strong>Hecke</strong> operators 17Say that a functi<strong>on</strong> F : R → C is holomorphic <strong>on</strong> h ∪ {∞} if the functi<strong>on</strong> f(τ) =F (Zτ + Z) is. We summarize the above discussi<strong>on</strong> in a propositi<strong>on</strong>.Propositi<strong>on</strong> 3.3.4. There is a bijecti<strong>on</strong> between M k (1) <strong>and</strong> functi<strong>on</strong>s F : R →C that are homogeneous of degree −k <strong>and</strong> holomorphic <strong>on</strong> h ∪ {∞}. Under thisbijecti<strong>on</strong> F : R → C corresp<strong>on</strong>ds to f(τ) = F (Zτ + Z).3.4 <strong>Hecke</strong> operatorsDefine a map T n from the free abelian group generated by all C-lattices into itselfbyT n (L) =∑ L ′ ⊂L[L:L ′ ]=nwhere the sum is over all sublattices L ′ ⊂ L of index n. For any functi<strong>on</strong> F : R → C<strong>on</strong> lattices, define T n (F ) : R → C by(T n (F ))(L) = n k−1 ∑L ′ ,L ′ ⊂L[L:L ′ ]=nF (L ′ ).If F is homogeneous of degree −k, then T n (F ) is also homogeneous of degree −k.We will next show that (n, m) = 1 implies T n T m = T nm <strong>and</strong> T p k is a polynomialin Z[T p ] (see [Ser73, Cor. 1, pg. 99]); the essential case to c<strong>on</strong>sider is n prime.Suppose L ′ ⊂ L with [L : L ′ ] = n. Then every element of L/L ′ has orderdividing n, so nL ⊂ L ′ ⊂ L <strong>and</strong>L ′ /nL ⊂ L/nL ≈ (Z/nZ) 2 .Thus the subgroups of (Z/nZ) 2 of order n corresp<strong>on</strong>d to the sublattices L ′ of L ofindex n. When n = l is prime, there are l + 1 such subgroups, since the subgroupscorresp<strong>on</strong>d to n<strong>on</strong>zero vectors in F l modulo scalar equivalence, <strong>and</strong> there are(l 2 − 1)/(l − 1) = l + 1 of them.Recall from Propositi<strong>on</strong> 3.3.2 that there is a bijecti<strong>on</strong> between the set R oflattices in C <strong>and</strong> the set E of isomorphism classes of pairs (E, ω), where E is anelliptic curve over C <strong>and</strong> ω is a n<strong>on</strong>zero differential <strong>on</strong> E.Suppose F : R → C is homogeneous of degree −k, so F (λL) = λ −k F (L). Thenwe may also view T l as a sum over lattices that c<strong>on</strong>tain L with index l, as follows.Suppose L ′ ⊂ L is a sublattice of index l <strong>and</strong> set L ′′ = l −1 L ′ . Then we have achain of inclusi<strong>on</strong>slL ⊂ L ′ ⊂ L ⊂ l −1 L ′ = L ′′ .Since [l −1 L ′ : L ′ ] = l 2 <strong>and</strong> [L : L ′ ] = l, it follows that [L ′′ : L] = l. Because F ishomogeneous of degree −k,T l (F )(L) = l k−1∑[L:L ′ ]=lF (L ′ ) = 1 l∑[L ′′ :L]=lF (L ′′ ). (3.4.1)


18 3. <strong>Modular</strong> <strong>Forms</strong> of Level 13.5 <strong>Hecke</strong> operators directly <strong>on</strong> q-expansi<strong>on</strong>sRecall that the nth <strong>Hecke</strong> operator T n of weight k <strong>on</strong> the free abelian group <strong>on</strong>lattices is given by∑T n (L) = n k−1 L ′ .L ′ ⊂L[L:L ′ ]=n<strong>Modular</strong> forms of weight k corresp<strong>on</strong>d to holomorphic functi<strong>on</strong>s of degree −k <strong>on</strong>lattices, <strong>and</strong> each T n extends to an operator <strong>on</strong> these functi<strong>on</strong>s <strong>on</strong> lattices, so T ndefines <strong>on</strong> operator <strong>on</strong> M k (1).A holomorphic functi<strong>on</strong> <strong>on</strong> the unit disk is determined by its Fourier expansi<strong>on</strong>,so Fourier expansi<strong>on</strong> defines an injective map M k (1) ↩→ C[[q]]. In this secti<strong>on</strong>, wedescribe T n ( ∑ a m q m ) explicitly as a q-expansi<strong>on</strong>.3.5.1 Explicit descripti<strong>on</strong> of sublatticesIn order to describe T n more explicitly, we enumerate the sublattices L ′ ⊂ L ofindex n. More precisely, we give a basis for each L ′ in terms of a basis for L. Notethat L/L ′ is a group of order n <strong>and</strong>L ′ /nL ⊂ L/nL = (Z/nZ) 2 .Write L = Zω 1 + Zω 2 , let Y 2 be the cyclic subgroup of L/L ′ generated by ω 2 <strong>and</strong>let d = #Y 2 . If Y 1 = (L/L ′ )/Y 2 , then Y 1 is generated by the image of ω 1 , so it is acyclic group of order a = n/d. Our goal is to exhibit a basis of L ′ . Let ω 2 ′ = dω 2 ∈ L ′<strong>and</strong> use that Y 1 is generated by the image of ω 1 to write aω 1 = ω 1 ′ − bω 2 for someinteger b <strong>and</strong> some ω 1 ′ ∈ L ′ . Since b is <strong>on</strong>ly well-defined modulo d we may assume0 ≤ b ≤ d − 1. Thus( ω′ ) (1 a b)( ω1)=0 d ω 2<strong>and</strong> the change of basis matrix has determinant ad = n. Sinceω ′ 2Zω ′ 1 + Zω ′ 2 ⊂ L ′ ⊂ L = Zω 1 + Zω 2<strong>and</strong> [L : Zω ′ 1 + Zω ′ 2] = n (since the change of basis matrix has determinant n) <strong>and</strong>[L : L ′ ] = n we see that L ′ = Zω ′ 1 + Zω ′ 2.Propositi<strong>on</strong> 3.5.1. Let n be a positive integer. There is a <strong>on</strong>e-to-<strong>on</strong>e corresp<strong>on</strong>dencebetween sublattices L ′ ⊂ L of index n <strong>and</strong> matrices ( )a b0 d with ad = n <strong>and</strong>0 ≤ b ≤ d − 1.Proof. The corresp<strong>on</strong>dence is described above. To check that it is a bijecti<strong>on</strong>, wejust need to show that if γ = ( )a b0 d <strong>and</strong> γ ′ = ( )a ′ b ′0 d are two matrices satisfying′the listed c<strong>on</strong>diti<strong>on</strong>s, <strong>and</strong>Z(aω 1 + bω 2 ) + Zdω 2 = Z(aω ′ 1 + bω ′ 2) + Zdω ′ 2,then γ = γ ′ . Equivalently, if σ ∈ SL 2 (Z) <strong>and</strong> σγ = γ ′ , then σ = 1. To see this, wecomputeσ = γ ′ γ −1 = 1 ( )a ′ d ab ′ − a ′ bn 0 ad ′ .


3.5 <strong>Hecke</strong> operators directly <strong>on</strong> q-expansi<strong>on</strong>s 19Since σ ∈ SL 2 (Z), we have n | a ′ d, <strong>and</strong> n | ad ′ , <strong>and</strong> aa ′ dd ′ = n 2 . If a ′ d > n, thenbecause aa ′ dd ′ = n 2 , we would have ad ′ < n, which would c<strong>on</strong>tradict the factthat n | ad ′ ; also, a ′ d < n is impossible since n | a ′ d. Thus a ′ d = n <strong>and</strong> likewisead ′ = n. Since ad = n as well, it follows that a ′ = a <strong>and</strong> d ′ = d, so σ = ( 1 0 1 t ) forsome t ∈ Z. Then σγ = ( )a b+dt0 d , which implies that t = 0, since 0 ≤ b ≤ d − 1<strong>and</strong> 0 ≤ b + dt ≤ d − 1.Remark 3.5.2. As menti<strong>on</strong>ed earlier, when n = l is prime, there are l+1 sublatticesof index l. In general, the number of such sublattices is the sum of the positivedivisors of n (exercise) 1 . 13.5.2 <strong>Hecke</strong> operators <strong>on</strong> q-expansi<strong>on</strong>sRecall that if f ∈ M k (1), then f is a holomorphic functi<strong>on</strong> <strong>on</strong> h ∪ {∞} such thatf(τ) = f( ) aτ + b(cτ + d) −kcτ + dfor all ( a bc d)∈ SL2 (Z). Using Fourier expansi<strong>on</strong> we write∞∑f(τ) = c m e 2πiτm ,m=0<strong>and</strong> say f is a cusp form if c 0 = 0. Also, there is a bijecti<strong>on</strong> between modularforms f of weight k <strong>and</strong> holomorphic lattice functi<strong>on</strong>s F : R → C that satisfyF (λL) = λ −k F (L); under this bijecti<strong>on</strong> F corresp<strong>on</strong>ds to f(τ) = F (Zτ + Z).Now assume f(τ) = ∑ ∞m=0 c mq m is a modular form with corresp<strong>on</strong>ding latticefuncti<strong>on</strong> F . Using the explicit descripti<strong>on</strong> of sublattices from Secti<strong>on</strong> 3.5.1 above,we can describe the acti<strong>on</strong> of the <strong>Hecke</strong> operator T n <strong>on</strong> the Fourier expansi<strong>on</strong> of1 Put reference to actual exercise


20 3. <strong>Modular</strong> <strong>Forms</strong> of Level 1f(τ), as follows:T n F (Zτ + Z) = n k−1∑a,b,dad=n0≤b≤d−1F ((aτ + b)Z + dZ)= n ∑ ( aτ + bk−1 d −k Fd= n ∑ ( ) aτ + bk−1 d −k fd∑= n k−1a,d,b,m= n k−1 ∑a,d,m)Z + Zd −k aτ+bc m e2πi( d )md 1−k c m e 2πiamτd1d= n ∑ k−1 d 1−k c dm ′e 2πiam′ τad=nm ′ ≥0= ∑a k−1 c dm ′q am′ .ad=nm ′ ≥0∑d−1(b=0e 2πimdIn the sec<strong>on</strong>d to the last expressi<strong>on</strong> we let m = dm ′ for m ′ ≥ 0, then use that thesum 1 ∑ d−1 2πimd b=0(e d ) b is <strong>on</strong>ly n<strong>on</strong>zero if d | m, in which case the sum equals 1.ThusT n f(q) = ∑a k−1 c dm q am .ad=nm≥0Put another way, if µ is a n<strong>on</strong>negative integer, then the coefficient of q µ is∑a|na|µa k−1 c nµa 2 .(To see this, let m = a/µ <strong>and</strong> d = n/a <strong>and</strong> substitute into the formula above.)Remark 3.5.3. When k ≥ 1 the coefficients of q µ for all µ bel<strong>on</strong>g to the Z-modulegenerated by the c m .Remark 3.5.4. Setting µ = 0 gives the c<strong>on</strong>stant coefficient of T n f which is∑a k−1 c 0 = σ k−1 (n)c 0 .a|nThus if f is a cusp form so is T n f. (T n f is holomorphic since its original definiti<strong>on</strong>is as a finite sum of holomorphic functi<strong>on</strong>s.)Remark 3.5.5. Setting µ = 1 shows that the coefficient of q in T n f is ∑ a|1 ak−1 c n =c n . As an immediate corollary we have the following important result.Corollary 3.5.6. If f is a cusp form such that T n f has 0 as coefficient of q forall n ≥ 1, then f = 0.) b


3.5 <strong>Hecke</strong> operators directly <strong>on</strong> q-expansi<strong>on</strong>s 21In the special case when n = p is prime, the acti<strong>on</strong> acti<strong>on</strong> of T p <strong>on</strong> the q-expansi<strong>on</strong> of f is given by the following formula:T p f = ∑ µ≥0∑a|pa|µa k−1 c nµa 2 qµ .Since p is prime, either a = 1 or a = p. When a = 1, c pµ occurs in the coefficientof q µ <strong>and</strong> when a = p, we can write µ = pλ <strong>and</strong> we get terms p k−1 c λ in q pλ . ThusT p f = ∑ µ≥0c pµ q µ + p k−1 ∑ λ≥0c λ q pλ .3.5.3 The <strong>Hecke</strong> algebra <strong>and</strong> eigenformsDefiniti<strong>on</strong> 3.5.7 (<strong>Hecke</strong> Algebra). The <strong>Hecke</strong> algebra T associated to M k (1) isthe subring of End(M k (1)) generated by the operators T n for all n. Similarly, the<strong>Hecke</strong> algebra associated to S k (1) is the subring of End(S k (1)) generated by all<strong>Hecke</strong> operators T n .The <strong>Hecke</strong> algebra is commutative because T p ν is a polynomial in T p <strong>and</strong> whengcd(n, m) = 1 we have T n T m = T nm = T mn = T m T n . Also, T is of finite rankover Z, because of Remark 3.5.3 <strong>and</strong> that the finite dimensi<strong>on</strong>al space S k (1) has abasis with q-expansi<strong>on</strong>s in Z[[q]].Definiti<strong>on</strong> 3.5.8 (Eigenform). An eigenform f ∈ M k (1) is a n<strong>on</strong>zero elementsuch that f is an eigenvector for every <strong>Hecke</strong> operator T n . If f ∈ S k (1) is aneigenform, then f is normalized if the coefficient of q in the q-expansi<strong>on</strong> of f is 1.We sometimes called a normalized cuspidal eigenform a newform.If f = ∑ ∞n=1 c nq n is a normalized eigenform, then Remark 3.5.5 implies thatT n (f) = c n f. Thus the coefficients of a newform are exactly the system of eigenvaluesof the <strong>Hecke</strong> operators acting <strong>on</strong> the newform.Remark 3.5.9. It follows from Victor Miller’s thesis [[ref my modform book??]]that T 1 , . . . , T n generate T ⊂ End(S k (1)), where n = dim S k (1).3.5.4 ExamplesWe compute the space of weight 12 modular forms of level 1, al<strong>on</strong>g with its cuspidalsubspace:sage : M = <strong>Modular</strong><strong>Forms</strong> (1 ,12 , prec =3)sage : M. basis ()[q - 24* q^2 + O(q^3) ,1 + 65520/691* q + 134250480/691* q^2 + O(q ^3)]sage : M. hecke_matrix (2)[ -24 0][ 0 2049]sage : S = M. cuspidal_subspace ()sage : S. hecke_matrix (2)


22 3. <strong>Modular</strong> <strong>Forms</strong> of Level 1[ -24]sage : factor (M. hecke_polynomial (2))(x - 2049) * (x + 24)We also compute the space of forms of weight 40:sage : M = <strong>Modular</strong><strong>Forms</strong> (1 ,40)sage : M. basis ()[q + 19291168* q^4 + 37956369150* q^5 + O(q^6) ,q^2 + 156024* q^4 + 57085952* q^5 + O(q^6) ,q^3 + 168* q^4 - 12636* q^5 + O(q^6) ,1 + 1082400/261082718496449122051* q + ...]sage : M. hecke_matrix (2)[ 0 549775105056 14446985236992 0][ 1 156024 1914094476 0][ 0 168 392832 0][ 0 0 0 549755813889]sage : factor (M. hecke_polynomial (2))(x - 549755813889) *(x^3 - 548856* x^2 - 810051757056* x + 213542160549543936)3.6 Two C<strong>on</strong>jectures about <strong>Hecke</strong> operators <strong>on</strong> level 1modular forms3.6.1 Maeda’s c<strong>on</strong>jectureC<strong>on</strong>jecture 3.6.1 (Maeda). Let k be a positive integer such that S k (1) has positivedimensi<strong>on</strong> <strong>and</strong> let T ⊂ End(S k (1)) be the <strong>Hecke</strong> algebra. Then there is <strong>on</strong>ly <strong>on</strong>eGal(Q/Q) orbit of normalized eigenforms of level 1.There is some numerical evidence for this c<strong>on</strong>jecture. It is true for k ≤ 2000,according to [FJ02]. The MathSciNet reviewer of [FJ02] said “In the present paperthe authors take a big step forward towards proving Maeda’s c<strong>on</strong>jecture in theaffirmative by establishing that the <strong>Hecke</strong> polynomial T p,k (x) is irreducible <strong>and</strong>has full Galois group over Q for k ≤ 2000 <strong>and</strong> p < 2000, p prime.” Using Sage,Alex Ghitza verified the c<strong>on</strong>jecture for k ≤ 4096 (see [Ghi]). Buzzard shows in[Buz96] that for the weights k ≤ 228 with k/12 a prime, the Galois group of thecharacteristic polynomial of T 2 is the full symmetric group, <strong>and</strong> is, in particular,irreducible.3.6.2 The Gouvea-Mazur c<strong>on</strong>jectureFix a prime p, <strong>and</strong> let F p,k ∈ Z[x] be the characteristic polynomial of T p acting <strong>on</strong>M k (1). The slopes of F p,k are the p-adic valuati<strong>on</strong>s ord p (α) ∈ Q of the roots α ∈ Q p


3.6 Two C<strong>on</strong>jectures about <strong>Hecke</strong> operators <strong>on</strong> level 1 modular forms 23of F p,k . They can be computed easily using Newt<strong>on</strong> polyg<strong>on</strong>s. 2 For example, the 2p = 5 slopes for F 5,12 are 0, 1, 1, for F 5,12+4·5 they are 0, 1, 1, 4, 4, <strong>and</strong> for F 5,12+4·5 2they are 0, 1, 1, 5, 5, 5, 5, 5, 5, 10, 10, 11, 11, 14, 14, 15, 15, 16, 16.sage : def s(k,p):... M = <strong>Modular</strong><strong>Forms</strong> (1 ,k)... v = M. hecke_polynomial (p). newt<strong>on</strong>_slopes (p)... return list ( sorted (v))sage : s (12 ,5)[0 , 1]sage : s (12 + 4*5 , 5)[0 , 1, 4]sage : s (12 + 4*5^2 , 5)[0 , 1, 5, 5, 5, 10 , 11 , 14 , 15 , 16]sage : s (12 + 4*5^3 , 5) # l<strong>on</strong>g time!! WAY TOO SLOW -- TODO -- see trac 9749 !!Instead, we compute the slopes more directly as follows (this is fast):sage : def s(k,p):... d = dimensi<strong>on</strong>_modular_forms (1 , k)... B = victor_miller_basis (k, p*d +1)... T = hecke_operator_<strong>on</strong>_basis (B, p, k)... return list ( sorted (T. charpoly (). newt<strong>on</strong>_slopes (p )))sage : s (12 ,5)[0 , 1]sage : s (12 + 4*5 , 5)[0 , 1, 4]sage : s (12 + 4*5^2 , 5)[0 , 1, 5, 5, 5, 10 , 11 , 14 , 15 , 16]sage : s (12 + 4*5^3 , 5)[0 , 1, 5, 5, 5, 10 , 11 , 14 , 15 , 16 , 20 , 21 , 24 , 25 , 27 ,30 , 31 , 34 , 36 , 37 , 40 , 41 , 45 , 46 , 47 , 50 , 51 , 55 , 55 ,55 , 59 , 60 , 63 , 64 , 65 , 69 , 70 , 73 , 74 , 76 , 79 , 80 , 83]Let d(k, α, p) be the multiplicity of α as a slope of F p,k .C<strong>on</strong>jecture 3.6.2 (Gouvea-Mazur, 1992). Fix a prime p <strong>and</strong> a n<strong>on</strong>negative rati<strong>on</strong>alnumber α. Suppose k 1 <strong>and</strong> k 2 are integers with k 1 , k 2 ≥ 2α + 2, <strong>and</strong> k 1 ≡ k 2(mod p n (p − 1)) for some integer n ≥ α. Then d(k 1 , α, p) = d(k 2 , α, p).Notice that the above examples, with p = 5 <strong>and</strong> k 1 = 12, are c<strong>on</strong>sistent withthis c<strong>on</strong>jecture. However, it came as a huge surprise that the c<strong>on</strong>jecture is false ingeneral!Frank Calegari <strong>and</strong> Kevin Buzzard [BC04] found the first counterexample, whenp = 59, k 1 = 16, α = 1, <strong>and</strong> k 2 = 16 + 59 · 58 = 3438. We have d(16, 0, 59) = 0,d(16, 1, 59) = 1, <strong>and</strong> d(16, α, 59) = 0 for all other α. However, initial computati<strong>on</strong>sstr<strong>on</strong>gly suggest (but do not prove!) that d(3438, 1, 59) = 2. It is a finite, butdifficult, computati<strong>on</strong> to decide what d(3438, 1, 59) really is (see Secti<strong>on</strong> 3.7). Using2 Jared Weinstein suggests we add some background explaining newt<strong>on</strong> polyg<strong>on</strong>s <strong>and</strong> why theyare helpful.


24 3. <strong>Modular</strong> <strong>Forms</strong> of Level 1a trace formula, Calegari <strong>and</strong> Buzzard at least showed that either d(3438, 1, 59) ≥2 or there exists α < 1 such that d(3438, α, 59) > 0, both of which c<strong>on</strong>tradictC<strong>on</strong>jecture 3.6.2.There are many theorems about more general formulati<strong>on</strong>s of the Gouvea-Mazurc<strong>on</strong>jecture, <strong>and</strong> a whole geometric theory “the Eigencurve” [CM98] that helpsexplain it, but discussing this further is bey<strong>on</strong>d the scope of this book.3.7 An Algorithm for computing characteristicpolynomials of <strong>Hecke</strong> operatorsIn computati<strong>on</strong>al investigati<strong>on</strong>s, it is frequently useful to compute the characteristicpolynomial of the <strong>Hecke</strong> operator T p,k of T p acting <strong>on</strong> S k (1). This can beaccomplished in several ways, each of which has advantages. The Eichler-Selbergtrace formula (see Zagier’s appendix to [Lan95, Ch. III]), can be used to computethe trace of T n,k , for n = 1, p, p 2 , . . . , p d−1 , where d = dim S k (1), <strong>and</strong> from thesetraces it is straightforward to recover the characteristic polynomial of T p,k . Usingthe trace formula, the time required to compute Tr(T n,k ) grows “very quickly”in n (though not in k), so this method becomes unsuitable when the dimensi<strong>on</strong>is large or p is large, since p d−1 is huge. Another alternative is to use modularsymbols of weight k, as in [Mer94], but if <strong>on</strong>e is <strong>on</strong>ly interested in characteristicpolynomials, little is gained over more naive methods (modular symbols are mostuseful for investigating special values of L-functi<strong>on</strong>s).In this secti<strong>on</strong>, we describe an algorithm to compute the characteristic polynomialof the <strong>Hecke</strong> operator T p,k , which is adapted for the case when p > 2. It couldbe generalized to modular forms for Γ 1 (N), given a method to compute a basisof q-expansi<strong>on</strong>s to “low precisi<strong>on</strong>” for the space of modular forms of weight k <strong>and</strong>level N. By “low precisi<strong>on</strong>” we mean to precisi<strong>on</strong> O(q dp+1 ), where T 1 , T 2 , . . . , T dgenerate the <strong>Hecke</strong> algebra T as a ring. The algorithm described here uses nothingmore than the basics of modular forms <strong>and</strong> some linear algebra; in particular, notrace formulas or modular symbols are involved.3.7.1 Review of basic facts about modular formsWe briefly recall the background for this secti<strong>on</strong>. Fix an even integer k. Let M k (1)denote the space of weight k modular forms for SL 2 (Z) <strong>and</strong> S k (1) the subspace ofcusp forms. Thus M k (1) is a C-vector space that is equipped with a ringT = Z[. . . T p,k . . .] ⊂ End(M k (1))of <strong>Hecke</strong> operators. Moreover, there is an injective q-expansi<strong>on</strong> map M k (1) ↩→C[[q]]. For example, when k ≥ 4 there is an Eisenstein series E k , which lies inM k (1). The first two Eisenstein series areE 4 (q) = 1240 + ∑ σ 3 (n)q n <strong>and</strong> E 6 (q) = 1504 + ∑ σ 5 (n)q n ,n≥1n≥1


3.7 An Algorithm for computing characteristic polynomials of <strong>Hecke</strong> operators 25where q = e 2πiz , σ k−1 (n) is the sum of the k − 1st power of the positive divisors.For every prime number p, the <strong>Hecke</strong> operator T p,k acts <strong>on</strong> M k (1) by⎛ ⎞T p,k⎝ ∑ a n q n ⎠ = ∑ a np q n + p k−1 a n q np . (3.7.1)n≥0n≥0Propositi<strong>on</strong> 3.7.1. The set of modular forms E a 4 E b 6 is a basis for M k (1), where a<strong>and</strong> b range through n<strong>on</strong>negative integers such that 4a+6b = k. Moreover, S k (1) isthe subspace of M k (1) of elements whose q-expansi<strong>on</strong>s have c<strong>on</strong>stant coefficient 0.3.7.2 The Naive approachLet k be an even positive integer <strong>and</strong> p be a prime. Our goal is to compute thecharacteristic polynomial of the <strong>Hecke</strong> operator T p,k acting <strong>on</strong> S k (1). In practice,when k <strong>and</strong> p are both reas<strong>on</strong>ably large, e.g., k = 886 <strong>and</strong> p = 59, then the coefficientsof the characteristic polynomial are huge (the roots of the characteristicpolynomial are O(p k/2−1 )). A naive way to compute the characteristic polynomialof T p,k is to use (3.7.1) to compute the matrix [T p,k ] of T p,k <strong>on</strong> the basis of Propositi<strong>on</strong>3.7.1, where E 4 <strong>and</strong> E 6 are computed to precisi<strong>on</strong> p dim M k (1), <strong>and</strong> to thencompute the characteristic polynomial of [T p,k ] using, e.g., a modular algorithm(compute the characteristic polynomial modulo many primes, <strong>and</strong> use the ChineseRemainder Theorem). The difficulty with this approach is that the coefficientsof the q-expansi<strong>on</strong>s of E a 4 E b 6 to precisi<strong>on</strong> p dim M k (1) quickly become enormous,so both storing them <strong>and</strong> computing with them is costly, <strong>and</strong> the comp<strong>on</strong>ents of[T p,k ] are also huge so the characteristic polynomial is difficult to compute. SeeExample 3.2.4 above, where the coefficients of the q-expansi<strong>on</strong>s are already large.3.7.3 The Eigenform methodWe now describe another approach to computing characteristic polynomials, whichgets just the informati<strong>on</strong> required. Recall Maeda’s c<strong>on</strong>jecture from Secti<strong>on</strong> 3.6.1,which asserts that S k (1) is spanned by the Gal(Q/Q)-c<strong>on</strong>jugates of a single eigenformf = ∑ b n q n . For simplicity of expositi<strong>on</strong> below, we assume this c<strong>on</strong>jecture,though the algorithm can probably be modified to deal with the general case. Wewill refer to this eigenform f, which is well-defined up to Gal(Q/Q)-c<strong>on</strong>jugacy, asMaeda’s eigenform.Lemma 3.7.2. The characteristic polynomial of the pth coefficient b p of Maeda’seigenform f, in the field Q(b 1 , b 2 , . . .), is equal to the characteristic polynomial ofT p,k acting <strong>on</strong> S k (1).Proof. The map T ⊗ Q → Q(b 1 , b 2 , . . .) that sends T n → b n is an isomorphism ofQ-algebras.Victor Miller shows in his thesis that S k (1) has a unique basis f 1 , . . . , f d ∈ Z[[q]]with a i (f j ) = δ ij , i.e., the first d × d block of coefficients is the identity matrix.Again, in the general case, the requirement that there is such a basis can be avoided,but for simplicity of expositi<strong>on</strong> we assume there is such a basis. We refer to thebasis f 1 , . . . , f d as Miller’s basis.


26 3. <strong>Modular</strong> <strong>Forms</strong> of Level 1Algorithm 3.7.3. We assume in the algorithm that the characteristic polynomialof T 2 has no multiple roots (this is easy to check, <strong>and</strong> if false, then you have foundan interesting counterexample to the c<strong>on</strong>jecture that the characteristic polynomialof T 2 has Galois group the full symmetric group).1. Using Propositi<strong>on</strong> 3.7.1 <strong>and</strong> Gauss eliminati<strong>on</strong>, we compute Miller’s basisf 1 , . . . , f d to precisi<strong>on</strong> O(q 2d+1 ), where d = dim S k (1). This is exactly theprecisi<strong>on</strong> needed to compute the matrix of T 2 .2. Using Definiti<strong>on</strong> 3.7.1, we compute the matrix [T 2 ] of T 2 with respect toMiller’s basis f 1 , . . . , f d . We compute the matrix with respect to the Millerbasis mainly because it makes the linear algebra much simpler.3. Using Algorithm 3.7.5 below we write down an eigenvector e = (e 1 , . . . , e d ) ∈K d for [T 2 ]. In practice, the comp<strong>on</strong>ents of T 2 are not very large, so thenumbers involved in computing e are also not very large.4. Since e 1 f 1 +· · ·+e d f d is an eigenvector for T 2 , our assumpti<strong>on</strong> that the characteristicpolynomial of T 2 is square free (<strong>and</strong> the fact that T is commutative)implies that e 1 f 1 + · · · + e d f d is also an eigenvector for T p . Normalizing, wesee that up to Galois c<strong>on</strong>jugacy,b p =d∑i=1e ie 1· a p (f i ),where the b p are the coefficients of Maeda’s eigenform f. For example, sincethe f i are Miller’s basis, if p ≤ d thenb p = e pe 1if p ≤ d,since a p (f i ) = 0 for all i ≠ p <strong>and</strong> a p (f p ) = 1.5. Finally, <strong>on</strong>ce we have computed b p , we can compute the characteristic polynomialof T p , because it is the minimal polynomial of b p . We spend the restof this secti<strong>on</strong> discussing how to make this step practical.Computing b p directly in step 4 is extremely costly because the divisi<strong>on</strong>s e i /e 1lead to massive coefficient explosi<strong>on</strong>, <strong>and</strong> the same remark applies to computingthe minimal polynomial of b p . Instead we compute the reducti<strong>on</strong>s b p modulo l<strong>and</strong> the characteristic polynomial of b p modulo l for many primes l, then recover<strong>on</strong>ly the characteristic polynomial of b p using the Chinese Remainder Theorem.Deligne’s bound <strong>on</strong> the magnitude of Fourier coefficients tells us how many primeswe need as moduli (we leave this analysis to the reader) 3 . 3More precisely, the reducti<strong>on</strong> modulo l steps are as follows. The field K can beviewed as Q[x]/(f(x)) where f(x) ∈ Z[x] is the characteristic polynomial of T 2 .We work <strong>on</strong>ly modulo primes such that1. f(x) has no repeated roots modulo l,2. l does not divide any denominator involved in our representati<strong>on</strong> of e, <strong>and</strong>3 Say more later.


3.7 An Algorithm for computing characteristic polynomials of <strong>Hecke</strong> operators 273. the image of e 1 in F l [x]/(f(x)) is invertible.For each such prime, we compute the image b p of b p in the reduced Artin ringF l [x]/(f(x)). Then the characteristic polynomial of T p modulo l equals the characteristicpolynomial of b p . This modular arithmetic is fast <strong>and</strong> requires negligiblestorage. Most of the time is spent doing the Chinese Remainder Theorem computati<strong>on</strong>s,which we do each time we do a few computati<strong>on</strong>s of the characteristicpolynomial of T p modulo l.Remark 3.7.4. If k is really large, so that steps 1 <strong>and</strong> 2 of the algorithm take tool<strong>on</strong>g or require too much memory, steps 1 <strong>and</strong> 2 can be performed modulo theprime l. Since the characteristic polynomial of T p,k modulo l does not depend <strong>on</strong>any choices, we will still be able to recover the original characteristic polynomial.3.7.4 How to write down an eigenvector over an extensi<strong>on</strong> fieldThe following algorithm, which was suggested to the author by H. Lenstra, producesan eigenvector defined over an extensi<strong>on</strong> of the base field.Algorithm 3.7.5. Let A be an n×n matrix over an arbitrary field k <strong>and</strong> supposethat the characteristic polynomial f(x) = x n + · · · + a 1 x + a 0 of A is irreducible.Let α be a root of f(x) in an algebraic closure k of k. Factor f(x) over k(α) asf(x) = (x − α)g(x). Then for any element v ∈ k n the vector g(A)v is either 0 orit is an eigenvector of A with eigenvalue α. The vector g(A)v can be computed byfinding Av, A(Av), A(A(Av)), <strong>and</strong> then using thatg(x) = x n−1 + c n−2 x n−2 + · · · + c 1 x + c 0 ,where the coefficients c i are determined by the recurrencec 0 = − a 0α ,c i = c i−1 − a i.αWe prove below that g(A)v ≠ 0 for all vectors v not in a proper subspace ofk n . Thus with high probability, a “r<strong>and</strong>omly chosen” v will have the property thatg(A)v ≠ 0. Alternatively, if v 1 , . . . v n form a basis for k n , then g(A)v i must ben<strong>on</strong>zero for some i.Proof. By the Cayley-Hamilt<strong>on</strong> theorem [Lan93, XIV.3] we have that f(A) = 0.C<strong>on</strong>sequently, for any v ∈ k n , we have (A − α)g(A)v = 0 so that Ag(A)v =αv. Since f is irreducible it is the polynomial of least degree satisfied by A <strong>and</strong>so g(A) ≠ 0. Therefore g(A)v ≠ 0 for all v not in the proper closed subspaceker(g(A)).


28 3. <strong>Modular</strong> <strong>Forms</strong> of Level 13.7.5 Simple example: weight 36, p = 3We compute the characteristic polynomial of T 3 acting <strong>on</strong> S 36 (1) using the algorithmdescribed above. A basis for M 36 (1) to precisi<strong>on</strong> 6 = 2 dim(S 36 (1)) isE 9 4 = 1 + 2160q + 2093040q 2 + 1198601280q 3 + 449674832880q 4+ 115759487504160q 5 + 20820305837344320q 6 + O(q 7 )E 6 4E 2 6 = 1 + 432q − 353808q 2 − 257501376q 3 − 19281363984q 4+ 28393576094880q 5 + 11565037898063424q 6 + O(q 7 )E 3 4E 4 6 = 1 − 1296q + 185328q 2 + 292977216q 3 − 52881093648q 4− 31765004621280q 5 + 1611326503499328q 6 + O(q 7 )E 6 6 = 1 − 3024q + 3710448q 2 − 2309743296q 3 + 720379829232q 4− 77533149038688q 5 − 8759475843314112q 6 + O(q 7 )The reduced row-echel<strong>on</strong> form (Miller) basis is:f 0 = 1 + 6218175600q 4 + 15281788354560q 5 + 9026867482214400q 6 + O(q 7 )f 1 = q + 57093088q 4 + 37927345230q 5 + 5681332472832q 6 + O(q 7 )f 2 = q 2 + 194184q 4 + 7442432q 5 − 197264484q 6 + O(q 7 )f 3 = q 3 − 72q 4 + 2484q 5 − 54528q 6 + O(q 7 )The matrix of T 2 with respect to the basis f 1 , f 2 , f 3 is⎛0 34416831456⎞5681332472832[T 2 ] = ⎝ 1 194184 −197264484⎠0 −72 −54528This matrix has (irreducible) characteristic polynomialg = x 3 − 139656x 2 − 59208339456x − 1467625047588864.If a is a root of this polynomial, then <strong>on</strong>e finds thate = (2a + 108984, 2a 2 + 108984a, a 2 − 394723152a + 11328248114208)is an eigenvector with eigenvalue a. The characteristic polynomial of T 3 is thenthe characteristic polynomial of e 3 /e 1 , which we can compute modulo l for anyprime l such that g ∈ F l [x] is square free. For example, when l = 11,which has characteristic polynomiale 3= a2 + a + 3e 1 2a 2 = 9a 2 + 2a + 3,+ 7x 3 + 10x 2 + 8x + 2.If we repeat this process for enough primes l <strong>and</strong> use the Chinese remaindertheorem, we find that the characteristic polynomial of T 3 acting <strong>on</strong> S 36 (1) isx 3 + 104875308x 2 − 144593891972573904x − 21175292105104984004394432.


4Duality, Rati<strong>on</strong>ality, <strong>and</strong> Integrality4.1 <strong>Modular</strong> forms for SL 2 (Z) <strong>and</strong> Eisenstein seriesLet Γ = Γ 1 (1) = SL 2 (Z) <strong>and</strong> for k ≥ 0 let{}∞∑M k = f = a n q n : f is a modular form of weight k for Γ⊃ S k =n=0{f =}∞∑a n q n : f ∈ M kn=1These are finite dimensi<strong>on</strong>al C-vector spaces whose dimensi<strong>on</strong>s are easily computed.Furthermore, they are generated by familiar elements (see Serre [Ser73] orLang [Lan95].) The main tool is the formula∑p∈D∪{∞}1e(p) ord p(f) = k 12where D is the st<strong>and</strong>ard fundamental domain for Γ <strong>and</strong>⎧⎪⎨ 1 otherwisee(p) = 2 if p = i⎪⎩3 if p = ρ = e 2πi/3One can alternatively define e(p) as follows. If p = τ <strong>and</strong> E = C/(Zτ + Z) thene(p) = 1 2 # Aut(E).For k ≥ 4 we define the Eisenstein series G k byG k (q) = 1 ∞ 2 ζ(1 − k) + ∑σ k−1 (n)q n .n=1


30 4. Duality, Rati<strong>on</strong>ality, <strong>and</strong> IntegralityThe mapτ ↦→∑(m,n)≠(0,0)m,n∈Z1(mτ + n) kdiffers from G k by a c<strong>on</strong>stant (see [Ser73, §VII.4.2]). Also, ζ(1 − k) ∈ Q <strong>and</strong> <strong>on</strong>e∞∑may say, symbolically at least, “ζ(1 − k) = d k−1 = σ k−1 (0).” The nth Bernoullinumber B n is defined by the equati<strong>on</strong>d=1x∞e x − 1 = ∑n=0B n x n.n!One can show, using the functi<strong>on</strong>al equati<strong>on</strong> for ζ <strong>and</strong> [Ser73, §VII.4.1], thatζ(1 − k) = − B kkso the c<strong>on</strong>stant coefficient of G k is − B k2kwhich is rati<strong>on</strong>al.4.2 Inner productIn what follows we assume k ≥ 2 to avoid trivialities.. The <strong>Hecke</strong> operators T nacts <strong>on</strong> the space M k . Fix a subspace V ⊂ M k which is stable under the acti<strong>on</strong> ofthe T n . Let T(V ) be the C-algebra generated by the endomorphism T n acting <strong>on</strong>V <strong>and</strong> note that T(V ) is actually a finite dimensi<strong>on</strong>al C-vector space since it is asubspace of End(V ) <strong>and</strong> V is finite dimensi<strong>on</strong>al. Recall that T is commutative.For any <strong>Hecke</strong> operator T ∈ T <strong>and</strong> f ∈ V , to make certain arguments notati<strong>on</strong>allymore natural, we will often write T (f) = f|T . The ring T is commutative, soit is harmless to use this notati<strong>on</strong>, which suggests both a left <strong>and</strong> right modulestructure.There is a bilinear formT × V → C〈T, f〉 ↦→ a 1 (f|T )where f|T = ∑ ∞n=0 a n(f|T )q n . We thus obtain mapsV → Hom(T, C) = T ∗T → Hom(V, C) = V ∗ .Theorem 4.2.1. The above maps are isomorphisms.Proof. It suffices to show that each map is injective. Then since a finite dimensi<strong>on</strong>alvector space <strong>and</strong> its dual have the same dimensi<strong>on</strong> the result follows. First supposef ↦→ 0 ∈ Hom(T, C). Then a 1 (f|T ) = 0 for all T ∈ T, so a n = a 1 (f|T n ) = 0 forall n ≥ 1. Thus f is a c<strong>on</strong>stant; since k ≥ 2 this implies f = 0.Next suppose T ↦→ 0 ∈ Hom(V, C), then a 1 (f|T ) = 0 for all f ∈ V . Substitutingf|T n for f <strong>and</strong> using the commutativity of T we havea 1 ((f|T n )|T ) = 0 for all f, n ≥ 1a 1 ((f|T )|T n ) = 0a n (f|T ) = 0 n ≥ 1f|T = 0by commutativitysince k ≥ 2, as above


4.3 Eigenforms 31Thus T = 0 which completes the proof.Remark 4.2.2. The above isomorphisms are T-equivariant, in the following sense.We endow Hom(T, C) with a T-module structure by letting T ∈ T act <strong>on</strong> ϕ ∈Hom(T, C) by (T · ϕ)(T ′ ) = ϕ(T T ′ ). If α : V → Hom(T, C) is the above isomorphism(so α : f ↦→ ϕ f := (T ′ ↦→ a 1 (f|T ′ ))), then equivariance is the statementthat α(T f) = T α(f). This follows sinceSimilar remarks hold for T → V ∗ .α(T f)(T ′ ) = ϕ T f (T ′ ) = a 1 (T f|T ′ ) = a 1 (f|T ′ T )= ϕ f (T ′ T ) = T ϕ f (T ′ ) = T α(f)(T ′ ).4.3 EigenformsWe c<strong>on</strong>tinue to assume that k ≥ 2. A modular form f ∈ M k is an eigenform forT if f|T n = λ n f for all n ≥ 1 <strong>and</strong> some complex numbers λ n . Suppose f is aneigenform, so a n (f) = a 1 (f|T n ) = λ n a 1 (f). Thus if a 1 (f) = 0, then a n (f) = 0 forall n ≥ 1, so since k ≥ 2 this implies that f = 0. Thus a 1 (f) ≠ 0, <strong>and</strong> we may aswell divide through by a 1 (f) to obtain the normalized eigenform 1a 1(f)f. We thusassume that a 1 (f) = 1, then the formula becomes a n (f) = λ n , so f|T n = a n (f)f,for all n ≥ 1.Theorem 4.3.1. Let f ∈ V <strong>and</strong> let ψ be the image of f in Hom(T, C), soψ(T ) = a 1 (f|T ). Then f is a normalized eigenform if <strong>and</strong> <strong>on</strong>ly if ψ is a ringhomomorphism.Proof. First suppose f is a normalized eigenform, so f|T n = a n (f)f. Thenψ(T n T m ) = a 1 (f|T n T m ) = a m (f|T n )= a m (a n (f)f) = a m (f)a n (f)= ψ(T n )ψ(T m ),so ψ is a homomorphism.C<strong>on</strong>versely, assume ψ is a homomorphism. Then f|T n = ∑ a m (f|T n )q m , so toshow that f|T n = a n (f)f we must show that a m (f|T n ) = a n (f)a m (f). Recall thatψ(T n ) = a 1 (f|T n ) = a n , soas desired.a n (f)a m (f) = a 1 (f|T n )a 1 (f|T m ) = ψ(T n )ψ(T m )= ψ(T n T m ) = a 1 (f|T n |T m )= a m (f|T n )4.4 IntegralityIn the previous secti<strong>on</strong>s, we looked at subspaces V ⊂ M k ⊂ C[[q]], with k ≥ 2, <strong>and</strong>c<strong>on</strong>sidered the space T = T(V ) = C[. . . , T n , . . .] ⊂ End C V of <strong>Hecke</strong> operators


32 4. Duality, Rati<strong>on</strong>ality, <strong>and</strong> Integrality<strong>on</strong> V . We defined a pairing T × V → C by (T, f) ↦→ a 1 (f|T ) <strong>and</strong> showed thispairing is n<strong>on</strong>degenerate <strong>and</strong> that it induces isomorphisms T ∼ = Hom(V, C) <strong>and</strong>V ∼ = Hom(T, C).Fix k ≥ 4 <strong>and</strong> let S = S k be the space of weight k cusp forms with respect tothe acti<strong>on</strong> of SL 2 (Z). LetS(Q) = S k ∩ Q[[q]]S(Z) = S k ∩ Z[[q]].Theorem 4.4.1. There is a C-basis of M k c<strong>on</strong>sisting of forms with integer coefficients.Proof. We see this by exhibiting a basis. Recall that for all k ≥ 4G k = − B ∞ k2k + ∑ ∑d k−1 q nk=1 d|kis the kth Eisenstein series, which is a modular form of weight k, <strong>and</strong>E k = − 2kB k· G k = 1 + · · ·is the normalizati<strong>on</strong> that starts with 1. Since the Bernoulli numbers B 2 , . . . , B 8have 1 as numerator (this isn’t always the case, B 10 = 5 66 ) we see that E 4 <strong>and</strong>E 6 have coefficients in Z <strong>and</strong> c<strong>on</strong>stant term 1. Furthermore <strong>on</strong>e shows (see [Ser73,§VII.3.2]) by dimensi<strong>on</strong> <strong>and</strong> independence arguments that the modular formsform a basis for M k .{E a 4 E b 6 : 4a + 6b = k}4.5 A Result from Victor Miller’s thesisSet d = dim C S k . Victor Miller showed in his thesis (see [Lan95], Ch. X, Theorem4.4) that there existsf 1 , . . . , f d ∈ S k (Z) such that a i (f j ) = δ ijfor 1 ≤ i, j ≤ d. The f i then form a basis for S k (Z).Example 4.5.1. The space S 36 (Z) has basisf 1 = q + 57093088q 4 + 37927345230q 5 + 5681332472832q 6 + · · ·f 2 = q 2 + 194184q 4 + 7442432q 5 − 197264484q 6 + 722386944q 7 · · ·f 3 = q 3 − 72q 4 + 2484q 5 − 54528q 6 + 852426q 7 − 10055232q 8 + · · ·Let T = Z[. . . , T n , . . .] ⊂ End(S k ) be the <strong>Hecke</strong> algebra associated to S k . Miller’sthesis implies the following result about T.Propositi<strong>on</strong> 4.5.2. We have T = ⊕ di=1 ZT i, as Z-modules.


4.6 The Peterss<strong>on</strong> inner product 33Proof. To see that T 1 , . . . , T d ∈ T = T(S k ) are linearly independent over C,suppose ∑ di=1 c iT i = 0. Then(0 = a 1 f j | ∑ )c i T i = ∑ c i a i (f j ) = ∑ c i δ ij = c j .iiFrom the isomorphism T ∼ = Hom(S k , C) we know that dim C T = d, so we canwrite any T n as a C-linear combinati<strong>on</strong>ButT n =Z ∋ a n (f j ) = a 1 (f j |T n ) =d∑c ni T i , c ni ∈ C.i=1d∑c ni a 1 (f j |T i ) =i=1so the c ni all lie in Z, which completes the proof.d∑c ni a i (f j ) = c njThus T is an integral <strong>Hecke</strong> algebra of finite rank d over Z. We have a mapwhich induces an embeddingS(Z) × T → Z(f, T ) ↦→ a 1 (f|T )S(Z) ↩→ Hom(T, Z) ≈ Z d .Remark 4.5.3. The map S(Z) ↩→ Hom(T, Z) is an isomorphism of T-modules,since if ϕ ∈ Hom(T, Z), then f = ∑ dj=1 ϕ(T j)f j ∈ S(Z) maps to ϕ.i=14.6 The Peterss<strong>on</strong> inner productThe main theorem isTheorem 4.6.1. The T n ∈ T(S k ) are all diag<strong>on</strong>alizable over C.To prove this, we note that S k supports a n<strong>on</strong>degenerate positive definite Hermiteaninner product (the Peterss<strong>on</strong> inner product)such that(f, g) ↦→ 〈f, g〉 ∈ C〈f|T n , g〉 = 〈f, g|T n 〉. (4.6.1)We need some background facts.An operator T is normal if it commutes with its adjoint, thus T T ∗ = T ∗ T . Itfollows from (4.6.1) that T ∗ n = T n , so T n is clearly normal.Theorem 4.6.2. A normal operator is diag<strong>on</strong>alizable.Thus each T n is diag<strong>on</strong>alizable.


34 4. Duality, Rati<strong>on</strong>ality, <strong>and</strong> IntegralityTheorem 4.6.3. A commuting family of semisimple (=diag<strong>on</strong>alizable) operatorscan be simultaneously diag<strong>on</strong>alized.Since the T n commute this implies S k has a basis c<strong>on</strong>sisting of normalized eigenformsf. Their eigenvalues are real sincea n (f)〈f, f〉 = 〈a n (f)f, f〉 = 〈f|T n , f〉 (4.6.2)= 〈f, a n (f)f〉 = a n (f)〈f, f〉. (4.6.3)Propositi<strong>on</strong> 4.6.4. The coefficients a n of the eigenforms are totally real algebraicintegers.Proof. Theorem 4.4.1 implies that there is a basis for S k with integer coefficients.With respect to a basis for S k (Z) = S k ∩ Z[[q]], the matrices of the <strong>Hecke</strong> opeatorsT n all have integer entries, so their characteristic polynomials are m<strong>on</strong>ic polynomialswith integer coefficients. The roots of these polynomials are all real (by (4.6.2)),so the roots are totally real algebraic integers.Leth = {x + iy : x, y ∈ R, <strong>and</strong> y > 0}be the upper half plane.If α = ( )a bc d ∈ GL+2 (R) then ( )a bc d acts <strong>on</strong> h by( )a b: z ↦→ az + bc d cz + d .Lemma 4.6.5.Im( ) az + b= det(α)cz + d |cz + d| 2 y.Proof. Apply the identity Im(z) = 1 2i(z − z) to both sides of the asserted equality<strong>and</strong> simplify.Propositi<strong>on</strong> 4.6.6. The volume form dx∧dyy 2is invariant under the acti<strong>on</strong> ofProof. Differentiating az+bcz+d gives( ) az + bd =cz + dGL + 2 (R) = {M ∈ GL 2(R) : det(M) > 0}.=a(cz + d)dz − c(az + b)dz(cz + d) 2(ad − bc)dz(cz + d) 2= det(α)(cz + d) 2 dzThus, under the acti<strong>on</strong> of α, dz ∧ dz takes <strong>on</strong> a factor ofdet(α) 2(cz + d) 2 (cz + d) 2 = ( det(α)|cz + d| 2 ) 2.


The Peterss<strong>on</strong> inner product of forms f, g ∈ S k is defined by∫〈f, g〉 = (f(z)g(z)y k dx ∧ dy)y 2 ,Γ\h4.6 The Peterss<strong>on</strong> inner product 35where Γ = SL 2 (Z).Integrating over Γ\h can be taken to mean integrating over a fundamental domainfor the acti<strong>on</strong> of h. Showing that the operators T n are self-adjoint with respectto the Peterss<strong>on</strong> inner product is a harder computati<strong>on</strong> than <strong>on</strong>e might think (see[Lan95, §III.4]).


36 4. Duality, Rati<strong>on</strong>ality, <strong>and</strong> Integrality


5Analytic Theory of <strong>Modular</strong> Curves5.1 The <strong>Modular</strong> groupThis secti<strong>on</strong> very closely follows Secti<strong>on</strong>s 1.1–1.2 of [Ser73]. We introduce themodular group G = PSL 2 (Z), describe a fundamental domain for the acti<strong>on</strong> of G<strong>on</strong> the upper half plane, <strong>and</strong> use it to prove that G is generated byS =( ) 0 − 11 0<strong>and</strong> T =( ) 1 1.0 15.1.1 The Upper half planeLeth = {z ∈ C : Im(z) > 0}be the open complex upper half plane. The group{( ) a bSL 2 (R) =c d}: a, b, c, d ∈ R <strong>and</strong> ad − bc = 1iFIGURE 5.1.1. The upper half plane hR


38 5. Analytic Theory of <strong>Modular</strong> Curvesacts by linear fracti<strong>on</strong>al transformati<strong>on</strong>s (z ↦→ (az + b)/(cz + d)) <strong>on</strong> C ∪ {∞}.Suppose g ∈ SL 2 (R) <strong>and</strong> z ∈ h. Then Lemma 4.6.5 implies that Im(gz) = Im(z)|cz+d|, 2so SL 2 (R) acts <strong>on</strong> h.The <strong>on</strong>ly element of SL 2 (R) that acts trivially <strong>on</strong> h is −1, soG = PSL 2 (Z) = SL 2 (Z)/〈−1〉acts faithfully <strong>on</strong> h. Let S <strong>and</strong> T be as above <strong>and</strong> note that S <strong>and</strong> T induce thelinear fracti<strong>on</strong>al transformati<strong>on</strong>s z ↦→ −1/z <strong>and</strong> z ↦→ z + 1, respectively. We provebelow that S <strong>and</strong> T generate G.5.2 Points <strong>on</strong> modular curves parameterize ellipticcurves with extra structureThe classical theory of the Weierstass ℘-functi<strong>on</strong> sets up a bijecti<strong>on</strong> betweenisomorphism classes of elliptic curves over C <strong>and</strong> isomorphism classes of <strong>on</strong>edimensi<strong>on</strong>alcomplex tori C/Λ. Here Λ is a lattice in C, i.e., a free abelian groupZω 1 + Zω 2 of rank 2 such that Rω 1 + Rω 2 = C.Any homomorphism ϕ of complex tori C/Λ 1 → C/Λ 2 is determined by a C-linear map T : C → C that sends Λ 1 into Λ 2 .Lemma 5.2.1. Suppose ϕ : C/Λ 1 → C/Λ 2 is n<strong>on</strong>zero. Then the kernel of ϕ isisomorphic to Λ 2 /T (Λ 1 ).Lemma 5.2.2. Two complex tori C/Λ 1 <strong>and</strong> C/Λ 2 are isomorphic if <strong>and</strong> <strong>on</strong>ly ifthere is a complex number α such that αΛ 1 = Λ 2 .Proof. Any C-linear map C → C is multiplicati<strong>on</strong> by a scalar α ∈ C.Suppose Λ = Zω 1 + Zω 2 is a lattice in C, <strong>and</strong> let τ = ω 1 /ω 2 . Then Λ τ = Zτ + Zdefines an elliptic curve that is isomorphic to the elliptic curve determined by Λ.By replacing ω 1 by −ω 1 , if necessary, we may assume that τ ∈ h. Thus everyelliptic curve is of the form E τ = C/Λ τ for some τ ∈ h <strong>and</strong> each τ ∈ h determinesan elliptic curve.Propositi<strong>on</strong> 5.2.3. Suppose τ, τ ′ ∈ h. Then E τ ≈ E τ ′ if <strong>and</strong> <strong>on</strong>ly if there existsg ∈ SL 2 (Z) such that τ = g(τ ′ ). Thus the set of isomorphism classes of ellipticcurves over C is in natural bijecti<strong>on</strong> with the orbit space SL 2 (Z)\h.Proof. Suppose E τ ≈ E τ ′. Then there exists α ∈ C such that αΛ τ = Λ τ ′, soατ = aτ ′ + b <strong>and</strong> α1 = cτ ′ + d for some a, b, c, d ∈ Z. The matrix g = ( )a bc d hasdeterminant ±1 since aτ ′ +b <strong>and</strong> cτ ′ +d form a basis for Zτ ′ +Z; this determinantis positive because g(τ ′ ) = τ <strong>and</strong> τ, τ ′ ∈ h. Thus det(g) = 1, so g ∈ SL 2 (Z).C<strong>on</strong>versely, suppose τ, τ ′ ∈ h <strong>and</strong> g = ( )a bc d ∈ SL2 (Z) is such thatτ = g(τ ′ ) = aτ ′ + bcτ ′ + d .Let α = cτ ′ + d, so ατ = aτ ′ + b. Since det(g) = 1, the scalar α defines anisomorphism from Λ τ to Λ τ ′, so E τ∼ = E′τ , as claimed.


5.2 Points <strong>on</strong> modular curves parameterize elliptic curves with extra structure 39Let E = C/Λ be an elliptic curve over C <strong>and</strong> N a positive integer. UsingLemma 5.2.1, we see that( )E[N] := {x ∈ E : Nx = 0} ∼ 1=N Λ /Λ ∼ = (Z/NZ) 2 .If Λ = Λ τ = Zτ + Z, this means that τ/N <strong>and</strong> 1/N are a basis for E[N].Suppose τ ∈ h <strong>and</strong> recall that E τ = C/Λ τ = C/(Zτ + Z). To τ, we associatethree “level N structures”. First, let C τ be the subgroup of E τ generated by 1/N.Sec<strong>on</strong>d, let P τ be the point of order N in E τ defined by 1/N ∈ 1 N Λ τ . Third, letQ τ be the point of order N in E τ defined by τ/N, <strong>and</strong> c<strong>on</strong>sider the basis (P τ , Q τ )for E[N].In order to describe the third level structure, we introduce the Weil pairinge : E[N] × E[N] → Z/NZas follows. If E = C/(Zω 1 + Zω 2 ) with τ = ω 1 /ω 2 ∈ h, <strong>and</strong> P = aω 1 /N + bω 2 /N,Q = cω 1 /N + dω 2 /N, thene(P, Q) = ad − bc ∈ Z/NZ.Also if C/Λ ∼ = C/Λ ′ via multiplicati<strong>on</strong> by α, <strong>and</strong> P, Q ∈ (C/Λ)[N], then we havee(α(P ), α(Q)) = e(P, Q), so e does not depend <strong>on</strong> the choice of Λ or a basis for it.In particular, P τ <strong>and</strong> Q τ map to P, Q via the map E τ → E given by multiplicati<strong>on</strong>by ω 2 , so e(P τ , Q τ ) = −1 ∈ Z/NZ.Remark 5.2.4. There is a can<strong>on</strong>ical Nth root of 1 in C, namely ζ = e 2πi/N . Usingζ as a can<strong>on</strong>ical generator of µ N , we can view the transcendental Weil pairingindifferently as a map with values in Z/NZ or as a map with values in µ N . However,for generalizati<strong>on</strong>s it is important to use µ N rather than Z/NZ. There are severalintrinsic algebraic definiti<strong>on</strong>s of the Weil pairing <strong>on</strong> N-divisi<strong>on</strong> points for an ellipticcurve (or, more generally, an abelian variety) over a field k whose characteristic isprime to N. In all cases, the Weil pairing takes values in the group of Nth roots ofunity with values in the algebraic closure of k. The various definiti<strong>on</strong>s all coincide“up to sign” in the sense that any two of them either coincide or are inverse toeach other. There is a discussi<strong>on</strong> of the Weil pairing in [Kat81, §5.2].We next c<strong>on</strong>sider the three modular curves X 0 (N), X 1 (N), <strong>and</strong> X(N), associatedto the c<strong>on</strong>gruence subgroups Γ 0 (N), Γ 1 (N) <strong>and</strong> Γ(N). Recall that Γ 0 (N) isthe subgroup of ( )a bc d ∈ SL2 (Z) c<strong>on</strong>gruent to ( ∗ 0 ∗∗ ) modulo N, that Γ 1 (N) c<strong>on</strong>sistsof matrices c<strong>on</strong>gruent to ( 1 0 ∗1 ) modulo N, <strong>and</strong> Γ(N) c<strong>on</strong>sists of matrices c<strong>on</strong>gruentto ( 1 0 01 ) modulo N.Theorem 5.2.5. Let N be a positive integer.1. The n<strong>on</strong>-cuspidal points <strong>on</strong> X 0 (N) corresp<strong>on</strong>d to isomorphism classes ofpairs (E, C) where C is a cyclic subgroup of E of order N. (Two pairs(E, C), (E ′ , C ′ ) are isomorphic if there is an isomorphism ϕ : E → E ′such that ϕ(C) = C ′ .)2. The n<strong>on</strong>-cuspidal points <strong>on</strong> X 1 (N) corresp<strong>on</strong>d to isomorphism classes ofpairs (E, P ) where P is a point <strong>on</strong> E of exact order N. (Two pairs (E, P )<strong>and</strong> (E ′ , P ′ ) are isomorphic if there is an isomorphism ϕ : E → E ′ such thatϕ(P ) = P ′ .)


40 5. Analytic Theory of <strong>Modular</strong> Curves3. The n<strong>on</strong>-cuspidal points <strong>on</strong> X(N) corresp<strong>on</strong>d to isomorphism classes of triples(E, P, Q) where P, Q is a basis for E[N] such that e(P, Q) = −1 ∈ Z/NZ.(Triples (E, P, Q) <strong>and</strong> (E, P ′ , Q ′ ) are isomorphic if there is an isomorphismϕ : E → E ′ such that ϕ(P ) = P ′ <strong>and</strong> ϕ(Q) = Q ′ .)This theorem follows from Propositi<strong>on</strong>s 5.2.8 <strong>and</strong> 5.2.9 below, whose proofs makeextensive use of the following lemma.Lemma 5.2.6. The reducti<strong>on</strong> map SL 2 (Z) → SL 2 (Z/NZ) is surjective.Proof. By c<strong>on</strong>sidering the bottom two entries of an element of SL 2 (Z/NZ) <strong>and</strong>using the extended Euclidean algorithm, it suffices to prove that if c, d ∈ Z <strong>and</strong>gcd(c, d, N) = 1, then there exists α ∈ Z such that gcd(c, d + αN) = 1. Letc 0 = ∏ l|gcd(c,d) lord l(c) <strong>and</strong> c 1 = c/c 0 . We have c<strong>on</strong>structed c 0 <strong>and</strong> c 1 so thatgcd(d, c 1 ) = 1 <strong>and</strong> gcd(c 0 , c 1 ) = 1; also, gcd(c 0 , N) = 1 since if l | gcd(c 0 , N),then l | gcd(c, d, N) = 1. Because gcd(Nc 1 , c 0 ) = 1, the class of Nc 1 generatesthe additive group Z/c 0 Z, so there exists m such that m · Nc 1 ≡ 1 − d (mod c 0 ).Thus d + mc 1 N ≡ 1 (mod c 0 ) <strong>and</strong> d + mc 1 N ≡ d (mod c 1 ), so d + mc 1 N is a unitmodulo both c 0 <strong>and</strong> c 1 , hence gcd(d + mc 1 N, c) = 1, which proves the lemma.Remark 5.2.7. The analogue of Lemma 5.2.6 is also true for SL m (Z) → SL m (Z/NZ)for any integer m. See [Shi94, ?].Propositi<strong>on</strong> 5.2.8. Let E be an elliptic curve over C. If C is a cyclic subgroup ofE of order N, then there exists τ ∈ h such that (E, C) is isomorphic to (E τ , C τ ).If P is a point <strong>on</strong> E of order N, then there exists τ ∈ C such that (E, P ) isisomorphic to (E τ , P τ ). If P, Q is a basis for E[N] <strong>and</strong> e(P, Q) = −1 ∈ Z/NZ,then there exists τ ∈ C such that (E, P, Q) is isomorphic to (E τ , P τ , Q τ ).Proof. Write E = C/Λ with Λ = Zω 1 + Zω 2 <strong>and</strong> ω 1 /ω 2 ∈ h.Suppose P = aω 1 /N + bω 2 /N is a point of order N. Then gcd(a, b, N) = 1,since otherwise P would have order strictly less than N, a c<strong>on</strong>tradicti<strong>on</strong>. As inLemma 5.2.6, we can modify a <strong>and</strong> b by adding multiples of N to them, so thatP = aω 1 /N +bω 2 /N <strong>and</strong> gcd(a, b) = 1. There exists c, d ∈ Z such that ad−bc = 1,so ω 1 ′ = aω 1 + bω 2 <strong>and</strong> ω 2 ′ = cω 1 + dω 2 form a basis for Λ, <strong>and</strong> C is generated byP = ω 1/N. ′ If necessary, replace ω 2 ′ by −ω 2 ′ so that τ = ω 2/ω ′ 1 ′ ∈ h. Then (E, P ) isisomorphic to (E τ , P τ ). Also, if C is the subgroup generated by P , then (E, C) isisomorphic to (E τ , C τ ).Suppose P = aω 1 /N + bω 2 /N <strong>and</strong> Q = cω 1 /N + dω 2 /N are a basis for E[N]with e(P, Q) = −1. Then the matrix ( )a b−c −d has determinant 1 modulo N, sobecause the map SL 2 (Z) → SL 2 (Z/NZ) is surjective, we can replace a, b, c, d byintegers that are equivalent to them modulo N (so P <strong>and</strong> Q are unchanged) sothat ad − bc = −1. Thus ω 1 ′ = aω 1 + bω 2 <strong>and</strong> ω 2 ′ = cω 1 + dω 2 form a basis for Λ.Letτ = ω 2/ω ′ 1 ′ = c ω1ω 2+ da ω1ω 2+ b .Then τ ∈ h since ω 1 /ω 2 ∈ h <strong>and</strong> ( c da b)has determinant +1. Finally, divisi<strong>on</strong> byω ′ 1 defines an isomorphism E → E τ that sends P to 1/N <strong>and</strong> Q to τ/N.The following propositi<strong>on</strong> completes the proof of Theorem 5.2.5.


5.3 The Genus of X(N) 41Propositi<strong>on</strong> 5.2.9. Suppose τ, τ ′ ∈ h. Then (E τ , C τ ) is isomorphic (E τ ′, C τ ′) if<strong>and</strong> <strong>on</strong>ly if there exists g ∈ Γ 0 (N) such that g(τ) = τ ′ . Also, (E τ , P τ ) is isomorphic(E τ ′, P τ ′) if <strong>and</strong> <strong>on</strong>ly if there exists g ∈ Γ 1 (N) such that g(τ) = τ ′ . Finally,(E τ , P τ , Q τ ) is isomorphic (E τ ′, P τ ′, Q τ ′) if <strong>and</strong> <strong>on</strong>ly if there exists g ∈ Γ(N) suchthat g(τ) = τ ′ .Proof. We prove <strong>on</strong>ly the first asserti<strong>on</strong>, since the others are proved in a similarway. Suppose (E τ , C τ ) is isomorphic to (E τ ′ , C τ ′ ). Then there is λ ∈ C such thatλΛ τ = Λ τ ′. Thus λτ = aτ ′ + b <strong>and</strong> λ1 = cτ ′ + d with g = ( )a bc d ∈ SL2 (Z) (aswe saw in the proof of Propositi<strong>on</strong> 5.2.3). Dividing the sec<strong>on</strong>d equati<strong>on</strong> by N weget λ 1 N = c N τ ′ + d N , which lies in Λ τ ′ = Zτ ′ + 1 NZ, by hypothesis. Thus c ≡ 0(mod N), so g ∈ Γ 0 (N), as claimed. For the c<strong>on</strong>verse, note that if N | c, thencN τ ′ + d N ∈ Λ τ ′.5.3 The Genus of X(N)Let N be a positive integer. The aim of this secti<strong>on</strong> is to establish some factsabout modular curves associated to c<strong>on</strong>gruence subgroups <strong>and</strong> compute the genusof X(N). Similar methods can be used to compute the genus of X 0 (N) <strong>and</strong> X 1 (N)(see Chapter 10).The groups Γ 0 (1), Γ 1 (1), <strong>and</strong> Γ(1) are all equal to SL 2 (Z), so X 0 (1) = X 1 (1) =X(1) = P 1 . Since P 1 has genus 0, we know the genus for each of these threecases. For general N we obtain the genus by determining the ramificati<strong>on</strong> of thecorresp<strong>on</strong>ding cover of P 1 <strong>and</strong> applying the Hurwitz formula, which we assumethe reader is familiar with, but which we now recall.Suppose f : X → Y is a surjective morphism of Riemann surfaces of degree d.For each point x ∈ X, let e x be the ramificati<strong>on</strong> exp<strong>on</strong>ent at x, so e x = 1 preciselywhen f is unramified at x, which is the case for all but finitely many x. (There is apoint over y ∈ Y that is ramified if <strong>and</strong> <strong>on</strong>ly if the cardinality of f −1 (y) is less thanthe degree of f.) Let g(X) <strong>and</strong> g(Y ) denote the genera of X <strong>and</strong> Y , respectively.Theorem 5.3.1 (Hurwitz Formula). Let f : X → Y be as above. Then2g(X) − 2 = d(2g(Y ) − 2) + ∑ x∈X(e x − 1).If X → Y is Galois, so the e x in the fiber over each fixed y ∈ Y are all equal to afixed value e y , then this formula becomes⎛(2g(X) − 2 = d1 − 1 ) ⎞ ⎠ .e y⎝2g(Y ) − 2 + ∑ y∈YLet X be <strong>on</strong>e of the modular curves X 0 (N), X 1 (N), or X(N) corresp<strong>on</strong>ding to ac<strong>on</strong>gruence subgroup Γ, <strong>and</strong> let Y = X(1) = P 1 . There is a natural map f : X → Ygot by sending the equivalence class of τ modulo the c<strong>on</strong>gruence subgroup Γ to theequivalence class of τ modulo SL 2 (Z). This is “the” map X → P 1 that we meaneverywhere below.Because PSL 2 (Z) acts faithfully <strong>on</strong> h, the degree of f is the index in PSL 2 (Z)of the image of Γ in PSL 2 (Z). Using Lemma 5.2.6 that the map SL 2 (Z) →SL 2 (Z/NZ) is surjective, we compute these indices, <strong>and</strong> obtain the following:


42 5. Analytic Theory of <strong>Modular</strong> CurvesPropositi<strong>on</strong> 5.3.2. Suppose N > 2. The degree of the map X 0 (N) → P 1 isN ∏ p|N (1 + 1/p). The degree of the map X 1(N) → P 1 is 1 2 N 2 ∏ p|N (1 − 1/p2 ).The degree of the map from X(N) → P 1 is 1 2 N 3 ∏ p|N (1 − 1/p2 ). If N = 2, thenthe degrees are 3, 3, <strong>and</strong> 6, respectively.Proof. This follows from the discussi<strong>on</strong> above, <strong>and</strong> the observati<strong>on</strong> that for N > 2the groups Γ(N) <strong>and</strong> Γ 1 (N) do not c<strong>on</strong>tain −1 <strong>and</strong> the group Γ 0 (N) does.Propositi<strong>on</strong> 5.3.3. Let X be X 0 (N), X 1 (N) or X(N). Then the map X → P 1is ramified at most over ∞ <strong>and</strong> the two points corresp<strong>on</strong>ding to elliptic curves withextra automorphisms (i.e., the two elliptic curves with j-invariants 0 <strong>and</strong> 1728).Proof. Since we have a tower X(N) → X 1 (N) → X 0 (N) → P 1 , it suffices to provethe asserti<strong>on</strong> for X = X(N). Since we do not claim that there is no ramificati<strong>on</strong>over ∞, we may restrict to Y (N). By Theorem 5.2.5, the points <strong>on</strong> Y (N) corresp<strong>on</strong>dto isomorphism classes of triples (E, P, Q), where E is an elliptic curveover C <strong>and</strong> P, Q are a basis for E[N]. The map from Y (N) to P 1 sends the isomorphismclass of (E, P, Q) to the isomorphism class of E. The equivalence classof (E, P, Q) also c<strong>on</strong>tains (E, −P, −Q), since −1 : E → E is an isomorphism. The<strong>on</strong>ly way the fiber over E can have cardinality smaller than the degree is if thereis an extra equivalence (E, P, Q) → (E, ϕ(P ), ϕ(Q)) with ϕ an automorphism ofE not equal to ±1. The theory of CM elliptic curves shows that there are <strong>on</strong>ly twoisomorphism classes of elliptic curves E with automorphisms other than ±1, <strong>and</strong>these are the <strong>on</strong>es with j-invariant 0 <strong>and</strong> 1728. This proves the propositi<strong>on</strong>.Theorem 5.3.4. For N > 2, the genus of X(N) isg(X(N)) = 1 + N 2 (N − 6)24∏p|N(1 − 1 p 2 ),where p runs through the prime divisors of N. For N = 1, 2, the genus is 0.Thus if g N = g(X(N)), then g 1 = g 2 = g 3 = g 4 = g 5 = 0, g 6 = 1, g 7 = 3, g 8 = 5,g 9 = 10, g 389 = 2414816, <strong>and</strong> g 2003 = 333832500.Proof. Since Γ(N) is a normal subgroup of SL 2 (Z), it follows that X(N) is a Galoiscovering of X(1) = P 1 , so the ramificati<strong>on</strong> indices e x are all the same for x over afixed point y ∈ P 1 ; we denote this comm<strong>on</strong> index by e y . The fiber over the curvewith j-invariant 0 has size <strong>on</strong>e-third of the degree, since the automorphism groupof the elliptic curve with j-invariant 0 has order 6, so the group of automorphismsmodulo ±1 has order three, hence e 0 = 3. Similarly, the fiber over the curve withj-invariant 1728 has size half the degree, since the automorphism group of theelliptic curve with j-invariant 1728 is cyclic of order 4, so e 1728 = 2.To compute the ramificati<strong>on</strong> degree e ∞ we use the orbit stabilizer theorem.The fiber of X(N) → X(1) over ∞ is exactly the set of Γ(N) equivalence classesof cusps, which is Γ(N)∞, Γ(N)g 2 ∞, . . . , Γ(N)g r ∞, where g 1 = 1, g 2 , . . . , g r arecoset representatives for Γ(N) in SL 2 (Z). By the orbit-stabilizer theorem, thenumber of cusps equals #(Γ(1)/Γ(N))/#S, where S is the stabilizer of Γ(N)∞in Γ(1)/Γ(N) ∼ = SL 2 (Z/NZ). Thus S is the subgroup {± ( 1 n0 1 ) : 0 ≤ n < N − 1},which has order 2N. Since the degree of X(N) → X(1) equals #(Γ(1)/Γ(N))/2,the number of cusps is the degree divided by N. Thus e ∞ = N.


5.3 The Genus of X(N) 43The Hurwitz formula for X(N) → X(1) with e 0 = 3, e 1728 = 2, <strong>and</strong> e ∞ = N, is( (2g(X(N)) − 2 = d 0 − 2 + 1 − 1 3 + 1 − 1 2 + 1 − 1 )),Nwhere d is the degree of X(N) → X(1). Solving for g(X(N)) we obtain(2g(X(N)) − 2 = d 1 − 5 6 − 1 ) ( ) N − 6= d ,N 6Nsog(X(N)) = 1 + d 2( ) N − 66N= d (N − 6) + 1.12NSubstituting the formula for d from Propositi<strong>on</strong> 5.3.2 yields the claimed formula.


44 5. Analytic Theory of <strong>Modular</strong> Curves


6<strong>Modular</strong> Curves6.1 Cusp <strong>Forms</strong>Recall that if N is a positive integer we define the c<strong>on</strong>gruence subgroups Γ(N) ⊂Γ 1 (N) ⊂ Γ 0 (N) byΓ 0 (N) = { ( )a bc d ∈ SL2 (Z) : c ≡ 0 (mod N)}Γ 1 (N) = { ( )a bc d ∈ SL2 (Z) : a ≡ d ≡ 1, c ≡ 0 (mod N)}Γ(N) = { ( )a bc d ∈ SL2 (Z) : ( ) (a bc d ≡ 1 0)0 1 (mod N)}.Let Γ be <strong>on</strong>e of the above subgroups. One can give a c<strong>on</strong>structi<strong>on</strong> of the spaceS k (Γ) of cusp forms of weight k for the acti<strong>on</strong> of Γ using the language of algebraicgeometry. Let X Γ = Γ\H ∗ be the compactificati<strong>on</strong> of the upper half plane (uni<strong>on</strong>the cusps) modulo the acti<strong>on</strong> of Γ. Then X Γ can be given the structure of Riemannsurface <strong>and</strong> S 2 (Γ) = H 0 (X Γ , Ω 1 ) where Ω 1 is the sheaf of differential 1-forms<strong>on</strong> X Γ . This works since an element of H 0 (X Γ , Ω 1 ) is a differential form f(z)dz,holomorphic <strong>on</strong> H <strong>and</strong> the cusps, which is invariant with respect to the acti<strong>on</strong> ofΓ. If γ = ( a bc d)∈ Γ thend(γ(z))/dz = (cz + d) −2sof(γ(z))d(γ(z)) = f(z)dzif <strong>and</strong> <strong>on</strong>ly if f satisfies the modular c<strong>on</strong>diti<strong>on</strong>f(γ(z)) = (cz + d) 2 f(z).There is a similar c<strong>on</strong>structi<strong>on</strong> of S k for k > 2.


46 6. <strong>Modular</strong> Curves6.2 <strong>Modular</strong> curvesOne knows that SL 2 (Z)\h parameterizes isomorphism classes of elliptic curves.The other c<strong>on</strong>gruence subgroups also give rise to similar parameterizati<strong>on</strong>s. ThusΓ 0 (N)\h parameterizes pairs (E, C) where E is an elliptic curve <strong>and</strong> C is a cyclicsubgroup of order N, <strong>and</strong> Γ 1 (N)\H parameterizes pairs (E, P ) where E is anelliptic curve <strong>and</strong> P is a point of exact order N. Note that <strong>on</strong>e can also give a pointof exact order N by giving an injecti<strong>on</strong> Z/NZ ↩→ E[N] or equivalently an injecti<strong>on</strong>µ N ↩→ E[N] where µ N denotes the Nth roots of unity. Γ(N)\h parameterizes pairs(E, {α, β}) where {α, β} is a basis for E[N] ∼ = (Z/NZ) 2 .The above quotients spaces are called moduli spaces for the moduli problem ofdetermining equivalence classes of pairs (E+ extra structure).6.3 Classifying Γ(N)-structuresLet S be an arbitrary scheme. An elliptic curve E/S is a proper smooth curvefESwith geometrically c<strong>on</strong>nected fibers all of genus <strong>on</strong>e, give with a secti<strong>on</strong> “0”.Loosely speaking, proper is a generalizati<strong>on</strong> of projective <strong>and</strong> smooth generalizesn<strong>on</strong>singularity. See Hartshorne [Har77, III.10] for the precise definiti<strong>on</strong>s.Let S be any scheme <strong>and</strong> E/S an elliptic curve. A Γ(N)-structure <strong>on</strong> E/S isa group homomorphismϕ : (Z/NZ) 2 → E[N](S)whose image “generates” E[N](S).A good reference is [KM85, III].Define a functor from the category of Q-schemes to the category of sets bysending a scheme S to the set of isomorphism classes of pairs(E, Γ(N)-structure)where E is an elliptic curve defined over S <strong>and</strong> isomorphisms (preserving theΓ(N)-structure) are taken over S. An isomorphism preserves the Γ(N)-structureif it takes the two distinguished generators to the two distinguished generators inthe image (in the correct order).Theorem 6.3.1. For N ≥ 4 the functor defined above is representable <strong>and</strong> theobject representing it is the modular curve X corresp<strong>on</strong>ding to Γ(N).What this means is that given a Q-scheme S, the setX(S) = Mor Q-schemes (S, X)is isomorphic to the image of the functor’s value <strong>on</strong> S.


6.4 More <strong>on</strong> integral <strong>Hecke</strong> operators 47There is a natural way to map a pair (E, Γ(N)-structure) to an Nth root of unity.If P, Q are the distinguished basis of E[N] we send the pair (E, Γ(N)-structure)toe N (P, Q) ∈ µ Nwhere e N : E[N]×E[N] → µ N is the Weil pairing. For the definiti<strong>on</strong> of this pairingsee [Sil92, III.8]. The Weil pairing is bilinear, alternating, n<strong>on</strong>-degenerate, Galoisinvariant, <strong>and</strong> maps surjectively <strong>on</strong>to µ N .6.4 More <strong>on</strong> integral <strong>Hecke</strong> operatorsWe are c<strong>on</strong>sidering the algebra of integral <strong>Hecke</strong> operators T = T Z <strong>on</strong> the spaceof cusp forms S k (C) with respect to the acti<strong>on</strong> of the full modular group SL 2 (Z).Our goal is to see why T ∼ = Z d where d = dim C S k (C).Suppose A ⊂ C is any subring of C <strong>and</strong> recall thatWe have a natural mapT A = A[. . . , T n , . . .] ⊂ End C S k .T A ⊗ A C → T Cbut we do not yet know that it is an isomorphism.6.5 Complex c<strong>on</strong>jugati<strong>on</strong>We have a c<strong>on</strong>jugate linear map <strong>on</strong> functi<strong>on</strong>sSince (e −2πiτ ) = e 2πiτ , it follows thatn=1f(τ) ↦→ f(−τ).∞∑∞∑a n q n ↦→ a n q nso it is reas<strong>on</strong>able to call this map “complex c<strong>on</strong>jugati<strong>on</strong>”. Furthermore, if we knowthatS k (C) = C ⊗ Q S k (Q)then it follows that complex c<strong>on</strong>jugati<strong>on</strong> takes S k (C) into S k (C). To see this notethat if we have the above equality then every element of S k (C) is a C-linearcombinati<strong>on</strong> of elements of S k (Q) <strong>and</strong> c<strong>on</strong>versely, <strong>and</strong> it is clear that the set ofsuch C-linear combinati<strong>on</strong>s is invariant under the acti<strong>on</strong> of complex c<strong>on</strong>jugati<strong>on</strong>.n=16.6 Isomorphism in the real casePropositi<strong>on</strong> 6.6.1. T R ⊗ R C ∼ = T C , as C-vector spaces.


48 6. <strong>Modular</strong> CurvesProof. Since S k (R) = S k (C)∩R[[q]] <strong>and</strong> since theorem 5.1 assures us that there is aC-basis of S k (C) c<strong>on</strong>sisting of forms with integral coefficients, we see that S k (R) ∼ =R d where d = dim C S k (C). (Any element of S k (R) is a C-linear combinati<strong>on</strong> of theintegral basis, hence equating real <strong>and</strong> imaginary parts, an R-linear combinati<strong>on</strong> ofthe integral basis, <strong>and</strong> the integral basis stays independent over R.) By c<strong>on</strong>sideringthe explicit formula for the acti<strong>on</strong> of the <strong>Hecke</strong> operators T n <strong>on</strong> S k (see secti<strong>on</strong> 3)we see that T R leaves S k (R) invariant, thusT R = R[. . . , T d , . . .] ⊂ End R S k (R).In secti<strong>on</strong> 4 we defined a “perfect” pairingT C × S k (C) → Cwhich allowed us to show that T C∼ = Sk (C). By restricting to R we again get aperfect pairing so we see that T R∼ = Sk (R) ∼ = R d which implies thatT R ⊗ R C ∼ −→ T C .This also shows that S k (C) ∼ = C⊗ R S k (R) so we have complex c<strong>on</strong>jugati<strong>on</strong> overR.6.7 The Eichler-Shimura isomorphismOur goal in this secti<strong>on</strong> is to outline a homological interpretati<strong>on</strong> of S k . For detailssee [Lan95, VI], the original paper [Shi59], or [Shi94, VIII].How is S k (C) sort of isomorphic to H 1 (X Γ , R)? Suppose k = 2 <strong>and</strong> Γ ⊂ SL 2 (Z)is a c<strong>on</strong>gruence subgroup, let X Γ = Γ\H be the Riemann surface obtained by compactifyingthe upper half plane modulo the acti<strong>on</strong> of Γ. Then S k (C) = H 0 (X Γ , Ω 1 )so we have a pairingH 1 (X Γ , Z) × S k (C) → Cgiven by integrati<strong>on</strong>This gives an embedding∫(γ, ω) ↦→Z 2d ∼ = H1 (X Γ , Z) ↩→ Hom C (S k (C), C) ∼ = C dof a “lattice” in C d . (We say “lattice” since there were some comments by Ribetthat Z 2d isn’t a lattice because the rank might be too small since a subring ofC d having Z-rank 2d might not spans C d over C). Passing to the quotient (<strong>and</strong>compactifying) gives a complex torus called the Jacobian of X Γ . Again using theabove pairing we get an embeddingγω.C d ∼ = Sk (C) ↩→ Hom(H 1 (X Γ , Z), C) ∼ = C 2dwhich, up<strong>on</strong> taking the real part, givesS k (C) → Hom(H 1 (X Γ , Z), R) ∼ = H 1 (X Γ , R) ∼ = H 1 p(Γ, R)


6.8 The Petters<strong>on</strong> inner product is <strong>Hecke</strong> compatible 49where H 1 p(Γ, R) denotes the parabolic group cohomology of Γ with respect to thetrivial acti<strong>on</strong>. It is this result, that we may view S k (C) as the cohomology groupH 1 p(Γ, R), that was alluded to above.Shimura generalized this for arbitrary k ≥ 2 so thatS k (C) ∼ = H 1 p(Γ, V k )where V k is a k − 1 dimensi<strong>on</strong>al R-vector space. The isomorphism is (approximately)the following: f ∈ S k (C) is sent to the map∫ γτ0γ ↦→ Re f(τ)τ i dτ, i = 0, . . . , k − 2.τ 0Let W = R ⊕ R, then Γ acts <strong>on</strong> W by( ) (a b x: ↦→c d y)so Γ acts <strong>on</strong>( )ax + bycx + dyV k = Sym k−2 W = W ⊗k−2 /S k−2where S k−2 is the symmetric group <strong>on</strong> k − 2 symbols (note that dim V k = k − 1).LetL = H 1 p(Γ, Sym k−2 (Z ⊕ Z))then under the isomorphismS k (C) ∼ = H 1 p(Γ, R)L is a sublattice of S k (C) of Z-rank 2 which is T n -stable for all n. Thus we havean embeddingT Z = T ↩→ End L<strong>and</strong> so T R ⊂ End R (L ⊗ R) <strong>and</strong> T Z ⊗ Z R ∼ = T R which has rank d.6.8 The Petters<strong>on</strong> inner product is <strong>Hecke</strong> compatibleTheorem 6.8.1. Let Γ = SL 2 (Z), let f, g ∈ S k (C), <strong>and</strong> let∫〈f, g〉 = f(τ)g(τ)y k dxdyy 2 .Γ\HThen this integral is well-defined <strong>and</strong> <strong>Hecke</strong> compatible, that is, 〈f|T n , g〉 = 〈f, g|T n 〉for all n.Proof. See [Lan95, III].


50 6. <strong>Modular</strong> Curves


7<strong>Modular</strong> SymbolsThis chapter is about how to explicitly compute the homology of modular curvesusing modular symbols.We assume the reader is familiar with basic noti<strong>on</strong>s of algebraic topology, includinghomology groups of surfaces <strong>and</strong> triangulati<strong>on</strong>. We also assume that thereader has read XXX about the fundamental domain for the acti<strong>on</strong> of PSL 2 (Z) <strong>on</strong>the upper half plane, <strong>and</strong> XXX about the c<strong>on</strong>structi<strong>on</strong> of modular curves.Some st<strong>and</strong>ard references for modular symbols are [Man72] [Lan95, IV], [Cre97],<strong>and</strong> [Mer94]. Secti<strong>on</strong>s 7.1–7.2 below very closely follow Secti<strong>on</strong> 1 of Manin’s paper[Man72].For the rest of this chapter, let Γ = PSL 2 (Z) <strong>and</strong> let G be a subgroup of Γof finite index. Note that we do not require G to be a c<strong>on</strong>gruence subgroup. Thequotient X(G) = G\h ∗ of h ∗ = h ∪ P 1 (Q) by G has an induced structure of acompact Riemann surface. Let π : h ∗ → X(G) denote the natural projecti<strong>on</strong>. Thematricess =( ) 0 − 11 0<strong>and</strong> t =( ) 1 − 11 0together generate Γ; they have orders 2 <strong>and</strong> 3, respectively.7.1 <strong>Modular</strong> symbolsLet H 0 (X(G), Ω 1 ) denote the complex vector space of holomorphic 1-forms <strong>on</strong>X(G). Integrati<strong>on</strong> of differentials al<strong>on</strong>g homology classes defines a perfect pairinghence an isomorphismH 1 (X(G), R) × H 0 (X(G), Ω 1 ) → C,H 1 (X(G), R) ∼ = Hom C (H 0 (X(G), Ω 1 ), C).


52 7. <strong>Modular</strong> SymbolsFor more details, see [Lan95, §IV.1].Given two elements α, β ∈ h ∗ , integrati<strong>on</strong> from α to β induces a well-definedelement of Hom C (H 0 (X(G), Ω 1 ), C), hence an element{α, β} ∈ H 1 (X(G), R).Definiti<strong>on</strong> 7.1.1 (<strong>Modular</strong> symbol). The homology class {α, β} ∈ H 1 (X(G), R)associated to α, β ∈ h ∗ is called the modular symbol attached to α <strong>and</strong> β.Propositi<strong>on</strong> 7.1.2. The symbols {α, β} have the following properties:1. {α, α} = 0, {α, β} = −{β, α}, <strong>and</strong> {α, β} + {β, γ} + {γ, α} = 0.2. {gα, gβ} = {α, β} for all g ∈ G3. If X(G) has n<strong>on</strong>zero genus, then {α, β} ∈ H 1 (X(G), Z) if <strong>and</strong> <strong>on</strong>ly if Gα =Gβ (i.e., we have π(α) = π(β)).Remark 7.1.3. We <strong>on</strong>ly have {α, β} = {β, α} if {α, β} = 0, so the modular symbolsnotati<strong>on</strong>, which suggests “unordered pairs,” is actively misleading.Propositi<strong>on</strong> 7.1.4. For any α ∈ h ∗ , the map G → H 1 (X(G), Z) that sends g to{α, gα} is a surjective group homomorphism that does not depend <strong>on</strong> the choiceof α.Proof. If g, h ∈ G <strong>and</strong> α ∈ h ∗ , then{α, gh(α)} = {α, gα} + {gα, ghα} = {α, gα} + {α, hα},so the map is a group homomorphism. To see that the map does not depend <strong>on</strong>the choice of α, suppose β ∈ h ∗ . By Propositi<strong>on</strong> 7.1.2, we have {α, β} = {gα, gβ}.Thus{α, gα} + {gα, β} = {gα, β} + {β, gβ},so cancelling {gα, β} from both sides proves the claim.The fact that the map is surjective follows from general facts from algebraictopology. Let h 0 be the complement of Γi ∪ Γρ in h, fix α ∈ h 0 , <strong>and</strong> let X(G) 0 =π(h 0 ). The map h 0 → X(G) 0 is an unramified covering of (n<strong>on</strong>compact) Riemannsurfaces with automorphism group G. Thus α determines a group homomorphismπ 1 (X(G) 0 , π(α)) → G. When composed with the morphism G → H 1 (X(G), Z)above, the compositi<strong>on</strong>π 1 (X(G) 0 , π(α)) → G → H 1 (X(G), Z)is the can<strong>on</strong>ical map from the fundamental group of X(G) 0 to the homology ofthe corresp<strong>on</strong>ding compact surface, which is surjective. This forces the map G →H 1 (X(G), Z) to be surjective, which proves the claim.7.2 Manin symbolsWe c<strong>on</strong>tinue to assume that G is a finite-index subgroup of Γ = PSL 2 (Z), so theset G\Γ = {Gg 1 , . . . Gg d } of right cosets of G in Γ is finite. Manin symbols are acertain finite subset of modular symbols that are indexed by right cosets of G inΓ.


7.2.1 Using c<strong>on</strong>tinued fracti<strong>on</strong>s to obtain surjectivityLet R = G\Γ be the set of right cosets of G in Γ. Define[ ] : R → H 1 (X(G), R)7.2 Manin symbols 53by [r] = {r0, r∞}, where r0 means the image of 0 under any element of the cosetr (it doesn’t matter which). For g ∈ Γ, we also write [g] = [gG].Propositi<strong>on</strong> 7.2.1. Any element of H 1 (X(G), Z) is a sum of elements of the form[r], <strong>and</strong> the representati<strong>on</strong> ∑ n r {α r , β r } of h ∈ H 1 (X(G), Z) can be chosen so that∑nr (π(β r ) − π(α r )) = 0 ∈ Div(X(G)).Proof. By Propositi<strong>on</strong> 7.1.4, every element h of H 1 (X(G), Z) is of the form {0, g(0)}for some g ∈ G. If g(0) = ∞, then h = [G] <strong>and</strong> π(∞) = π(0), so we may assumeg(0) = a/b ≠ ∞, with a/b in lowest terms <strong>and</strong> b > 0. Also assume a > 0, since thecase a < 0 is treated in the same way. Let0 = p −2q −2= 0 1 , p −1q −1= 1 0 , p 01 = p 0q 0,p 1q 1,p 2q 2, . . . , p nq n= a bdenote the c<strong>on</strong>tinued fracti<strong>on</strong> c<strong>on</strong>vergents of the rati<strong>on</strong>al number a/b. ThenIf we let g j =p j q j−1 − p j−1 q j = (−1) j−1 for − 1 ≤ j ≤ n.( )(−1) j−1 p j p j−1(−1) j−1 , then gq j q j ∈ SL 2 (Z) <strong>and</strong>j−1{0, a }=b==n∑j=−1{pj−1q j−1, p jq j}n∑{g j 0, g j ∞})j=−1r∑[g j ].j=−1For the asserti<strong>on</strong> about the divisor sum equaling zero, notice that the endpointsof the successive modular symbols cancel out, leaving the difference of 0 <strong>and</strong> g(0)in the divisor group, which is 0.Lemma 7.2.2. If x = ∑ tj=1 n j{α j , β j } is a Z-linear combinati<strong>on</strong> of modularsymbols for G <strong>and</strong> ∑ n j (π(β j ) − π(α j )) = 0 ∈ Div(X(G)), then x ∈ H 1 (X(G), Z).Proof. We may assume that each n j is ±1 by allowing duplicati<strong>on</strong>. We may furtherassume that each n j = 1 by using that {α, β} = −{β, α}. Next reorder the sum soπ(β j ) = π(α j+1 ) by using that the divisor is 0, so every β j must be equivalent tosome α j ′, etc. The lemma should now be clear.


54 7. <strong>Modular</strong> SymbolsooE'i 1+irhot 2 E'tE'(1+i)/20FIGURE 7.2.1.17.2.2 Triangulating X(G) to obtain injectivityLet C be the abelian group generated by symbols (r) for r ∈ G\Γ, subject to therelati<strong>on</strong>s(r) + (rs) = 0, <strong>and</strong> (r) = 0 if r = rs.For (r) ∈ C, define the boundary of (r) to be the difference π(r∞) − π(r0) ∈Div(X(G)). Since s swaps 0 <strong>and</strong> ∞, the boundary map is a well-defined map <strong>on</strong>C. Let Z be its kernel.Let B be the subgroup of C generated by symbols (r), for all r ∈ G\Γ thatsatisfy r = rt, <strong>and</strong> by (r) + (rt) + (rt 2 ) for all other r. If r = rt, then rt(0) = r(0),so r(∞) = r(0), so (r) ∈ Z. Also, using (7.2.1) below, we see that for any r, theelement (r) + (rt) + (rt 2 ) lies in Z.The map G\Γ → H 1 (X(G), R) that sends (r) to [r] induces a homomorphismC → H 1 (X(G), R), so by Propositi<strong>on</strong> 7.2.1 we obtain a surjective homomorphismψ : Z/B → H 1 (X(G), Z).Theorem 7.2.3 (Manin). The map ψ : Z/B → H 1 (X(G), Z) is an isomorphism.Proof. We <strong>on</strong>ly have to prove that ψ is injective. Our proof follows the proofof [Man72, Thm. 1.9] very closely. We compute the homology H 1 (X(G), Z) bytriangulating X(G) to obtain a simplicial complex L with homology Z 1 /B 1 , thenembed Z/B in the homology Z 1 /B 1 of X(G). Most of our work is spent describingthe triangulati<strong>on</strong> L.Let E denote the interior of the triangle with vertices 0, 1, <strong>and</strong> ∞, as illustratedin Figure 7.2.1. Let E ′ denote the uni<strong>on</strong> of the interior of the regi<strong>on</strong> bounded bythe path from i to ρ = e πi/3 to 1 + i to ∞ with the indicated path from i to ρ, notincluding the vertex i.When reading the proof below, it will be helpful to look at the following table,which illustrates what s = ( 0 −11 0), t =( 1 −11 0), <strong>and</strong> t 2 do to the vertices in


7.2 Manin symbols 55Figure 7.2.1:1 0 1 ∞ i 1 + i (1 + i)/2 ρs ∞ −1 0 i (−1 + i)/2 −1 + i −ρt ∞ 0 1 1 + i (1 + i)/2 i ρt 2 1 ∞ 0 (1 + i)/2 i 1 + i ρ(7.2.1)Note that each of E ′ , tE ′ , <strong>and</strong> t 2 E ′ is a fundamental domain for Γ, in the sensethat every element of the upper half plane is c<strong>on</strong>jugate to exactly <strong>on</strong>e element inthe closure of E ′ (except for identificati<strong>on</strong>s al<strong>on</strong>g the boundaries). For example,E ′ is obtained from the st<strong>and</strong>ard fundamental domain for Γ, which has verticesρ 2 , ρ, <strong>and</strong> ∞, by chopping it in half al<strong>on</strong>g the imaginary axis, <strong>and</strong> translating thepiece <strong>on</strong> the left side horiz<strong>on</strong>tally by 1.If (0, ∞) is the path from 0 to ∞, then t(0, ∞) = (∞, 1) <strong>and</strong> t 2 (0, ∞) = (1, 0).Also, s(0, ∞) = (∞, 0). Thus each half side of E is Γ-c<strong>on</strong>jugate to the side fromi to ∞. Also, each 1-simplex in Figure 7.2.1, i.e., the sides that c<strong>on</strong>nected twoadjacent labeled vertices such as i <strong>and</strong> ρ, maps homeomorphically into X(Γ). Thisis clear for the half sides, since they are c<strong>on</strong>jugate to a path in the interior of thest<strong>and</strong>ard fundamental domain for Γ, <strong>and</strong> for the medians (lines from midpoints toρ) since the path from i to ρ is <strong>on</strong> an edge of the st<strong>and</strong>ard fundamental domainwith no self identificati<strong>on</strong>s.We now describe our triangulati<strong>on</strong> L of X(G):0-cells The 0 cells are the cusps π(P 1 (Q)) <strong>and</strong> i-elliptic points π(Γi). Note thatthese are the images under π of the vertices <strong>and</strong> midpoints of sides of thetriangles gE, for all g ∈ Γ.1-cells The 1 cells are the images of the half-sides of the triangles gE, for g ∈ Γ,oriented from the edge to the midpoint (i.e., from the cusp to the i-ellipticpoint). For example, if r = Gg is a right coset, thene 1 (r) = π(g(∞), g(i)) ∈ X(G)is a 1 cell in L. Since, as we observed above, every half side is Γ-c<strong>on</strong>jugateto e 1 (G), it follows that every 1-cell is of the form e(r) for some right cosetr ∈ G\Γ.Next observe that if r ≠ r ′ thene 1 (r) = e 1 (r ′ ) implies r ′ = rs. (7.2.2)Indeed, if π(g(∞), g(i)) = π(g ′ (∞), g ′ (i)), then ri = r ′ i (note that the endpointsof a path are part of the definiti<strong>on</strong> of the path). Thus there existsh, h ′ ∈ G such that hg(i) = h ′ g ′ (i). Since the <strong>on</strong>ly n<strong>on</strong>trivial element of Γthat stabilizes i is s, this implies that (hg) −1 h ′ g ′ = s. Thus h ′ g ′ = hgs, soGg ′ = Ggs, so r ′ = rs.2-cells There are two types of 2-cells, those with 2 sides <strong>and</strong> those with 3.2-sided: The 2-sided 2-cells e 2 (r) are indexed by the cosets r = Gg suchthat rt = r. Note that for such an r, we have π(rE ′ ) = π(rtE ′ ) = π(rt 2 E ′ ).The 2-cell e 2 (r) is π(gE ′ ). The image g(ρ, i) of the half median maps to a


56 7. <strong>Modular</strong> Symbolsline from the center of e 2 (r) to the edge π(g(i)) = π(g(1 + i)). Orient e 2 (r)in a way compatible with the e 1 . Since Ggt = Gg,π(g(1 + i), g(∞)) = π(gt 2 (1 + i), gt 2 (∞)) = π(g(i), g(0)) = π(gs(i), gs(∞)),soe 1 (r)−e 1 (rs) = π(g(∞), g(i))+π(gs(i), gs(∞)) = π(g(∞), g(i))+π(g(1+i), g(∞)).Thus∂e 2 (r) = e 1 (r) − e 1 (rs).Finally, note that if r ′ ≠ r also satisfies r ′ t = r ′ , then e 2 (r) ≠ e 2 (r ′ ) (to seethis use that E ′ is a fundamental domain for Γ).3-sided: The 3-sided 2-cells e 2 (r) are indexed by the cosets r = Gg suchthat rt ≠ r. Note that for such an r, the three triangles rE ′ , rtE ′ , <strong>and</strong> rt 2 E ′are distinct (since they are n<strong>on</strong>trivial translates of a fundamental domain).Orient e 2 (r) in a way compatible with the e 1 (so edges go from cusps tomidpoints). Then∂e 2 (r) =2∑(e 1 (rt n ) − e 1 (rt n s)) .n=0We have now defined a complex L that is a triangulati<strong>on</strong> of X(G). Let C 1 , Z 1 ,<strong>and</strong> B 1 be the group of 1-chains, 1-cycles, <strong>and</strong> 1-boundaries of the complex L.Thus C 1 is the abelian group generated by the paths e 1 (r), the subgroup Z 1 is thekernel of the map that sends e 1 (r) = π(r(∞), r(0)) to π(r(0)) − π((∞)), <strong>and</strong> B 1is the subgroup of Z 1 generated by boundaries of 2-cycles.Let C, Z, B be as defined before the statement of the Theorem 7.2.3. We haveH 1 (X(G), Z) ∼ = Z 1 /B 1 , <strong>and</strong> would like to prove that Z/B ∼ = Z 1 /B 1 .Define a map ϕ : C → C 1 by (r) ↦→ e 1 (rs) − e 1 (r). The map ϕ is well definedbecause if r = rs, then clearly (r) ↦→ 0, <strong>and</strong> (r) + (rs) maps to e 1 (rs) − e 1 (r) +e 1 (r) − e 1 (rs) = 0. To see that f is injective, suppose ∑ n r (r) ≠ 0. Since in Cwe have the relati<strong>on</strong>s (r) = −(rs) <strong>and</strong> (r) = 0 if rs = r, we may assume thatn r n rs = 0 for all r. We have(∑ )ϕ nr (r) = ∑ n r (e 1 (rs) − e 1 (r)).If n r ≠ 0 then r ≠ rs, so (7.2.2) implies that e 1 (r) ≠ e 1 (rs). If n r ≠ 0 <strong>and</strong> n r ′ ≠ 0with r ′ ≠ r, then r ≠ rs <strong>and</strong> r ′ ≠ r ′ s, so e 1 (r), e 1 (rs), e 1 (r ′ ), e 1 (r ′ s) are all distinct.We c<strong>on</strong>clude that ∑ n r (e 1 (rs) − e 1 (r)) ≠ 0, which proves that ϕ is injective.Suppose (r) ∈ C. Thensinceϕ(r) + B 1 = ψ(r) = {r(0), r(∞)} ∈ H 1 (X(G), Z) = C 1 /B 1 ,ϕ(r) = e 1 (rs)−e 1 (r) = π(rs(∞), rs(i))−π(r(∞), r(i)) = π(r(0), r(i))−π(r(∞), r(i))bel<strong>on</strong>gs to the homology class {r(0), r(∞)}. Extending linearly, we have, for anyz ∈ C, that ϕ(z) + B 1 = ψ(z).


7.3 <strong>Hecke</strong> operators 57The generators for B 1 are the boundaries of 2-cells e 2 (r). As we saw above, thesehave the form ϕ(r) for all r such that r = rt, <strong>and</strong> ϕ(r) + ϕ(rt) + ϕ(rt 2 ) for ther such that rt ≠ r. Thus B 1 = ϕ(B) ⊂ ϕ(Z), so the map ϕ induces an injecti<strong>on</strong>Z/B ↩→ Z 1 /B 1 . This completes the proof of the theorem.7.3 <strong>Hecke</strong> operatorsIn this secti<strong>on</strong> we will <strong>on</strong>ly c<strong>on</strong>sider the modular curve X 0 (N) associated to thesubgroup Γ 0 (N) of SL 2 (Z) of matrices that are upper triangular modulo N. Muchof what we say will also be true, possibly with slight modificati<strong>on</strong>, for X 1 (N), butnot for arbitrary finite-index subgroups.There is a commutative ringT = Z[T 1 , T 2 , T 3 , . . .]of <strong>Hecke</strong> operators that acts <strong>on</strong> H 1 (X 0 (N), R). We will frequently revisit this ring,which also acts <strong>on</strong> the Jacobian J 0 (N) of X 0 (N), <strong>and</strong> <strong>on</strong> modular forms. The ringT is generated by T p , for p prime, <strong>and</strong> as a free Z-module T is isomorphic to Z g ,where g is the genus of X 0 (N). We will not prove these facts here (see 1 ). 1Suppose{α, β} ∈ H 1 (X 0 (N), R),is a modular symbol, with α, β ∈ P 1 (Q). For g ∈ M 2 (Z), write g({α, β}) ={g(α), g(β)}. This is not a well-defined acti<strong>on</strong> of M 2 (Z) <strong>on</strong> H 1 (X 0 (N), R), since{α ′ , β ′ } = {α, β} ∈ H 1 (X 0 (N), R) does not imply that {g(α ′ ), g(β ′ )} = {g(α), g(β)}.Example 7.3.1. Using Magma we see that the homology H 1 (X 0 (11), R) is generatedby {−1/7, 0} <strong>and</strong> {−1/5, 0}.> M := <strong>Modular</strong>Symbols(11); // Homology relative to cusps,// with Q coefficients.> S := CuspidalSubspace(M); // Homology, with Q coefficients.> Basis(S);[ {-1/7, 0}, {-1/5, 0} ]Also, we have 5{0, ∞} = {−1/5, 0}.> pi := Projecti<strong>on</strong>Map(S); // The natural map M --> S.> M.3;{oo, 0}> pi(M.3);-1/5*{-1/5, 0}Let g = ( 2 0 01 ). Then 5{g(0), g(∞)} is not equal to {g(−1/5), g(0)}, so g does notdefine a well-defined map <strong>on</strong> H 1 (X 0 (11), R).> x := 5*pi(M!);> y := pi(M!);> x;1 Add some references <strong>and</strong> pointers to other parts of this book.


58 7. <strong>Modular</strong> Symbols{-1/5, 0}> y;-1*{-1/7, 0} + -1*{-1/5, 0}> x eq y;falseDefiniti<strong>on</strong> 7.3.2 (<strong>Hecke</strong> operators). We define the <strong>Hecke</strong> operator T p <strong>on</strong> H 1 (X 0 (N), R)as follows. When p is a prime with p ∤ N, we haveT p ({α, β}) =( ) p 0({α, β}) +0 1∑p−1r=0( ) 1 r({α, β}).0 pWhen p | N, the formula is the same, except that the first summ<strong>and</strong>, which involves( p 00 1), is omitted.Example 7.3.3. We c<strong>on</strong>tinue with Example 7.3.1. If we apply the <strong>Hecke</strong> operatorT 2 to both 5{0, ∞} <strong>and</strong> {−1/5, 0}, the “n<strong>on</strong>-well-definedness” cancels out.> x := 5*pi(M! +M! + M!);> x;-2*{-1/5, 0}> y := pi(M!+ M! + M!);> y;-2*{-1/5, 0}Examples 7.3.1 shows that it is not clear that the definiti<strong>on</strong> of T p given abovemakes sense. For example, if {α, β} is replaced by an equivalent modular symbol{α ′ , β ′ }, why does the formula for T p give the same answer? We will not addressthis questi<strong>on</strong> further here, but will revisit it later 2 when we have a more natural 2<strong>and</strong> intrinsic definiti<strong>on</strong> of <strong>Hecke</strong> operators. We <strong>on</strong>ly remark that T p is induced bya “corresp<strong>on</strong>dence” from X 0 (N) to X 0 (N), so T p preserve H 1 (X 0 (N), Z).7.4 <strong>Modular</strong> symbols <strong>and</strong> rati<strong>on</strong>al homologyIn this secti<strong>on</strong> we sketch a beautiful proof, due to Manin, of a result that is crucialto our underst<strong>and</strong>ing of rati<strong>on</strong>ality properties of special values of L-functi<strong>on</strong>s. Forexample, Mazur <strong>and</strong> Swinnert<strong>on</strong>-Dyer write in [MSD74, §6], “The modular symbolis essential for our theory of p-adic Mellin transforms,” right before discussing thisrati<strong>on</strong>ality result. Also, as we will see in the next secti<strong>on</strong>, this result implies that ifE is an elliptic curve over Q, then L(E, 1)/Ω E ∈ Q, which c<strong>on</strong>firms a c<strong>on</strong>sequenceof the Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer c<strong>on</strong>jecture.Theorem 7.4.1 (Manin). For any α, β ∈ P 1 (Q), we have{α, β} ∈ H 1 (X 0 (N), Q).Proof (sketch). Since {α, β} = {α, ∞} − {β, ∞}, it suffices to show that {α, ∞} ∈H 1 (X 0 (N), Q) for all α ∈ Q. We c<strong>on</strong>tent ourselves with proving that {0, ∞} ∈H 1 (X 0 (N), Z), since the proof for general {0, α} is almost the same.2 Say where, when I write this later.


7.5 Special values of L-functi<strong>on</strong>s 59We will use that the eigenvalues of T p <strong>on</strong> H 1 (X 0 (N), R) have absolute valuebounded by 2 √ p, a fact that was proved by Weil (the Riemann hypothesis forcurves over finite fields). Let p ∤ N be a prime. Thenso∑p−1{ } rT p ({0, ∞}) = {0, ∞} +p , ∞ ∑p−1{ } r= (1 + p){0, ∞} +p , 0 ,r=0∑p−1{(1 + p − T p )({0, ∞}) = 0, r }.pSince p ∤ N, the cusps 0 <strong>and</strong> r/p are equivalent (use the Euclidean algorithmto find a matrix in SL 2 (Z) of the form ( p r ∗∗ )), so the modular symbols {0, r/p},for r = 0, 1, . . . , p − 1 all lie in H 1 (X 0 (N), Z). Since the eigenvalues of T p haveabsolute value at most 2 √ p, the linear transformati<strong>on</strong> 1 + p − T p of H 1 (X 0 (N), Z)is invertible. It follows that some integer multiple of {0, ∞} lies in H 1 (X 0 (N), Z),as claimed.There are general theorems about the denominator of {α, β} in some cases.Example 7.3.1 above dem<strong>on</strong>strated the following theorem in the case N = 11.Theorem 7.4.2 (Ogg [Ogg71]). Let N be a prime. Then the imager=0[{0, ∞}] ∈ H 1 (X 0 (N), Q)/ H 1 (X 0 (N), Z)has order equal to the numerator of (N − 1)/12.r=07.5 Special values of L-functi<strong>on</strong>sThis secti<strong>on</strong> is a preview of <strong>on</strong>e of the central arithmetic results we will discuss inmore generality later in this book. 3 3The celebrated modularity theorem of Wiles et al. asserts that there is a corresp<strong>on</strong>dencebetween isogeny classes of elliptic curves E of c<strong>on</strong>ductor N <strong>and</strong> normalizednew modular eigenforms f = ∑ a n q n ∈ S 2 (Γ 0 (N)) with a n ∈ Z. Thiscorresp<strong>on</strong>dence is characterized by the fact that for all primes p ∤ N, we havea p = p + 1 − #E(F p ).Recall that a modular form for Γ 0 (N) of weight 2 is a holomorphic functi<strong>on</strong>f : h → C that is “holomorphic at the cusps” <strong>and</strong> such that for all ( )a bc d ∈ Γ0 (N),( ) az + bf = (cz + d) 2 f(z).cz + dSuppose E is an elliptic curve that corresp<strong>on</strong>ds to a modular form f. If L(E, s)is the L-functi<strong>on</strong> attached to E, thenL(E, s) = L(f, s) = ∑ a nn s ,so, by a theorem of <strong>Hecke</strong> which we will prove [later] 4 , L(f, s) is holomorphic <strong>on</strong> 43 Where?4 where?


60 7. <strong>Modular</strong> Symbolsall C. Note that L(f, s) is the Mellin transform of the modular form f:∫ i∞L(f, s) = (2π) s Γ(s) −1 (−iz) s f(z) dzz . (7.5.1)The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer c<strong>on</strong>jecture c<strong>on</strong>cerns the leading coefficient ofthe series expansi<strong>on</strong> of L(E, s) about s = 1. A special case is that if L(E, 1) ≠ 0,then∏L(E, 1) cp · #X(E)=Ω E #E(Q) 2 .torHere Ω E = | ∫ ω|, where ω is a “Nér<strong>on</strong>” differential 1-form <strong>on</strong> E, i.e., a generatorfor H 0 (E, Ω 1 E/ZE(R)), where E is the Nér<strong>on</strong> model of E. (The Nér<strong>on</strong> model ofE is the unique, up to unique isomorphism, smooth group scheme E over Z, withgeneric fiber E, such that for all smooth schemes S over Z, the natural mapHom Z (S, E) → Hom Q (S × Spec(Q), E) is an isomorphism.) In particular, the c<strong>on</strong>jectureasserts that for any elliptic curve E we have L(E, 1)/Ω E ∈ Q.Theorem 7.5.1. Let E be an elliptic curve over Q. Then L(E, 1)/Ω E ∈ Q.Proof (sketch). By the modularity theorem of Wiles et al., E is modular, so thereis a surjective morphism π E : X 0 (N) → E, where N is the c<strong>on</strong>ductor of E. Thisimplies that there is a newform f that corresp<strong>on</strong>ds to (the isogeny class of) E, withL(f, s) = L(E, s). Also assume, without loss of generality, that E is “optimal” inits isogeny class, which means that if X 0 (N) → E ′ → E is a sequence of morphismwhose compositi<strong>on</strong> is π E <strong>and</strong> E ′ is an elliptic curve, then E ′ = E.By Equati<strong>on</strong> 7.5.1, we haveL(E, 1) = 2π∫ i∞00−izf(z)dz/z. (7.5.2)If q = e 2πiz , then dq = 2πiqdz, so 2πif(z)dz = dq/q, <strong>and</strong> (7.5.2) becomesL(E, 1) = −∫ i∞0f(q)dq.Recall that Ω E = | ∫ ω|, where ω is a Nér<strong>on</strong> differential <strong>on</strong> E. The expressi<strong>on</strong>E(R)f(q)dq defines a differential <strong>on</strong> the modular curve X 0 (N), <strong>and</strong> there is a rati<strong>on</strong>alnumber c, the Manin c<strong>on</strong>stant, such that πE ∗ ω = cf(q)dq. More is true: Edixhovenproved (as did Ofer Gabber) that c ∈ Z; also Manin c<strong>on</strong>jectured that c = 1 <strong>and</strong>Edixhoven proved (unpublished) that if p | c, then p = 2, 3, 5, 7.A st<strong>and</strong>ard fact is that if{∫}L = ω : γ ∈ H 1 (E, Z)γis the period lattice of E associated to ω, then E(C) ∼ = C/L. Note that Ω E iseither the least positive real element of L or twice this least positive element (ifE(R) has two real comp<strong>on</strong>ents).The next crucial observati<strong>on</strong> is that by Theorem 7.4.1, there is an integer n suchthat n{0, ∞} ∈ H 1 (X 0 (N), Z). This is relevant because if{∫}L ′ = f(q)dq : γ ∈ H 1 (X 0 (N), Z) ⊂ C.γ


7.5 Special values of L-functi<strong>on</strong>s 61then L = 1 c L′ ⊂ L ′ . This asserti<strong>on</strong> follows from our hypothesis that E is optimal<strong>and</strong> st<strong>and</strong>ard facts about complex tori <strong>and</strong> Jacobians, which we will prove later[in this course/book].One can show that L(E, 1) ∈ R, for example, by writing down an explicit realc<strong>on</strong>vergent series that c<strong>on</strong>verges to L(E, 1). This series is used in algorithms tocompute L(E, 1), <strong>and</strong> the derivati<strong>on</strong> of the series uses properties of modular formsthat we have not yet developed. Another approach is to use complex c<strong>on</strong>jugati<strong>on</strong>to define an involuti<strong>on</strong> ∗ <strong>on</strong> H 1 (X 0 (N), R), then observe that {0, ∞} is fixed by∗. (The involuti<strong>on</strong> ∗ is given <strong>on</strong> modular symbols by ∗{α, β} = {−α, −β}.)Since L(E, 1) ∈ R, the integral∫n{0,∞}f(q)dq = n∫ i∞0f(q)dq = −nL(E, 1) ∈ L ′lies in the subgroup (L ′ ) + of elements fixed by complex c<strong>on</strong>jugati<strong>on</strong>. If c is theManin c<strong>on</strong>stant, we have cnL(E, 1) ∈ L + . Since Ω E is the least n<strong>on</strong>zero element ofL + (or twice it), it follows that 2cnL(E, 1)/Ω E ∈ Z, which proves the propositi<strong>on</strong>.


62 7. <strong>Modular</strong> Symbols


8<strong>Modular</strong> <strong>Forms</strong> of Higher Level8.1 <strong>Modular</strong> <strong>Forms</strong> <strong>on</strong> Γ 1 (N)Fix integers k ≥ 0 <strong>and</strong> N ≥ 1. Recall that Γ 1 (N) is the subgroup of elements ofSL 2 (Z) that are of the form ( 1 0 ∗1 ) when reduced modulo N.Definiti<strong>on</strong> 8.1.1 (<strong>Modular</strong> <strong>Forms</strong>). The space of modular forms of level N <strong>and</strong>weight k isM k (Γ 1 (N)) = { f : f(γτ) = (cτ + d) k f(τ) all γ ∈ Γ 1 (N) } ,where the f are assumed holomorphic <strong>on</strong> h ∪ {cusps} (see below for the precisemeaning of this). The space of cusp forms of level N <strong>and</strong> weight k is the subspaceS k (Γ 1 (N)) of M k (Γ 1 (N)) of modular forms that vanish at all cusps.Suppose f ∈ M k (Γ 1 (N)). The group Γ 1 (N) c<strong>on</strong>tains the matrix ( 1 0 11 ), sof(z + 1) = f(z),<strong>and</strong> for f to be holomorphic at infinity means that f has a Fourier expansi<strong>on</strong>f =∞∑a n q n .n=0To explain what it means for f to be holomorphic at all cusps, we introducesome additi<strong>on</strong>al notati<strong>on</strong>. For α ∈ GL + 2 (R) <strong>and</strong> f : h → C define another functi<strong>on</strong>f |[α]k as follows:f |[α]k (z) = det(α) k−1 (cz + d) −k f(αz).It is straightforward to check that f |[αα′ ] k= (f |[α]k ) |[α′ ] k. Note that we do not haveto make sense of f |[α]k (∞), since we <strong>on</strong>ly assume that f is a functi<strong>on</strong> <strong>on</strong> h <strong>and</strong>not h ∗ .


64 8. <strong>Modular</strong> <strong>Forms</strong> of Higher LevelUsing our new notati<strong>on</strong>, the transformati<strong>on</strong> c<strong>on</strong>diti<strong>on</strong> required for f : h → Cto be a modular form for Γ 1 (N) of weight k is simply that f be fixed by the [ ] k -acti<strong>on</strong> of Γ 1 (N). Suppose x ∈ P 1 (Q) is a cusp, <strong>and</strong> choose α ∈ SL 2 (Z) such thatα(∞) = x. Then g = f |[α]k is fixed by the [ ] k acti<strong>on</strong> of α −1 Γ 1 (N)α.Lemma 8.1.2. Let α ∈ SL 2 (Z). Then there exists a positive integer h such that( 1 h0 1 ) ∈ α−1 Γ 1 (N)α.Proof. This follows from the general fact that the set of c<strong>on</strong>gruence subgroups ofSL 2 (Z) is closed under c<strong>on</strong>jugati<strong>on</strong> by elements α ∈ SL 2 (Z), <strong>and</strong> every c<strong>on</strong>gruencesubgroup c<strong>on</strong>tains an element of the form ( 1 0 h1). If G is a c<strong>on</strong>gruence subgroup,then Γ(N) ⊂ G for some N, <strong>and</strong> α −1 Γ(N)α = Γ(N), since Γ(N) is normal, soΓ(N) ⊂ α −1 Gα.Letting h be as in the lemma, we have g(z + h) = g(z). Then the c<strong>on</strong>diti<strong>on</strong>that f be holomorphic at the cusp x is thatg(z) = ∑ n≥0b n/h q 1/h<strong>on</strong> the upper half plane. We say that f vanishes at x if b n/h = 0, so a cusp formis a form that vanishes at every cusp.8.2 The Diam<strong>on</strong>d bracket <strong>and</strong> <strong>Hecke</strong> operatorsIn this secti<strong>on</strong> we c<strong>on</strong>sider the spaces of modular forms S k (Γ 1 (N), ε), for Dirichletcharacters ε mod N, <strong>and</strong> explicitly describe the acti<strong>on</strong> of the <strong>Hecke</strong> operators <strong>on</strong>these spaces.8.2.1 Diam<strong>on</strong>d bracket operatorsThe group Γ 1 (N) is a normal subgroup of Γ 0 (N), <strong>and</strong> the quotient Γ 0 (N)/Γ 1 (N)is isomorphic to (Z/NZ) ∗ . From this structure we obtain an acti<strong>on</strong> of (Z/NZ) ∗ <strong>on</strong>S k (Γ 1 (N)), <strong>and</strong> use it to decompose S k (Γ 1 (N)) as a direct sum of more manageablechunks S k (Γ 1 (N), ε).Definiti<strong>on</strong> 8.2.1 (Dirichlet character). A Dirichlet character ε modulo N is ahomomorphismε : (Z/NZ) ∗ → C ∗ .We extend ε to a map ε : Z → C by setting ε(m) = 0 if (m, N) ≠ 1 <strong>and</strong>ε(m) = ε(m mod N) otherwise. If ε : Z → C is a Dirichlet character, the c<strong>on</strong>ductorof ε is the smallest positive integer N such that ε arises from a homomorphism(Z/NZ) ∗ → C ∗ .Remarks 8.2.2.1. If ε is a Dirichlet character modulo N <strong>and</strong> M is a multiple of N then ε inducesa Dirichlet character mod M. If M is a divisor of N then ε is induced by aDirichlet character modulo M if <strong>and</strong> <strong>on</strong>ly if M divides the c<strong>on</strong>ductor of ε.


8.2 The Diam<strong>on</strong>d bracket <strong>and</strong> <strong>Hecke</strong> operators 652. The set of Dirichlet characters forms a group, which is n<strong>on</strong>-can<strong>on</strong>ically isomorphicto (Z/NZ) ∗ (it is the dual of this group).3. The mod N Dirichlet characters all take values in Q(e 2πi/e ) where e is theexp<strong>on</strong>ent of (Z/NZ) ∗ . When N is an odd prime power, the group (Z/NZ) ∗is cyclic, so e = ϕ(ϕ(N)). This double-ϕ can sometimes cause c<strong>on</strong>fusi<strong>on</strong>.4. There are many ways to represent Dirichlet characters with a computer. Ithink the best way is also the simplest—fix generators for (Z/NZ) ∗ in anyway you like <strong>and</strong> represent ε by the images of each of these generators. Assumefor the moment that N is odd. To make the representati<strong>on</strong> more “can<strong>on</strong>ical”,reduce to the prime power case by writing (Z/NZ) ∗ as a product ofcyclic groups corresp<strong>on</strong>ding to prime divisors of N. A “can<strong>on</strong>ical” generatorfor (Z/p r Z) ∗ is then the smallest positive integer s such that s mod p r generates(Z/p r Z) ∗ . Store the character that sends s to e 2πin/ϕ(ϕ(pr )) by storingthe integer n. For general N, store the list of integers n p , <strong>on</strong>e p for each primedivisor of N (unless p = 2, in which case you store two integers n 2 <strong>and</strong> n ′ 2,where n 2 ∈ {0, 1}).Definiti<strong>on</strong> 8.2.3. Let d ∈ (Z/NZ) ∗ <strong>and</strong> f ∈ S k (Γ 1 (N)). The map SL 2 (Z) →SL 2 (Z/NZ) is surjective, so there exists a matrix γ = ( )a bc d ∈ Γ0 (N) such thatd ≡ d (mod N). The diam<strong>on</strong>d bracket d operator is thenf(τ)|〈d〉 = f |[γ]k = f(γτ)(cτ + d) −k .The definiti<strong>on</strong> of 〈d〉 does not depend <strong>on</strong> the choice of lift matrix ( a bc d), sinceany two lifts differ by an element of Γ(N) <strong>and</strong> f is fixed by Γ(N) since it is fixedby Γ 1 (N).For each Dirichlet character ε mod N letS k (Γ 1 (N), ε) = {f : f|〈d〉 = ε(d)f all d ∈ (Z/NZ) ∗ }= {f : f |[γ]k = ε(d γ )f all γ ∈ Γ 0 (N)},where d γ is the lower-left entry of γ.When f ∈ S k (Γ 1 (N), ε), we say that f has Dirichlet character ε. In the literature,sometimes f is said to be of “nebentypus” ε.Lemma 8.2.4. The operator 〈d〉 <strong>on</strong> the finite-dimensi<strong>on</strong>al vector space S k (Γ 1 (N))is diag<strong>on</strong>alizable.Proof. There exists n such that I = 〈1〉 = 〈d n 〉 = 〈d〉 n , so the characteristicpolynomial of 〈d〉 divides the square-free polynomial X n − 1.Note that S k (Γ 1 (N), ε) is the ε(d) eigenspace of 〈d〉. Thus we have a direct sumdecompositi<strong>on</strong>⊕S k (Γ 1 (N)) =S k (Γ 1 (N), ε).ε:(Z/NZ) ∗ →C ∗We have ( −1 00 −1)∈ Γ0 (N), so if f ∈ S k (Γ 1 (N), ε), thenf(τ)(−1) −k = ε(−1)f(τ).Thus S k (Γ 1 (N), ε) = 0, unless ε(−1) = (−1) k , so about half of the direct summ<strong>and</strong>sS k (Γ 1 (N), ε) vanish.


66 8. <strong>Modular</strong> <strong>Forms</strong> of Higher Level8.2.2 <strong>Hecke</strong> <strong>Operators</strong> <strong>on</strong> q-expansi<strong>on</strong>sSuppose<strong>and</strong> let p be a prime. Thenf =∞∑a n q n ∈ S k (Γ 1 (N), ε),n=1⎧ ∞∑∞∑⎪⎨ a np q n + p k−1 ε(p) a n q pn , p ∤ Nn=1n=1f|T p = ∞∑⎪⎩ a np q n + 0. p | N.n=1Note that ε(p) = 0 when p | N, so the sec<strong>on</strong>d part of the formula is redundant.When p | N, T p is often denoted U p in the literature, but we will not do sohere. Also, the ring T generated by the <strong>Hecke</strong> operators is commutative, so it isharmless, though potentially c<strong>on</strong>fusing, to write T p (f) instead of f|T p .We record the relati<strong>on</strong>sT m T n = T mn , (m, n) = 1,{(T p ) k , p | NT p k =T p k−1T p − ε(p)p k−1 T p k−2, p ∤ N.WARNING: When p | N, the operator T p <strong>on</strong> S k (Γ 1 (N), ε) need not be diag<strong>on</strong>alizable.8.3 Old <strong>and</strong> new subspacesNLet M <strong>and</strong> N be positive integers such that M | N <strong>and</strong> let t |M. If f(τ) ∈S k (Γ 1 (M)) then f(tτ) ∈ S k (Γ 1 (N)). We thus have mapsS k (Γ 1 (M)) → S k (Γ 1 (N))for each divisor t | N M. Combining these gives a map⊕ϕ M : S k (Γ 1 (M)) → S k (Γ 1 (N)).t|(N/M)Definiti<strong>on</strong> 8.3.1 (Old Subspace). The old subspace of S k (Γ 1 (N)) is the subspacegenerated by the images of the ϕ M for all M | N with M ≠ N.Definiti<strong>on</strong> 8.3.2 (New Subspace). The new subspace of S k (Γ 1 (N)) is the complementof the old subspace with respect to the Peterss<strong>on</strong> inner product.1 Since I haven’t introduced the Peterss<strong>on</strong> inner product yet, note that the new 1subspace of S k (Γ 1 (N)) is the largest subspace of S k (Γ 1 (N)) that is stable under the<strong>Hecke</strong> operators <strong>and</strong> has trivial intersecti<strong>on</strong> with the old subspace of S k (Γ 1 (N)).1 Remove from book.


8.3 Old <strong>and</strong> new subspaces 67Definiti<strong>on</strong> 8.3.3 (Newform). A newform is an element f of the new subspace ofS k (Γ 1 (N)) that is an eigenvector for every <strong>Hecke</strong> operator, which is normalized sothat the coefficient of q in f is 1.If f = ∑ a n q n is a newform then the coefficient a n are algebraic integers, whichhave deep arithmetic significance. For example, when f has weight 2, there is anassociated∏abelian variety A f over Q of dimensi<strong>on</strong> [Q(a 1 , a 2 , . . .) : Q] such thatL(f σ , s) = L(A f , s), where the product is over the Gal(Q/Q)-c<strong>on</strong>jugates of F .The abelian variety A f was c<strong>on</strong>structed by Shimura as follows. Let J 1 (N) be theJacobian of the modular curve X 1 (N). As we will see tomorrow, the ring T of <strong>Hecke</strong>operators acts naturally <strong>on</strong> J 1 (N). Let I f be the kernel of the homomorphismT → Z[a 1 , a 2 , . . .] that sends T n to a n . ThenA f = J 1 (N)/I f J 1 (N).In the c<strong>on</strong>verse directi<strong>on</strong>, it is a deep theorem of Breuil, C<strong>on</strong>rad, Diam<strong>on</strong>d,Taylor, <strong>and</strong> Wiles that if E is any elliptic curve over Q, then E is isogenous to A ffor some f of level equal to the c<strong>on</strong>ductor N of E.When f has weight greater than 2, Scholl c<strong>on</strong>structs 2 , in an analogous way, 2a Grothendieck motive (=compatible collecti<strong>on</strong> of cohomology groups 3 ) M f at- 3tached to f.2 add reference3 remove


68 8. <strong>Modular</strong> <strong>Forms</strong> of Higher Level


9Newforms <strong>and</strong> Euler ProductsIn this chapter we discuss the work of Atkin, Lehner, <strong>and</strong> W. Li <strong>on</strong> newforms <strong>and</strong>their associated L-series <strong>and</strong> Euler products. Then we discuss explicitly how U p , forp | N, acts <strong>on</strong> old forms, <strong>and</strong> how U p can fail to be diag<strong>on</strong>alizable. Then we describea can<strong>on</strong>ical generator for S k (Γ 1 (N)) as a free module over T C . Finally, we observethat the subalgebra of T Q generated by <strong>Hecke</strong> operators T n with (n, N) = 1 isisomorphic to a product of number fields.9.1 Atkin-Lehner-Li theoryThe results of [Li75] about newforms are proved using many linear transformati<strong>on</strong>sthat do not necessarily preserve S k (Γ 1 (N), ε). Thus we introduce more generalspaces of cusp forms, which these transformati<strong>on</strong>s preserve. These spaces arealso useful because they make precise how the space of cusp forms for the fullc<strong>on</strong>gruence subgroup Γ(N) can be understood in terms of spaces S k (Γ 1 (M), ε) forvarious M <strong>and</strong> ε, which justifies our usual focus <strong>on</strong> these latter spaces. This secti<strong>on</strong>follows [Li75] closely.Let M <strong>and</strong> N be positive integers <strong>and</strong> define{( )}a bΓ 0 (M, N) = ∈ SLc d 2 (Z) : M | c, N | b ,<strong>and</strong>{( ) a bΓ(M, N) = ∈ Γc d 0 (M, N) : a ≡ d ≡ 1}(mod MN) .Note that Γ 0 (M, 1) = Γ 0 (M) <strong>and</strong> Γ(M, 1) = Γ 1 (M). Let S k (M, N) denote thespace of cusp forms for Γ(M, N).If ε is a Dirichlet character modulo MN such that ε(−1) = (−1) k , let S k (M, N, ε)denote the space of all cups forms for Γ(M, N) of weight k <strong>and</strong> character ε. This


70 9. Newforms <strong>and</strong> Euler Productsis the space of holomorphic functi<strong>on</strong>s f : h → C that satisfy the usual vanishingc<strong>on</strong>diti<strong>on</strong>s at the cusps <strong>and</strong> such that for all ( )a bc d ∈ Γ0 (M, N),( ) a bf| = ε(d)f.c dWe haveS k (M, N) = ⊕ ε S k (M, N, ε).We now introduce operators between various S k (M, N). Note that, except whenotherwise noted, the notati<strong>on</strong> we use for these operators below is as in [Li75],which c<strong>on</strong>flicts with notati<strong>on</strong> in various other books. When in doubt, check thedefiniti<strong>on</strong>s.Let ( )( )a baτ + bf| (τ) = (ad − bc) k/2 (cτ + d) −k f .c dcτ + dThis is like before, but we omit the weight k from the bar notati<strong>on</strong>, since k will befixed for the whole discussi<strong>on</strong>.For any d <strong>and</strong> f ∈ S k (M, N, ε), definef|U N d( = d k/2−1 ∑f∣u mod d( ) ) 1 uN,0 dwhere the sum is over any set u of representatives for the integers modulo d. Notethat the N in the notati<strong>on</strong> is a superscript, not a power of N. Also, let( )f|B d = d −k/2 d 0f| ,0 1<strong>and</strong>f|C d = d k/2 f|( )1 0.0 dIn [Li75], C d is denoted W d , which would be c<strong>on</strong>fusing, since in the literature W d isusually used to denote a completely different operator (the Atkin-Lehner operator,which is denoted VdM in [Li75]).Since ( 1 0 N1 ) ∈ Γ(M, N), any f ∈ S k(M, N, ε) has a Fourier expansi<strong>on</strong> in termsof powers of q N = q 1/N . We have(∑ )an qN n |Ud N = ∑ a nd qN,nn≥1(∑an qN)n |B d = ∑ a n qN nd ,n≥1<strong>and</strong> (∑an q n N)|C d = ∑ n≥1a n q n Nd.The sec<strong>on</strong>d two equalities are easy to see; for the first, write everything out <strong>and</strong>use that for n ≥ 1, the sum ∑ u e2πiun/d is 0 or d if d ∤ n, d | n, respectively.


9.1 Atkin-Lehner-Li theory 71The maps B d <strong>and</strong> C d define injective maps between various spaces S k (M, N, ε).To underst<strong>and</strong> B d , use the matrix relati<strong>on</strong>( ) ( ) ( ) ( )d 0 x y x dy d 0=,0 1 z w z/d w 0 1<strong>and</strong> a similar <strong>on</strong>e for C d . If d | N then B d : S k (M, N, ε) → S k (dM, N/d, ε) isan isomorphism, <strong>and</strong> if d | M, then C d : S k (M, N) → S k (M/d, Nd, ε) is also anisomorphism. In particular, taking d = N, we obtain an isomorphismB N : S k (M, N, ε) → S k (MN, 1, ε) = S k (Γ 1 (MN), ε). (9.1.1)Putting these maps together allows us to completely underst<strong>and</strong> the cusp formsS k (Γ(N)) in terms of spaces S k (Γ 1 (N 2 ), ε), for all Dirichlet characters ε that arisefrom characters modulo N. (Recall that Γ(N) is the principal c<strong>on</strong>gruence subgroupΓ(N) = ker(SL 2 (Z) → SL 2 (Z/NZ)). This is because S k (Γ(N)) is isomorphic tothe direct sum of S k (N, N, ε), as ε various over all Dirichlet characters modulo N.For any prime p, the pth <strong>Hecke</strong> operator <strong>on</strong> S k (M, N, ε) is defined byT p = U N p + ε(p)p k−1 B p .Note that T p = UpN when p | N, since then ε(p) = 0. In terms of Fourier expansi<strong>on</strong>s,we have (∑ )an qN n |T p = ∑ (anp + ε(p)p k−1 )a n/p qnN ,n≥1where a n/p = 0 if p ∤ n.The operators we have just defined satisfy several commutativity relati<strong>on</strong>s. Supposep <strong>and</strong> q are prime. Then T p B q = B q T p , T p C q = C q T p , <strong>and</strong> T p UqN = Uq N T p if(p, qMN) = 1. Moreover Ud NB d ′ = B d ′UN d if (d, d′ ) = 1.Remark 9.1.1. Because of these relati<strong>on</strong>s, (9.1.1) describe S k (Γ(N)) as a moduleover the ring generated by T p for p ∤ N.Definiti<strong>on</strong> 9.1.2 (Old Subspace). The old subspace S k (M, N, ε) old is the subspaceof S k (M, N, ε) generated by all f|B d <strong>and</strong> g|C e where f ∈ S k (M ′ , N), g ∈S k (M, N ′ ), <strong>and</strong> M ′ , N ′ are proper factors of M, N, respectively, <strong>and</strong> d | M/M ′ ,e | N/N ′ .Since T p commutes with B d <strong>and</strong> C e , the <strong>Hecke</strong> operators T p all preserve S k (M, N, ε) old ,for p ∤ MN. Also, B N defines an isomorphismS k (M, N, ε) old∼ = Sk (MN, 1, ε) old .Definiti<strong>on</strong> 9.1.3 (Peterss<strong>on</strong> Inner Product). If f, g ∈ S k (Γ(N)), the Peterss<strong>on</strong>inner product of f <strong>and</strong> g is∫1〈f, g〉 =f(z)g(z)y k−2 dx dy,[SL 2 (Z) : Γ(N)]where D is a fundamental domain for Γ(N) <strong>and</strong> z = x + iy.This Peterss<strong>on</strong> pairing is normalized so that if we c<strong>on</strong>sider f <strong>and</strong> g as elementsof Γ(N ′ ) for some multiple N ′ of N, then the resulting pairing is the same (sincethe volume of the fundamental domain shrinks by the index).D


72 9. Newforms <strong>and</strong> Euler ProductsPropositi<strong>on</strong> 9.1.4 (Peterss<strong>on</strong>). If p ∤ N <strong>and</strong> f ∈ S k (Γ 1 (N), ε), then 〈f|T p , g〉 =ε(p)〈f, g|T p 〉.Remark 9.1.5. The propositi<strong>on</strong> implies that the T p , for p ∤ N, are diag<strong>on</strong>alizable.Be careful, because the T p , with p | N, need not be diag<strong>on</strong>alizable.Definiti<strong>on</strong> 9.1.6 (New Subspace). The new subspace S k (M, N, ε) new is the orthog<strong>on</strong>alcomplement of S k (M, N, ε) old in S k (M, N, ε) with respect to the Peterss<strong>on</strong>inner product.Both the old <strong>and</strong> new subspaces of S k (M, N, ε) are preserved by the <strong>Hecke</strong>operators T p with (p, NM) = 1.Remark 9.1.7. Li [Li75] also gives a purely algebraic definiti<strong>on</strong> of the new subspaceas the intersecti<strong>on</strong> of the kernels of various trace maps from S k (M, N, ε), whichare obtained by averaging over coset representatives.Definiti<strong>on</strong> 9.1.8 (Newform). A newform f = ∑ a n q n N ∈ S k(M, N, ε) is an elementof S k (M, N, ε) new that is an eigenform for all T p , for p ∤ NM, <strong>and</strong> is normalizedso that a 1 = 1.Li introduces the crucial “Atkin-Lehner operator” WqM (denoted VqM in [Li75]),which plays a key roll in all the proofs, <strong>and</strong> is defined as follows. For a positiveinteger M <strong>and</strong> prime q, let α = ord q (M) <strong>and</strong> find integers x, y, z such thatq 2α x − yMz = q α . Then WqM is the operator defined by slashing with the matrixα x y( ) qMz q α . Li shows that if f ∈ S k (M, 1, ε), then f|Wq M |Wq M = ε(q α )f, soWq M is an automorphism. Care must be taken, because the operator Wq M need notcommute with T p = Up N , when p | M.After proving many technical but elementary lemmas about the operators B d ,C d , Up N , T p , <strong>and</strong> WqM , Li uses the lemmas to deduce the following theorems. Theproofs are all elementary, but there is little I can say about them, except that youjust have to read them. 1 1Theorem 9.1.9. Suppose f = ∑ a n q n N ∈ S k(M, N, ε) <strong>and</strong> a n = 0 for all n with(n, K) = 1, where K is a fixed positive integer. Then f ∈ S k (M, N, ε) old .From the theorem we see that if f <strong>and</strong> g are newforms in S k (M, N, ε), <strong>and</strong> if forall but finitely many primes p, the T p eigenvalues of f <strong>and</strong> g are the same, thenf −g is an old form, so f −g = 0, hence f = g. Thus the eigenspaces corresp<strong>on</strong>dingto the systems of <strong>Hecke</strong> eigenvalues associated to the T p , with p ∤ MN, each havedimensi<strong>on</strong> 1. This is known as “multiplicity <strong>on</strong>e”.Theorem 9.1.10. Let f = ∑ a n q n N be a newform in S k(M, N, ε), p a prime with(p, MN) = 1, <strong>and</strong> q | MN a prime. Then1. f|T p = a p f, f|U N q = a q f, <strong>and</strong> for all n ≥ 1,a p a n = a np + ε(p)p k−1 a n/p ,a q a n = a nq .1 Remove from book.


9.2 The U p operator 73If L(f, s) = ∑ n≥1 a nn −s is the Dirichlet series associated to f, then L(f, s)has an Euler productL(f, s) = ∏(1 − a q q −s ) ∏ −1 (1 − a p p −s + ε(p)p k−1 p −2s ) −1 .q|MNp∤MN2. (a) If ε is not a character mod MN/q, then |a q | = q (k−1)/2 .(b) If ε is a character mod MN/q, then a q = 0 if q 2 | MN, <strong>and</strong> a 2 q =ε(q)q k−2 if q 2 ∤ MN.9.2 The U p operatorLet N be a positive integer <strong>and</strong> M a divisor of N. For each divisor d of N/M wedefine a mapα d : S k (Γ 1 (M)) → S k (Γ 1 (N)) :f(τ) ↦→ f(dτ).We verify that f(dτ) ∈ S k (Γ 1 (N)) as follows. Recall that for γ = ( a bc d), we write(f|[γ] k )(τ) = det(γ) k−1 (cz + d) −k f(γ(τ)).The transformati<strong>on</strong> c<strong>on</strong>diti<strong>on</strong> for f to be in S k (Γ 1 (N)) is that f|[γ] k (τ) = f(τ). Letf(τ) ∈ S k (Γ 1 (M)) <strong>and</strong> let ι d = ( )d 00 1 . Then f|[ιd ] k (τ) = d k−1 f(dτ) is a modularform <strong>on</strong> Γ 1 (N) since ι −1dΓ 1(M)ι d c<strong>on</strong>tains Γ 1 (N). Moreover, if f is a cusp formthen so is f|[ι d ] k .Propositi<strong>on</strong> 9.2.1. If f ∈ S k (Γ 1 (M)) is n<strong>on</strong>zero, then{α d (f) : d | N }Mis linearly independent.Proof. If the q-expansi<strong>on</strong> of f is ∑ a n q n , then the q-expansi<strong>on</strong> of α d (f) is ∑ a n q dn .The matrix of coefficients of the q-expansi<strong>on</strong>s of α d (f), for d | (N/M), is uppertriangular. Thus the q-expansi<strong>on</strong>s of the α d (f) are linearly independent, hencethe α d (f) are linearly independent, since the map that sends a cusp form to itsq-expansi<strong>on</strong> is linear.When p | N, we denote by U p the <strong>Hecke</strong> operator T p acting <strong>on</strong> the image spaceS k (Γ 1 (N)). For clarity, in this secti<strong>on</strong> we will denote by T p,M , the <strong>Hecke</strong> operatorT p ∈ End(S k (Γ 1 (M))). For f = ∑ a n q n ∈ S k (Γ 1 (N)), we havef|U p = ∑ a np q n .Suppose f = ∑ a n q n ∈ S k (Γ 1 (M)) is a normalized eigenform for all of the <strong>Hecke</strong>operators T n <strong>and</strong> 〈n〉, <strong>and</strong> p is a prime that does not divide M. Thenf|T p,M = a p f <strong>and</strong> f|〈p〉 = ε(p)f.


74 9. Newforms <strong>and</strong> Euler ProductsAssume N = p r M, where r ≥ 1 is an integer. Letf i (τ) = f(p i τ),so f 0 , . . . , f r are the images of f under the maps α p 0, . . . , α p r, respectively, <strong>and</strong>f = f 0 . We havef|T p,M = ∑ n≥1a np q n + ε(p)p k−1 ∑ a n q pn= f 0 |U p + ε(p)p k−1 f 1 ,soAlsof 0 |U p = f|T p,M − ε(p)p k−1 f 1 = a p f 0 − ε(p)p k−1 f 1 .f 1 |U p =(∑an q pn) |U p = ∑ a n q n = f 0 .More generally, for any i ≥ 1, we have f i |U p = f i−1 .The operator U p preserves the two dimensi<strong>on</strong>al vector space spanned by f 0<strong>and</strong> f 1 , <strong>and</strong> the matrix of U p with respect to the basis f 0 , f 1 iswhich has characteristic polynomial()aA =p 1− ε(p)p k−1 ,0X 2 − a p X + p k−1 ε(p). (9.2.1)9.2.1 A C<strong>on</strong>necti<strong>on</strong> with Galois representati<strong>on</strong>sThis leads to a striking c<strong>on</strong>necti<strong>on</strong> with Galois representati<strong>on</strong>s. Let f be a newform<strong>and</strong> let K = K f be the field generated over Q by the Fourier coefficients of f. Letl be a prime <strong>and</strong> λ a prime lying over l. Then Deligne (<strong>and</strong> Serre, when k = 1)c<strong>on</strong>structed a representati<strong>on</strong>ρ λ : Gal(Q/Q) → GL(2, K λ ).If p ∤ Nl, then ρ λ is unramified at p, so if Frob p ∈ Gal(Q/Q) if a Frobenius element,then ρ λ (Frob p ) is well defined, up to c<strong>on</strong>jugati<strong>on</strong>. Moreover, <strong>on</strong>e can show thatdet(ρ λ (Frob p )) = p k−1 ε(p),tr(ρ λ (Frob p )) = a p .<strong>and</strong>(We will discuss the proof of these relati<strong>on</strong>s further in the case k = 2.) Thus thecharacteristic polynomial of ρ λ (Frob p ) ∈ GL 2 (E λ ) iswhich is the same as (9.2.1).X 2 − a p X + p k−1 ε(p),


9.2.2 When is U p semisimple?9.3 The Cusp forms are free of rank <strong>on</strong>e over T C 75Questi<strong>on</strong> 9.2.2. Is U p semisimple <strong>on</strong> the span of f 0 <strong>and</strong> f 1 ?If the eigenvalues of U p are distinct, then the answer is yes. If the eigenvaluesare the same, then X 2 − a p X + p k−1 ε(p) has discriminant 0, so a 2 p = 4p k−1 ε(p),hence√a p = 2p k−12 ε(p).Open Problem 9.2.3. Does there exist an eigenform f = ∑ a n q n ∈ S k (Γ 1 (N))√such that a p = 2p k−12 ε(p)?It is a curious fact that the Ramanujan c<strong>on</strong>jectures, which were proved byDeligne in 1973, imply that |a p | ≤ 2p (k−1)/2 , so the above equality remains taunting.When k = 2, Coleman <strong>and</strong> Edixhoven proved that |a p | < 2p (k−1)/2 .229.2.3 An Example of n<strong>on</strong>-semisimple U pSuppose f = f 0 is a normalized eigenform. Let W be the space spanned by f 0 , f 1<strong>and</strong> let V be the space spanned by f 0 , f 1 , f 2 , f 3 . Then U p acts <strong>on</strong> V/W by f 2 ↦→ 0<strong>and</strong> f 3 ↦→ f 2 . Thus the matrix of the acti<strong>on</strong> of U p <strong>on</strong> V/W is ( 0 0 10 ), which isn<strong>on</strong>zero <strong>and</strong> nilpotent, hence not semisimple. Since W is invariant under U p thisshows that U p is not semisimple <strong>on</strong> V , i.e., U p is not diag<strong>on</strong>alizable.9.3 The Cusp forms are free of rank <strong>on</strong>e over T C9.3.1 Level 1Suppose N = 1, so Γ 1 (N) = SL 2 (Z). Using the Peterss<strong>on</strong> inner product, we seethat all the T n are diag<strong>on</strong>alizable, so S k = S k (Γ 1 (1)) has a basisf 1 , . . . , f dof normalized eigenforms where d = dim S k . This basis is can<strong>on</strong>ical up to ordering.Let T C = T ⊗ C be the ring generated over C by the <strong>Hecke</strong> operator T p . Then,having fixed the basis above, there is a can<strong>on</strong>ical mapT C ↩→ C d : T ↦→ (λ 1 , . . . , λ d ),where f i |T = λ i f i . This map is injective <strong>and</strong> dim T C = d, so the map is anisomorphism of C-vector spaces.The formv = f 1 + · · · + f ngenerates S k as a T-module. Note that v is can<strong>on</strong>ical since it does not depend <strong>on</strong>the ordering of the f i . Since v corresp<strong>on</strong>ds to the vector (1, . . . , 1) <strong>and</strong> T ∼ = C d2 Look in Coleman-edixhoven <strong>and</strong> say more about this. Plus find the Weil reference. Whenk = 2, Weil [?] showed that ρ λ (Frob p) is semisimple, so if the eigenvalues of U p are equal thenρ λ (Frob p) is a scalar. But Edixhoven <strong>and</strong> Coleman [CE98] show that it is not a scalar by lookingat the abelian variety attached to f.


76 9. Newforms <strong>and</strong> Euler Productsacts <strong>on</strong> S k∼ = C d comp<strong>on</strong>entwise, this is just the statement that C d is generatedby (1, . . . , 1) as a C d -module.There is a perfect pairing S k × T C → C given by〈∑f, Tn〉= a 1 (f|T n ) = a n (f),where a n (f) denotes the nth Fourier coefficient of f. Thus we have simultaneously:1. S k is free of rank 1 over T C , <strong>and</strong>2. S k∼ = HomC (T C , C) as T-modules.Combining these two facts yields an isomorphismT C∼ = HomC (T C , C). (9.3.1)This isomorphism sends an element T ∈ T to the homomorphismX ↦→ 〈v|T, X〉 = a 1 (v|T |X).Since the identificati<strong>on</strong> S k = Hom C (T C , C) is can<strong>on</strong>ical <strong>and</strong> since the vector v iscan<strong>on</strong>ical, we see that the isomorphism (9.3.1) is can<strong>on</strong>ical.Recall that M k has as basis the set of products E a 4 E b 6, where 4a + 6b = k, <strong>and</strong>S k is the subspace of forms where the c<strong>on</strong>stant coefficient of their q-expansi<strong>on</strong> is 0.Thus there is a basis of S k c<strong>on</strong>sisting of forms whose q-expansi<strong>on</strong>s have coefficientsin Q. Let S k (Z) = S k ∩ Z[[q]], be the submodule of S k generated by cusp formswith Fourier coefficients in Z, <strong>and</strong> note that S k (Z)⊗Q ∼ = S k (Q). Also, the explicitformula ( ∑ a n q n )|T p = ∑ a np q n + p k−1 ∑ a n q np implies that the <strong>Hecke</strong> algebra Tpreserves S k (Z).Propositi<strong>on</strong> 9.3.1. The Fourier coefficients of each f i are totally real algebraicintegers.Proof. The coefficient a n (f i ) is the eigenvalue of T n acting <strong>on</strong> f i . As observedabove, the <strong>Hecke</strong> operator T n preserves S k (Z), so the matrix [T n ] of T n with respectto a basis for S k (Z) has integer entries. The eigenvalues of T n are algebraic integers,since the characteristic polynomial of [T n ] is m<strong>on</strong>ic <strong>and</strong> has integer coefficients.The eigenvalues are real since the <strong>Hecke</strong> operators are self-adjoint with respectto the Peterss<strong>on</strong> inner product.Remark 9.3.2. A CM field is a quadratic imaginary extensi<strong>on</strong> of a totally real field.For example, when n > 2, the field Q(ζ n ) is a CM field, with totally real subfieldQ(ζ n ) + = Q(ζ n + 1/ζ n ). More generally, <strong>on</strong>e shows that the eigenvalues of anynewform f ∈ S k (Γ 1 (N)) generate a totally real or CM field.Propositi<strong>on</strong> 9.3.3. We have v ∈ S k (Z).Proof. This is because v = ∑ Tr(T n )q n , <strong>and</strong>, as we observed above, there is a basisso that the matrices T n have integer coefficients.Example 9.3.4. When k = 36, we havev = 3q + 139656q 2 − 104875308q 3 + 34841262144q 4 + 892652054010q 5− 4786530564384q 6 + 878422149346056q 7 + · · · .


9.3 The Cusp forms are free of rank <strong>on</strong>e over T C 77The normalized newforms f 1 , f 2 , f 3 aref i = q + aq 2 + (−1/72a 2 + 2697a + 478011548)q 3 + (a 2 − 34359738368)q 4(a 2 − 34359738368)q 4 + (−69/2a 2 + 14141780a + 1225308030462)q 5 + · · · ,for a each of the three roots of X 3 −139656X 2 −59208339456X−1467625047588864.9.3.2 General levelNow we c<strong>on</strong>sider the case for general level N. Recall that there are mapsS k (Γ 1 (M)) → S k (Γ 1 (N)),for all M dividing N <strong>and</strong> all divisor d of N/M.The old subspace of S k (Γ 1 (N)) is the space generated by all images of thesemaps with M|N but M ≠ N. The new subspace is the orthog<strong>on</strong>al complement ofthe old subspace with respect to the Peterss<strong>on</strong> inner product.There is an algebraic definiti<strong>on</strong> of the new subspace. One defines trace mapsS k (Γ 1 (N)) → S k (Γ 1 (M))for all M < N, M | N which are adjoint to the above maps (with respect to thePeterss<strong>on</strong> inner product). Then f is in the new part of S k (Γ 1 (N)) if <strong>and</strong> <strong>on</strong>ly if fis in the kernels of all of the trace maps.It follows from Atkin-Lehner-Li theory that the T n acts semisimply <strong>on</strong> the newsubspace S k (Γ 1 (M)) new for all M ≥ 1, since the comm<strong>on</strong> eigenspaces for all T neach have dimensi<strong>on</strong> 1. Thus S k (Γ 1 (M)) new has a basis of normalized eigenforms.We have a natural map⊕S k (Γ 1 (M)) new ↩→ S k (Γ 1 (N)).M|NThe image in S k (Γ 1 (N)) of an eigenform f for some S k (Γ 1 (M)) new is called anewform of level M f = M. Note that a newform of level less than N is notnecessarily an eigenform for all of the <strong>Hecke</strong> operators acting <strong>on</strong> S k (Γ 1 (N)); inparticular, it can fail to be an eigenform for the T p , for p | N.Letv = ∑ f(q N M f ) ∈ Sk (Γ 1 (N)),fwhere the sum is taken over all newforms f of weight k <strong>and</strong> some level M | N. Thisgeneralizes the v c<strong>on</strong>structed above when N = 1 <strong>and</strong> has many of the same goodproperties. For example, S k (Γ 1 (N)) is free of rank 1 over T with basis element v.Moreover, the coefficients of v lie in Z, but to show this we need to know thatS k (Γ 1 (N)) has a basis whose q-expansi<strong>on</strong>s lie in Q[[q]]. This is true, but we willnot prove it here. One way to proceed is to use the Tate curve to c<strong>on</strong>struct aq-expansi<strong>on</strong> map H 0 (X 1 (N), Ω X1(N)/Q) → Q[[q]], which is compatible with theusual Fourier expansi<strong>on</strong> map. 3 33 Where will we?


78 9. Newforms <strong>and</strong> Euler ProductsExample 9.3.5. The space S 2 (Γ 1 (22)) has dimensi<strong>on</strong> 6. There is a single newformof level 11,f = q − 2q 2 − q 3 + 2q 4 + q 5 + 2q 6 − 2q 7 + · · · .There are four newforms of level 22, the four Gal(Q/Q)-c<strong>on</strong>jugates ofg = q − ζq 2 + (−ζ 3 + ζ − 1)q 3 + ζ 2 q 4 + (2ζ 3 − 2)q 5+ (ζ 3 − 2ζ 2 + 2 ζ − 1)q 6 − 2ζ 2 q 7 + ...where ζ is a primitive 10th root of unity.Warning 9.3.6. Let S = S 2 (Γ 0 (88)), <strong>and</strong> let v = ∑ Tr(T n )q n . Then S has dimensi<strong>on</strong>9, but the <strong>Hecke</strong> span of v <strong>on</strong>ly has dimensi<strong>on</strong> 7. Thus the more “can<strong>on</strong>icallooking” element ∑ Tr(T n )q n is not a generator for S. 4 49.4 Decomposing the anemic <strong>Hecke</strong> algebraWe first observe that it make no difference whether or not we include the Diam<strong>on</strong>dbracket operators in the <strong>Hecke</strong> algebra. Then we note that the Q-algebra generatedby the <strong>Hecke</strong> operators of index coprime to the level is isomorphic to a product offields corresp<strong>on</strong>ding to the Galois c<strong>on</strong>jugacy classes of newforms.Propositi<strong>on</strong> 9.4.1. The operators 〈d〉 <strong>on</strong> S k (Γ 1 (N)) lie in Z[. . . , T n , . . .].Proof. It is enough to show 〈p〉 ∈ Z[. . . , T n , . . .] for primes p, since each 〈d〉 can bewritten in terms of the 〈p〉. Since p ∤ N, we have that 5 5T p 2 = T 2 p − 〈p〉p k−1 ,so 〈p〉p k−1 = T 2 p − T p 2. By Dirichlet’s theorem <strong>on</strong> primes in arithmetic progressi<strong>on</strong>[Lan94, VIII.4], there is another prime q c<strong>on</strong>gruent to p mod N. Since p k−1 <strong>and</strong>q k−1 are relatively prime, there exist integers a <strong>and</strong> b such that ap k−1 + bq k−1 = 1.Then〈p〉 = 〈p〉(ap k−1 + bq k−1 ) = a(T p 2 − T p 2) + b(T q 2 − T q 2) ∈ Z[. . . , T n , . . .].Let S be a space of cusp forms, such as S k (Γ 1 (N)) or S k (Γ 1 (N), ε). Letf 1 , . . . , f d ∈ Sbe representatives for the Galois c<strong>on</strong>jugacy classes of newforms in S of level N fidividing N. For each i, let K i = Q(. . . , a n (f i ), . . .) be the field generated by theFourier coefficients of f i .4 I think this because using my MAGMA program, I computed the image of v under T 1 ,...,T 25<strong>and</strong> the span of the image has dimensi<strong>on</strong> 7. For example, there is an element of S whose q-expansi<strong>on</strong> has valuati<strong>on</strong> 7, but no element of the T-span of v has q-expansi<strong>on</strong> with valuati<strong>on</strong> 7or 9.5 See where?


9.4 Decomposing the anemic <strong>Hecke</strong> algebra 79Definiti<strong>on</strong> 9.4.2 (Anemic <strong>Hecke</strong> Algebra). The anemic <strong>Hecke</strong> algebra is the subalgebraT 0 = Z[. . . , T n , . . . : (n, N) = 1] ⊂ Tof T obtained by adjoining to Z <strong>on</strong>ly those <strong>Hecke</strong> operators T n with n relativelyprime to N.Propositi<strong>on</strong> 9.4.3. We have T 0 ⊗ Q ∼ = ∏ di=1 K i.The map sends T n to (a n (f 1 ), . . . , a n (f d )). The propositi<strong>on</strong> can be proved usingthe discussi<strong>on</strong> above <strong>and</strong> Atkin-Lehner-Li theory, but we will not give a proofhere. 6 6Example 9.4.4. When S = S 2 (Γ 1 (22)), then T 0 ⊗ Q ∼ = Q × Q(ζ 10 ) (see Example9.3.5). When S = S 2 (Γ 0 (37)), then T 0 ⊗ Q ∼ = Q × Q.6 Add for book.


80 9. Newforms <strong>and</strong> Euler Products


10Some Explicit Genus Computati<strong>on</strong>sThis chapter is about computing the genus of certain modular curves, or equivalently,the dimensi<strong>on</strong>s of certain spaces of cusp forms. For X 0 (N), the reader mightalso look at [Shi94, §1.6] <strong>and</strong> for X 1 (N) at [DI95, §9.1]. See also Secti<strong>on</strong> 5.3 forthe particularly easy case of the genus of X(N).10.1 Computing the dimensi<strong>on</strong> of S k (Γ)Let k = 2 unless otherwise noted, <strong>and</strong> let Γ ⊂ SL 2 (Z) be a c<strong>on</strong>gruence subgroup.ThenS 2 (Γ) = H 0 (X Γ , Ω 1 )whereX Γ = (Γ\H) ∪ (Γ\P 1 (Q)).By definiti<strong>on</strong> dim H 0 (X Γ , Ω 1 ) is the genus of X Γ .Exercise 10.1.1. Prove that when Γ = SL 2 (Z) then Γ\P 1 (Q) is a point.Since Γ ⊂ Γ(1) there is a covering X Γ → X Γ(1) which is <strong>on</strong>ly ramified at pointsabove 0, 1728, ∞, where 0 corresp<strong>on</strong>ds to ρ = e 2πi/3 <strong>and</strong> 1728 to i under j. This isillustrated as follows:Γ\HX ΓΓ(1)\H X Γ(1)jP 1 (C)


82 10. Some Explicit Genus Computati<strong>on</strong>sExample 10.1.2. Suppose Γ = Γ 0 (N), then the degree of the covering is the index(SL 2 (Z)/{±1} : Γ 0 (N)/{±1}). A point <strong>on</strong> Y Γ(1) corresp<strong>on</strong>ds to an elliptic curve,whereas a points <strong>on</strong> Y 0 (N) corresp<strong>on</strong>d to a pair c<strong>on</strong>sisting of an elliptic curve <strong>and</strong>a subgroup of order N.10.2 Applicati<strong>on</strong> of Riemann-HurwitzNow we compute the genus of X Γ by applying the Riemann-Hurwitz formula.Intuitively the Euler characteristic should be totally additive, that is, if A <strong>and</strong> Bare disjoint spaces thenχ(A ∪ B) = χ(A) + χ(B).Let X be a compact Riemann surface of genus g, then χ(X) = 2 − 2g. Sinceχ({point}) = 1 we should have thatχ(X − {p 1 , . . . , p n }) = χ(X) − nχ(1) = (2 − 2g) − n.If we have an unramified covering X → Y of degree d then χ(X) = d · χ(Y ).C<strong>on</strong>sider the coveringX Γ − {points over 0, 1728, ∞}X Γ(1) − {0, 1728, ∞}Since X Γ(1) has genus 0, X Γ(1) −{0, 1728, ∞} has Euler characteristic 2−3 = −1. Ifwe let g = χ(X Γ ) then χ(X Γ −{points over 0, 1728, ∞} = 2−2g −n 0 −n 1728 −n ∞ ,where n p denotes the number of points lying over p. Thus −d = 2 − 2g − n 0 −n 1728 − n ∞ whence2g − 2 = d − n 0 − n 1728 − n ∞ .Suppose Γ = Γ 0 (N) with N > 3, then n 0 = d/3 <strong>and</strong> n 1728 = d/2 (I’m not surewhy). The degree d of the covering is equal to the number of unordered orderedbasis of E[N], thusd = # SL 2 (Z/NZ)/2.We still need to compute n ∞ . SL 2 (Z) acts <strong>on</strong> P 1 (Q) if we view P 1 (Q) as allpairs (a, b) of relatively prime integers <strong>and</strong> suppose ∞ corresp<strong>on</strong>ds to (1, 0). Thestabilizer of (1, 0) is the sugroup { ( )a bc d ∈ SL2 (Z) : c = 0} of upper triangularmatrices. Since the points lying over ∞ are all c<strong>on</strong>jugate by the Galois group ofthe covering (which is SL 2 (Z/NZ)/{±1}),We thus havenumber of cusps = order of SL 2(Z/NZ)/{±1}order of stabilizer of ∞ .where d Nis the number of cusps.2g(X(N)) − 2 = d 6 − d N


10.3 Explicit genus computati<strong>on</strong>s10.3 Explicit genus computati<strong>on</strong>s 83Let N > 3 <strong>and</strong> c<strong>on</strong>sider the modular curve X = X(N). There is a natural coveringmap X → X(1) j −→ C. Let d be the degree, then2g − 2 = d − m 0 − m 1728 − m ∞where g is the genus of X <strong>and</strong> m x is the number of points lying over x. Since m 0is approximately d 3 <strong>and</strong> m 1728 is approximately d 2 ,2g − 2 = d 6 − m ∞ ± small correcti<strong>on</strong> factor.10.4 The Genus of X(N)Now we count the number of cusps of X(N), that is, the size of Γ(N)\P 1 (Q).There is a surjective map from SL 2 (Z) to P 1 (Q) given by( ) ( a b a b (1 )↦→.c d c d)0Let U be the kernel, thus U is the stabilizer of ∞ = ( 10), so U = {±( 1 a0 1): a ∈ Z}.Then the cusps of X(N) are the elements ofΓ(N)\(SL 2 (Z)/U) = (Γ(N)\ SL 2 (Z))/U = SL 2 (Z/NZ)/Uwhich has order# SL 2 (Z/NZ)= d 2N N .Substituting this into the above formula givessoWhen N is prime2g − 2 = d 6 − d N =g = 1 +d (N − 6)6Nd (N − 6).12Nd = 1 2 # SL 2(Z/NZ) = 1 2 · (N 2 − 1)(N 2 − N).N − 1Thus when N = 5, d = 60 so g = 0, <strong>and</strong> when N = 7, d = 168 so g = 3.10.5 The Genus of X 0 (N)Suppose N > 3 <strong>and</strong> N is prime. The covering map X 0 (N) → X(1) is of degreeN +1 since a point of X 0 (N) corresp<strong>on</strong>ds to an elliptic curve al<strong>on</strong>g with a subgroupof order N <strong>and</strong> there are N + 1 such subgroups because N is prime.


84 10. Some Explicit Genus Computati<strong>on</strong>sExercise 10.5.1. X 0 (N) has two cusps; they are the orbit of ∞ which is unramified<strong>and</strong> 0 which is ramified of order N.Thus2g − 2 = N + 1 − 2 − n 1728 − n 0 .n 0 is the number of pairs (E, C) (modulo isomorphism) such that E has j-invariant0. So we c<strong>on</strong>sider E = C/Z[ −1+i√ 32] which has endomorphism ring End(E) =Z[µ 6 ]. Now µ 6 /±1 acts <strong>on</strong> the cyclic subgroups C so, letting ω be a primitive cuberoot of unity, we have(E, C) ∼ = (E, ωC) ∼ = (E, ω 2 C).This might lead <strong>on</strong>e to think that m 0 is (N + 1)/3, but it may be bigger if, forexample, C = ωC. Thus we must count those C so that ωC = C or ω 2 C = C,that is, those C which are stable under O = Z[ −1+i√ 32]. So we must compute thenumber of stable O/NO-submodules of order N. This depends <strong>on</strong> the structure ofO/NO:O/NO ={F N ⊕ F NF N 2if ( −3N) = 1 (N splits)if ( −3N) = −1 (N stays inert)Since O/NO = F N 2 is a field it has no submodules of order N, whereas F N ⊕ F Nhas two O/NO-submodules of order N, namely F N ⊕ 0 <strong>and</strong> 0 ⊕ F N . Thus{N+1m 0 =3if N ≡ 2 (mod 3)N−13+ 2 if N ≡ 1 (mod 3)Exercise 10.5.2. It is an exercise in elegance to write this as a single formulainvolving the quadratic symbol.By similar reas<strong>on</strong>ing <strong>on</strong>e shows that{N+1m 1728 =2if N ≡ 3 (mod 4)N−12+ 2 if N ≡ 1 (mod 4)We can now compute the genus of X 0 (N) for any prime N. For example, if N = 37then 2g − 2 = 36 − (2 + 18) − (14) = 2 so g = 2. Similarly, X 0 (13) has genus 0 <strong>and</strong>X 0 (11) has genus 1. In general, X 0 (N) has genus approximately N/12.Serre c<strong>on</strong>structed a nice formula for the above genus. Suppose N > 3 is a prime<strong>and</strong> write N = 12a + b with 0 ≤ b ≤ 11. Then Serre’s formula isb 1 5 7 11g a − 1 a a a + 110.6 <strong>Modular</strong> forms mod pLet N be a positive integer, let p be a prime <strong>and</strong> assume Γ is either Γ 0 (N) orΓ 1 (N).Let M k (Γ, Z) = M k (Γ, C) ∩ Z[[q]], thenM k (Γ, F p ) = M k (Γ, Z) ⊗ Z F p


is the space of modular forms mod p of weight k.Suppose p = N, then <strong>on</strong>e has Serre’s Equality:M p+1 (SL 2 (Z), F p ) = M 2 (Γ 0 (p), F p )10.6 <strong>Modular</strong> forms mod p 85The map from the right h<strong>and</strong> side to the left h<strong>and</strong> side is accomplished via acertain normalized Eisenstein series. Recall that in SL 2 (Z),G k = − B k2k + ∞ ∑n=1( ∑ d|nd k−1 )q n<strong>and</strong>E k = 1 − 2kB k∞ ∑n=1( ∑ d|nd k−1 )q n .One finds ord p (− B k2k ) using Kummer c<strong>on</strong>gruences. In particular, ord p(B p−1 ) = −1,so E p−1 ≡ 1 (mod p). Thus multiplicati<strong>on</strong> by E p−1 raises the level by p − 1 butdoes not change the q-expansi<strong>on</strong> mod p. We thus get a mapThe mapM 2 (Γ 0 (p), F p ) → M p+1 (Γ 0 (p), F p ).M p+1 (Γ 0 (p), F p ) → M p+1 (SL 2 (Z), F p )is the trace map (which is dual to the natural inclusi<strong>on</strong> going the other way) <strong>and</strong>is accomplished by averaging in order to get a form invariant under SL 2 (Z).


86 10. Some Explicit Genus Computati<strong>on</strong>s


11The Field of ModuliIn this chapter we will study the field of definiti<strong>on</strong> of the modular curves X(N),X 0 (N), <strong>and</strong> X 1 (N).The functi<strong>on</strong> field of X(1) = P 1 Q is Q(t). If E is an elliptic curve given by aWeierstrass equati<strong>on</strong> y 2 = 4x 3 − g 2 x − g 3 thenj(E) = j(g 2 , g 3 ) = 1728g3 2g2 3 − .27g2 3The j invariant determines the isomorphism class of E over C. Pick an ellipticcurve E/Q(t) such that j(E) = t. In particular we could pick the elliptic curvewith Weierstrass equati<strong>on</strong>y 2 = 4x 3 −27tt − 1728 x − 27tt − 1728 .Let E/k be an arbitrary elliptic curve <strong>and</strong> N a positive integer prime to char k.Then E[N](k) ∼ = (Z/NZ) 2 . Let k(E[N]) be the field obtained by adjoining the coordinatesof the N-torsi<strong>on</strong> points of E. C<strong>on</strong>sider the tower of fields k ⊃ k(E[N]) ⊃k. There is a Galois representati<strong>on</strong> <strong>on</strong> the N torsi<strong>on</strong> of E:Gal(k/k) ρ E,N−−−→ Aut(E[N]) ∼ = GL 2 (Z/NZ)<strong>and</strong> Gal(k/k(E[N])) = ker(ρ E,N ). Thus the Galois group of the extensi<strong>on</strong> Q(t)(E[N])over Q(t) is c<strong>on</strong>tained in GL 2 (Z/NZ). Let X(N) be the curve corresp<strong>on</strong>ding tothe functi<strong>on</strong> field Q(t)(E[N]) over Q. Since Q∩Q(t)(E[N]) is c<strong>on</strong>tained in Q(µ N ),X(N) is defined over Q(µ N ).Composing ρ E with the natural map GL 2 (Z/NZ) → GL 2 (Z/NZ)/{±1} gives amapρ E : Gal(K/K) → GL 2 (Z/NZ)/{±1}.Propositi<strong>on</strong> 11.0.1. ρ E is surjective if <strong>and</strong> <strong>on</strong>ly if ρ E is surjective.


88 11. The Field of ModuliProof. If ρ E is surjective then either ( )0 −11 0 or its negative lies in the image of ρ.Thus ( )−1 00 −1 lie in the image of ρ. Since ρE is surjective this implies that ρ issurjective. The c<strong>on</strong>verse is trivial.11.1 Digressi<strong>on</strong> <strong>on</strong> moduliX 0 (N)(K) is the set of K-isomorphisms classes of pairs (E, C) where E/K isan elliptic curve <strong>and</strong> C is a cyclic subgroup of order N. X 0 (N)(K) is the set ofisomorphism classes of pairs (E, C) such that for all σ ∈ Gal(K/K), σ(E, C) =(E, C). There is a map{k-isomorphism classes of pairs (E, C)/K} → X 0 (N)(K)which is “notoriously” n<strong>on</strong>-injective. [DR73] prove the map is surjective. WhenN = 1 they observe that the map is surjective, then for N > 1 they show thatcertain obstructi<strong>on</strong>s vanish. A related questi<strong>on</strong> isQuesti<strong>on</strong> 11.1.1. If E/K is isomorphic to all its Galois c<strong>on</strong>jugates, is there E ′ /Kwhich is isomorphic to E over K?11.2 When is ρ E surjective?Propositi<strong>on</strong> 11.2.1. Let E 1 <strong>and</strong> E 2 be elliptic curves defined over K with equalj-invariants, thus E 1∼ = E2 over K. Assume E 1 <strong>and</strong> E 2 do not have complexmultiplicati<strong>on</strong> over K. Then ρ E1 is surjective if <strong>and</strong> <strong>on</strong>ly if ρ E2 is surjective.Proof. Assume ρ E1 is surjective. Since E 1 has no complex multiplicati<strong>on</strong> over K,∼Aut E 1 = {±1}. Choose an isomorphism ϕ : E 1 −→ E2 over K. Then for σ ∈Gal(K/K) we have the diagramE 1ϕE 2∼ =∼ =σ E 1 σϕ σ E 2Thus σϕ = ±ϕ for all σ ∈ Gal(K/K), so ϕ : E 1 [N] → E 2 [N] defines an equivalenceρ ∼E1 = ρE2 . Since ρ E1 is surjective this implies ρ E2is surjective which, by theprevious propositi<strong>on</strong>, implies ρ E2 is surjective.Let K = C(j), with j transcendental over C. Let E/K be an elliptic curve withj-invariant j. Fix a positive integer N <strong>and</strong> letρ E : Gal(K/K) → GL 2 (Z/NZ)be the associated Galois representati<strong>on</strong>. Then det ρ E is the cyclotomic characterwhich is trivial since C c<strong>on</strong>tains the Nth roots of unity. Thus the image of ρ El<strong>and</strong>s inside of SL 2 (Z/NZ). Our next theorem states that a generic elliptic curvehas maximal possible Galois acti<strong>on</strong> <strong>on</strong> its divisi<strong>on</strong> points.


Theorem 11.2.2. ρ E : Gal(K/K) → SL 2 (Z/NZ) is surjective.11.3 Observati<strong>on</strong>s 89Igusa [Igu56] found an algebraic proof of this theorem We will now make somecomments <strong>on</strong> how an analytic proof goes.Proof. Let C(j) = K = F 1 be the field of modular functi<strong>on</strong>s for SL 2 (Z). SupposeN ≥ 3 <strong>and</strong> let F N be the field of mereomorphic functi<strong>on</strong>s for Γ(N). Then F N /F 1is a Galois extensi<strong>on</strong> with Galois group SL 2 (Z/NZ)/{±1}.Let E be an elliptic curve over K with j-invariant j. We will show that Gal(F N /F 1 ) =SL 2 (Z/NZ)/{±1} acts transitively <strong>on</strong> the x-coordinates of the N torsi<strong>on</strong> pointsof E. This will show that ρ E maps surjectively <strong>on</strong>to SL 2 (Z/NZ)/{±1}. Then bypropositi<strong>on</strong> 11.0.1, ρ E maps surjectively <strong>on</strong>to SL 2 (Z/NZ), as claimed.We will now c<strong>on</strong>struct the x-coordinates of E[N] as functi<strong>on</strong>s <strong>on</strong> H which areinvariant under Γ(N). (Thus K(E[N]/{±1}) ⊂ F N .)Let τ ∈ H <strong>and</strong> let L τ = Zτ + Z. C<strong>on</strong>sider ℘(z, L τ ) which gives the x coordinateof C/L τ in it st<strong>and</strong>ard form. Define, for each n<strong>on</strong>zero (r, s) ∈ ((Z/NZ)/{±1}) 2 , afuncti<strong>on</strong>f (r,s) : H → C : τ ↦→ g 2(τ)g 3 (τ) ℘(rτ + sN , L τ ).First notice that for any α = ( a bc d)∈ SL2 (Z),f (r,s) (ατ) = f( )(r,s) a b (τ).c dIndeed, ℘ is homogeneous of degree −2, g 2 is modular of weight 4 <strong>and</strong> g 3 is modularof weight 6, sof (r,s) (ατ) = g 2(ατ)g 3 (ατ) ℘(rατ + sN )= (cτ + d) −2 g 2(τ) + rb + csτ + sd℘(raτ )g 3 (τ) N(cτ + d)= g 2(τ) + sc)τ + rb + sd℘((ra ) = f (r,s)α (τ)g 3 (τ) NIf τ ∈ H with g 2 (τ), g 3 (τ) ≠ 0 then the f (r,s) (τ) are the x-coordinates ofthe n<strong>on</strong>zero N-divisi<strong>on</strong> points of E j(τ) . The various f (r,s) (τ) are distinct. ThusSL 2 (Z/NZ)/{±1} acts transitively <strong>on</strong> the f (r,s) . The c<strong>on</strong>sequence is that theN 2 − 1 n<strong>on</strong>zero divisi<strong>on</strong> points of E j have x-coordinates in F N equal to thef (r,s) ∈ F N .11.3 Observati<strong>on</strong>sPropositi<strong>on</strong> 11.3.1. If E/Q(µ N )(t) is an elliptic curve with j(E) = t, then ρ Ehas image SL 2 (Z/NZ).Proof. Since Q(µ N ) c<strong>on</strong>tains the Nth roots of unity Nth cyclotomic characteris trivial hence the determinant of ρ E is trivial. Thus the image of ρ E lies inSL 2 (Z/NZ). In the other directi<strong>on</strong>, there is a natural inclusi<strong>on</strong>SL 2 (Z/NZ) = Gal(C(t)(E[N])/C(t)) ↩→ Gal(Q(µ N )(t)(E[N])/Q(µ N )(t)).


90 11. The Field of ModuliPropositi<strong>on</strong> 11.3.2. If E/Q(t) is an elliptic curve with j(E) = t, then ρ E hasimage GL 2 (Z/NZ) <strong>and</strong> Q ∩ Q(t)(E[N]) = Q(µ N ).Proof. Since Q(t) c<strong>on</strong>tains no Nth roots of unity, the mod N cyclotomic character,<strong>and</strong> hence detρ E , is surjective <strong>on</strong>to (Z/NZ) ∗ . Since the image of ρ E alreadyc<strong>on</strong>tains SL 2 (Z/NZ) it must equal GL 2 (Z/NZ). For the sec<strong>on</strong>d asserti<strong>on</strong> c<strong>on</strong>siderthe diagramQQ(t)(E[N])SL 2(Z/NZ) ∗Q(µ N )QQ(µ N )(t)Q(t)GL 2/SL 2=(Z/NZ) ∗This gives a way to view X 0 (N) as a projective algebraic curve over Q. LetK = Q(t) <strong>and</strong> let L = K(E[N]) = Q(µ N )(t). ThenH = { ( ∗ ∗0 ∗)} ⊂ GL2 (Z/NZ) = Gal(L/K).The fixed field L H is an extensi<strong>on</strong> of Q(t) of transcendence degree 1 with field ofc<strong>on</strong>stants Q ∩ L H = Q, i.e., a projective algebraic curve.11.4 A descent problemC<strong>on</strong>sider the following exercise which may be approached in an h<strong>on</strong>est or dish<strong>on</strong>estway.Exercise 11.4.1. Suppose L/K is a finite Galois extensi<strong>on</strong> <strong>and</strong> G = Gal(L/K). LetE/L be an elliptic curve, assume Aut L E = {±1}, <strong>and</strong> suppose that for all g ∈ G,there is an isomorphism g E ∼ −→ E over L. Show that there exists E 0 /K such thatE 0∼ = E over L.Cauti<strong>on</strong>! The exercise is false as stated. Both the dish<strong>on</strong>est <strong>and</strong> h<strong>on</strong>est approachesbelow work <strong>on</strong>ly if L is a separable closure of K. Now: can <strong>on</strong>e c<strong>on</strong>structa counterexample?Discussi<strong>on</strong>. First the hard, but “h<strong>on</strong>est” way to look at this problem. For noti<strong>on</strong>s<strong>on</strong> descent see [Ser88]. By descent theory, to give E 0 is the same as to give a family(λ g ) g∈G of maps λ g : g E ∼ −→ E such that λ gh = λ g ◦ g λ h where g λ h = g ◦ λ h ◦ g −1 .Note that λ g ◦ g λ h maps gh E → E. This is the natural c<strong>on</strong>diti<strong>on</strong> to impose, becauseif f : E 0∼−→ E <strong>and</strong> we let λg = f ◦ g (f −1 ) then λ gh = λ g ◦ g λ h .Using our hypothesis choose, for each g ∈ G, an isomorphismλ g : g E ∼ −→ E.Define a map c byc(g, h) = λ g ◦ g λ h ◦ λ −1gh .


11.5 Sec<strong>on</strong>d look at the descent exercise 91Note that c(g, h) ∈ Aut E = {±1} so c defines an element ofH 2 (G, {±1}) ⊂ H 2 (Gal(L/K), {±1}) = Br(K)[2].Here Br(K)[2] denotes the 2-torsi<strong>on</strong> of the Brauer groupBr(K) = H 2 (Gal(L/K), L ∗ ).This probably leads to an h<strong>on</strong>est proof.The dish<strong>on</strong>est approach is to note that g(j(E)) = j(E) for all g ∈ G since allc<strong>on</strong>jugates of E are isomorphic <strong>and</strong> j( g E) = g(j(E)). Thus j(E) ∈ K so we c<strong>and</strong>efine E 0 /K by substituting j(E) into the universal elliptic curve formula [Sil92,III.1.4]. This gives an elliptic curve E 0 defined over K but isomorphic to E overK.11.5 Sec<strong>on</strong>d look at the descent exerciseLast time we talked about the following problem. Suppose L/K is a Galois extensi<strong>on</strong>with char K = 0, <strong>and</strong> let E/L be an elliptic curve. Suppose that for allσ ∈ G = Gal(L/K), σ E ∼ = E over L. C<strong>on</strong>clude that there is an elliptic curve E 0 /Ksuch that E 0∼ = E over L. The c<strong>on</strong>clusi<strong>on</strong> may fail to hold if L is a finite extensi<strong>on</strong>of K, but the exercise is true when L = K. First we give a descent argument whichholds when L = K <strong>and</strong> then give a counterexample to the more general statement.For g, h ∈ G = Gal(L/K) we define an automorphism c(g, h) ∈ Aut E = {±1}.Choose for every g ∈ Gal(L/K) some isomorphismλ g : g E ∼ −→ E.If the λ g were to all satisfy the compatibility criteri<strong>on</strong> λ gh = λ g ◦ g λ h then bydescent theory we could find a K-structure <strong>on</strong> E, that is a model for E definedover K <strong>and</strong> isomorphic to E over L. Define c(g, h) by c(g, h)λ gh = λ g ◦ g λ h so c(g, h)measures how much the λ g fail to satisfy the compatibility criteri<strong>on</strong>. Since c(g, h)is a cocycle it defines an elements of H 2 (G, {±1}). We want to know that thiselement is trivial. When L = K, the map H 2 (G, {±1}) → H 2 (G, L ∗ ) is injective.To see this first c<strong>on</strong>sider the exact sequence0 → {±1} → K ∗ 2 −→ K ∗ → 0where 2 : K ∗→ K ∗ is the squaring map. Taking cohomology yields an exactsequenceH 1 (G, K ∗ ) → H 2 (G, {±1}) → H 2 (G, K ∗ ).By Hilbert’s theorem 90 ([Ser79] Ch. X, Prop. 2), H 1 (G, {±1}) = 0. Thus we havean exact sequence0 → H 2 (G, {±1}) → H 2 (G, K ∗ )[2] → 0.Thus H 2 (G, {±1}) naturally sits inside H 2 (G, L ∗ ).To finish Ribet does something with differentials <strong>and</strong> H 0 ( g E, Ω 1 ) which I d<strong>on</strong>’tunderst<strong>and</strong>.The counterexample in the case when L/K is finite was provided by KevinBuzzard (who said Coates gave it to him). Let L = Q(i), K = Q <strong>and</strong> E be theelliptic curve with Weirstrass equati<strong>on</strong> iy 2 = x 3 + x + 1. Then E is isomorphic toits c<strong>on</strong>jugate over L but <strong>on</strong>e can show directly that E has no model over Q.


92 11. The Field of Moduli11.6 Acti<strong>on</strong> of GL 2Let N > 3 be an integer <strong>and</strong> E/Q(j) an elliptic curve with j-invariant j(E) = j.Then there is a Galois extensi<strong>on</strong>F N = Q(j)(E[N]/{±1})|F 1 = Q(j)with Galois group GL 2 (Z/NZ)/{±1}. Think of Q(j)(E[N]/{±1} as the field obtainedfrom Q(j) by adjoining the x-coordinates of the N-torsi<strong>on</strong> points of E. Notethat this situati<strong>on</strong> differs from the previous situati<strong>on</strong> in that the base field C hasbeen replaced by Q.C<strong>on</strong>siderF = ⋃ F NNwhich corresp<strong>on</strong>ds to a projective system of modular curves . Let A f be the ringof finite adèles, thusA f = ˆQ = Ẑ ⊗ Q ⊂ ∏ Q p .pA f can be thought of as{(x p ) : x p ∈ Z p for almost all p}.The group GL 2 (A f ) acts <strong>on</strong> F. To underst<strong>and</strong> what this acti<strong>on</strong> is we first c<strong>on</strong>siderthe subgroup GL 2 (Ẑ) of GL 2(A f ).It can be shown thatwhere f N,(r,s) is a functi<strong>on</strong>F = Q(f N,(r,s) : (r, s) ∈ (Z/NZ) 2 − {(0, 0)}, N ≥ 1)f N,(r,s) : H → C :τ ↦→ g 2(τ)g 3 (τ) ℘(rτ + sN , L τ ).We define the acti<strong>on</strong> of GL 2 (Ẑ) <strong>on</strong> F as follows. Let g ∈ GL2(Ẑ), then to give theacti<strong>on</strong> of g <strong>on</strong> f N,(r,s) first map g into GL 2 (Z/NZ) via the natural reducti<strong>on</strong> map,then note that GL 2 (Z/NZ) acts <strong>on</strong> f N,(r,s) by( a b)c d· f N,(r,s) = f ( )N,(r,s) a b = fN,(ra+sc,rb+sd) .c dLet E be an elliptic curve. Then the universal Tate module isT (E) = lim ←−E[N] = ∏ pT p (E).There is an isomorphism α : Ẑ 2 −→ ∼ T (E). Via right compositi<strong>on</strong> GL 2 (Ẑ) acts<strong>on</strong> the collecti<strong>on</strong> of all such isomorphism α. So GL 2 (Ẑ) acts naturally <strong>on</strong> pairs(E, α) but the acti<strong>on</strong> does nothing to E. An important point to be grasped when


11.6 Acti<strong>on</strong> of GL 2 93c<strong>on</strong>structi<strong>on</strong> objects like Shimura varieties is that we must “free ourselves” <strong>and</strong>allow GL 2 (Ẑ) to act <strong>on</strong> the E’s as well.Let( a b)g = ∈ GL +c d2 (Q)(thus g has positive determinant). Let τ ∈ H <strong>and</strong> let E = E τ be the elliptic curvedetermined by the lattice L τ = Z + Zτ. Letα τ : L τ = Z + Zτ ∼ −→ Z 2be the isomorphism defined by τ ↦→ (1, 0) <strong>and</strong> 1 ↦→ (0, 1). Now view α = α τ as amapα : Z 2 ∼−→ H1 (E(C), Z).Tensoring with Q then gives another map (also denoted α)α : Q 2∼−→ H1 (E, Q).Then α ◦ g is another isomorphismQ 2α◦g−−→ H 1 (E, Q)which induces an isomorphism Z 2 −→ ∼ L ′ ⊂ H 1 (E, Q) where L ′ is a lattice. Thereexists an elliptic curve E ′ /C <strong>and</strong> a map λ ∈ Hom(E ′ , E) ⊗ Q which induces a map(also denoted λ)λ : H 1 (E ′ , Z) −→ ∼ L ′ ⊂ H 1 (E, Q)<strong>on</strong> homology groups.Now we can define an acti<strong>on</strong> <strong>on</strong> pairs (E, α) by sending (E, α) to (E ′ , α ′ ). Hereα ′ is the map α ′ : Z 2 → H 1 (E ′ , Z) given by the compositi<strong>on</strong>Z 2αg−→ L ′ λ−−→ −1H 1 (E ′ , Z).In more c<strong>on</strong>crete terms the acti<strong>on</strong> iswhere τ ′ = gτ = aτ+bcτ+d .g : (E τ , α τ ) ↦→ (E ′ τ , α ′ τ )


94 11. The Field of Moduli


12<strong>Hecke</strong> <strong>Operators</strong> as Corresp<strong>on</strong>dencesOur goal is to view the <strong>Hecke</strong> operators T n <strong>and</strong> 〈d〉 as objects defined over Q thatact in a compatible way <strong>on</strong> modular forms, modular Jacobians, <strong>and</strong> homology. Inorder to do this, we will define the <strong>Hecke</strong> operators as corresp<strong>on</strong>dences.12.1 The Definiti<strong>on</strong>Definiti<strong>on</strong> 12.1.1 (Corresp<strong>on</strong>dence). Let C 1 <strong>and</strong> C 2 be curves. A corresp<strong>on</strong>denceC 1 C 2 is a curve C together with n<strong>on</strong>c<strong>on</strong>stant morphisms α : C → C 1 <strong>and</strong>β : C → C 2 . We represent a corresp<strong>on</strong>dence by a diagramC 1α C βGiven a corresp<strong>on</strong>dence C 1 C 2 the dual corresp<strong>on</strong>dence C 2 C 1 is obtainedby looking at the diagram in a mirrorC 2β C αIn defining <strong>Hecke</strong> operators, we will focus <strong>on</strong> the simple case when the modularcurve is X 0 (N) <strong>and</strong> <strong>Hecke</strong> operator is T p , where p ∤ N. We will view T p as acorresp<strong>on</strong>dence X 0 (N) X 0 (N), so there is a curve C = X 0 (pN) <strong>and</strong> maps α<strong>and</strong> β fitting into a diagramC 2C 1X 0 (pN)α β X 0 (N) X 0 (N).


96 12. <strong>Hecke</strong> <strong>Operators</strong> as Corresp<strong>on</strong>dencesThe maps α <strong>and</strong> β are degeneracy maps which forget data. To define them, we viewX 0 (N) as classifying isomorphism classes of pairs (E, C), where E is an ellipticcurve <strong>and</strong> C is a cyclic subgroup of order N (we will not worry about what happensat the cusps, since any rati<strong>on</strong>al map of n<strong>on</strong>singular curves extends uniquely to amorphism). Similarly, X 0 (pN) classifies isomorphism classes of pairs (E, G) whereG = C ⊕ D, C is cyclic of order N <strong>and</strong> D is cyclic of order p. Note that since(p, N) = 1, the group G is cyclic of order pN <strong>and</strong> the subgroups C <strong>and</strong> D areuniquely determined by G. The map α forgets the subgroup D of order p, <strong>and</strong> βquotients out by D:α : (E, G) ↦→ (E, C) (12.1.1)β : (E, G) ↦→ (E/D, (C + D)/D) (12.1.2)We translate this into the language of complex analysis by thinking of X 0 (N)<strong>and</strong> X 0 (pN) as quotients of the upper half plane. The first map α corresp<strong>on</strong>ds tothe mapΓ 0 (pN)\h → Γ 0 (N)\hinduced by the inclusi<strong>on</strong> Γ 0 (pN) ↩→ Γ 0 (N). The sec<strong>on</strong>d map β is c<strong>on</strong>structed bycomposing the isomorphism( ) ( ) −1Γ 0 (pN)\h −→∼ p 0p 0Γ0 1 0 (pN) \h (12.1.3)0 1with the map to Γ 0 (N)\h induced by the inclusi<strong>on</strong>( ) ( ) −1p 0p 0Γ0 1 0 (pN) ⊂ Γ0 10 (N).The isomorphism (12.1.3) is induced by z ↦→ ( )p 00 1 z = pz; explicitly, it isΓ 0 (pN)z ↦→ ( )p 00 1 Γ0 (pN) ( )p 0 −1 ( p 0)0 1 0 1 z.(Note that this is well-defined.)The maps α <strong>and</strong> β induce pullback maps <strong>on</strong> differentialsα ∗ , β ∗ : H 0 (X 0 (N), Ω 1 ) → H 0 (X 0 (pN), Ω 1 ).We can identify S 2 (Γ 0 (N)) with H 0 (X 0 (N), Ω 1 ) by sending the cusp form f(z) tothe holomorphic differential f(z)dz. Doing so, we obtain two mapsα ∗ , β ∗ : S 2 (Γ 0 (N)) → S 2 (Γ 0 (pN)).Since α is induced by the identity map <strong>on</strong> the upper half plane, we have α ∗ (f) =f, where we view f = ∑ a n q n as a cusp form with respect to the smaller groupΓ 0 (pN). Also, since β ∗ is induced by z ↦→ pz, we haveThe factor of p is because∞∑β ∗ (f) = p a n q pn .n=1β ∗ (f(z)dz) = f(pz)d(pz) = pf(pz)dz.


12.2 Maps induced by corresp<strong>on</strong>dences 97Let X, Y , <strong>and</strong> C be curves, <strong>and</strong> α <strong>and</strong> β be n<strong>on</strong>c<strong>on</strong>stant holomorphic maps, sowe have a corresp<strong>on</strong>denceXα C β Y.By first pulling back, then pushing forward, we obtain induced maps <strong>on</strong> differentialsH 0 (X, Ω 1 ) α∗−→ H 0 (C, Ω 1 ) −→ β∗H 0 (Y, Ω 1 ).The compositi<strong>on</strong> β ∗ ◦ α ∗ is a map H 0 (X, Ω 1 ) → H 0 (Y, Ω 1 ). If we c<strong>on</strong>sider the dualcorresp<strong>on</strong>dence, which is obtained by switching the roles of X <strong>and</strong> Y , we obtain amap H 0 (Y, Ω 1 ) → H 0 (X, Ω 1 ).Now let α <strong>and</strong> β be as in (12.1.1). Then we can recover the acti<strong>on</strong> of T p <strong>on</strong>modular forms by c<strong>on</strong>sidering the induced mapβ ∗ ◦ α ∗ : H 0 (X 0 (N), Ω 1 ) → H 0 (X 0 (N), Ω 1 )<strong>and</strong> using that S 2 (Γ 0 (N)) ∼ = H 0 (X 0 (N), Ω 1 ).12.2 Maps induced by corresp<strong>on</strong>dencesIn this secti<strong>on</strong> we will see how corresp<strong>on</strong>dences induce maps <strong>on</strong> divisor groups,which in turn induce maps <strong>on</strong> Jacobians.Suppose ϕ : X → Y is a morphism of curves. Let Γ ⊂ X × Y be the graph of ϕ.This gives a corresp<strong>on</strong>denceXα Γ β YWe can rec<strong>on</strong>struct ϕ from the corresp<strong>on</strong>dence by using that ϕ(x) = β(α −1 (x)).[draw picture here]More generally, suppose Γ is a curve <strong>and</strong> that α : Γ → X has degree d ≥ 1. Viewα −1 (x) as a divisor <strong>on</strong> Γ (it is the formal sum of the points lying over x, countedwith appropriate multiplicities). Then β(α −1 (x)) is a divisor <strong>on</strong> Y . We thus obtaina mapDiv n (X) −−−−→ β◦α−1Div dn (Y ),where Div n (X) is the group of divisors of degree n <strong>on</strong> X. In particular, settingd = 0, we obtain a map Div 0 (X) → Div 0 (Y ).We now apply the above c<strong>on</strong>structi<strong>on</strong> to T p . Recall that T p is the corresp<strong>on</strong>denceX 0 (pN)α β X 0 (N) X 0 (N),


98 12. <strong>Hecke</strong> <strong>Operators</strong> as Corresp<strong>on</strong>denceswhere α <strong>and</strong> β are as in Secti<strong>on</strong> 12.1 <strong>and</strong> the induced map is(E, C) ↦→α∗∑D∈E[p](E, C ⊕ D) ↦→β∗∑D∈E[p](E/D, (C + D)/D).Thus we have a map Div(X 0 (N)) → Div(X 0 (N)). This str<strong>on</strong>gly resembles thefirst definiti<strong>on</strong> we gave of T p <strong>on</strong> level 1 forms, where T p was a corresp<strong>on</strong>dence oflattices.12.3 Induced maps <strong>on</strong> Jacobians of curvesLet X be a curve of genus g over a field k. Recall that there is an importantassociati<strong>on</strong>{} {}curves X/k −→ Jacobians Jac(X) = J(X) of curvesbetween curves <strong>and</strong> their Jacobians.Definiti<strong>on</strong> 12.3.1 (Jacobian). Let X be a curve of genus g over a field k. Then theJacobian of X is an abelian variety of dimensi<strong>on</strong> g over k whose underlying groupis functorially isomorphic to the group of divisors of degree 0 <strong>on</strong> X modulo linearequivalence. (For a more precise definiti<strong>on</strong>, see Secti<strong>on</strong> ?? (Jacobians secti<strong>on</strong>) 1 .) 1There are many c<strong>on</strong>structi<strong>on</strong>s of the Jacobian of a curve. We first c<strong>on</strong>sider theAlbanese c<strong>on</strong>structi<strong>on</strong>. Recall that over C, any abelian variety is isomorphic toC g /L, where L is a lattice (<strong>and</strong> hence a free Z-module of rank 2g). There is anembeddingThen we realize Jac(X) as a quotientι : H 1 (X, Z) ↩→ H 0 (X, Ω 1 ) ∗∫γ ↦→ •Jac(X) = H 0 (X, Ω 1 ) ∗ /ι(H 1 (X, Z)).In this c<strong>on</strong>structi<strong>on</strong>, Jac(X) is most naturally viewed as covariantly associatedto X, in the sense that if X → Y is a morphism of curves, then the resulting mapH 0 (X, Ω 1 ) ∗ → H 0 (Y, Ω 1 ) ∗ <strong>on</strong> tangent spaces induces a map Jac(X) → Jac(Y ).There are other c<strong>on</strong>structi<strong>on</strong>s in which Jac(X) is c<strong>on</strong>travariantly associatedto X. For example, if we view Jac(X) as Pic 0 (X), <strong>and</strong> X → Y is a morphism, thenpullback of divisor classes induces a map Jac(Y ) = Pic 0 (Y ) → Pic 0 (X) = Jac(X).If F : X Y is a corresp<strong>on</strong>dence, then F induces an a map Jac(X) → Jac(Y )<strong>and</strong> also a map Jac(Y ) → Jac(X). If X = Y , so that X <strong>and</strong> Y are the same, itcan often be c<strong>on</strong>fusing to decide which duality to use. Fortunately, for T p , with pprime to N, it does not matter which choice we make. But it matters a lot if p | Nsince then we have n<strong>on</strong>-commuting c<strong>on</strong>fusable operators <strong>and</strong> this has resulted inmistakes in the literature.γ1 insert this


12.4 More <strong>on</strong> <strong>Hecke</strong> operators 9912.4 More <strong>on</strong> <strong>Hecke</strong> operatorsOur goal is to move things down to Q from C or Q. In doing this we want tounderst<strong>and</strong> T n (or T p ), that is, how they act <strong>on</strong> the associated Jacobians <strong>and</strong>how they can be viewed as corresp<strong>on</strong>dences. In characteristic p the formulas ofEichler-Shimura will play an important role.We c<strong>on</strong>sider T p as a corresp<strong>on</strong>dence <strong>on</strong> X 1 (N) or X 0 (N). To avoid c<strong>on</strong>fusi<strong>on</strong>we will mainly c<strong>on</strong>sider T p <strong>on</strong> X 0 (N) with p ∤ N. Thus assume, unless otherwisestated, that p ∤ N. Remember that T p was defined to be the corresp<strong>on</strong>denceX 0 (pN)α β X 0 (N) X 0 (N)Think of X 0 (pN) as c<strong>on</strong>sisting of pairs (E, D) where D is a cyclic subgroup of E oforder p <strong>and</strong> E is the enhanced elliptic curve c<strong>on</strong>sisting of an elliptic curve E al<strong>on</strong>gwith a cyclic subgroup of order N. The degeneracy map α forgets the subgroup D<strong>and</strong> the degeneracy map β divides by it. By c<strong>on</strong>travariant functoriality we have acommutative diagramH 0 (X 0 (N), Ω 1 )T ∗ p =α∗◦β∗ H 0 (X 0 (N), Ω 1 )S 2 (Γ 0 (N))T p S 2 (Γ 0 (N))Our c<strong>on</strong>venti<strong>on</strong> to define Tp∗ as α ∗ ◦ β ∗ instead of β ∗ ◦ α ∗ was completely psychologicalbecause there is a can<strong>on</strong>ical duality relating the two. We chose the waywe did because of the analogy with the case of a morphism ϕ : Y → X with graphΓ which induces a corresp<strong>on</strong>denceYπ 1 Γ π 2XSince the morphism ϕ induces a map <strong>on</strong> global secti<strong>on</strong>s in the other directi<strong>on</strong>H 0 (X, Ω 1 ) = Γ(X) ϕ∗−→ Γ(Y ) = H 0 (Y, Ω 1 )it is psychologically natural for more general corresp<strong>on</strong>dence such as T p to mapfrom the right to the left.The morphisms α <strong>and</strong> β in the definiti<strong>on</strong> of T p are defined over Q. This can beseen using the general theory of representable functors. Thus since T p is definedover Q most of the algebraic geometric objects we will c<strong>on</strong>struct related to T p willbe defined over Q.12.5 <strong>Hecke</strong> operators acting <strong>on</strong> JacobiansThe Jacobian J(X 0 (N)) = J 0 (N) is an abelian variety defined over Q. Thereare both covariant <strong>and</strong> c<strong>on</strong>travariant ways to c<strong>on</strong>struct J 0 (N). Thus a map α :


100 12. <strong>Hecke</strong> <strong>Operators</strong> as Corresp<strong>on</strong>dencesX 0 (pN) → X 0 (N) induces mapsJ 0 (pN)α ∗ J 0 (N)p+1J 0 (pN)α ∗ J 0 (N)Note that α ∗ ◦ α ∗ : J 0 (N) → J 0 (N) is just multiplicati<strong>on</strong> by deg(α) = p + 1, sincethere are p + 1 subgroups of order p in E. (At least when p ∤ N, when p|N thereare <strong>on</strong>ly p subgroups.)There are two possible ways to define T p as an endomorphism of J 0 (N). Wecould either define T p as β ∗ ◦ α ∗ or equivalently as α ∗ ◦ β ∗ (assuming still thatp ∤ N).12.5.1 The Albanese MapThere is a way to map the curve X 0 (N) into its Jacobian since the underlyinggroup structure of J 0 (N) is{}divisors of degree 0 <strong>on</strong> X 0 (N)J 0 (N) = {}principal divisorsOnce we have chosen a rati<strong>on</strong>al point, say ∞, <strong>on</strong> X 0 (N) we obtain the Albanesemapθ : X 0 (N) → J 0 (N) : x ↦→ x − ∞which sends a point x to the divisor x − ∞. The map θ gives us a way to pullbackdifferentials <strong>on</strong> J 0 (N). Let Cot J 0 (N) denote the cotangent space of J 0 (N) (or thespace of regular differentials). The diagramCot J 0 (N)ξ ∗ pCot J 0 (N)θ ∗ ≀T ∗ pH 0 (X 0 (N), Ω 1 ) H 0 (X 0 (N), Ω 1 )may be taken to give a definiti<strong>on</strong> of ξ p since there is a unique endomorphismξ p : J 0 (N) → J 0 (N) inducing a map ξ ∗ p which makes the diagram commute.Now suppose Γ is a corresp<strong>on</strong>dence X Y so we have a diagramXα Γ β YFor example, think of Γ as the graph of a morphism ϕ : X → Y . Then Γ shouldinduce a natural mapH 0 (Y, Ω 1 ) −→ H 0 (X, Ω 1 ).≀θ ∗


Taking Jacobians we see that the compositi<strong>on</strong>J(X) α∗−→ J(Γ) −→ β∗J(Y )12.6 The Eichler-Shimura relati<strong>on</strong> 101gives a map β ∗ ◦ α ∗ : J(X) → J(Y ). On cotangent spaces this induces a mapα ∗ ◦ β ∗ : H 0 (Y, Ω 1 ) → H 0 (X, Ω 1 ).Now, after choice of a rati<strong>on</strong>al point, the map X → J(X) induces a mapCot J(X) → H 0 (X, Ω 1 ). This is in fact independent of the choice of rati<strong>on</strong>al pointsince differentials <strong>on</strong> J(X) are invariant under translati<strong>on</strong>.The map J(X) → J(Y ) is preferred in the literature. It is said to be induced bythe Albanese functoriality of the Jacobian. We could have just as easily defined amap from J(Y ) → J(X). To see this letDualizing induces a map ψ ∨ = α ∗ ◦ β ∗ :ψ = β ∗ ◦ α ∗ : J(X) → J(Y ).J(X) ∨∼ =ψ ∨J(Y ) ∨∼ =J(X) J(Y )Here we have used autoduality of Jacobians. This can<strong>on</strong>ical duality is discussed in[MFK94] <strong>and</strong> [Mum70] <strong>and</strong> in Milne’s article in [Sch65].12.5.2 The <strong>Hecke</strong> algebraWe now have ξ p = T p ∈ End J 0 (N) for every prime p. If p|N, then we must decidebetween α ∗ ◦β ∗ <strong>and</strong> β ∗ ◦α ∗ . The usual choice is the <strong>on</strong>e which induces the usual T p<strong>on</strong> cusp forms. If you d<strong>on</strong>’t like your choice you can get out of it with Atkin-Lehneroperators.LetT = Z[. . . , T p , . . .] ⊂ End J 0 (N)then T is the same as T Z ⊂ End(S 2 (Γ 0 (N))). To see this first note that thereis a map T → T Z which is not a prior injective, but which is injective becauseelements of End J 0 (N) are completely determined by their acti<strong>on</strong> <strong>on</strong> Cot J 0 (N).12.6 The Eichler-Shimura relati<strong>on</strong>Suppose p ∤ N is a prime. The <strong>Hecke</strong> operator T p <strong>and</strong> the Frobenius automorphismFrob p induce, by functoriality, elements of End(J 0 (N) Fp ), which we also denoteT p <strong>and</strong> Frob p . The Eichler-Shimura relati<strong>on</strong> asserts that the relati<strong>on</strong>T p = Frob p +p Frob ∨ p (12.6.1)


102 12. <strong>Hecke</strong> <strong>Operators</strong> as Corresp<strong>on</strong>dencesABFIGURE 12.6.1. The reducti<strong>on</strong> mod p of the Deligne-Rapoport model of X 0(Np)holds in End(J 0 (N) Fp ). In this secti<strong>on</strong> we sketch the main idea behind why (12.6.1)holds. For more details <strong>and</strong> a proof of the analogous statement for J 1 (N), see[C<strong>on</strong>01]. 2 2Since J 0 (N) is an abelian variety defined over Q, it is natural to ask for theprimes p such that J 0 (N) have good reducti<strong>on</strong>. In the 1950s Igusa showed 3 that 3J 0 (N) has good reducti<strong>on</strong> for all p ∤ N. He viewed J 0 (N) as a scheme over Spec(Q),then “spread things out” to make an abelian scheme over Spec(Z[1/N]). He didthis by taking the Jacobian of the normalizati<strong>on</strong> of X 0 (N) (which is defined overZ[1/N]) in P n Z[1/N] .The Eichler-Shimura relati<strong>on</strong> is a formula for T p in characteristic p, or moreprecisely, for the corresp<strong>on</strong>ding endomorphisms in End(J 0 (N) Fp )) for all p forwhich J 0 (N) has good reducti<strong>on</strong> at p. If p ∤ N, then X 0 (N) Fp has many of thesame properties as X 0 (N) Q . In particular, the n<strong>on</strong>cuspidal points <strong>on</strong> X 0 (N) Fpclassify isomorphism classes of enhanced elliptic curves E = (E, C), where E is anelliptic curve over F p <strong>and</strong> C is a cyclic subgroup of E of order N. (Note that twopairs are c<strong>on</strong>sidered isomorphic if they are isomorphic over F p .)Next we ask what happens to the map T p : J 0 (N) → J 0 (N) under reducti<strong>on</strong>modulo p. To this end, c<strong>on</strong>sider the corresp<strong>on</strong>denceX 0 (Np)α β X 0 (N) X 0 (N)that defines T p . The curve X 0 (N) has good reducti<strong>on</strong> at p, but X 0 (Np) typicallydoes not. Deligne <strong>and</strong> Rapaport [DR73] showed that X 0 (Np) has relatively benignreducti<strong>on</strong> at p. Over F p , the reducti<strong>on</strong> X 0 (Np) Fp can be viewed as two copies ofX 0 (N) glued at the supersingular points, as illustrated in Figure 12.6.1.The set of supersingular pointsΣ ⊂ X 0 (N)(F p )is the set of points in X 0 (N) represented by pairs E = (E, C), where E is asupersingular elliptic curve (so E(F p )[p] = 0). There are exactly g+1 supersingularpoints, where g is the genus of X 0 (N). 4 4C<strong>on</strong>sider the corresp<strong>on</strong>dence T p : X 0 (N) X 0 (N) which takes an enhancedelliptic curve E to the sum ∑ E/D of all quotients of E by subgroups D of order p.2 Add more references to original source materials...3 Ken, what’s a reference for this?4 Reference.


12.6 The Eichler-Shimura relati<strong>on</strong> 103This is the corresp<strong>on</strong>denceX 0 (pN)α β X 0 (N) X 0 (N),(12.6.2)where the map α forgets the subgroup of order p, <strong>and</strong> β quotients out by it. Fromthis <strong>on</strong>e gets T p : J 0 (N) → J 0 (N) by functoriality.Remark 12.6.1. There are many ways to think of J 0 (N). The cotangent spaceCot J 0 (N) of J 0 (N) is the space of holomorphic (or translati<strong>on</strong> invariant) differentials<strong>on</strong> J 0 (N), which is isomorphic to S 2 (Γ 0 (N)). This gives a c<strong>on</strong>necti<strong>on</strong> betweenour geometric definiti<strong>on</strong> of T p <strong>and</strong> the definiti<strong>on</strong>, presented earlier, 5 of T p as an 5operator <strong>on</strong> a space of cusp forms.The Eichler-Shimura relati<strong>on</strong> takes place in End(J 0 (N) Fp ). Since X 0 (N) reduces“nicely” in characteristic p, we can apply the Jacobian c<strong>on</strong>structi<strong>on</strong> to X 0 (N) Fp .Lemma 12.6.2. The natural reducti<strong>on</strong> mapis injective.End(J 0 (N)) ↩→ End(J 0 (N) Fp )Proof. Let l ∤ Np be a prime. By [ST68, Thm. 1, Lem. 2], the reducti<strong>on</strong> to characteristicp map induces an isomorphismJ 0 (N)(Q)[l ∞ ] ∼ = J 0 (N)(F p )[l ∞ ].If ϕ ∈ End(J 0 (N)) reduces to the 0 map in End(J 0 (N) Fp ), then J 0 (N)(Q)[l ∞ ]must be c<strong>on</strong>tained in ker(ϕ). Thus ϕ induces the 0 map <strong>on</strong> Tate l (J 0 (N)), soϕ = 0.Let F : X 0 (N) Fp → X 0 (N) Fp be the Frobenius map in characteristic p. Thus,if K = K(X 0 (N)) is the functi<strong>on</strong> field of the n<strong>on</strong>singular curve X 0 (N), thenF : K → K is induced by the pth power map a ↦→ a p .Remark 12.6.3. The Frobenius map corresp<strong>on</strong>ds to the pth powering map <strong>on</strong>points. For example, if X = Spec(F p [t]), <strong>and</strong> z = (Spec(F p ) → X) is a pointdefined by a homomorphism α : F p [t] ↦→ F p , then F (z) is the compositeF p [t]If α(t) = ξ, then F (z)(t) = α(t p ) = ξ p .x↦→x−−−−−−−→ pαF p [t] −−−−→ F p .By both functorialities, F induces maps <strong>on</strong> the Jacobian of X 0 (N) Fp :which we illustrate as follows:Frob p = F ∗ <strong>and</strong> Ver p = Frob ∨ p = F ∗ ,Ver pJ 0 (N) Fp J 0 (N) FpFrob p5 more precise


104 12. <strong>Hecke</strong> <strong>Operators</strong> as Corresp<strong>on</strong>dencesNote that Ver p ◦ Frob p = Frob p ◦ Ver p = [p] since p is the degree of F (for example,if K = F p (t), then F (K) = F p (t p ) is a subfield of degree p, so the map inducedby F has degree p).Theorem 12.6.4 (Eichler-Shimura Relati<strong>on</strong>). Let N be a positive integer <strong>and</strong>p ∤ N be a prime. Then the following relati<strong>on</strong> holds:T p = Frob p + Ver p ∈ End(J 0 (N) Fp ).Sketch of Proof. We view X 0 (pN) Fp as two copies of X 0 (N) Fp glued al<strong>on</strong>g corresp<strong>on</strong>dingsupersingular points Σ, as in Figure 12.6.1. This diagram <strong>and</strong> the corresp<strong>on</strong>dence(12.6.2) that defines T p translate into the following diagram of schemesover F p :Σ X 0 (N) Fp X 0 (N) Fprs ∼ = X 0 (pN) Fp∼ =αβ X 0 (N) FpX 0 (N) FpThe maps r <strong>and</strong> s are defined as follows. Recall that a point of X 0 (N) Fp is anenhanced elliptic curve E = (E, C) c<strong>on</strong>sisting of an elliptic curve E (not necessarilydefined over F p ) al<strong>on</strong>g with a cyclic subgroup C of order N. We view a point <strong>on</strong>X 0 (Np) as a triple (E, C, E → E ′ ), where (E, C) is as above <strong>and</strong> E → E ′ isan isogeny of degree p. We use an isogeny instead of a cyclic subgroup of order pbecause E(F p )[p] has order either 1 or p, so the data of a cyclic subgroup of order pholds very little informati<strong>on</strong>.The map r sends E to (E, ϕ), where ϕ is the isogeny of degree p,ϕ : E Frob−−−→ E (p) .Here E (p) is the curve obtained from E by hitting all defining equati<strong>on</strong>s by Frobenious,that is, by pth powering the coefficients of the defining equati<strong>on</strong>s for E. Weintroduce E (p) since if E is not defined over F p , then Frobenious does not definean endomorphism of E. Thus r is the mapr :E ↦→ (E, E Frobp−−−→ E (p) ),<strong>and</strong> similarly we define s to be the maps :E ↦→ (E (p) , C, E Verp←−−− E (p) )where Ver p is the dual of Frob p (so Ver p ◦ Frob p = Frob p ◦ Ver p = [p]).We view α as the map sending (E, E → E ′ ) to E, <strong>and</strong> similarly we view β asthe map sending (E, E → E ′ ) to the pair (E ′ , C ′ ), where C ′ is the image of C in


12.7 Applicati<strong>on</strong>s of the Eichler-Shimura relati<strong>on</strong> 105E ′ via E → E ′ . Thusα : (E → E ′ ) ↦→ Eβ : (E ′ → E) ↦→ E ′It now follows immediately that α ◦ r = id <strong>and</strong> β ◦ s = id. Note also that α ◦ s =β ◦ r = F is the map E ↦→ E (p) .Away from the finitely many supersingular points, we may view X 0 (pN) Fp asthe disjoint uni<strong>on</strong> of two copies of X 0 (N) Fp . Thus away from the supersingularpoints, we have the following equality of corresp<strong>on</strong>dences:X 0 (pN) FpX 0 (N)α β = ′ FpX 0 (N)id=α◦r Fp F =β◦r F =α◦s+ id=β◦sX 0 (N) Fp X 0 (N) Fp X 0 (N) Fp X 0 (N) Fp X 0 (N) Fp X 0 (N) Fp ,where F = Frob p , <strong>and</strong> the = ′ means equality away from the supersingular points.Note that we are simply “pulling back” the corresp<strong>on</strong>dence; in the first summ<strong>and</strong>we use the inclusi<strong>on</strong> r, <strong>and</strong> in the sec<strong>on</strong>d we use the inclusi<strong>on</strong> s.This equality of corresp<strong>on</strong>dences implies that the equalityT p = Frob p + Ver pof endomorphisms holds <strong>on</strong> a dense subset of J 0 (N) Fp , hence <strong>on</strong> all J 0 (N) Fp .12.7 Applicati<strong>on</strong>s of the Eichler-Shimura relati<strong>on</strong>12.7.1 The Characteristic polynomial of FrobeniusHow can we apply the relati<strong>on</strong> T p = Frob + Ver in End(J 0 (N) Fp )? Let l ∤ pN bea prime <strong>and</strong> c<strong>on</strong>sider the l-adic Tate module()Tate l (J 0 (N)) = lim←− J 0(N)[l ν ] ⊗ Zl Q lwhich is a vector space of dimensi<strong>on</strong> 2g over Q l , where g is the genus of X 0 (N) orthe dimensi<strong>on</strong> of J 0 (N). Reducti<strong>on</strong> modulo p induces an isomorphismTate l (J 0 (N)) → Tate l (J 0 (N) Fp )(see the proof of Lemma 12.6.2). On Tate l (J 0 (N) Fp ) we have linear operatorsFrob p , Ver p <strong>and</strong> T p which, as we saw in Secti<strong>on</strong> 12.6, satisfyFrob p + Ver p = T p ,<strong>and</strong>Frob p ◦ Ver p = Ver p ◦ Frob p = [p].The endomorphism [p] is invertible <strong>on</strong> Tate l (J 0 (N) Fp ), since p is prime to l, soVer p <strong>and</strong> Frob p are also invertible <strong>and</strong>T p = Frob p +[p] Frob −1p .


106 12. <strong>Hecke</strong> <strong>Operators</strong> as Corresp<strong>on</strong>dencesMultiplying both sides by Frob p <strong>and</strong> rearranging, we see thatFrob 2 p −T p Frob p +[p] = 0 ∈ End(Tate l (J 0 (N) Fp )).This is a beautiful quadratic relati<strong>on</strong>, so we should be able to get something outof it. We will come back to this shortly, but first we c<strong>on</strong>sider the various objectsacting <strong>on</strong> the l-adic Tate module.The module Tate l (J 0 (N)) is acted up<strong>on</strong> in a natural way by1. The Galois group Gal(Q/Q) of Q, <strong>and</strong>2. End Q (J 0 (N)) ⊗ Zl Q l (which acts by functoriality).These acti<strong>on</strong>s commute with each other since endomorphisms defined over Q arenot affected by the acti<strong>on</strong> of Gal(Q/Q). Reducing modulo p, we also have thefollowing commuting acti<strong>on</strong>s:3. The Galois group Gal(F p /F p ) of F p , <strong>and</strong>4. End Fp (J 0 (N)) ⊗ Zl Q l .Note that a decompositi<strong>on</strong> group group D p ⊂ Gal(Q/Q) acts, after quotientingout by the corresp<strong>on</strong>ding inertia group, in the same way as Gal(F p /F p ) <strong>and</strong> theacti<strong>on</strong> is unramified, so acti<strong>on</strong> 3 is a special case of acti<strong>on</strong> 1.The Frobenius elements ϕ p ∈ Gal(F p /F p ) <strong>and</strong> Frob ∈ End Fp (J 0 (N)) ⊗ Zl Q linduce the same operator <strong>on</strong> Tate l (J 0 (N) Fp ). Note that while ϕ p naturally livesin a quotient of a decompositi<strong>on</strong> group, <strong>on</strong>e often takes a lift to get an element inGal(Q/Q).On Tate l (J 0 (N) Fp ) we have a quadratic relati<strong>on</strong>shipϕ 2 p − T p ϕ p + p = 0.This relati<strong>on</strong> plays a role when <strong>on</strong>e separates out pieces of J 0 (N) in order toc<strong>on</strong>struct Galois representati<strong>on</strong>s attached to newforms of weight 2. LetR = Z[. . . , T p , . . .] ⊂ End J 0 (N),where we <strong>on</strong>ly adjoin those T p with p ∤ N. Think of R as a reduced <strong>Hecke</strong> algebra;in particular, R is a subring of T. ThenR ⊗ Q =r∏E i ,where the E i are totally real number fields. The factors E i are in bijecti<strong>on</strong> withthe Galois c<strong>on</strong>jugacy classes of weight 2 newforms f <strong>on</strong> Γ 0 (M) (for some M|N).The bijecti<strong>on</strong> is the mapi=1f ↦→ Q(coefficients of f) = E iObserve that the map is the same if we replace f by <strong>on</strong>e of its c<strong>on</strong>jugates. Thisdecompositi<strong>on</strong> is a decompositi<strong>on</strong> of a subringR ⊗ Q ⊂ End(J 0 (N)) ⊗ Q def= End(J 0 (N) ⊗ Q).


12.7 Applicati<strong>on</strong>s of the Eichler-Shimura relati<strong>on</strong> 107Thus it induces a direct product decompositi<strong>on</strong> of J 0 (N), so J 0 (N) gets dividedup into subvarieties which corresp<strong>on</strong>d to c<strong>on</strong>jugacy classes of newforms.The relati<strong>on</strong>shipϕ 2 p − T p ϕ p + p = 0 (12.7.1)suggests thattr(ϕ p ) = T p <strong>and</strong> det ϕ p = p. (12.7.2)This is true, but (12.7.2) does not follow formally just from the given quadraticrelati<strong>on</strong>. It can be proved by combining (12.7.1) with the Weil pairing.12.7.2 The Cardinality of J 0 (N)(F p )Propositi<strong>on</strong> 12.7.1. Let p ∤ N be a prime, <strong>and</strong> let f be the characteristic polynomialof T p acting <strong>on</strong> S 2 (Γ 0 (N)). Then#J 0 (N)(F p ) = f(p + 1).666 Add details later, al<strong>on</strong>g with various generalizati<strong>on</strong>s.


108 12. <strong>Hecke</strong> <strong>Operators</strong> as Corresp<strong>on</strong>dences


13Abelian VarietiesThis chapter provides foundati<strong>on</strong>al background about abelian varieties <strong>and</strong> Jacobians,with an aim toward what we will need later when we c<strong>on</strong>struct abelianvarieties attached to modular forms. We will not give complete proofs of very much,but will try to give precise references whenever possible, <strong>and</strong> many examples.We will follow the articles by Rosen [Ros86] <strong>and</strong> Milne [Mil86] <strong>on</strong> abelian varieties.We will try primarily to explain the statements of the main results aboutabelian varieties, <strong>and</strong> prove results when the proofs are not too technical <strong>and</strong>enhance underst<strong>and</strong>ing of the statements.13.1 Abelian varietiesDefiniti<strong>on</strong> 13.1.1 (Variety). A variety X over a field k is a finite-type separatedscheme over k that is geometrically integral.The c<strong>on</strong>diti<strong>on</strong> that X be geometrically integral means that X kis reduced (n<strong>on</strong>ilpotents in the structure sheaf) <strong>and</strong> irreducible.Definiti<strong>on</strong> 13.1.2 (Group variety). A group variety is a group object in thecategory of varieties. More precisely, a group variety X over a field k is a varietyequipped with morphismsm : X × X → X <strong>and</strong> i : X → X<strong>and</strong> a point 1 X ∈ A(k) such that m, i, <strong>and</strong> 1 X satisfy the axioms of a group; inparticular, for every k-algebra R they give X(R) a group structure that dependsin a functorial way <strong>on</strong> R.Definiti<strong>on</strong> 13.1.3 (Abelian Variety). An abelian variety A over a field k is acomplete group variety.Theorem 13.1.4. Suppose A is an abelian variety. Then


110 13. Abelian Varieties1. The group law <strong>on</strong> A is commutative.2. A is projective, i.e., there is an embedding from A into P n for some n.3. If k = C, then A(k) is analytically isomorphic to V/L, where V is a finitedimensi<strong>on</strong>alcomplex vector space <strong>and</strong> L is a lattice in V . (A lattice is a freeZ-module of rank equal to 2 dim V such that RL = V .)Proof. Part 1 is not too difficult, <strong>and</strong> can be proved by showing that every morphismof abelian varieties is the compositi<strong>on</strong> of a homomorphism with a translati<strong>on</strong>,then applying this result to the inversi<strong>on</strong> map (see [Mil86, Cor. 2.4]). Part 2is proved with some effort in [Mil86, §7]. Part 3 is proved in [Mum70, §I.1] usingthe exp<strong>on</strong>ential map from Lie theory from the tangent space at 0 to A.13.2 Complex toriLet A be an abelian variety over C. By Theorem 13.1.4, there is a complex vectorspace V <strong>and</strong> a lattice L in V such that A(C) = V/L, that is to say, A(C) is acomplex torus.More generally, if V is any complex vector space <strong>and</strong> L is a lattice in V , we callthe quotient T = V/L a complex torus. In this secti<strong>on</strong>, we prove some results aboutcomplex tori that will help us to underst<strong>and</strong> the structure of abelian varieties, <strong>and</strong>will also be useful in designing algorithms for computing with abelian varieties.The differential 1-forms <strong>and</strong> first homology of a complex torus are easy to underst<strong>and</strong>in terms of T . If T = V/L is a complex torus, the tangent space to0 ∈ T is can<strong>on</strong>ically isomorphic to V . The C-linear dual V ∗ = Hom C (V, C) isisomorphic to the C-vector space Ω(T ) of holomorphic differential 1-forms <strong>on</strong> T .Since V → T is the universal covering of T , the first homology H 1 (T, Z) of T iscan<strong>on</strong>ically isomorphic to L.13.2.1 HomomorphismsSuppose T 1 = V 1 /L 1 <strong>and</strong> T 2 = V 2 /L 2 are two complex tori. If ϕ : T 1 → T 2 is a(holomorphic) homomorphism, then ϕ induces a C-linear map from the tangentspace of T 1 at 0 to the tangent space of T 2 at 0. The tangent space of T i at 0 iscan<strong>on</strong>ically isomorphic to V i , so ϕ induces a C-linear map V 1 → V 2 . This maps


13.2 Complex tori 111sends L 1 into L 2 , since L i = H 1 (T i , Z). We thus have the following diagram:00(13.2.1)L 1ρ Z (ϕ) V 1ρ C (ϕ) T 1 ϕ L 2L 2T 20 0We obtain two faithful representati<strong>on</strong>s of Hom(T 1 , T 2 ),ρ C : Hom(T 1 , T 2 ) → Hom C (V 1 , V 2 )ρ Z : Hom(T 1 , T 2 ) → Hom Z (L 1 , L 2 ).Suppose ψ ∈ Hom Z (L 1 , L 2 ). Then ψ = ρ Z (ϕ) for some ϕ ∈ Hom(T 1 , T 2 ) if <strong>and</strong><strong>on</strong>ly if there is a complex linear homomorphism f : V 1 → V 2 whose restricti<strong>on</strong> toL 1 is ψ. Note that f = ψ ⊗ R is uniquely determined by ψ, so ψ arises from someϕ precisely when f is C-linear. This is the case if <strong>and</strong> <strong>on</strong>ly if fJ 1 = J 2 f, whereJ n : V n → V n is the R-linear map induced by multiplicati<strong>on</strong> by i = √ −1 ∈ C.Example 13.2.1.1. Suppose L 1 = Z + Zi ⊂ V 1 = C. Then with respect to the basis 1, i, wehave J 1 = ( )0 −11 0 . One finds that Hom(T1 , T 1 ) is the free Z-module of rank 2whose image via ρ Z is generated by J 1 <strong>and</strong> ( 1 0 01 ). As a ring Hom(T 1, T 1 ) isisomorphic to Z[i].2. Suppose L 1 = Z + Zαi ⊂ V 1 ( = C, with ) α 3 = 2. Then with respect tothe basis 1, αi, we have J 1 = 0 −α1/α 0 . Only the scalar integer matricescommute with J 1 .Propositi<strong>on</strong> 13.2.2. Let T 1 <strong>and</strong> T 2 be complex tori. Then Hom(T 1 , T 2 ) is a freeZ-module of rank at most 4 dim T 1 · dim T 2 .Proof. The representati<strong>on</strong> ρ Z is faithful (injective) because ϕ is determined by itsacti<strong>on</strong> <strong>on</strong> L 1 , since L 1 spans V 1 . Thus Hom(T 1 , T 2 ) is isomorphic to a subgroup ofHom Z (L 1 , L 2 ) ∼ = Z d , where d = 2 dim V 1 · 2 dim V 2 .Lemma 13.2.3. Suppose ϕ : T 1 → T 2 is a homomorphism of complex tori. Thenthe image of ϕ is a subtorus of T 2 <strong>and</strong> the c<strong>on</strong>nected comp<strong>on</strong>ent of ker(ϕ) is asubtorus of T 1 that has finite index in ker(ϕ).


112 13. Abelian VarietiesProof. Let W = ker(ρ C (ϕ)). Then the following diagram, which is induced by ϕ,has exact rows <strong>and</strong> columns:0000 L 1 ∩ W L 1L 2L 2 /ϕ(L 1 )00 W V 1V 2V 2 /ϕ(V 1 )00 ker(ϕ) T 1T 2T 2 /ϕ(T 1 )00 0 0Using the snake lemma, we obtain an exact sequence0 → L 1 ∩ W → W → ker(ϕ) → L 2 /ϕ(L 1 ) → V 2 /ϕ(V 1 ) → T 2 /ϕ(T 1 ) → 0.Note that T 2 /ϕ(T 1 ) is compact because it is the c<strong>on</strong>tinuous image of a compactset, so the cokernel of ϕ is a torus (it is given as a quotient of a complex vectorspace by a lattice).The kernel ker(ϕ) ⊂ T 1 is a closed subset of the compact set T 1 , so is compact.Thus L 1 ∩W is a lattice in W . The map L 2 /ϕ(L 1 ) → V 2 /ϕ(V 1 ) has kernel generatedby the saturati<strong>on</strong> of ϕ(L 1 ) in L 2 , so it is finite, so the torus W/(L 1 ∩ W ) has finiteindex in ker(ϕ).Remark 13.2.4. The category of complex tori is not an abelian category becausekernels need not be in the category.13.2.2 IsogeniesDefiniti<strong>on</strong> 13.2.5 (Isogeny). An isogeny ϕ : T 1 → T 2 of complex tori is a surjectivemorphism with finite kernel. The degree deg(ϕ) of ϕ is the order of the kernelof ϕ.Note that deg(ϕ ◦ ϕ ′ ) = deg(ϕ) deg(ϕ ′ ).Lemma 13.2.6. Suppose that ϕ is an isogeny. Then the kernel of ϕ is isomorphicto the cokernel of ρ Z (ϕ).Proof. (This is essentially a special case of Lemma 13.2.3.) Apply the snake lemmato the morphism (13.2.1) of short exact sequences, to obtain a six-term exactsequence0 → K L → K V → K T → C L → C V → C T → 0,where K X <strong>and</strong> C X are the kernel <strong>and</strong> cokernel of X 1 → X 2 , for X = L, V, T ,respectively. Since ϕ is an isogeny, the induced map V 1 → V 2 must be an isomorphism,since otherwise the kernel would c<strong>on</strong>tain a n<strong>on</strong>zero subspace (modulo alattice), which would be infinite. Thus K V = C V = 0. It follows that K T∼ = CL , asclaimed.


13.3 Abelian varieties as complex tori 113One c<strong>on</strong>sequence of the lemma is that if ϕ is an isogeny, thendeg(ϕ) = [L 1 : ρ Z (ϕ)(L 1 )] = | det(ρ Z (ϕ))|.Propositi<strong>on</strong> 13.2.7. Let T be a complex torus of dimensi<strong>on</strong> d, <strong>and</strong> let n be apositive integer. Then multiplicati<strong>on</strong> by n, denoted [n], is an isogeny T → T withkernel T [n] ∼ = (Z/nZ) 2d <strong>and</strong> degree n 2d .Proof. By Lemma 13.2.6, T [n] is isomorphic to L/nL, where T = V/L. SinceL ≈ Z 2d , the propositi<strong>on</strong> follows.We can now prove that isogeny is an equivalence relati<strong>on</strong>.Propositi<strong>on</strong> 13.2.8. Suppose ϕ : T 1 → T 2 is a degree m isogeny of complex toriof dimensi<strong>on</strong> d. Then there is a unique isogeny ˆϕ : T 2 → T 1 of degree m 2d−1 suchthat ˆϕ ◦ ϕ = ϕ ◦ ˆϕ = [m].Proof. Since ker(ϕ) ⊂ ker([m]), the map [m] factors through ϕ, so there is amorphism ˆϕ such that ˆϕ ◦ ϕ = [m]:T ϕ 1 T 2ˆϕ[m]T 1We have(ϕ ◦ ˆϕ − [m]) ◦ ϕ = ϕ ◦ ˆϕ ◦ φ − [m] ◦ ϕ = ϕ ◦ ˆϕ ◦ φ − ϕ ◦ [m] = ϕ ◦ ( ˆϕ ◦ φ − [m]) = 0.This implies that ϕ ◦ ˆϕ = [m], since ϕ is surjective. Uniqueness is clear since thedifference of two such morphisms would vanish <strong>on</strong> the image of ϕ. To see that ˆϕhas degree m 2d−1 , we take degrees <strong>on</strong> both sides of the equati<strong>on</strong> ˆϕ ◦ ϕ = [m].13.2.3 EndomorphismsThe ring End(T ) = Hom(T, T ) is called the endomorphism ring of the complextorus T . The endomorphism algebra of T is End 0 (T ) = End(T ) ⊗ Z Q.Definiti<strong>on</strong> 13.2.9 (Characteristic polynomial). The characteristic polynomial ofϕ ∈ End(T ) is the characteristic polynomial of the ρ Z (ϕ). Thus the characteristicpolynomial is a m<strong>on</strong>ic polynomial of degree 2 dim T .13.3 Abelian varieties as complex toriIn this secti<strong>on</strong> we introduce extra structure <strong>on</strong> a complex torus T = V/L that willenable us to underst<strong>and</strong> whether or not T is isomorphic to A(C), for some abelianvariety A over C. When dim T = 1, the theory of the Weierstrass ℘ functi<strong>on</strong>implies that T is always E(C) for some elliptic curve. In c<strong>on</strong>trast, the generictorus of dimensi<strong>on</strong> > 1 does not arise from an abelian variety.In this secti<strong>on</strong> we introduce the basic structures <strong>on</strong> complex tori that are neededto underst<strong>and</strong> which tori arise from abelian varieties, to c<strong>on</strong>struct the dual of an


114 13. Abelian Varietiesabelian variety, to see that End 0 (A) is a semisimple Q-algebra, <strong>and</strong> to underst<strong>and</strong>the polarizati<strong>on</strong>s <strong>on</strong> an abelian variety. For proofs, including extensive motivati<strong>on</strong>from the <strong>on</strong>e-dimensi<strong>on</strong>al case, read the beautifully written book [SD74] bySwinnert<strong>on</strong>-Dyer, <strong>and</strong> for another survey that str<strong>on</strong>gly influenced the discussi<strong>on</strong>below, see Rosen’s [Ros86].13.3.1 Hermitian <strong>and</strong> Riemann formsLet V be a finite-dimensi<strong>on</strong>al complex vector space.Definiti<strong>on</strong> 13.3.1 (Hermitian form). A Hermitian form is a c<strong>on</strong>jugate-symmetricpairingH : V × V → Cthat is C-linear in the first variable <strong>and</strong> C-antilinear in the sec<strong>on</strong>d. Thus H isR-bilinear, H(iu, v) = iH(u, v) = H(u, iv), <strong>and</strong> H(u, v) = H(v, u).Write H = S + iE, where S, E : V × V → R are real bilinear pairings.Propositi<strong>on</strong> 13.3.2. Let H, S, <strong>and</strong> E be as above.1. We have that S is symmetric, E is antisymmetric, <strong>and</strong>S(u, v) = E(iu, v), S(iu, iv) = S(u, v), E(iu, iv) = E(u, v).2. C<strong>on</strong>versely, if E is a real-valued antisymmetric bilinear pairing <strong>on</strong> V suchthat E(iu, iv) = E(u, v), then H(u, v) = E(iu, v) + iE(u, v) is a Hermitianform <strong>on</strong> V . Thus there is a bijecti<strong>on</strong> between the Hermitian forms <strong>on</strong> V <strong>and</strong>the real, antisymmetric bilinear forms E <strong>on</strong> V such that E(iu, iv) = E(u, v).Proof. To see that S is symmetric, note that 2S = H +H <strong>and</strong> H +H is symmetricbecause H is c<strong>on</strong>jugate symmetric. Likewise, E = (H − H)/(2i), soE(v, u) = 1 2i()H(v, u) − H(v, u) = 1 ()H(u, v) − H(u, v) = −E(u, v),2iwhich implies that E is antisymmetric. To see that S(u, v) = E(iu, v), rewrite bothS(u, v) <strong>and</strong> E(iu, v) in terms of H <strong>and</strong> simplify to get an identity. The other twoidentities follow sinceH(iu, iv) = iH(u, iv) = iiH(u, v) = H(u, v).Suppose E : V × V → R is as in the sec<strong>on</strong>d part of the propositi<strong>on</strong>. ThenH(iu, v) = E(i 2 u, v) + iE(iu, v) = −E(u, v) + iE(iu, v) = iH(u, v),<strong>and</strong> the other verificati<strong>on</strong>s of linearity <strong>and</strong> antilinearity are similar. For c<strong>on</strong>jugatesymmetry, note thatH(v, u) = E(iv, u) + iE(v, u) = −E(u, iv) − iE(u, v)= −E(iu, −v) − iE(u, v) = H(u, v).


13.3 Abelian varieties as complex tori 115Note that the set of Hermitian forms is a group under additi<strong>on</strong>.Definiti<strong>on</strong> 13.3.3 (Riemann form). A Riemann form <strong>on</strong> a complex torus T =V/L is a Hermitian form H <strong>on</strong> V such that the restricti<strong>on</strong> of E = Im(H) to L isinteger valued. If H(u, u) ≥ 0 for all u ∈ V then H is positive semi-definite <strong>and</strong> ifH is positive <strong>and</strong> H(u, u) = 0 if <strong>and</strong> <strong>on</strong>ly if u = 0, then H is n<strong>on</strong>degenerate.Theorem 13.3.4. Let T be a complex torus. Then T is isomorphic to A(C), forsome abelian variety A, if <strong>and</strong> <strong>on</strong>ly if there is a n<strong>on</strong>degenerate Riemann form <strong>on</strong> T .This is a n<strong>on</strong>trivial theorem, which we will not prove here. It is proved in [SD74,Ch.2] by defining an injective map from positive divisors <strong>on</strong> T = V/L to positivesemi-definite Riemann forms, then c<strong>on</strong>structing positive divisors associated totheta functi<strong>on</strong>s <strong>on</strong> V . If H is a n<strong>on</strong>degenerate Riemann form <strong>on</strong> T , <strong>on</strong>e computesthe dimensi<strong>on</strong> of a space of theta functi<strong>on</strong>s that corresp<strong>on</strong>ds to H in terms of thedeterminant of E = Im(H). Since H is n<strong>on</strong>degenerate, this space of theta functi<strong>on</strong>sis n<strong>on</strong>zero, so there is a corresp<strong>on</strong>ding n<strong>on</strong>degenerate positive divisor D. Then abasis forL(3D) = {f : (f) + 3D is positive } ∪ {0}determines an embedding of T in a projective space.Why the divisor 3D instead of D above? For an elliptic curve y 2 = x 3 + ax + b,we could take D to be the point at infinity. Then L(3D) c<strong>on</strong>sists of the functi<strong>on</strong>swith a pole of order at most 3 at infinity, which c<strong>on</strong>tains 1, x, <strong>and</strong> y, which havepoles of order 0, 2, <strong>and</strong> 3, respectively.Remark 13.3.5. (Copied from page 39 of [SD74].) When n = dim V > 1, however,a general lattice L will admit no n<strong>on</strong>zero Riemann forms. For if λ 1 , . . . , λ 2n is abase for L then E as an R-bilinear alternating form is uniquely determined bythe E(λ i , λ j ), which are integers; <strong>and</strong> the c<strong>on</strong>diti<strong>on</strong> E(z, w) = E(iz, iw) induceslinear relati<strong>on</strong>s with real coefficients between E(λ i , λ j ), which for general L haveno n<strong>on</strong>trivial integer soluti<strong>on</strong>s.13.3.2 Complements, quotients, <strong>and</strong> semisimplicity of theendomorphism algebraLemma 13.3.6. If T possesses a n<strong>on</strong>degenerate Riemann form <strong>and</strong> T ′ ⊂ T is asubtorus, then T ′ also possesses a n<strong>on</strong>degenerate Riemann form.Proof. If H is a n<strong>on</strong>degenerate Riemann form <strong>on</strong> a torus T <strong>and</strong> T ′ is a subtorusof T , then the restricti<strong>on</strong> of H to T ′ is a n<strong>on</strong>degenerate Riemann form <strong>on</strong> T ′ (therestricti<strong>on</strong> is still n<strong>on</strong>degenerate because H is positive definite).Lemma 13.3.6 <strong>and</strong> Lemma 13.2.3 together imply that the kernel of a homomorphismof abelian varieties is an extensi<strong>on</strong> of an abelian variety by a finite group.Lemma 13.3.7. If T possesses a n<strong>on</strong>degenerate Riemann form <strong>and</strong> T → T ′ is anisogeny, then T ′ also possesses a n<strong>on</strong>degenerate Riemann form.Proof. Suppose T = V/L <strong>and</strong> T ′ = V ′ /L ′ . Since the isogeny is induced by anisomorphism V → V ′ that sends L into L ′ , we may assume for simplicity that V =V ′ <strong>and</strong> L ⊂ L ′ . If H is a n<strong>on</strong>degenerate Riemann form <strong>on</strong> V/L, then E = Re(H)need not be integer valued <strong>on</strong> L ′ . However, since L has finite index in L ′ , there


116 13. Abelian Varietiesis some integer d so that dE is integer valued <strong>on</strong> L ′ . Then dH is a n<strong>on</strong>degenerateRiemann form <strong>on</strong> V/L ′ .Note that Lemma 13.3.7 implies that the quotient of an abelian variety by afinite subgroup is again an abelian variety.Theorem 13.3.8 (Poincare Reducibility). Let A be an abelian variety <strong>and</strong> supposeA ′ ⊂ A is an abelian subvariety. Then there is an abelian variety A ′′ ⊂ A suchthat A = A ′ + A ′′ <strong>and</strong> A ′ ∩ A ′′ is finite. (Thus A is isogenous to A ′ × A ′′ .)Proof. We have A(C) ≈ V/L <strong>and</strong> there is a n<strong>on</strong>degenerate Riemann form H<strong>on</strong> V/L. The subvariety A ′ is isomorphic to V ′ /L ′ , where V ′ is a subspace of V<strong>and</strong> L ′ = V ′ ∩ L. Let V ′′ be the orthog<strong>on</strong>al complement of V ′ with respect to H,<strong>and</strong> let L ′′ = L∩V ′′ . To see that L ′′ is a lattice in V ′′ , it suffices to show that L ′′ isthe orthog<strong>on</strong>al complement of L ′ in L with respect to E = Im(H), which, becauseE is integer valued, will imply that L ′′ has the correct rank. First, suppose thatv ∈ L ′′ ; then, by definiti<strong>on</strong>, v is in the orthog<strong>on</strong>al complement of L ′ with respectto H, so for any u ∈ L ′ , we have 0 = H(u, v) = S(u, v) + iE(u, v), so E(u, v) = 0.Next, suppose that v ∈ L satisfies E(u, v) = 0 for all u ∈ L ′ . Since V ′ = RL ′ <strong>and</strong> Eis R-bilinear, this implies E(u, v) = 0 for any u ∈ V ′ . In particular, since V ′ is acomplex vector space, if u ∈ L ′ , then S(u, v) = E(iu, v) = 0, so H(u, v) = 0.We have shown that L ′′ is a lattice in V ′′ , so A ′′ = V ′′ /L ′′ is an abelian subvarietyof A. Also L ′ +L ′′ has finite index in L, so there is an isogeny V ′ /L ′ ⊕V ′′ /L ′′ → V/Linduced by the natural inclusi<strong>on</strong>s.Propositi<strong>on</strong> 13.3.9. Suppose A ′ ⊂ A is an inclusi<strong>on</strong> of abelian varieties. Thenthe quotient A/A ′ is an abelian variety.Proof. Suppose A = V/L <strong>and</strong> A ′ = V ′ /L ′ , where V ′ is a subspace of V . LetW = V/V ′ <strong>and</strong> M = L/(L ∩ V ′ ). Then, W/M is isogenous to the complex torusV ′′ /L ′′ of Theorem 13.3.8 via the natural map V ′′ → W . Applying Lemma 13.3.7completes the proof.Definiti<strong>on</strong> 13.3.10. An abelian variety A is simple if it has no n<strong>on</strong>zero properabelian subvarieties.Propositi<strong>on</strong> 13.3.11. The algebra End 0 (A) is semisimple.Proof. Using Theorem 13.3.8 <strong>and</strong> inducti<strong>on</strong>, we can find an isogenyA ≃ A n11 × An2 2 × · · · × Anr rwith each A i simple. Since End 0 (A) = End(A) ⊗ Q is unchanged by isogeny, <strong>and</strong>Hom(A i , A j ) = 0 when i ≠ j, we haveEnd 0 (A) = End 0 (A n11 ) × End 0(A n22 ) × · · · × End 0(A nrr )Each of End 0 (A nii ) is isomorphic to M ni (D i ), where D i = End 0 (A i ). By Schur’sLemma, D i = End 0 (A i ) is a divisi<strong>on</strong> algebra over Q (proof: any n<strong>on</strong>zero endomorphismhas trivial kernel, <strong>and</strong> any injective linear transformati<strong>on</strong> of a Q-vector spaceis invertible), so End 0 (A) is a product of matrix algebras over divisi<strong>on</strong> algebrasover Q, which proves the propositi<strong>on</strong>.


13.3.3 Theta functi<strong>on</strong>sSuppose T = V/L is a complex torus.13.4 A Summary of duality <strong>and</strong> polarizati<strong>on</strong>s 117Definiti<strong>on</strong> 13.3.12 (Theta functi<strong>on</strong>). Let M : V × L → C <strong>and</strong> J : L → C beset-theoretic maps such that for each λ ∈ L the map z ↦→ M(z, λ) is C-linear. Atheta functi<strong>on</strong> of type (M, J) is a functi<strong>on</strong> θ : V → C such that for all z ∈ V <strong>and</strong>λ ∈ L, we haveθ(z + λ) = θ(z) · exp(2πi(M(z, λ) + J(λ))).Suppose that θ(z) is a n<strong>on</strong>zero holomorphic theta functi<strong>on</strong> of type (M, J). TheM(z, λ), for various λ, cannot be unc<strong>on</strong>nected. Let F (z, λ) = 2πi(M(z, λ) + J(λ)).Lemma 13.3.13. For any λ, λ ′ ∈ L, we haveF (z, λ + λ ′ ) = F (z + λ, λ ′ ) + F (z, λ)(mod 2πi).ThusM(z, λ + λ ′ ) = M(z, λ) + M(z, λ ′ ), (13.3.1)<strong>and</strong>J(λ + λ ′ ) − J(λ) − J(λ ′ ) ≡ M(λ, λ ′ )(mod Z).Proof. Page 37 of [SD74].Using (13.3.1) we see that M extends uniquely to a functi<strong>on</strong> ˜M : V × V → Cwhich is C-linear in the first argument <strong>and</strong> R-linear in the sec<strong>on</strong>d. LetE(z, w) = ˜M(z, w) − M(w, z),H(z, w) = E(iz, w) + iE(z, w).Propositi<strong>on</strong> 13.3.14. The pairing H is Riemann form <strong>on</strong> T with real part E.We call H the Riemann form associated to θ.13.4 A Summary of duality <strong>and</strong> polarizati<strong>on</strong>sSuppose A is an abelian variety over an arbitrary field k. In this secti<strong>on</strong> we summarizethe most important properties of the dual abelian variety A ∨ of A. First wereview the language of sheaves <strong>on</strong> a scheme X, <strong>and</strong> define the Picard group of X asthe group of invertible sheaves <strong>on</strong> X. The dual of A is then a variety whose pointscorresp<strong>on</strong>d to elements of the Picard group that are algebraically equivalent to 0.Next, when the ground field is C, we describe how to view A ∨ as a complex torusin terms of a descripti<strong>on</strong> of A as a complex torus. We then define the Nér<strong>on</strong>-Severigroup of A <strong>and</strong> relate it to polarizati<strong>on</strong>s of A, which are certain homomorphismsA → A ∨ . Finally we observe that the dual is functorial.


118 13. Abelian Varieties13.4.1 SheavesWe will use the language of sheaves, as in [Har77], which we now quickly recall. Apre-sheaf of abelian groups F <strong>on</strong> a scheme X is a c<strong>on</strong>travariant functor from thecategory of open sets <strong>on</strong> X (morphisms are inclusi<strong>on</strong>s) to the category of abeliangroups. Thus for every open set U ⊂ X there is an abelian group F(U), <strong>and</strong> ifU ⊂ V , then there is a restricti<strong>on</strong> map F(V ) → F(U). (We also require thatF(∅) = 0, <strong>and</strong> the map F(U) → F(U) is the identity map.) A sheaf is a pre-sheafwhose secti<strong>on</strong>s are determined locally (for details, see [Har77, §II.1]).Every scheme X is equipped with its structure sheaf O X , which has the propertythat if U = Spec(R) is an affine open subset of X, then O X (U) = R. A sheaf ofO X -modules is a sheaf M of abelian groups <strong>on</strong> X such that each abelian group hasthe structure of O X -module, such that the restricti<strong>on</strong> maps are module morphisms.A locally-free sheaf of O X -modules is a sheaf M of O X -modules, such that X canbe covered by open sets U so that M| U is a free O X -module, for each U.13.4.2 The Picard groupAn invertible sheaf is a sheaf L of O X -modules that is locally free of rank 1. If Lis an invertible sheaf, then the sheaf-theoretic Hom, L ∨ = Hom(L, O X ) has theproperty that L ∨ ⊗L = O X . The group Pic(X) of invertible sheaves <strong>on</strong> a scheme Xis called the Picard group of X. See [Har77, §II.6] for more details.Let A be an abelian variety over a field k. An invertible sheaf L <strong>on</strong> A is algebraicallyequivalent to 0 if there is a c<strong>on</strong>nected variety T over k, an invertible sheafM <strong>on</strong> A × k T , <strong>and</strong> t 0 , t 1 ∈ T (k) such that M t0∼ = L <strong>and</strong> Mt1∼ = OA . Let Pic 0 (A)be the subgroup of elements of Pic(A) that are algebraically equivalent to 0.The dual A ∨ of A is a (unique up to isomorphism) abelian variety such that forevery field F that c<strong>on</strong>tains the base field k, we have A ∨ (F ) = Pic 0 (A F ). For theprecise definiti<strong>on</strong> of A ∨ <strong>and</strong> a proof that A ∨ exists, see [Mil86, §9–10].13.4.3 The Dual as a complex torusWhen A is defined over the complex numbers, so A(C) = V/L for some vectorspace V <strong>and</strong> some lattice L, [Ros86, §4] describes a c<strong>on</strong>structi<strong>on</strong> of A ∨ as a complextorus, which we now describe. LetV ∗ = {f ∈ Hom R (V, C) : f(αt) = αf(t), all α ∈ C, t ∈ V }.Then V ∗ is a complex vector space of the same dimensi<strong>on</strong> as V <strong>and</strong> the map〈f, v〉 = Im f(t) is an R-linear pairing V ∗ × V → R. LetL ∗ = {f ∈ V ∗ : 〈f, λ〉 ∈ Z, all λ ∈ L}.Since A is an abelian variety, there is a n<strong>on</strong>degenerate Riemann form H <strong>on</strong> A.The map λ : V → V ∗ defined by λ(v) = H(v, ·) is an isomorphism of complexvector spaces. If v ∈ L, then λ(v) = H(v, ·) is integer valued <strong>on</strong> L, so λ(L) ⊂ L ∗ .Thus λ induces an isogeny of complex tori V/L → V ∗ /L ∗ , so by Lemma 13.3.7 thetorus V ∗ /L ∗ possesses a n<strong>on</strong>degenerate Riemann form (it’s a multiple of H). In[Ros86, §4], Rosen describes an explicit isomorphism between V ∗ /L ∗ <strong>and</strong> A ∨ (C).


13.4.4 The Nér<strong>on</strong>-Severi group <strong>and</strong> polarizati<strong>on</strong>s13.5 Jacobians of curves 119Let A be an abelian variety over a field k. Recall that Pic(A) is the group ofinvertible sheaves <strong>on</strong> A, <strong>and</strong> Pic 0 (A) is the subgroup of invertible sheaves thatare algebraically equivalent to 0. The Nér<strong>on</strong>-Severi group of A is the quotientPic(A)/ Pic 0 (A), so by definiti<strong>on</strong> we have an exact sequence0 → Pic 0 (A) → Pic(A) → NS(A) → 0.Suppose L is an invertible sheaf <strong>on</strong> A. One can show that the map A(k) →Pic 0 (A) defined by a ↦→ t ∗ aL ⊗ L −1 is induced by homomorphism ϕ L : A →A ∨ . (Here t ∗ aL is the pullback of the sheaf L by translati<strong>on</strong> by a.) Moreover, themap L ↦→ ϕ L induces a homomorphism from Pic(A) → Hom(A, A ∨ ) with kernelPic 0 (A). The group Hom(A, A ∨ ) is free of finite rank, so NS(A) is a free abeliangroup of finite rank. Thus Pic 0 (A) is saturated in Pic(A) (i.e., the cokernel of theinclusi<strong>on</strong> Pic 0 (A) → Pic(A) is torsi<strong>on</strong> free).Definiti<strong>on</strong> 13.4.1 (Polarizati<strong>on</strong>). A polarizati<strong>on</strong> <strong>on</strong> A is a homomorphism λ :A → A ∨ such that λ k= ϕ L for some L ∈ Pic(A k). A polarizati<strong>on</strong> is principal if itis an isomorphism.When the base field k is algebraically closed, the polarizati<strong>on</strong>s are in bijecti<strong>on</strong>with the elements of NS(A). For example, when dim A = 1, we have NS(A) = Z,<strong>and</strong> the polarizati<strong>on</strong>s <strong>on</strong> A are multiplicati<strong>on</strong> by n, for each integer n.13.4.5 The Dual is functorialThe associati<strong>on</strong> A ↦→ A ∨ extends to a c<strong>on</strong>travariant functor <strong>on</strong> the category ofabelian varieties. Thus if ϕ : A → B is a homomorphism, there is a natural choiceof homomorphism ϕ ∨ : B ∨ → A ∨ . Also, (A ∨ ) ∨ = A <strong>and</strong> (ϕ ∨ ) ∨ = ϕ.Theorem 13.4.2 below describes the kernel of ϕ ∨ in terms of the kernel of ϕ.If G is a finite group scheme, the Cartier dual of G is Hom(G, G m ). For example,the Cartier dual of Z/mZ is µ m <strong>and</strong> the Cartier dual of µ m is Z/mZ. (If k isalgebraically closed, then the Cartier dual of G is just G again.)Theorem 13.4.2. If ϕ : A → B is a surjective homomorphism of abelian varietieswith kernel G, so we have an exact sequence 0 → G → A → B → 0, then the kernelof ϕ ∨ is the Cartier dual of G, so we have an exact sequence 0 → G ∨ → B ∨ →A ∨ → 0.13.5 Jacobians of curvesWe begin this lecture about Jacobians with an inspiring quote of David Mumford:“The Jacobian has always been a corner-st<strong>on</strong>e in the analysis of algebraiccurves <strong>and</strong> compact Riemann surfaces. [...] Weil’s c<strong>on</strong>structi<strong>on</strong> [ofthe Jacobian] was the basis of his epoch-making proof of the RiemannHypothesis for curves over finite fields, which really put characteristicp algebraic geometry <strong>on</strong> its feet.” – Mumford, Curves <strong>and</strong> TheirJacobians, page 49.


120 13. Abelian Varieties13.5.1 Divisors <strong>on</strong> curves <strong>and</strong> linear equivalenceLet X be a projective n<strong>on</strong>singular algebraic curve over an algebraically field k. Adivisor <strong>on</strong> X is a formal finite Z-linear combinati<strong>on</strong> ∑ mi=1 n iP i of closed points inX. Let Div(X) be the group of all divisors <strong>on</strong> X. The degree of a divisor ∑ mi=1 n iP iis the integer ∑ mi=1 n i. Let Div 0 (X) denote the subgroup of divisors of degree 0.Suppose k is a perfect field (for example, k has characteristic 0 or k is finite),but do not require that k be algebraically closed. Let the group of divisors <strong>on</strong> Xover k be the subgroupDiv(X) = Div(X/k) = H 0 (Gal(k/k), Div(X/k))of elements of Div(X/k) that are fixed by every automorphism of k/k. Likewise,let Div 0 (X/k) be the elements of Div(X/k) of degree 0.A rati<strong>on</strong>al functi<strong>on</strong> <strong>on</strong> an algebraic curve X is a functi<strong>on</strong> X → P 1 , defined bypolynomials, which has <strong>on</strong>ly a finite number of poles. For example, if X is theelliptic curve over k defined by y 2 = x 3 +ax+b, then the field of rati<strong>on</strong>al functi<strong>on</strong>s<strong>on</strong> X is the fracti<strong>on</strong> field of the integral domain k[x, y]/(y 2 − (x 3 + ax + b)). LetK(X) denote the field of all rati<strong>on</strong>al functi<strong>on</strong>s <strong>on</strong> X defined over k.There is a natural homomorphism K(X) ∗ → Div(X) that associates to a rati<strong>on</strong>alfuncti<strong>on</strong> f its divisor(f) = ∑ ord P (f) · Pwhere ord P (f) is the order of vanishing of f at P . Since X is n<strong>on</strong>singular, the localring of X at a point P is isomorphic to k[[t]]. Thus we can write f = t r g(t) forsome unit g(t) ∈ k[[t]]. Then R = ord P (f).Example 13.5.1. If X = P 1 , then the functi<strong>on</strong> f = x has divisor (0) − (∞). If Xis the elliptic curve defined by y 2 = x 3 + ax + b, then(x) = (0, √ b) + (0, − √ b) − 2∞,<strong>and</strong>(y) = (x 1 , 0) + (x 2 , 0) + (x 3 , 0) − 3∞,where x 1 , x 2 , <strong>and</strong> x 3 are the roots of x 3 +ax+b = 0. A uniformizing parameter t atthe point ∞ is x/y. An equati<strong>on</strong> for the elliptic curve in an affine neighborhood of∞ is Z = X 3 + aXZ 2 + bZ 3 (where ∞ = (0, 0) with respect to these coordinates)<strong>and</strong> x/y = X in these new coordinates. By repeatedly substituting Z into thisequati<strong>on</strong> we see that Z can be written in terms of X.It is a st<strong>and</strong>ard fact in the theory of algebraic curves that if f is a n<strong>on</strong>zero rati<strong>on</strong>alfuncti<strong>on</strong>, then (f) ∈ Div 0 (X), i.e., the number of poles of f equals the number ofzeros of f. For example, if X is the Riemann sphere <strong>and</strong> f is a polynomial, thenthe number of zeros of f (counted with multiplicity) equals the degree of f, whichequals the order of the pole of f at infinity.The Picard group Pic(X) of X is the group of divisors <strong>on</strong> X modulo linearequivalence. Since divisors of functi<strong>on</strong>s have degree 0, the subgroup Pic 0 (X) ofdivisors <strong>on</strong> X of degree 0, modulo linear equivalence, is well defined. Moreover, wehave an exact sequence of abelian groups0 → K(X) ∗ → Div 0 (X) → Pic 0 (X) → 0.


13.5 Jacobians of curves 121Thus for any algebraic curve X we have associated to it an abelian groupPic 0 (X). Suppose π : X → Y is a morphism of algebraic curves. If D is a divisor<strong>on</strong> Y , the pullback π ∗ (D) is a divisor <strong>on</strong> X, which is defined as follows.If P ∈ Div(Y/k) is a point, let π ∗ (P ) be the sum ∑ e Q/P Q where π(Q) = P<strong>and</strong> e Q/P is the ramificati<strong>on</strong> degree of Q/P . (Remark: If t is a uniformizer at Pthen e Q/P = ord Q (π ∗ t P ).) One can show that π ∗ : Div(Y ) → Div(X) inducesa homomorphism Pic 0 (Y ) → Pic 0 (X). Furthermore, we obtain the c<strong>on</strong>travariantPicard functor from the category of algebraic curves over a fixed base field tothe category of abelian groups, which sends X to Pic 0 (X) <strong>and</strong> π : X → Y toπ ∗ : Pic 0 (Y ) → Pic 0 (X).Alternatively, instead of defining morphisms by pullback of divisors, we couldc<strong>on</strong>sider the push forward. Suppose π : X → Y is a morphism of algebraic curves<strong>and</strong> D is a divisor <strong>on</strong> X. If P ∈ Div(X/k) is a point, let π ∗ (P ) = π(P ). Then π ∗induces a morphism Pic 0 (X) → Pic 0 (Y ). We again obtain a functor, called thecovariant Albanese functor from the category of algebraic curves to the categoryof abelian groups, which sends X to Pic 0 (X) <strong>and</strong> π : X → Y to π ∗ : Pic 0 (X) →Pic 0 (Y ).13.5.2 Algebraic definiti<strong>on</strong> of the JacobianFirst we describe some universal properties of the Jacobian under the hypothesisthat X(k) ≠ ∅. Thus suppose X is an algebraic curve over a field k <strong>and</strong> thatX(k) ≠ ∅. The Jacobian variety of X is an abelian variety J such that for anextensi<strong>on</strong> k ′ /k, there is a (functorial) isomorphism J(k ′ ) → Pic 0 (X/k ′ ). (I d<strong>on</strong>’tknow whether this c<strong>on</strong>diti<strong>on</strong> uniquely characterizes the Jacobian.)Fix a point P ∈ X(k). Then we obtain a map f : X(k) → Pic 0 (X/k) by sendingQ ∈ X(k) to the divisor class of Q−P . One can show that this map is induced by aninjective morphism of algebraic varieties X ↩→ J. This morphism has the followinguniversal property: if A is an abelian variety <strong>and</strong> g : X → A is a morphism thatsends P to 0 ∈ A, then there is a unique homomorphism ψ : J → A of abelianvarieties such that g = ψ ◦ f:X f J gThis c<strong>on</strong>diti<strong>on</strong> uniquely characterizes J, since if f ′ : X → J ′ <strong>and</strong> J ′ has the universalproperty, then there are unique maps J → J ′ <strong>and</strong> J ′ → J whose compositi<strong>on</strong>in both directi<strong>on</strong>s must be the identity (use the universal property with A = J<strong>and</strong> f = g).If X is an arbitrary curve over an arbitrary field, the Jacobian is an abelianvariety that represents the “sheafificati<strong>on</strong>” of the “relative Picard functor”. Lookin Milne’s article or Bosch-Lüktebohmert-Raynaud Ner<strong>on</strong> Models for more details.Knowing this totally general definiti<strong>on</strong> w<strong>on</strong>’t be important for this course, sincewe will <strong>on</strong>ly c<strong>on</strong>sider Jacobians of modular curves, <strong>and</strong> these curves always havea rati<strong>on</strong>al point, so the above properties will be sufficient.A useful property of Jacobians is that they are can<strong>on</strong>ically principally polarized,by a polarizati<strong>on</strong> that arises from the “θ divisor” <strong>on</strong> J. In particular, there is alwaysan isomorphism J → J ∨ = Pic 0 (J).ψA


122 13. Abelian Varieties13.5.3 The Abel-Jacobi theoremOver the complex numbers, the c<strong>on</strong>structi<strong>on</strong> of the Jacobian is classical. It wasfirst c<strong>on</strong>sidered in the 19th century in order to obtain relati<strong>on</strong>s between integralsof rati<strong>on</strong>al functi<strong>on</strong>s over algebraic curves (see Mumford’s book, Curves <strong>and</strong> TheirJacobians, Ch. III, for a nice discussi<strong>on</strong>).Let X be a Riemann surface, so X is a <strong>on</strong>e-dimensi<strong>on</strong>al complex manifold.Thus there is a system of coordinate charts (U α , t α ), where t α : U α → C is ahomeomorphism of U α <strong>on</strong>to an open subset of C, such that the change of coordinatemaps are analytic isomorphisms. A differential 1-form <strong>on</strong> X is a choice of twoc<strong>on</strong>tinuous functi<strong>on</strong>s f <strong>and</strong> g to each local coordinate z = x + iy <strong>on</strong> U α ⊂ Xsuch that f dx + g dy is invariant under change of coordinates (i.e., if another localcoordinate patch U ′ α intersects U α , then the differential is unchanged by the changeof coordinate map <strong>on</strong> the overlap). If γ : [0, 1] → X is a path <strong>and</strong> ω = f dx + g dyis a 1-form, then∫γω :=∫ 10(f(x(t), y(t)) dx)+ g(x(t), y(t))dy dt ∈ C.dt dtFrom complex analysis <strong>on</strong>e sees that if γ is homologous to γ ′ , then ∫ γ ω = ∫ γ ′ ω.In fact, there is a n<strong>on</strong>degenerate pairingH 0 (X, Ω 1 X) × H 1 (X, Z) → CIf X has genus g, then it is a st<strong>and</strong>ard fact that the complex vector spaceH 0 (X, Ω 1 X ) of holomorphic differentials <strong>on</strong> X is of dimensi<strong>on</strong> g. The integrati<strong>on</strong>pairing defined above induces a homomorphism from integral homology to thedual V of the differentials:Φ : H 1 (X, Z) → V = Hom(H 0 (X, Ω 1 X), C).This homomorphism is called the period mapping.Theorem 13.5.2 (Abel-Jacobi). The image of Φ is a lattice in V .The proof involves repeated clever applicati<strong>on</strong> of the residue theorem.The intersecti<strong>on</strong> pairingH 1 (X, Z) × H 1 (X, Z) → Zdefines a n<strong>on</strong>degenerate alternating pairing <strong>on</strong> L = Φ(H 1 (X, Z)). This pairingsatisfies the c<strong>on</strong>diti<strong>on</strong>s to induce a n<strong>on</strong>degenerate Riemann form <strong>on</strong> V , which givesJ = V/L to structure of abelian variety. The abelian variety J is the Jacobian of X,<strong>and</strong> if P ∈ X, then the functi<strong>on</strong>al ω ↦→ ∫ Qω defines an embedding of X into J.PAlso, since the intersecti<strong>on</strong> pairing is perfect, it induces an isomorphism from J toJ ∨ .Example 13.5.3. For example, suppose X = X 0 (23) is the modular curve attachedto the subgroup Γ 0 (23) of matrices in SL 2 (Z) that are upper triangular modulo 24.Then g = 2, <strong>and</strong> a basis for H 1 (X 0 (23), Z) in terms of modular symbols is{−1/19, 0}, {−1/17, 0}, {−1/15, 0}, {−1/11, 0}.


The matrix for the intersecti<strong>on</strong> pairing <strong>on</strong> this basis is⎛⎞0 −1 −1 −1⎜1 0 −1 −1⎟⎝1 1 0 −1⎠1 1 1 0With respect to a reduced integral basis forH 0 (X, Ω 1 X) ∼ = S 2 (Γ 0 (23)),the lattice Φ(H 1 (X, Z)) of periods is (approximately) spanned by[]13.5 Jacobians of curves 123(0.59153223605591049412844857432 - 1.68745927346801253993135357636*i0.762806324458047168681080323846571478727 - 0.60368764497868211035115379488*i),(-0.59153223605591049412844857432 - 1.68745927346801253993135357636*i-0.762806324458047168681080323846571478727 - 0.60368764497868211035115379488*i),(-1.354338560513957662809528899804 - 1.0837716284893304295801997808568748714097*i-0.59153223605591049412844857401 + 0.480083983510648319229045987467*i),(-1.52561264891609433736216065099 0.342548176804273349105263499648)13.5.4 Every abelian variety is a quotient of a JacobianOver an infinite field, every abelin variety can be obtained as a quotient of aJacobian variety. The modular abelian varieties that we will encounter later are,by definiti<strong>on</strong>, exactly the quotients of the Jacobian J 1 (N) of X 1 (N) for some N.In this secti<strong>on</strong> we see that merely being a quotient of a Jacobian does not endowan abelian variety with any special properties.Theorem 13.5.4 (Matsusaka). Let A be an abelian variety over an algebraicallyclosed field. Then there is a Jacobian J <strong>and</strong> a surjective map J → A.This was originally proved in On a generating curve of an abelian variety, Nat.Sc. Rep. Ochanomizu Univ. 3 (1952), 1–4. Here is the Math Review by P. Samuel:An abelian variety A is said to be generated by a variety V (<strong>and</strong> amapping f of V into A) if A is the group generated by f(V ). It is provedthat every abelian variety A may be generated by a curve defined overthe algebraic closure of def(A). A first lemma shows that, if a varietyV is the carrier of an algebraic system (X(M)) M∈U of curves (X(M)being defined, n<strong>on</strong>-singular <strong>and</strong> disjoint from the singular bunch of Vfor almost all M in the parametrizing variety U) if this system has asimple base point <strong>on</strong> V , <strong>and</strong> if a mapping f of V into an abelian varietyis c<strong>on</strong>stant <strong>on</strong> some X(M 0 ), then f is a c<strong>on</strong>stant; this is proved byspecializing <strong>on</strong> M 0 a generic point M of U <strong>and</strong> by using specializati<strong>on</strong>sof cycles [Matsusaka, Mem. Coll. Sci. Kyoto Univ. Ser. A. Math. 26,167–173 (1951); these Rev. 13, 379]. Another lemma notices that, for anormal projective variety V , a suitable linear family of plane secti<strong>on</strong>sof V may be taken as a family (X(M)). Then the main result followsfrom the complete reducibility theorem. This result is said to be the


124 13. Abelian Varietiesbasic tool for generalizing Chow’s theorem (”the Jacobian variety of acurve defined over k is an abelian projective variety defined over k”).Milne [Mil86, §10] proves the theorem under the weaker hypothesis that the basefield is infinite. We briefly sketch his proof now. If dim A = 1, then A is theJacobian of itself, so we may assume dim A > 1. Embed A into P n , then, usingthe Bertini theorem, cut A ⊂ P n by hyperplane secti<strong>on</strong>s dim(A) − 1 times toobtain a n<strong>on</strong>singular curve C <strong>on</strong> A of the form A ∩ V , where V is a linear subspaceof P n . Using st<strong>and</strong>ard arguments from Hartshorne [Har77], Milne shows (Lemma10.3) that if W is a n<strong>on</strong>singular variety <strong>and</strong> π : W → A is a finite morphism, thenπ −1 (C) is geometrically c<strong>on</strong>nected (the main point is that the pullback of an ampleinvertible sheaf by a finite morphism is ample). (A morphism f : X → Y is finiteif for every open affine subset U = Spec(R) ⊂ Y , the inverse image f −1 (U) ⊂ Xis an affine open subset Spec(B) with B a finitely generated R-module. Finitemorphisms have finite fibers, but not c<strong>on</strong>versely.) We assume this lemma <strong>and</strong>deduce the theorem.Let J be the Jacobian of C; by the universal property of Jacobians there isa unique homomorphism f : J → A coming from the inclusi<strong>on</strong> C ↩→ A. Theimage A 1 = f(J) is an abelian subvariety since images of homomorphisms ofabelian varieties are abelian varieties. By the Poincare reducibility theorem (we<strong>on</strong>ly proved this over C, but it is true in general), there is an abelian subvarietyA 2 ⊂ A such that A 1 + A 2 = A <strong>and</strong> A 1 ∩ A 2 is finite. The isogeny g : A 1 × A 2 → Agiven by g(x, y) = x + y ∈ A is a finite morphism (any isogeny of abelian varietiesis finite, flat, <strong>and</strong> surjective by Secti<strong>on</strong> 8 of [Mil86]). The inverse image g −1 (A 1 )is a uni<strong>on</strong> of #(A 1 ∩ A 2 ) irreducible comp<strong>on</strong>ents; if this intersecti<strong>on</strong> is n<strong>on</strong>trivial,then likewise g −1 (C) is reducible, which is a c<strong>on</strong>tradicti<strong>on</strong>. This does not completethe proof, since it is possible that g is an isomorphism, so we use <strong>on</strong>e additi<strong>on</strong>altrick. Suppose n is a positive integer coprime to the residue characteristic, <strong>and</strong> leth = 1 × [n] : A 1 × A 2 → A 1 × A 2be the identity map <strong>on</strong> the first factor <strong>and</strong> multiplicati<strong>on</strong> by n <strong>on</strong> the sec<strong>on</strong>d.Then h is finite <strong>and</strong> (h ◦ g) −1 2 dim A2(A 1 ) is a uni<strong>on</strong> of n = deg(h) irreduciblecomp<strong>on</strong>ents, hence (h ◦ g) −1 (C) is reducible, a c<strong>on</strong>tradicti<strong>on</strong>.Questi<strong>on</strong> 13.5.5. Is Theorem 13.5.4 false for some abelian variety A over somefinite field k?Questi<strong>on</strong> 13.5.6 (Milne). Using the theorem we can obtain a sequence of Jacobianvarieties J 1 , J 2 , . . . that form a complex· · · → J 2 → J 1 → A → 0.(In each case the image of J i+1 is the c<strong>on</strong>nected comp<strong>on</strong>ent of the kernel of J i →J i−1 .) Is it possible to make this c<strong>on</strong>structi<strong>on</strong> in such a way that the sequenceterminates in 0?Questi<strong>on</strong> 13.5.7 (Yau). Let A be an abelian variety. What can be said aboutthe minimum of the dimensi<strong>on</strong>s of all Jacobians J such that there is a surjectivemorphism J → A?Remark 13.5.8. Brian C<strong>on</strong>rad has explained to the author that if A is an abelianvariety over an infinite field, then A can be embedded in a Jacobian J. This does


13.6 Nér<strong>on</strong> models 125not follow directly from Theorem 13.5.4 above, since if J → A ∨ , then the dualmap A → J need not be injective.13.6 Nér<strong>on</strong> modelsThe main references for Nér<strong>on</strong> models are as follows:1. [AEC2]: Silverman, Advanced Topics in the Arithmetic of Elliptic Curves.Chapter IV of this book c<strong>on</strong>tains an extremely well written <strong>and</strong> motivateddiscussi<strong>on</strong> of Nér<strong>on</strong> models of elliptic curves over Dedekind domains withperfect residue field. In particular, Silverman gives an almost complete c<strong>on</strong>structi<strong>on</strong>of Nér<strong>on</strong> models of elliptic curves. Silverman very clearly reallywants his reader to underst<strong>and</strong> the c<strong>on</strong>structi<strong>on</strong>. Highly recommended.2. [BLR]: Bosch, Lütkebohmert, Raynaud, Nér<strong>on</strong> Models. This is an excellent<strong>and</strong> accessible book that c<strong>on</strong>tains a complete c<strong>on</strong>structi<strong>on</strong> of Nér<strong>on</strong> models<strong>and</strong> some of their generalizati<strong>on</strong>s, a discussi<strong>on</strong> of their functorial properties,<strong>and</strong> a sketch of the c<strong>on</strong>structi<strong>on</strong> of Jacobians of families of curves. The goalof this book was to redo in scheme-theoretic language Nér<strong>on</strong> original paper,which is written in a language that was ill-adapted to the subtleties of Nér<strong>on</strong>models.3. Artin, Nér<strong>on</strong> Models, in Cornell-Silverman. This is the first-ever expositi<strong>on</strong>of Nér<strong>on</strong>’s original paper in the language of schemes.13.6.1 What are Nér<strong>on</strong> models?Suppose E is an elliptic curve over Q. If ∆ is the minimal discriminant of E,then E has good reducti<strong>on</strong> at p for all p ∤ ∆, in the sense that E extends to anabelian scheme E over Z p (i.e., a “smooth” <strong>and</strong> “proper” group scheme). One cannot ask for E to extend to an abelian scheme over Z p for all p | ∆. One can,however, ask whether there is a noti<strong>on</strong> of “good” model for E at these bad primes.To quote [BLR, page 1], “It came as a surprise for arithmeticians <strong>and</strong> algebraicgeometers when A. Nér<strong>on</strong>, relaxing the c<strong>on</strong>diti<strong>on</strong> of properness <strong>and</strong> c<strong>on</strong>centrating<strong>on</strong> the group structure <strong>and</strong> the smoothness, discovered in the years 1961–1963 thatsuch models exist in a can<strong>on</strong>ical way.”Before formally defining Nér<strong>on</strong> models, we describe what it means for a morphismf : X → Y of schemes to be smooth. A morphism f : X → Y is finite typeif for every open affine U = Spec(R) ⊂ Y there is a finite covering of f −1 (U) byopen affines Spec(S), such that each S is a finitely generated R-algebra.Definiti<strong>on</strong> 13.6.1. A morphism f : X → Y is smooth at x ∈ X if it is of finitetype <strong>and</strong> there are open affine neighborhoods Spec(A) ⊂ X of x <strong>and</strong> Spec(R) ⊂ Yof f(x) such thatA ∼ = R[t 1 , . . . , t n+r ]/(f 1 , . . . , f n )for elements f 1 , . . . , f n ∈ R[t 1 , . . . , t n+r ] <strong>and</strong> all n × n minors of the Jacobianmatrix (∂f i /∂t j ) generate the unit ideal of A. The morphism f is étale at x if, inadditi<strong>on</strong>, r = 0. A morphism is smooth of relative dimensi<strong>on</strong> d if it is smooth at xfor every x ∈ X <strong>and</strong> r = d in the isomorphism above.


126 13. Abelian VarietiesSmooth morphisms behave well. For example, if f <strong>and</strong> g are smooth <strong>and</strong> f ◦ gis defined, then f ◦ g is automatically smooth. Also, smooth morphisms are closedunder base extensi<strong>on</strong>: if f : X → Y is a smooth morphism over S, <strong>and</strong> S ′ is ascheme over S, then the induced map X× S S ′ → Y × S S ′ is smooth. (If you’ve neverseen products of schemes, it might be helpful to know that Spec(A) × Spec(B) =Spec(A⊗B). Read [Har77, §II.3] for more informati<strong>on</strong> about fiber products, whichprovide a geometric way to think about tensor products. Also, we often write X S ′as shorth<strong>and</strong> for X × S S ′ .)We are now ready for the definiti<strong>on</strong>. Suppose R is a Dedekind domain with fieldof fracti<strong>on</strong>s K (e.g., R = Z <strong>and</strong> K = Q).Definiti<strong>on</strong> 13.6.2 (Nér<strong>on</strong> model). Let A be an abelian variety over K. The Nér<strong>on</strong>model A of A is a smooth commutative group scheme over R such that for anysmooth morphism S → R the natural map of abelian groupsHom R (S, A) → Hom K (S × R K, A)is a bijecti<strong>on</strong>. This is called the Nér<strong>on</strong> mapping property: In more compact notati<strong>on</strong>,it says that there is an isomorphism A(S) ∼ = A(S K ).Taking S = A in the definiti<strong>on</strong> we see that A is unique, up to a unique isomorphism.It is a deep theorem that Nér<strong>on</strong> models exist. Fortunately, Bosch, Lütkebohmert,<strong>and</strong> Raynaud devoted much time to create a carefully written book [BLR90] thatexplains the c<strong>on</strong>structi<strong>on</strong> in modern language. Also, in the case of elliptic curves,Silverman’s sec<strong>on</strong>d book [Sil94] is extremely helpful.The basic idea of the c<strong>on</strong>structi<strong>on</strong> is to first observe that if we can c<strong>on</strong>structa Nér<strong>on</strong> model at each localizati<strong>on</strong> R p at a n<strong>on</strong>zero prime ideal of R, then eachof these local models can be glued to obtain a global Nér<strong>on</strong> model (this uses thatthere are <strong>on</strong>ly finitely many primes of bad reducti<strong>on</strong>). Thus we may assume thatR is a discrete valuati<strong>on</strong> ring.The next step is to pass to the “strict henselizati<strong>on</strong>” R ′ of R. A local ring R withmaximal ideal ℘ is henselian if “every simple root lifts uniquely”; more precisely,if whenever f(x) ∈ R[x] <strong>and</strong> α ∈ R is such that f(α) ≡ 0 (mod ℘) <strong>and</strong> f ′ (α) ≢ 0(mod ℘), there is a unique element ˜α ∈ R such that ˜α ≡ α (mod ℘) <strong>and</strong> f(˜α) = 0.The strict henselizati<strong>on</strong> of a discrete valuati<strong>on</strong> ring R is an extensi<strong>on</strong> of R thatis henselian <strong>and</strong> for which the residue field of R ′ is the separable closure of theresidue field of R (when the residue field is finite, the separable close is just thealgebraic closure). The strict henselizati<strong>on</strong> is not too much bigger than R, thoughit is typically not finitely generated over R. It is, however, much smaller thanthe completi<strong>on</strong> of R (e.g., Z p is uncountable). The main geometric property ofa strictly henselian ring R with residue field k is that if X is a smooth schemeover R, then the reducti<strong>on</strong> map X(R) → X(k) is surjective.Working over the strict henselizati<strong>on</strong>, we first resolve singularities. Then we usea generalizati<strong>on</strong> of the theorem that Weil used to c<strong>on</strong>struct Jacobians to pass froma birati<strong>on</strong>al group law to an actual group law. We thus obtain the Nér<strong>on</strong> modelover the strict henselizati<strong>on</strong> of R. Finally, we use Grothendieck’s faithfully flatdescent to obtain a Nér<strong>on</strong> model over R.When A is the Jacobian of a curve X, there is an alternative approach thatinvolves the “minimal proper regular model” of X. For example, when A is anelliptic curve, it is the Jacobian of itself, <strong>and</strong> the Nér<strong>on</strong> model can be c<strong>on</strong>structed in


13.6 Nér<strong>on</strong> models 127terms of the minimal proper regular model X of A as follows. In general, the modelX → R is not also smooth. Let X ′ be the smooth locus of X → R, which is obtainedby removing from each closed fiber X Fp = ∑ n i C i all irreducible comp<strong>on</strong>ents withmultiplicity n i ≥ 2 <strong>and</strong> all singular points <strong>on</strong> each C i , <strong>and</strong> all points where at leasttwo C i intersect each other. Then the group structure <strong>on</strong> A extends to a groupstructure <strong>on</strong> X ′ , <strong>and</strong> X ′ equipped with this group structure is the Nér<strong>on</strong> modelof A.Explicit determinati<strong>on</strong> of the possibilities for the minimal proper regular modelof an elliptic curve was carried out by Kodaira, then Nér<strong>on</strong>, <strong>and</strong> finally in avery explicit form by Tate. Tate codified a way to find the model in what’scalled “Tate’s Algorithm” (see Antwerp IV, which is available <strong>on</strong> my web page:http://modular.fas.harvard.edu/scans/antwerp/, <strong>and</strong> look at Silverman, chapterIV, which also has important implementati<strong>on</strong> advice).13.6.2 The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer c<strong>on</strong>jecture <strong>and</strong> Nér<strong>on</strong>modelsThroughout this secti<strong>on</strong>, let A be an abelian variety over Q <strong>and</strong> let A be thecorresp<strong>on</strong>ding Nér<strong>on</strong> model over Z. We work over Q for simplicity, but could workover any number field.Let L(A, s) be the Hasse-Weil L-functi<strong>on</strong> of A (see Secti<strong>on</strong> [to be written] 1 ). 1Let r = ord s=1 L(A, s) be the analytic rank of A. The Birch <strong>and</strong> Swinnert<strong>on</strong>-DyerC<strong>on</strong>jecture asserts that A(Q) ≈ Z r ⊕ A(Q) tor <strong>and</strong>L (r) (A, 1)r!= (∏ c p ) · Ω A · Reg A· #X(A)#A(Q) tor · #A ∨ (Q) tor.We have not defined most of the quantities appearing in this formula. In thissecti<strong>on</strong>, we will define the Tamagawa numbers c p , the real volume Ω A , <strong>and</strong> theShafarevich-Tate group X(A) in terms of the Nér<strong>on</strong> model A of A.We first define the Tamagawa numbers c p , which are the orders groups of c<strong>on</strong>nectedcomp<strong>on</strong>ents. Let p be a prime <strong>and</strong> c<strong>on</strong>sider the closed fiber A Fp , whichis a smooth commutative group scheme over F p . Then A Fp is a disjoint uni<strong>on</strong> of<strong>on</strong>e or more c<strong>on</strong>nected comp<strong>on</strong>ents. The c<strong>on</strong>nected comp<strong>on</strong>ent A 0 F pthat c<strong>on</strong>tainsthe identity element is a subgroup of A Fp (Intuiti<strong>on</strong>: the group law is c<strong>on</strong>tinuous<strong>and</strong> the c<strong>on</strong>tinuous image of a c<strong>on</strong>nected set is c<strong>on</strong>nected, so the group structurerestricts to A 0 F p).Definiti<strong>on</strong> 13.6.3 (Comp<strong>on</strong>ent Group). The comp<strong>on</strong>ent group of A at p isΦ A,p = A Fp /A 0 F p.Fact: The comp<strong>on</strong>ent group Φ A,p is a finite flat group scheme over F p , <strong>and</strong> 2 for 2all but finitely many primes p, we have Φ A,p = 0.Definiti<strong>on</strong> 13.6.4 (Tamagawa Numbers). The Tamagawa number of A at a primep isc p = #Φ A,p (F p ).1 Add reference.2 Reference?


128 13. Abelian VarietiesNext we define the real volume Ω A . Choose a basisω 1 , . . . , ω d ∈ H 0 (A, Ω 1 A/Z )for the global differential 1-forms <strong>on</strong> A, where d = dim A. The wedge productw = ω 1 ∧ω 2 ∧· · ·∧ω d is a global d-form <strong>on</strong> A. Then w induces a differential d-form<strong>on</strong> the real Lie group A(R).Definiti<strong>on</strong> 13.6.5 (Real Volume). The real volume of A is∫Ω A =w∣ ∣ ∈ R >0.A(R)Finally, we give a definiti<strong>on</strong> of the Shafarevich-Tate group in terms of the Nér<strong>on</strong>model. Let A 0 be the scheme obtained from the Nér<strong>on</strong> model A over A by removingfrom each closed fiber all n<strong>on</strong>identity comp<strong>on</strong>ents. Then A 0 is again a smoothcommutative group scheme, but it need not have the Nér<strong>on</strong> mapping property.Recall that an étale morphism is a morphism that is smooth of relative dimensi<strong>on</strong>0. A sheaf of abelian groups <strong>on</strong> the étale site Zét is a functor (satisfying certainaxioms) from the category of étale morphism X → Z to the category of abeliangroups. There are enough sheaves <strong>on</strong> Zét so that there is a cohomology theory forsuch sheaves, which is called étale cohomology. In particular if F is a sheaf <strong>on</strong> Zét ,then for every integer q there is an abelian group H q (Zét , F) associated to F thathas the st<strong>and</strong>ard properties of a cohomology functor.The group schemes A 0 <strong>and</strong> A both determine sheaves <strong>on</strong> the étale site, whichwe will also denote by A 0 <strong>and</strong> A.Definiti<strong>on</strong> 13.6.6 (Shafarevich-Tate Group). Suppose A(R) is c<strong>on</strong>nected thatthat A 0 = A. Then the Shafarevich-Tate group of A is H 1 (Zét , A). More generally,suppose <strong>on</strong>ly that A(R) is c<strong>on</strong>nected. Then the Shafarevich-Tate group is theimage of the natural mapf : H 1 (Zét , A 0 ) → H 1 (Zét , A).Even more generally, if A(R) is not c<strong>on</strong>nected, then there is a natural map r :H 1 (Zét , A) → H 1 (Gal(C/R), A(C)) <strong>and</strong> X(A) = im(f) ∩ ker(r).Mazur proves in the appendix to [Maz72] that this definiti<strong>on</strong> is equivalent to theusual Galois cohomology definiti<strong>on</strong>. To do this, he c<strong>on</strong>siders the exact sequence0 → A 0 → A → Φ A → 0, where Φ A is a sheaf versi<strong>on</strong> of ⊕ p Φ A,p . The maininput is Lang’s Theorem, which implies that over a local field, unramified Galoiscohomology is the same as the cohomology of the corresp<strong>on</strong>ding comp<strong>on</strong>ent group. 3 3C<strong>on</strong>jecture 13.6.7 (Shafarevich-Tate). The group H 1 (Zét , A) is finite.When A has rank 0, all comp<strong>on</strong>ent groups Φ A,p are trivial, A(R) is c<strong>on</strong>nected,<strong>and</strong> A(Q) tor <strong>and</strong> A ∨ (Q) tor are trivial, the Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer c<strong>on</strong>jecturetakes the simple formL(A, 1)= # H 1 (Zét , A).Ω ALater 4 , when A is modular, we will (almost) interpret L(A, 1)/Ω A as the order of 43 Reference for Lang’s Lemma, etc.4 Where?


13.6 Nér<strong>on</strong> models 129a certain group that involves modular symbols. Thus the BSD c<strong>on</strong>jecture assertsthat two groups have the same order; however, they are not isomorphic, since, e.g.,when dim A = 1 the modular symbols group is always cyclic, but the Shafarevich-Tate group is never cyclic (unless it is trivial).13.6.3 Functorial properties of Ner<strong>on</strong> modelsThe definiti<strong>on</strong> of Nér<strong>on</strong> model is functorial, so <strong>on</strong>e might expect the formati<strong>on</strong> ofNér<strong>on</strong> models to have good functorial properties. Unfortunately, it doesn’t.Propositi<strong>on</strong> 13.6.8. Let A <strong>and</strong> B be abelian varieties. If A <strong>and</strong> B are the Nér<strong>on</strong>models of A <strong>and</strong> B, respectively, then the Nér<strong>on</strong> model of A × B is A × B.Suppose R ⊂ R ′ is a finite extensi<strong>on</strong> of discrete valuati<strong>on</strong> rings with fields offracti<strong>on</strong>s K ⊂ K ′ . Sometimes, given an abelian variety A over a field K, it iseasier to underst<strong>and</strong> properties of the abelian variety, such as reducti<strong>on</strong>, over K ′ .For example, you might have extra informati<strong>on</strong> that implies that A K ′ decomposesas a product of well-understood abelian varieties. It would thus be useful if theNér<strong>on</strong> model of A K ′ were simply the base extensi<strong>on</strong> A R ′ of the Nér<strong>on</strong> model of Aover R. This is, however, frequently not the case.Distinguishing various types of ramificati<strong>on</strong> will be useful in explaining howNér<strong>on</strong> models behave with respect to base change, so we now recall the noti<strong>on</strong>s oftame <strong>and</strong> wild ramificati<strong>on</strong>. If π generates the maximal ideal of R <strong>and</strong> v ′ is thevaluati<strong>on</strong> <strong>on</strong> R ′ , then the extensi<strong>on</strong> is unramified if v ′ (π) = 1. It is tamely ramifiedif v ′ (π) is not divisible by the residue characteristic of R, <strong>and</strong> it is wildly ramifiedif v ′ (π) is divisible by the residue characteristic of R. For example, the extensi<strong>on</strong>Q p (p 1/p ) of Q p is wildly ramified.Example 13.6.9. If R is the ring of integers of a p-adic field, then for every integer nthere is a unique unramified extensi<strong>on</strong> of R of degree n. See [Cp86, §I.7], whereFröhlich uses Hensel’s lemma to show that the unramified extensi<strong>on</strong>s of K =Frac(R) are in bijecti<strong>on</strong> with the finite (separable) extensi<strong>on</strong>s of the residue classfield.The Nér<strong>on</strong> model does not behave well with respect to base change, except insome special cases. For example, suppose A is an abelian variety over the fieldof fracti<strong>on</strong>s K of a discrete valuati<strong>on</strong> ring R. If K ′ is the field of fracti<strong>on</strong>s of afinite unramified extensi<strong>on</strong> R ′ of R, then the Nér<strong>on</strong> model of A K ′ is A R ′, whereA is the Nér<strong>on</strong> model of A over R. Thus the Nér<strong>on</strong> model over an unramifiedextensi<strong>on</strong> is obtained by base extending the Nér<strong>on</strong> model over the base. This isnot too surprising because in the c<strong>on</strong>structi<strong>on</strong> of Nér<strong>on</strong> model we first passed tothe strict henselizati<strong>on</strong> of R, which is a limit of unramified extensi<strong>on</strong>s.C<strong>on</strong>tinuing with the above notati<strong>on</strong>, if K ′ is tamely ramified over K, then ingeneral A R ′ need not be the Nér<strong>on</strong> model of A K ′. Assume that K ′ is Galois over K.In [Edi92a], Bas Edixhoven describes the Nér<strong>on</strong> model of A K in terms of A R ′. Todescribe his main theorem, we introduce the restricti<strong>on</strong> of scalars of a scheme.Definiti<strong>on</strong> 13.6.10 (Restricti<strong>on</strong> of Scalars). Let S ′ → S be a morphism ofschemes <strong>and</strong> let X ′ be a scheme over S ′ . C<strong>on</strong>sider the functorR(T ) = Hom S ′(T × S S ′ , X ′ )


130 13. Abelian Varieties<strong>on</strong> the category of all schemes T over S. If this functor is representable, the representingobject X = Res S′ /S(X ′ ) is called the restricti<strong>on</strong> of scalars of X ′ to S.Edixhoven’s main theorem is that if G is the Galois group of K ′ over K <strong>and</strong>X = Res R ′ /R(A R ′) is the restricti<strong>on</strong> of scalars of A R ′ down to R, then there is anatural map A → X whose image is the closed subscheme X G of fixed elements.We finish this secti<strong>on</strong> with some cautious remarks about exactness properties ofNér<strong>on</strong> models. If 0 → A → B → C → 0 is an exact sequence of abelian varieties,then the functorial definiti<strong>on</strong> of Nér<strong>on</strong> models produces a complex of Nér<strong>on</strong> models0 → A → B → C → 0,where A, B, <strong>and</strong> C are the Nér<strong>on</strong> models of A, B, <strong>and</strong> C, respectively. This complexcan fail to be exact at every point. For an in-depth discussi<strong>on</strong> of c<strong>on</strong>diti<strong>on</strong>s whenwe have exactness, al<strong>on</strong>g with examples that violate exactness, see [BLR90, Ch. 7],which says: “we will see that, except for quite special cases, there will be a defectof exactness, the defect of right exactness being much more serious than the <strong>on</strong>eof left exactness.”To give examples in which right exactness fails, it suffices to give an optimalquotient B → C such that for some p the induced map Φ B,p → Φ C,p <strong>on</strong> comp<strong>on</strong>entgroups is not surjective (recall that optimal means A = ker(B → C) is an abelianvariety). Such quotients, with B <strong>and</strong> C modular, arise naturally in the c<strong>on</strong>text ofRibet’s level optimizati<strong>on</strong>. For example, the elliptic curve E given by y 2 + xy =x 3 + x 2 − 11x is the optimal new quotient of the Jacobian J 0 (33) of X 0 (33). Thecomp<strong>on</strong>ent group of E at 3 has order 6, since E has semistable reducti<strong>on</strong> at 3(since 3 || 33) <strong>and</strong> ord 3 (j(E)) = −6. The image of the comp<strong>on</strong>ent group of J 0 (33)in the comp<strong>on</strong>ent group of E has order 2:> OrderOfImageOfComp<strong>on</strong>entGroupOfJ0N(<strong>Modular</strong>Symbols("33A"),3);2Note that the modular form associated to E is c<strong>on</strong>gruent modulo 3 to the formcorresp<strong>on</strong>ding to J 0 (11), which illustrates the c<strong>on</strong>necti<strong>on</strong> with level optimizati<strong>on</strong>.


14Abelian Varieties Attached to <strong>Modular</strong><strong>Forms</strong>In this chapter we describe how to decompose J 1 (N), up to isogeny, as a product ofabelian subvarieties A f corresp<strong>on</strong>ding to Galois c<strong>on</strong>jugacy classes of cusp forms fof weight 2. This was first accomplished by Shimura (see [Shi94, Theorem 7.14]).We also discuss properties of the Galois representati<strong>on</strong> attached to f. 1 1In this chapter we will work almost exclusively with J 1 (N). However, everythinggoes through exactly as below with J 1 (N) replaced by J 0 (N) <strong>and</strong> S 2 (Γ 1 (N)) replacedby S 2 (Γ 0 (N)). Since, J 1 (N) has dimensi<strong>on</strong> much larger than J 0 (N), so forcomputati<strong>on</strong>al investigati<strong>on</strong>s it is frequently better to work with J 0 (N).See Brian C<strong>on</strong>rad’s appendix to [ribet-stein: <str<strong>on</strong>g>Lectures</str<strong>on</strong>g> <strong>on</strong> Serre’s C<strong>on</strong>jectures]for a much more extensive expositi<strong>on</strong> of the c<strong>on</strong>structi<strong>on</strong> discussed below, which isgeared toward preparing the reader for Deligne’s more general c<strong>on</strong>structi<strong>on</strong> of Galoisrepresentati<strong>on</strong>s associated to newforms of weight k ≥ 2 (for that, see C<strong>on</strong>rad’sbook ...).14.1 Decompositi<strong>on</strong> of the <strong>Hecke</strong> algebraLet N be a positive integer <strong>and</strong> letT = Z[. . . , T n , . . .] ⊂ End(J 1 (N))be the algebra of all <strong>Hecke</strong> operators acting <strong>on</strong> J 1 (N). Recall from Secti<strong>on</strong> 9.4 thatthe anemic <strong>Hecke</strong> algebra is the subalgebraT 0 = Z[. . . , T n , . . . : (n, N) = 1] ⊂ Tof T obtained by adjoining to Z <strong>on</strong>ly those <strong>Hecke</strong> operators T n with n relativelyprime to N.1 Rewrite intro after chapter is d<strong>on</strong>e, <strong>and</strong> point to where each thing is d<strong>on</strong>e.


132 14. Abelian Varieties Attached to <strong>Modular</strong> <strong>Forms</strong>Remark 14.1.1. Viewed as Z-modules, T 0 need not be saturated in T, i.e., T/T 0need not be torsi<strong>on</strong> free. For example, if T is the <strong>Hecke</strong> algebra associated toS 2 (Γ 1 (24)) then T/T 0∼ = Z/2Z. Also, if T is the <strong>Hecke</strong> algebra associated toS 2 (Γ 0 (54)), then T/T 0∼ = Z/3Z × Z.22If f = ∑ a n q n is a newform, then the field K f = Q(a 1 , a 2 , . . .) has finite degreeover Q, since the a n are the eigenvalues of a family of commuting operators withintegral characteristic polynomials. The Galois c<strong>on</strong>jugates of f are the newformsσ(f) = ∑ σ(a n )q n , for σ ∈ Gal(Q/Q). There are [K f : Q] Galois c<strong>on</strong>jugates of f.As in Secti<strong>on</strong> 9.4, we have a can<strong>on</strong>ical decompositi<strong>on</strong>T 0 ⊗ Q ∼ ∏= K f , (14.1.1)where f varies over a set of representatives for the Galois c<strong>on</strong>jugacy classes ofnewforms in S 2 (Γ 1 (N)) of level dividing N. For each f, letπ f = (0, . . . , 0, 1, 0, . . . , 0) ∈ ∏ K fbe projecti<strong>on</strong> <strong>on</strong>to the factor K f of the product (14.1.1). Since T 0 ⊂ T, <strong>and</strong> Thas no additive torsi<strong>on</strong>, we have T 0 ⊗ Q ⊂ T ⊗ Q, so these projectors π f liein T Q = T ⊗ Q. Since T Q is commutative <strong>and</strong> the π f are mutually orthog<strong>on</strong>alidempotents whose sum is (1, 1, . . . , 1), we see that T Q breaks up as a product ofalgebras∏T Q∼ = L f , t ↦→ ∑ tπ f .fff14.1.1 The Dimensi<strong>on</strong> of the algebras L fPropositi<strong>on</strong> 14.1.2. If f, L f <strong>and</strong> K f are as above, then dim Kf L f is the numberof divisors of N/N f where N f is the level of the newform f.Proof. Let V f be the complex vector space spanned by all images of Galois c<strong>on</strong>jugatesof f via all maps α d with d | N/N f . It follows from [Atkin-Lehner-Li theory – multiplicity <strong>on</strong>e] 3 that the images via α d of the Galois c<strong>on</strong>jugates 3of f are linearly independent. (Details: More generally, if f <strong>and</strong> g are newformsof level M, then by Propositi<strong>on</strong> 9.2.1, B(f) = {α d (f) : d | N/N f } is a linearlyindependent set <strong>and</strong> likewise for B(g). Suppose some n<strong>on</strong>zero element f ′of the span of B(f) equals some element g ′ of the span of B(g). Since T p , forp ∤ N, commutes with α d , we have T p (f ′ ) = a p (f)f ′ <strong>and</strong> T p (g ′ ) = a p (g)g ′ , so0 = T p (0) = T p (f ′ − g ′ ) = a p (f)f ′ − a p (g)g ′ . Since f ′ = g ′ , this implies thata p (f) = a p (g). Because a newform is determined by the eigenvalues of T p forp ∤ N, it follows that f = g.) Thus the C-dimensi<strong>on</strong> of V f is the number of divisorsof N/N f times dim Q K f .The factor L f is isomorphic to the image of T Q ⊂ End(S k (Γ 1 (N))) in End(V f ).As in Secti<strong>on</strong> ??, there is a single element v ∈ V f so that V f = T C · v. Thus theimage of T Q in End(V f ) has dimensi<strong>on</strong> dim C V f , <strong>and</strong> the result follows.2 I’m including the MAGMA scripts I used to check this as comments in the latex source, untilI find the right way to justify these computati<strong>on</strong>al remarks. Maybe the remarks should point toan appendix where they are all justified?3 fill in


14.2 Decompositi<strong>on</strong> of J 1(N) 133Let’s examine a particular case of this propositi<strong>on</strong>. Suppose p is a prime <strong>and</strong> f =∑an q n is a newform of level N f coprime to p, <strong>and</strong> let N = p · N f . We will showthatL f = K f [U]/(U 2 − a p U + p), (14.1.2)hence dim Kf L f = 2 which, as expected, is the number of divisors of N/N f = p.The first step is to view L f as the space of operators generated by the <strong>Hecke</strong>operators T n acting <strong>on</strong> the span V of the images f(dz) = f(q d ) for d | (N/N f ) = p.If n ≠ p, then T n acts <strong>on</strong> V as the scalar a n , <strong>and</strong> when n = p, the <strong>Hecke</strong> operator T pacts <strong>on</strong> S k (Γ 1 (p · N f )) as the operator also)denoted U p . By Secti<strong>on</strong> 9.2, we knowthat U p corresp<strong>on</strong>ds to the matrix with respect to the basis f(q), f(q p )(ap 1−p 0of V . Thus U p satisfies the relati<strong>on</strong> U 2 p − a p U + p. Since U p is not a scalar matrix,this minimal polynomial of U p is quadratic, which proves (14.1.2).More generally, see [DDT94, Lem. 4.4] (Diam<strong>on</strong>d-Darm<strong>on</strong>-Taylor) 4 for an ex- 4plicit presentati<strong>on</strong> of L f as a quotientL f∼ = Kf [. . . , U p , . . .]/Iwhere I is an ideal <strong>and</strong> the U p corresp<strong>on</strong>d to the prime divisors of N/N f .14.2 Decompositi<strong>on</strong> of J 1 (N)Let f be a newform in S 2 (Γ 1 (N)) of level a divisor M of N, so f ∈ S 2 (Γ 1 (M)) newis a normalized eigenform for all the <strong>Hecke</strong> operators of level M. We associateto f an abelian subvariety A f of J 1 (N), of dimensi<strong>on</strong> [L f : Q], as follows. Recallthat π f is the fth projector in T 0 ⊗ Q = ∏ g K g. We can not define A f to be theimage of J 1 (N) under π f , since π f is <strong>on</strong>ly, a priori, an element of End(J 1 (N))⊗Q.Fortunately, there exists a positive integer n such that nπ f ∈ End(J 1 (N)), <strong>and</strong> weletA f = nπ f (J 1 (N)).This is independent of the choice of n, since the choices for n are all multiples ofthe “denominator” n 0 of π f , <strong>and</strong> if A is any abelian variety <strong>and</strong> n is a positiveinteger, then nA = A.The natural map ∏ f A f → J 1 (N), which is induced by summing the inclusi<strong>on</strong>maps, is an isogeny. Also A f is simple if f is of level N, <strong>and</strong> otherwise A f isisogenous to a power of A ′ f ⊂ J 1(N f ). Thus we obtain an isogeny decompositi<strong>on</strong>of J 1 (N) as a product of Q-simple abelian varieties.Remark 14.2.1. The abelian varieties A f frequently decompose further over Q,i.e., they are not absolutely simple, <strong>and</strong> it is an interesting problem to determinean isogeny decompositi<strong>on</strong> of J 1 (N) Qas a product of simple abelian varieties. It isstill not known precisely how to do this computati<strong>on</strong>ally for any particular N. 5 5This decompositi<strong>on</strong> can be viewed in another way over the complex numbers.As a complex torus, J 1 (N)(C) has the following model:J 1 (N)(C) = Hom(S 2 (Γ 1 (N)), C)/H 1 (X 1 (N), Z).4 Remove parens.5 Add more/pointers/etc.


134 14. Abelian Varieties Attached to <strong>Modular</strong> <strong>Forms</strong>The acti<strong>on</strong> of the <strong>Hecke</strong> algebra T <strong>on</strong> J 1 (N)(C) is compatible with its acti<strong>on</strong> <strong>on</strong>the cotangent space S 2 (Γ 1 (N)). This c<strong>on</strong>structi<strong>on</strong> presents J 1 (N)(C) naturallyas V/L with V a complex vector space <strong>and</strong> L a lattice in V . The anemic <strong>Hecke</strong>algebra T 0 then decomposes V as a direct sum V = ⊕ f V f . The <strong>Hecke</strong> operatorsact <strong>on</strong> V f <strong>and</strong> L in a compatible way, so T 0 decomposes L ⊗ Q in a compatibleway. Thus L f = V f ∩ L is a lattice in V f , so we may A f (C) view as the complextorus V f /L f .Lemma 14.2.2. Let f ∈ S 2 (Γ 1 (N)) be a newform of level dividing N <strong>and</strong> A f =nπ f (J 1 (N)) be the corresp<strong>on</strong>ding abelian subvariety of J 1 (N). Then the <strong>Hecke</strong>algebra T ⊂ End(J 1 (N)) leaves A f invariant.Proof. The <strong>Hecke</strong> algebra T is commutative, so if t ∈ T, thentA f = tnπ f (J 1 (N)) = nπ f (tJ 1 (N)) ⊂ nπ f (J 1 (N)) = A f .Remark 14.2.3. Viewing A f (C) as V f /L f is extremely useful computati<strong>on</strong>ally,since L can be computed using modular symbols, <strong>and</strong> L f can be cut out usingthe <strong>Hecke</strong> operators. For example, if f <strong>and</strong> g are n<strong>on</strong>c<strong>on</strong>jugate newforms of leveldividing N, we can explicitly compute the group structure of A f ∩ A g ⊂ J 1 (N) bydoing a computati<strong>on</strong> with modular symbols in L. More precisely, we haveA f ∩ A g∼ = (L/(Lf + L g )) tor .Note that A f depends <strong>on</strong> viewing f as an element of S 2 (Γ 1 (N)) for some N.Thus it would be more accurate to denote A f by A f,N , where N is any multiple ofthe level of f, <strong>and</strong> to reserve the notati<strong>on</strong> A f for the case N = 1. Then dim A f,Nis dim A f times the number of divisors of N/N f .14.2.1 Aside: intersecti<strong>on</strong>s <strong>and</strong> c<strong>on</strong>gruencesSuppose f <strong>and</strong> g are not Galois c<strong>on</strong>jugate. Then the intersecti<strong>on</strong> Ψ = A f ∩ A gis finite, since V f ∩ V g = 0, <strong>and</strong> the integer #Ψ is of interest. This cardinalityis related to c<strong>on</strong>gruence between f <strong>and</strong> g, but the exact relati<strong>on</strong> is unclear. Forexample, <strong>on</strong>e might expect that p | #Ψ if <strong>and</strong> <strong>on</strong>ly if there is a prime ℘ of thecompositum K f .K g of residue characteristic p such that a q (f) ≡ a q (g) (mod ℘)for all q ∤ N. If p | #Ψ, then such a prime ℘ exists (take ℘ to be induced bya maximal ideal in the support of the n<strong>on</strong>zero T-module Ψ[p]). The c<strong>on</strong>verse isfrequently true, but is sometimes false. For example, if N is the prime 431 <strong>and</strong>f = q − q 2 + q 3 − q 4 + q 5 − q 6 − 2q 7 + · · ·g = q − q 2 + 3q 3 − q 4 − 3q 5 − 3q 6 + 2q 7 + · · · ,then f ≡ g (mod 2), but A f ∩ A g = 0. This example implies that “multiplicity<strong>on</strong>e fails” for level 431 <strong>and</strong> p = 2, so the <strong>Hecke</strong> algebra associated to J 0 (431) isnot Gorenstein (see [Lloyd Kilford paper] 6 for more details). 66 fix reference


14.3 Galois representati<strong>on</strong>s attached to A f 13514.3 Galois representati<strong>on</strong>s attached to A fIt is important to emphasize the case when f is a newform of level N, sincethen A f is Q-simple 7 <strong>and</strong> there is a compatible family of 2-dimensi<strong>on</strong>al l-adic 7representati<strong>on</strong>s attached to f, which arise from torsi<strong>on</strong> points <strong>on</strong> A f .Propositi<strong>on</strong> 14.1.2 implies that L f = K f . Fix such an f, let A = A f , let K = K f ,<strong>and</strong> letd = dim A = dim Q K = [K : Q].Let l be a prime <strong>and</strong> c<strong>on</strong>sider the Q l -adic Tate module Tate l (A) of A:Tate l (A) = Q l ⊗ lim A[l ν ]. ←−Note that as a Q l -vector space Tate l (A) ∼ = Q 2dl, since A[n] ∼ = (Z/nZ) 2d , as groups.There is a natural acti<strong>on</strong> of the ring K ⊗ Q Q l <strong>on</strong> Tate l (A). By algebraic numbertheoryK ⊗ Q Q l = ∏ K λ ,λ|lν>0where λ runs through the primes of the ring O K of integers of K lying over l <strong>and</strong>K λ denotes the completi<strong>on</strong> of K with respect to the absolute value induced by λ.Thus Tate l (A) decomposes as a productTate l (A) = ∏ λ|lTate λ (A)where Tate λ (A) is a K λ vector space.Lemma 14.3.1. Let the notati<strong>on</strong> be as above. Then for all λ lying over l,dim Kλ Tate λ (A) = 2.Proof. Write A = V/L, with V = V f a complex vector space <strong>and</strong> L a lattice. ThenTate λ (A) ∼ = L ⊗ Q l as K λ -modules (not as Gal(Q/Q)-modules!), since A[l n ] ∼ =L/l n L, <strong>and</strong> lim L/l n L ∼ ←−n= Z l ⊗L. Also, L⊗Q is a vector space over K, which musthave dimensi<strong>on</strong> 2, since L ⊗ Q has dimensi<strong>on</strong> 2d = 2 dim A <strong>and</strong> K has degree d.ThusTate λ (A) ∼ = L ⊗ K λ ≈ (K ⊕ K) ⊗ K K λ∼ = Kλ ⊕ K λhas dimensi<strong>on</strong> 2 over K λ .Now c<strong>on</strong>sider Tate λ (A), which is a K λ -vector space of dimensi<strong>on</strong> 2. The <strong>Hecke</strong>operators are defined over Q, so Gal(Q/Q) acts <strong>on</strong> Tate l (A) in a way compatiblewith the acti<strong>on</strong> of K ⊗ Q Q l . We thus obtain a homomorphismρ l = ρ f,l : Gal(Q/Q) → Aut K⊗Ql Tate l (A) ≈ GL 2 (K ⊗ Q l ) ∼ ∏= GL 2 (K λ ).Thus ρ l is the direct sum of l-adic Galois representati<strong>on</strong>s ρ λ whereρ λ : Gal(Q/Q) → End Kλ (Tate λ (A))λ7 add pointer to where this is proved.


136 14. Abelian Varieties Attached to <strong>Modular</strong> <strong>Forms</strong>gives the acti<strong>on</strong> of Gal(Q/Q) <strong>on</strong> Tate λ (A).If p ∤ lN, then ρ λ is unramified at p (see [ST68, Thm. 1]). In this case it makessense to c<strong>on</strong>sider ρ λ (ϕ p ), where ϕ p ∈ Gal(Q/Q) is a Frobenius element at p. Thenρ λ (ϕ p ) has a well-defined trace <strong>and</strong> determinant, or equivalently, a well-definedcharacteristic polynomial Φ(X) ∈ K λ [X].Theorem 14.3.2. Let f ∈ S 2 (Γ 1 (N), ε) be a newform of level N with Dirichletcharacter ε. Suppose p ∤ lN, <strong>and</strong> let ϕ p ∈ Gal(Q/Q) be a Frobenius element at p.Let Φ(X) be the characteristic polynomial of ρ λ (ϕ p ). ThenΦ(X) = X 2 − a p X + p · ε(p),where a p is the pth coefficient of the modular form f (thus a p is the image of T pin E f <strong>and</strong> ε(p) is the image of 〈p〉).Let ϕ = ϕ p . By the Cayley-Hamilt<strong>on</strong> theoremρ λ (ϕ) 2 − tr(ρ λ (ϕ))ρ λ (ϕ) + det(ρ λ (ϕ)) = 0.Using the Eichler-Shimura c<strong>on</strong>gruence relati<strong>on</strong> (see 8 ) we will show that tr(ρ λ (ϕ)) = 8a p , but we defer the proof of this until ... 9 . 9We will prove that det(ρ λ (ϕ)) = p in the special case when ε = 1. This willfollow from the equalitydet(ρ λ ) = χ l , (14.3.1)where χ l is the lth cyclotomic characterχ l : Gal(Q/Q) → Z ∗ l ⊂ K ∗ λ,which gives the acti<strong>on</strong> of Gal(Q/Q) <strong>on</strong> µ l ∞. We have χ l (ϕ) = p because ϕ inducesinduces pth powering map <strong>on</strong> µ l ∞.It remains to establish (14.3.1). The simplest case is when A is an elliptic curve.In [Sil92, ] 10 , Silverman shows that det(ρ l ) = χ l using the Weil pairing. We will 10c<strong>on</strong>sider the Weil pairing in more generality in the next secti<strong>on</strong>, <strong>and</strong> use it toestablish (14.3.1).14.3.1 The Weil pairingLet T l (A) = lim ←−n≥1A[l n ], so Tate l (A) = Q l ⊗ T l (A). The Weil pairing is a n<strong>on</strong>degenerateperfect pairinge l : T l (A) × T l (A ∨ ) → Z l (1).(See e.g., [Mil86, §16] for a summary of some of its main properties.)Remark 14.3.3. Identify Z/l n Z with µ l n by 1 ↦→ e −2πi/ln , <strong>and</strong> extend to a mapZ l → Z l (1). If J = Jac(X) is a Jacobian, then the Weil pairing <strong>on</strong> J is inducedby the can<strong>on</strong>ical isomorphismT l (J) ∼ = H 1 (X, Z l ) = H 1 (X, Z) ⊗ Z l ,8 Next week!9 when?10 get ref


14.3 Galois representati<strong>on</strong>s attached to A f 137<strong>and</strong> the cup product pairingH 1 (X, Z l ) ⊗ Zl H 1 (X, Z l )∪−−→ Z l .For more details see the discussi<strong>on</strong> <strong>on</strong> pages 210–211 of C<strong>on</strong>rad’s appendix to[RS01], <strong>and</strong> the references therein. In particular, note that H 1 (X, Z l ) is isomorphicto H 1 (X, Z l ), because H 1 (X, Z l ) is self-dual because of the intersecti<strong>on</strong> pairing. Itis easy to see that H 1 (X, Z l ) ∼ = T l (J) since by Abel-Jacobi J ∼ = T 0 (J)/ H 1 (X, Z),where T 0 (J) is the tangent space at J at 0 (see Lemma 14.3.1). 11 11Here Z l (1) ∼ = lim µ ←− l n is isomorphic to Z l as a ring, but has the acti<strong>on</strong> ofGal(Q/Q) induced by the acti<strong>on</strong> of Gal(Q/Q) <strong>on</strong> lim µ ←− l n. Given σ ∈ Gal(Q/Q),there is an element χ l (σ) ∈ Z ∗ lsuch that σ(ζ) = ζχl(σ) , for every l n th root ofunity ζ. If we view Z l (1) as just Z l with an acti<strong>on</strong> of Gal(Q/Q), then the acti<strong>on</strong>of σ ∈ Gal(Q/Q) <strong>on</strong> Z l (1) is left multiplicati<strong>on</strong> by χ l (σ) ∈ Z ∗ l .Definiti<strong>on</strong> 14.3.4 (Cyclotomic Character). The homomorphismis called the l-adic cyclotomic character.χ l : Gal(Q/Q) → Z ∗ lIf ϕ : A → A ∨ is a polarizati<strong>on</strong> (so it is an isogeny defined by translati<strong>on</strong> of anample invertible sheaf), we define a pairinge ϕ l : T l(A) × T l (A) → Z l (1) (14.3.2)by e ϕ l (a, b) = e l(a, ϕ(b)). The pairing (14.3.2) is a skew-symmetric, n<strong>on</strong>degenerate,bilinear pairing that is Gal(Q/Q)-equivariant, in the sense that if σ ∈ Gal(Q/Q),thene ϕ l (σ(a), σ(b)) = σ · eϕ l (a, b) = χ l(σ)e ϕ l(a, b).We now apply the Weil pairing in the special case A = A f ⊂ J 1 (N). Abelianvarieties attached to modular forms are equipped with a can<strong>on</strong>ical polarizati<strong>on</strong>called the modular polarizati<strong>on</strong>. The can<strong>on</strong>ical principal polarizati<strong>on</strong> of J 1 (N) isan isomorphism J 1 (N) ∼ −→ J 1 (N) ∨ , so we obtain the modular polarizati<strong>on</strong> ϕ =ϕ A : A → A ∨ of A, as illustrated in the following diagram:J 1 (N)Aautoduality ∼ =polarizati<strong>on</strong> ϕ A J 1 (N) ∨ A ∨C<strong>on</strong>sider (14.3.2) with ϕ = ϕ A the modular polarizati<strong>on</strong>. Tensoring over Q<strong>and</strong> restricting to Tate λ (A), we obtain a n<strong>on</strong>degenerate skew-symmetric bilinearpairinge : Tate λ (A) × Tate λ (A) → Q l (1). (14.3.3)The n<strong>on</strong>degeneracy follows from the n<strong>on</strong>degeneracy of e ϕ le ϕ l (Tate λ(A), Tate λ ′(A)) = 0<strong>and</strong> the observati<strong>on</strong> that11 Remove or exp<strong>and</strong>?


138 14. Abelian Varieties Attached to <strong>Modular</strong> <strong>Forms</strong>when λ ≠ λ ′ . This uses the Galois equivariance of e φ lcarries over to Galois equivarianceof e, in the following sense. If σ ∈ Gal(Q/Q) <strong>and</strong> x, y ∈ Tate λ (A), thene(σx, σy) = σe(x, y) = χ l (σ)e(x, y).Note that σ acts <strong>on</strong> Q l (1) as multiplicati<strong>on</strong> by χ l (σ).14.3.2 The DeterminantThere are two proofs of the theorem, a fancy proof <strong>and</strong> a c<strong>on</strong>crete proof. We firstpresent the fancy proof. The pairing e of (14.3.3) is a skew-symmetric <strong>and</strong> bilinearform so it determines a Gal(Q/Q)-equivarient homomorphism2∧K λTate λ (A) → Q l (1). (14.3.4)It is not a priori true that we can take the wedge product over K λ instead ofQ l , but we can because e(tx, y) = e(x, ty) for any t ∈ K λ . This is where we usethat A is attached to a newform with trivial character, since when the characteris n<strong>on</strong>trivial, the relati<strong>on</strong> between e(T p x, y) <strong>and</strong> e(x, T p y) will involve 〈p〉. LetD = ∧2 Tate λ (A) <strong>and</strong> note that dim Kλ D = 1, since Tate λ (A) has dimensi<strong>on</strong> 2over K λ .There is a can<strong>on</strong>ical isomorphismHom Ql (D, Q l (1)) ∼ = Hom Kλ (D, K λ (1)),<strong>and</strong> the map of (14.3.4) maps to an isomorphism D ∼ = K λ (1) of Gal(Q/Q)-modules.Since the representati<strong>on</strong> of Gal(Q/Q) <strong>on</strong> D is the determinant, <strong>and</strong> the representati<strong>on</strong><strong>on</strong> K λ (1) is the cyclotomic character χ l , it follows that det ρ λ = χ l .Next we c<strong>on</strong>sider a c<strong>on</strong>crete proof. If σ ∈ Gal(Q/Q), then we must show thatdet(σ) = χ l (σ). Choose a basis x, y ∈ Tate λ (A) of Tate λ (A) as a 2 dimensi<strong>on</strong>alK λ vector space. We have σ(x) = ax + cy <strong>and</strong> σ(y) = bx + dy, for a, b, c, d ∈ K λ .Thenχ l (σ)e(x, y) = 〈σx, σy)= e(ax + cy, bx + dy)= e(ax, bx) + e(ax, dy) + e(cy, bx) + e(cy, dy)= e(ax, dy) + e(cy, bx)= e(adx, y) − e(bcx, y)= e((ad − bc)x, y)= (ad − bc)e(x, y)To see that e(ax, bx) = 0, note thate(ax, bx) = e(abx, x) = −e(x, abx) = −e(ax, bx).Finally, since e is n<strong>on</strong>degenerate, there exists x, y such that e(x, y) ≠ 0, so χ l (σ) =ad − bc = det(σ).


14.4 Remarks about the modular polarizati<strong>on</strong> 13914.4 Remarks about the modular polarizati<strong>on</strong>Let A <strong>and</strong> ϕ be as in Secti<strong>on</strong> 14.3.1. The degree deg(ϕ) of the modular polarizati<strong>on</strong>of A is an interesting arithmetic invariant of A. If B ⊂ J 1 (N) is the sum of allmodular abelian varieties A g attached to newforms g ∈ S 2 (Γ 1 (N)), with g not aGalois c<strong>on</strong>jugate of f <strong>and</strong> of level dividing N, then ker(ϕ) ∼ = A ∩ B, as illustratedin the following diagram:ker(ϕ B ) ∼ =∼ =ker(ϕ A i)A ∩ BB A J 1 (N)ϕA ∨ B ∨Note that ker(ϕ B ) is also isomorphic to A ∩ B, as indicated in the diagram.In c<strong>on</strong>necti<strong>on</strong> with Secti<strong>on</strong> ??, the quantity ker(ϕ A ) = A∩B is closely related toc<strong>on</strong>gruences between f <strong>and</strong> eigenforms orthog<strong>on</strong>al to the Galois c<strong>on</strong>jugates of f.When A has dimensi<strong>on</strong> 1, we may alternatively view A as a quotient of X 1 (N)via the mapX 1 (N) → J 1 (N) → A ∨ ∼ = A.Then ϕ A : A → A is pullback of divisors to X 1 (N) followed by push forward,which is multiplicati<strong>on</strong> by the degree. Thus ϕ A = [n], where n is the degree of themorphism X 1 (N) → A of algebraic curves. The modular degree isdeg(X 1 (N) → A) = √ deg(ϕ A ).More generally, if A has dimensi<strong>on</strong> greater than 1, then deg(ϕ A ) has order a perfectsquare (for references, see [Mil86, Thm. 13.3]), <strong>and</strong> we define the modular degreeto be √ deg(ϕ A ).Let f be a newform of level N. In the spirit of Secti<strong>on</strong> 14.2.1 we use c<strong>on</strong>gruencesto define a number related to the modular degree, called the c<strong>on</strong>gruence number.For a subspace V ⊂ S 2 (Γ 1 (N)), let V (Z) = V ∩Z[[q]] be the elements with integralq-expansi<strong>on</strong> at ∞ <strong>and</strong> V ⊥ denotes the orthog<strong>on</strong>al complement of V with respectto the Peterss<strong>on</strong> inner product. The c<strong>on</strong>gruence number of f isr f = # S 2(Γ 1 (N))(Z)V f (Z) + V ⊥f (Z),where V f is the complex vector space spanned by the Galois c<strong>on</strong>jugates of f. Wethus have two positive associated to f, the c<strong>on</strong>gruence number r f <strong>and</strong> the modulardegree m f of of A f .Theorem 14.4.1. m f | r f


140 14. Abelian Varieties Attached to <strong>Modular</strong> <strong>Forms</strong>Ribet menti<strong>on</strong>s this in the case of elliptic curves in [ZAGIER, 1985] [Zag85a],but the statement is given incorrectly in that paper (the paper says that r f | m f ,which is wr<strong>on</strong>g). The proof for dimensi<strong>on</strong> greater than <strong>on</strong>e is in [AGASHE-STEIN,Manin c<strong>on</strong>stant...]. Ribet also subsequently proved that if p 2 ∤ N, then ord p (m f ) =ord p (r f ).We can make the same definiti<strong>on</strong>s with J 1 (N) replaced by J 0 (N), so if f ∈S 2 (Γ 0 (N)) is a newform, A f ⊂ J 0 (N), <strong>and</strong> the c<strong>on</strong>gruence number measures c<strong>on</strong>gruencesbetween f <strong>and</strong> other forms in S 2 (Γ 0 (N)). In [FM99, Ques. 4.4], they askwhether it is always the case that m f = r f when A f is an elliptic curve, <strong>and</strong> m f<strong>and</strong> r f are defined relative to Γ 0 (N). I 12 implemented an algorithm in MAGMA 12to compute r f , <strong>and</strong> found the first few counterexamples, which occur whenN = 54, 64, 72, 80, 88, 92, 96, 99, 108, 120, 124, 126, 128, 135, 144.For example, the elliptic curve A labeled 54B1 in [Cre97] has r A = 6 <strong>and</strong> m A = 2.To see directly that 3 | r A , observe that if f is the newform corresp<strong>on</strong>ding to E<strong>and</strong> g is the newform corresp<strong>on</strong>ding to X 0 (27), then g(q) + g(q 2 ) is c<strong>on</strong>gruent to fmodulo 3. This is c<strong>on</strong>sistent with Ribet’s theorem that if p | r A /m A then p 2 | N.There seems to be no absolute bound <strong>on</strong> the p that occur.It would be interesting to determine the answer to the analogue of the questi<strong>on</strong>of Frey-Mueller for Γ 1 (N). For example, if A ⊂ J 1 (54) is the curve isogeneous to54B1, then m A = 18 is divisible by 3. However, I do not know 13 r A in this case, 13because I haven’t written a program to compute it for Γ 1 (N).12 change...13 fix


15<strong>Modular</strong>ity of Abelian Varieties15.1 <strong>Modular</strong>ity over QDefiniti<strong>on</strong> 15.1.1 (<strong>Modular</strong> Abelian Variety). Let A be an abelian variety over Q.Then A is modular if there exists a positive integer N <strong>and</strong> a surjective mapJ 1 (N) → A defined over Q.The following theorem is the culminati<strong>on</strong> of a huge amount of work, whichstarted with Wiles’s successful attack [Wil95] <strong>on</strong> Fermat’s Last Theorem, <strong>and</strong>finished with [BCDT01].Theorem 15.1.2 (Breuil, C<strong>on</strong>rad, Diam<strong>on</strong>d, Taylor, Wiles). Let E be an ellipticcurve over Q. Then E is modular.We will say nothing about the proof here. 1 1If A is an abelian variety over Q, let End Q (A) denote the ring of endomorphismsof A that are defined over Q.Definiti<strong>on</strong> 15.1.3 (GL 2 -type). An abelian variety A over Q is of GL 2 -type if theendomorphism algebra Q ⊗ End Q (A) c<strong>on</strong>tains a number field of degree equal tothe dimensi<strong>on</strong> of A.For example, every elliptic curve E over Q is trivially of GL 2 -type, since Q ⊂Q ⊗ End Q (E).Propositi<strong>on</strong> 15.1.4. If A is an abelian variety over Q, <strong>and</strong> K ⊂ Q ⊗ End Q (A)is a field, then [K : Q] divides dim A.Proof. As discussed in [Rib92, §2], K acts faithfully <strong>on</strong> the tangent space Tan 0 (A/Q)over Q to A at 0, which is a Q vector space of dimensi<strong>on</strong> dim(A). Thus Tan 0 (A/Q)is a vector space over K, hence has Q-dimensi<strong>on</strong> a multiple of [K : Q].1 Also pointer to later in book.


142 15. <strong>Modular</strong>ity of Abelian VarietiesPropositi<strong>on</strong> 15.1.4 implies, in particular, that if E is an elliptic curve over Q,then End Q (E) = Q. Recall 2 that E has CM or is a complex multiplicati<strong>on</strong> elliptic 2curve if End Q(E) ≠ Z). Propositi<strong>on</strong> 15.1.4 implies that if E is a CM elliptic curve,the extra endomorphisms are never defined over Q.Propositi<strong>on</strong> 15.1.5. Suppose A = A f ⊂ J 1 (N) is an abelian variety attached toa newform of level N. Then A is of GL 2 -type.Proof. The endomorphism ring of A f c<strong>on</strong>tains O f = Z[. . . , a n (f), . . .], hence thefield K f = Q(. . . , a n (f), . . .) is c<strong>on</strong>tained in Q ⊗ End Q (A). Since A f = nπJ 1 (N),where π is a projector <strong>on</strong>to the factor K f of the anemic <strong>Hecke</strong> algebra T 0 ⊗ Z Q,we have dim A f = [K f : Q]. (One way to see this is to recall that the tangentspace T = Hom(S 2 (Γ 1 (N)), C) to J 1 (N) at 0 is free of rank 1 over T 0 ⊗ Z C.)C<strong>on</strong>jecture 15.1.6 (Ribet). Every abelian variety over Q of GL 2 -type is modular.Supposeρ : Gal(Q/Q) → GL 2 (F p )is an odd irreducible c<strong>on</strong>tinuous Galois representati<strong>on</strong>, where odd means thatdet(ρ(c)) = −1,where c is complex c<strong>on</strong>jugati<strong>on</strong>. We say that ρ is modular if there is a newformf ∈ S k (Γ 1 (N)), <strong>and</strong> a prime ideal ℘ ⊂ O f such that for all l ∤ Np, we haveTr(ρ(Frob l )) ≡ a l(mod ℘),Det(ρ(Frob l )) ≡ l k−1 · ε(l)(mod ℘).Here χ p is the p-adic cyclotomic character, <strong>and</strong> ε is the (Nebentypus) character ofthe newform f.C<strong>on</strong>jecture 15.1.7 (Serre). Every odd irreducible c<strong>on</strong>tinuous representati<strong>on</strong>ρ : Gal(Q/Q) → GL 2 (F p )is modular. Moreover, there is a formula for the “optimal” weight k(ρ) <strong>and</strong> levelN(ρ) of a newform that gives rise to ρ.In [Ser87], Serre describes the formula for the weight <strong>and</strong> level. Also, it is nowknown due to work of Ribet, Edixhoven, Coleman, Voloch, Gross, <strong>and</strong> others thatif ρ is modular, then ρ arises from a form of the c<strong>on</strong>jectured weight <strong>and</strong> level,except in some cases when p = 2. (For more details see the survey paper [RS01].)However, the full C<strong>on</strong>jecture 15.1.7 is known in very few cases.Remark 15.1.8. There is interesting recent work of Richard Taylor which c<strong>on</strong>nectsC<strong>on</strong>jecture 15.1.7 with the open questi<strong>on</strong> of whether every variety of a certain typehas a point over a solvable extensi<strong>on</strong> of Q. The questi<strong>on</strong> of the existence of solvablepoints (“solvability of varieties in radicals”) seems very difficult. For example, wed<strong>on</strong>’t even know the answer for genus <strong>on</strong>e curves, or have a good reas<strong>on</strong> to makea c<strong>on</strong>jecture either way (as far as I know 3 ). There’s a book of Mike Fried that 3discusses this solvability questi<strong>on</strong>. 4 42 from where3 fix4 Find the exact reference in that book <strong>and</strong> the exact book.


15.1 <strong>Modular</strong>ity over Q 143Serre’s c<strong>on</strong>jecture is very str<strong>on</strong>g. For example, it would imply modularity of allabelian varieties over Q that could possibly be modular, <strong>and</strong> the proof of thisimplicati<strong>on</strong> does not rely <strong>on</strong> Theorem 15.1.2.Theorem 15.1.9 (Ribet). Serre’s c<strong>on</strong>jectures <strong>on</strong> modularity of all odd irreduciblemod p Galois representati<strong>on</strong>s implies C<strong>on</strong>jecture 15.1.6.To give the reader a sense of the c<strong>on</strong>necti<strong>on</strong> between Serre’s c<strong>on</strong>jecture <strong>and</strong>modularity, we sketch some of the key ideas of the proof of Theorem 15.1.9; formore details the reader may c<strong>on</strong>sult Secti<strong>on</strong>s 1–4 of [Rib92].Without loss, we may assume that A is Q-simple. As explained in the not trivial[Rib92, Thm. 2.1], this hypothesis implies thatK = Q ⊗ Z End Q (A)is a number field of degree dim(A). The Tate modulesTate l (A) = Q l ⊗ lim A[l n ]←−n≥1are free of rank two over K ⊗ Q l , so the acti<strong>on</strong> of Gal(Q/Q) <strong>on</strong> Tate l (A) definesa representati<strong>on</strong>ρ A,l : Gal(Q/Q) → GL 2 (K ⊗ Q l ).Remarks 15.1.10. That these representati<strong>on</strong>s take values in GL 2 is why such A aresaid to be “of GL 2 -type”. Also, note that the above applies to A = A f ⊂ J 1 (N),<strong>and</strong> the l-adic representati<strong>on</strong>s attached to f are just the factors of ρ A,l comingfrom the fact that K ⊗ Q l∼ =∏λ|l K λ.The deepest input to Ribet’s proof is Faltings’s isogeny theorem, which Faltingsproved in order to prove Mordell’s c<strong>on</strong>jecture (there are <strong>on</strong>ly a finite number ofL-rati<strong>on</strong>al points <strong>on</strong> any curve over L of genus at least 2).If B is an abelian variety over Q, let∏1L(B, s) =det (1 − p −s · Frob p | Tate l (A)) = ∏ L p (B, s),pall primes pwhere l is a prime of good reducti<strong>on</strong> (it makes no difference which <strong>on</strong>e).Theorem 15.1.11 (Faltings). Let A <strong>and</strong> B be abelian varieties. Then A is isogenousto B if <strong>and</strong> <strong>on</strong>ly if L p (A, s) = L p (B, s) for almost all p.Using an analysis of Galois representati<strong>on</strong>s <strong>and</strong> properties of c<strong>on</strong>ductors <strong>and</strong>applying results of Faltings, Ribet finds an infinite set Λ of primes of K such thatall ρ A,λ are irredudible <strong>and</strong> there <strong>on</strong>ly finitely many Serre invariants N(ρ A,λ ) <strong>and</strong>k(ρ A,λ ). For each of these λ, by C<strong>on</strong>jecture 15.1.7 there is a newform f λ of levelN(ρ A,λ )) <strong>and</strong> weight k(ρ A,λ ) that gives rise to the mod l representati<strong>on</strong> ρ A,λ .Since Λ is infinite, but there are <strong>on</strong>ly finitely many Serre invariants N(ρ A,λ )),k(ρ A,λ ), there must be a single newform f <strong>and</strong> an infinite subset Λ ′ of Λ so thatfor every λ ∈ Λ ′ the newform f gives rise to ρ A,λ .Let B = A f ⊂ J 1 (N) be the abelian variety attached to f. Fix any prime p ofgood reducti<strong>on</strong>. There are infinitely many primes λ ∈ Λ ′ such that ρ A,λ∼ = ρB,˜λforsome ˜λ, <strong>and</strong> for these λ,det ( 1 − p −s · Frob p |A[λ] ) = det()1 − p −s · Frob p |B[˜λ] .


144 15. <strong>Modular</strong>ity of Abelian VarietiesThis means that the degree two polynomials in p −s (over the appropriate fields,e.g., K ⊗ Q l for A)det ( 1 − p −s · Frob p | Tate l (A) )<strong>and</strong>det ( 1 − p −s · Frob p | Tate l (B) )are c<strong>on</strong>gruent modulo infinitely many primes. Therefore they are equal. By Theorem15.1.11, it follows that A is isogenous to B = A f , so A is modular.15.2 <strong>Modular</strong>ity of elliptic curves over QDefiniti<strong>on</strong> 15.2.1 (<strong>Modular</strong> Elliptic Curve). An elliptic curve E over Q is modularif there is a surjective morphism X 1 (N) → E for some N.Definiti<strong>on</strong> 15.2.2 (Q-curve). An elliptic curve E over Q-bar is a Q-curve if forevery σ ∈ Gal(Q/Q) there is an isogeny E σ → E (over Q).Theorem 15.2.3 (Ribet). Let E be an elliptic curve over Q. If E is modular,then E is a Q-curve, or E has CM.This theorem is proved in [Rib92, §5].C<strong>on</strong>jecture 15.2.4 (Ribet). Let E be an elliptic curve over Q. If E is a Q-curve,then E is modular.In [Rib92, §6], Ribet proves that C<strong>on</strong>jecture 15.1.7 implies C<strong>on</strong>jecture 15.2.4. Hedoes this by showing that if a Q-curve E does not have CM then there is a Q-simpleabelian variety A over Q of GL 2 -type such that E is a simple factor of A over Q.This is accomplished finding a model for E over a Galois extensi<strong>on</strong> K of Q, restrictingscalars down to Q to obtain an abelian variety B = Res K/Q (E), <strong>and</strong> usingGalois cohomology computati<strong>on</strong>s (mainly in H 2 ’s) to find the required A of GL 2 -type inside B. Then Theorem 15.1.9 <strong>and</strong> our assumpti<strong>on</strong> that C<strong>on</strong>jecture 15.1.7 istrue together immediately imply that A is modular.Ellenberg <strong>and</strong> Skinner [ES00] have recently used methods similar to those usedby Wiles to prove str<strong>on</strong>g theorems toward C<strong>on</strong>jecture 15.2.4. See also Ellenberg’ssurvey [Ell02], which discusses earlier modularity results of Hasegawa, Hashimoto,Hida, Momose, <strong>and</strong> Shimura, <strong>and</strong> gives an example to show that there are infinitelymany Q-curves whose modularity is not known.Theorem 15.2.5 (Ellenberg, Skinner). Let E be a Q-curve over a number field Kwith semistable reducti<strong>on</strong> at all primes of K lying over 3, <strong>and</strong> suppose that K isunramified at 3. Then E is modular.15.3 <strong>Modular</strong>ity of abelian varieties over QHida discusses modularity of abelian varieties over Q in [Hid00]. Let A be anabelian variety over Q. For any subalgebra E ⊂ Q⊗End(A/Q), let Q⊗End E (A/Q)be the subalgebra of endomorphism that commute with E.


15.3 <strong>Modular</strong>ity of abelian varieties over Q 145Definiti<strong>on</strong> 15.3.1 (Real Multiplicati<strong>on</strong> Abelian Variety). An abelian variety Aover Q is a real multiplicati<strong>on</strong> abelian variety if there is a totally real field K with[K : Q] = dim(A) such thatK ⊂ Q ⊗ End(A/Q) <strong>and</strong> K = Q ⊗ End K (A/Q).If E is a CM elliptic curve, then E is not a real multiplicati<strong>on</strong> abelian variety,because the extra CM endomorphisms in End(E/Q) commute with K = Q, soK ≠ Q ⊗ End K (A/Q).In analogy with Q-curves, Hida makes the following definiti<strong>on</strong>.Definiti<strong>on</strong> 15.3.2 (Q-RMAV). Let A be an abelian variety over Q with realmultiplicati<strong>on</strong> by a field K. Then A is a Q-real multiplicati<strong>on</strong> abelian variety(abbreviated Q-RMAV) if for every σ ∈ Gal(Q/Q) there is a K-linear isogenyA σ → A. Here K acts <strong>on</strong> A σ via the can<strong>on</strong>ical isomorphism End(A) → End(A σ ),which exists since applying σ is nothing but relabeling everything.I haven’t had sufficient time to absorb Hida’s paper to see just what he provesabout such abelian varieties, so I’m not quite sure how to formulate the correctmodularity c<strong>on</strong>jectures <strong>and</strong> theorems in this generality.Elizabeth Pyle’s Ph.D. thesis [Pyl] under Ribet about “building blocks” is alsovery relevant to this secti<strong>on</strong>.


146 15. <strong>Modular</strong>ity of Abelian Varieties


16L-functi<strong>on</strong>s16.1 L-functi<strong>on</strong>s attached to modular formsLet f = ∑ n≥1 a nq n ∈ S k (Γ 1 (N)) be a cusp form.Definiti<strong>on</strong> 16.1.1 (L-series). The L-series of f isL(f, s) = ∑ n≥1a nn s .Definiti<strong>on</strong> 16.1.2 (Λ-functi<strong>on</strong>). The completed Λ functi<strong>on</strong> of f isΛ(f, s) = N s/2 (2π) −s Γ(s)L(f, s),whereΓ(s) =∫ ∞0e −t t s dttis the Γ functi<strong>on</strong> (so Γ(n) = (n − 1)! for positive integers n).We can view Λ(f, s) as a (Mellin) transform of f, in the following sense:Propositi<strong>on</strong> 16.1.3. We haveΛ(f, s) = N s/2 ∫ ∞<strong>and</strong> this integral c<strong>on</strong>verges for Re(s) > k 2 + 1.0f(iy)y s dyy ,


148 16. L-functi<strong>on</strong>sProof. We have∫ ∞0f(iy)y s dyy = ∫ ∞0∞∑n=1a n e −2πny y s dyy∞∑∫ ∞= a n e −t (2πn) −s t s dtn=1 0t∞∑= (2π) −s a nΓ(s)n s .n=1(t = 2πny)To go from the first line to the sec<strong>on</strong>d line, we reverse the summati<strong>on</strong> <strong>and</strong> integrati<strong>on</strong><strong>and</strong> perform the change of variables t = 2πny. (We omit discussi<strong>on</strong> ofc<strong>on</strong>vergence. 1 ) 116.1.1 Analytic c<strong>on</strong>tinuati<strong>on</strong> <strong>and</strong> functi<strong>on</strong>al equati<strong>on</strong>sWe define ) the Atkin-Lehner operator W N <strong>on</strong> S k (Γ 1 (N)) as follows. If w N =, then [w2N ] k acts as (−N) k−2 , so if( 0 −1N 0W N (f) = N 1− k 2 · f|[wN ] k ,then WN 2 = (−1)k . (Note that W N is an involuti<strong>on</strong> when k is even.) It is easyto check directly that if γ ∈ Γ 1 (N), then w N γw −1N ∈ Γ 1(N), so W N preservesS k (Γ 1 (N)). Note that in general W N does not commute with the <strong>Hecke</strong> operatorsT p , for p | N.The following theorem is mainly due to <strong>Hecke</strong> (<strong>and</strong> maybe other people, at leastin this generality). For a very general versi<strong>on</strong> of this theorem, see [Li75]. 2 2Theorem 16.1.4. Suppose f ∈ S k (Γ 1 (N), χ) is a cusp form with character χ.Then Λ(f, s) extends to an entire (holomorphic <strong>on</strong> all of C) functi<strong>on</strong> which satisfiesthe functi<strong>on</strong>al equati<strong>on</strong>Λ(f, s) = i k Λ(W N (f), k − s).Since N s/2 (2π) −s Γ(s) is everywhere n<strong>on</strong>zero, Theorem 16.1.4 implies that L(f, s)also extends to an entire functi<strong>on</strong>.It follows from Definiti<strong>on</strong> 16.1.2 that Λ(cf, s) = cΛ(f, s) for any c ∈ C. Thusif f is a W N -eigenform, so that W N (f) = wf for some w ∈ C, then the functi<strong>on</strong>alequati<strong>on</strong> becomesΛ(f, s) = i k wΛ(f, k − s).If k = 2, then W N is an involuti<strong>on</strong>, so w = ±1, <strong>and</strong> the sign in the functi<strong>on</strong>alequati<strong>on</strong> is ε(f) = i k w = −w, which is the negative of the sign of the Atkin-Lehner involuti<strong>on</strong> W N <strong>on</strong> f. It is straightforward to show that ε(f) = 1 if <strong>and</strong><strong>on</strong>ly if ord s=1 L(f, s) is even. Parity observati<strong>on</strong>s such as this are extremely usefulwhen trying to underst<strong>and</strong> the Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer c<strong>on</strong>jecture.1 change for book2 Add refs. – see page 58 of Diam<strong>on</strong>d-Im.


16.1 L-functi<strong>on</strong>s attached to modular forms 149Sketch of proof of Theorem 16.1.4 when N = 1. We follow [Kna92, §VIII.5] closely.Note that since w 1 = ( )0 1−1 0 ∈ SL2 (Z), the c<strong>on</strong>diti<strong>on</strong> W 1 (f) = f is satisfied forany f ∈ S k (1). This translates into the equalityf(− 1 )= z k f(z). (16.1.1)zWrite z = x + iy with x <strong>and</strong> y real. Then (16.1.1) al<strong>on</strong>g the positive imaginaryaxis (so z = iy with y positive real) is( ) if = i k y k f(iy). (16.1.2)yFrom Propositi<strong>on</strong> 16.1.3 we haveΛ(f, s) =∫ ∞<strong>and</strong> this integral c<strong>on</strong>verges for Re(s) > k 2 + 1.Again using growth estimates, <strong>on</strong>e shows that∫ ∞10f(iy)y s−1 dy, (16.1.3)f(iy)y s−1 dyc<strong>on</strong>verges for all s ∈ C, <strong>and</strong> defines an entire functi<strong>on</strong>. Breaking the path in (16.1.3)at 1, we have for Re(s) > k 2 + 1 thatΛ(f, s) =∫ 10f(iy)y s−1 dy +∫ ∞1f(iy)y s−1 dy.Apply the change of variables t = 1/y to the first term <strong>and</strong> use (16.1.2) to getThus∫ 10f(iy)y s−1 dy ===∫ 1∞∫ ∞1∫ ∞∫ ∞Λ(f, s) = i k f(it)t k−s−1 dt +11−f(i/t)t 1−s 1 t 2 dtf(i/t)t −1−s dti k t k f(it)t −1−s dt∫ ∞= i k f(it)t k−1−s dt.1∫ ∞1f(iy)y s−1 dy.The first term is just a translati<strong>on</strong> of the sec<strong>on</strong>d, so the first term extends to anentire functi<strong>on</strong> as well. Thus Λ(f, s) extends to an entire functi<strong>on</strong>.The proof of the general case for Γ 0 (N) is almost the same, except the path isbroken at 1/ √ N, since i/ √ N is a fixed point for w N .


150 16. L-functi<strong>on</strong>s16.1.2 A C<strong>on</strong>jecture about n<strong>on</strong>vanishing of L(f, k/2)Suppose f ∈ S k (1) is an eigenform. If k ≡ 2 (mod 4), then L(f, k/2) = 0 forreas<strong>on</strong>s related to the discussi<strong>on</strong> after the statement of Theorem 16.1.4. On theother h<strong>and</strong>, if k ≡ 0 (mod 4), then ord s=k/2 L(f, k/2) is even, so L(f, k/2) may ormay not vanish.C<strong>on</strong>jecture 16.1.5. Suppose k ≡ 0 (mod 4). Then L(f, k/2) ≠ 0.According to [CF99], C<strong>on</strong>jecture 16.1.5 is true for weight k if there is some nsuch that the characteristic polynomial of T n <strong>on</strong> S k (1) is irreducible. Thus Maeda’sc<strong>on</strong>jecture implies C<strong>on</strong>jecture 16.1.5. Put another way, if you find an f of level 1<strong>and</strong> weight k ≡ 0 (mod 4) such that L(f, k/2) = 0, then Maeda’s c<strong>on</strong>jecture isfalse for weight k.Oddly enough, 3 I pers<strong>on</strong>ally find C<strong>on</strong>jecture 16.1.5 less c<strong>on</strong>vincing that Maeda’s 3c<strong>on</strong>jecture, despite it being a weaker c<strong>on</strong>jecture.16.1.3 Euler productsEuler products make very clear how L-functi<strong>on</strong>s of eigenforms encode deep arithmeticinformati<strong>on</strong> about representati<strong>on</strong>s of Gal(Q/Q). Given a “compatible family”of l-adic representati<strong>on</strong>s ρ of Gal(Q/Q), <strong>on</strong>e can define an Euler productL(ρ, s), but in general it is very hard to say anything about the analytic propertiesof L(ρ, s). However, as we saw above, when ρ is attached to a modular form, weknow that L(ρ, s) is entire.Theorem 16.1.6. Let f = ∑ a n q n be a newform in S k (Γ 1 (N), ε), <strong>and</strong> let L(f, s) =∑n≥1 a nn −s be the associated Dirichlet series. Then L(f, s) has an Euler productL(f, s) = ∏ p|N11 − a p p −s · ∏11 − a p p −s + ε(p)p k−1 p −2s .p∤NNote that it is not really necessary to separate out the factors with p | N as wehave d<strong>on</strong>e, since ε(p) = 0 whenever p | N. Also, note that the denominators are ofthe form F (p −s ), whereF (X) = 1 − a p X + ε(p)p k−1 X 2is the reverse of the characteristic polynomial of Frob p acting <strong>on</strong> any of the l-adicrepresentati<strong>on</strong>s attached to f, with p ≠ l.Recall that if p is a prime, then for every r ≥ 2 the <strong>Hecke</strong> operators satisfy therelati<strong>on</strong>shipT p r = T p r−1T p − p k−1 ε(p)T p r−2. (16.1.4)Lemma 16.1.7. For every prime p we have the formal equality∑T p rX r 1=1 − T p X + ε(p)p k−1 X 2 . (16.1.5)r≥03 remove from book


16.1 L-functi<strong>on</strong>s attached to modular forms 151[[THERE IS A COMMENTED OUT POSTSCRIPT PICTURE – look at source<strong>and</strong> redo it in Sage]]FIGURE 16.1.1. Graph of L(E, s) for s real, for curves of ranks 0 to 3.Proof. Multiply both sides of (16.1.5) by 1 − T p X + ε(p)p k−1 X 2 to obtain theequati<strong>on</strong>∑(ε(p)p k−1 T p r)X r+2 = 1.r≥0T p rX r − ∑ r≥0(T p rT p )X r+1 + ∑ r≥0This equati<strong>on</strong> is true if <strong>and</strong> <strong>on</strong>ly if the lemma is true. Equality follows by checkingthe first few terms <strong>and</strong> shifting the index down by 1 for the sec<strong>on</strong>d sum <strong>and</strong> downby 2 for the third sum, then using (16.1.4).Note that ε(p) = 0 when p | N, so when p | N∑T p rX r 1=1 − T p X .r≥0Since the eigenvalues a n of f also satisfy (16.1.4), we obtain each factor of theEuler product of Theorem 16.1.6 by substituting the a n for the T n <strong>and</strong> p −s for Xinto (16.1.4). For (n, m) = 1, we have a nm = a n a m , so∑n≥1a nn s = ∏ p⎛⎝ ∑ r≥0⎞a p r⎠p rs ,which gives the full Euler product for L(f, s) = ∑ a n n −s .16.1.4 Visualizing L-functi<strong>on</strong>A. Shwayder did his Harvard junior project with me 4 <strong>on</strong> visualizing L-functi<strong>on</strong>s of 4elliptic curves (or equivalently, of newforms f = ∑ a n q n ∈ S 2 (Γ 0 (N)) with a n ∈ Zfor all n. The graphs in Figures 16.1.1–?? of L(E, s), for s real, <strong>and</strong> |L(E, s)|, for scomplex, are from his paper.4 refine


152 16. L-functi<strong>on</strong>s


17The Birch <strong>and</strong> Swinnert<strong>on</strong>-DyerC<strong>on</strong>jectureThis chapter is about the c<strong>on</strong>jecture of Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer <strong>on</strong> the arithmeticof abelian varieties. We focus primarily <strong>on</strong> abelian varieties attached tomodular forms.In the 1960s, Sir Peter Swinnert<strong>on</strong>-Dyer worked with the EDSAC computer labat Cambridge University, <strong>and</strong> developed an operating system that ran <strong>on</strong> thatcomputer (so he told me <strong>on</strong>ce). He <strong>and</strong> Bryan Birch programmed EDSAC tocompute various quantities associated to elliptic curves. They then formulated thec<strong>on</strong>jectures in this chapter in the case of dimensi<strong>on</strong> 1 (see [Bir65, Bir71, SD67]).Tate formulated the c<strong>on</strong>jectures in a functorial way for abelian varieties of arbitrarydimensi<strong>on</strong> over global fields in [Tat66], <strong>and</strong> proved that if the c<strong>on</strong>jecture is true foran abelian variety A, then it is also true for each abelian variety isogenous to A.Suitably interpreted, the c<strong>on</strong>jectures may by viewed as generalizing the analyticclass number formula, <strong>and</strong> Bloch <strong>and</strong> Kato generalized the c<strong>on</strong>jectures toGrothendieck motives in [BK90].17.1 The Rank c<strong>on</strong>jectureLet A be an abelian variety over a number field K.Definiti<strong>on</strong> 17.1.1 (Mordell-Weil Group). The Mordell-Weil group of A is theabelian group AK) of all K-rati<strong>on</strong>al points <strong>on</strong> A.Theorem 17.1.2 (Mordell-Weil). The Mordell-Weil group A(K) of A is finitelygenerated.The proof is n<strong>on</strong>trivial <strong>and</strong> combines two ideas. First, <strong>on</strong>e proves the “weakMordell-Weil theorem”: for any integer m the quotient A(K)/mA(K) is finite.This is proved by combining Galois cohomology techniques with st<strong>and</strong>ard finitenesstheorems from algebraic number theory. The sec<strong>on</strong>d idea is to introduce the Nér<strong>on</strong>-


154 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jectureTate can<strong>on</strong>ical height h : A(K) → R ≥0 <strong>and</strong> use properties of h to deduce, fromfiniteness of A(K)/mA(K), that A(K) itself is finitely generated.Definiti<strong>on</strong> 17.1.3 (Rank). By the structure theorem A(K) ∼ = Z r ⊕ G tor , where ris a n<strong>on</strong>negative integer <strong>and</strong> G tor is the torsi<strong>on</strong> subgroup of G. The rank of A is r.Let f ∈ S 2 (Γ 1 (N)) be a newform of level N, <strong>and</strong> let A = A f ⊂ J 1 (N) bethe corresp<strong>on</strong>ding abelian variety. Let f 1 , . . . , f d denote the Gal(Q/Q)-c<strong>on</strong>jugatesof f, so if f = ∑ a n q n , then f i = ∑ σ(a n )q n , for some σ ∈ Gal(Q/Q).Definiti<strong>on</strong> 17.1.4 (L-functi<strong>on</strong> of A). We define the L-functi<strong>on</strong> of A = A f (orany abelian variety isogenous to A) to beL(A, s) =d∏L(f i , s).i=1By Theorem 16.1.4, each L(f i , s) is an entire functi<strong>on</strong> <strong>on</strong> C, so L(A, s) is entire.In Secti<strong>on</strong> 17.4 we will discuss an intrinsic way to define L(A, s) that does notrequire that A be attached to a modular form. However, in general we do notknow that L(A, s) is entire.C<strong>on</strong>jecture 17.1.5 (Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer). The rank of A(Q) is equal toord s=1 L(A, s).One motivati<strong>on</strong> for C<strong>on</strong>jecture 17.1.5 is the following formal observati<strong>on</strong>. Assumefor simplicity of notati<strong>on</strong> that dim A = 1. By Theorem 16.1.6, the L-functi<strong>on</strong>L(A, s) = L(f, s) has an Euler product representati<strong>on</strong>L(A, s) = ∏ p|N11 − a p p −s · ∏ 11 − a p p −s + p · p −2s ,which is valid for Re(s) sufficiently large. (Note that ε = 1, since A is a modular ellipticcurve, hence a quotient of X 0 (N).) There is no loss in c<strong>on</strong>sidering the productL ∗ (A, s) over <strong>on</strong>ly the good primes p ∤ N, since ord s=1 L(A, s) = ord s=1 L ∗ (A, s)(because ∏ p|N11−a pp −sp∤Nis n<strong>on</strong>zero at s = 1). We then have formally thatL ∗ (A, 1) = ∏ p∤N= ∏ p∤N= ∏ p∤N11 − a p p −1 + p −1pp − a p + 1p#A(F p )The intuiti<strong>on</strong> is that if the rank of A is large, i.e., A(Q) is large, then each groupA(F p ) will also be large since it has many points coming from reducing the elementsof A(Q) modulo p. It seems likely that if the groups #A(F p ) are unusuallylarge, then L ∗ (A, 1) = 0, <strong>and</strong> computati<strong>on</strong>al evidence suggests the more preciseC<strong>on</strong>jecture 17.1.5.Example 17.1.6. Let A 0 be the elliptic curve y 2 + y = x 3 − x 2 , which has rank 0<strong>and</strong> c<strong>on</strong>ductor 11, let A 1 be the elliptic curve y 2 +y = x 3 −x, which has rank 1 <strong>and</strong>


17.2 Refined rank zero c<strong>on</strong>jecture 155c<strong>on</strong>ductor 37, let A 2 be the elliptic curve y 2 + y = x 3 + x 2 − 2x, which has rank 2<strong>and</strong> c<strong>on</strong>ductor 389, <strong>and</strong> finally let A 3 be the elliptic curve y 2 + y = x 3 − 7x + 6,which has rank 3 <strong>and</strong> c<strong>on</strong>ductor 5077. By an exhaustive search, these are known tobe the smallest-c<strong>on</strong>ductor elliptic curves of each rank. C<strong>on</strong>jecture 17.1.5 is knownto be true for them, the most difficult being A 3 , which relies <strong>on</strong> the results of[GZ86].The following diagram illustrates |#A i (F p )| for p < 100, for each of these curves.The height of the red line (first) above the prime p is |#A 0 (F p )|, the green line(sec<strong>on</strong>d) gives the value for A 1 , the blue line (third) for A 2 , <strong>and</strong> the black line(fourth) for A 3 . The intuiti<strong>on</strong> described above suggests that the clumps shouldlook like triangles, with the first line shorter than the sec<strong>on</strong>d, the sec<strong>on</strong>d shorterthan the third, <strong>and</strong> the third shorter than the fourth—however, this is visibly notthe case. The large Mordell-Weil group over Q does not increase the size of everyE(F p ) as much as we might at first suspect. N<strong>on</strong>etheless, the first line is no l<strong>on</strong>gerthan the last line for every p except p = 41, 79, 83, 97.[[THERE IS A COMMENTED OUT POSTSCRIPT PICTURE – look at source<strong>and</strong> redo it in Sage]]Remark 17.1.7. Suppose that L(A, 1) ≠ 0. Then assuming the Riemann hypothesisfor L(A, s) (i.e., that L(A, s) ≠ 0 for Re(s) > 1), Goldfeld [Gol82] proved that theEuler product for L(A, s), formally evaluated at 1, c<strong>on</strong>verges but does not c<strong>on</strong>vergeto L(A, 1). Instead, it c<strong>on</strong>verges (very slowly) to L(A, 1)/ √ 2. For further details<strong>and</strong> insight into this strange behavior, see [C<strong>on</strong>03].Remark 17.1.8. The Clay Math Institute has offered a <strong>on</strong>e milli<strong>on</strong> dollar prize fora proof of C<strong>on</strong>jecture 17.1.5 for elliptic curves over Q. See [Wil00].Theorem 17.1.9 (Kolyvagin-Logachev). Suppose f ∈ S 2 (Γ 0 (N)) is a newformsuch that ord s=1 L(f, s) ≤ 1. Then C<strong>on</strong>jecture 17.1.5 is true for A f .Theorem 17.1.10 (Kato). Suppose f ∈ S 2 (Γ 1 (N)) <strong>and</strong> L(f, 1) ≠ 0. Then C<strong>on</strong>jecture17.1.5 is true for A f .17.2 Refined rank zero c<strong>on</strong>jectureLet f ∈ S 2 (Γ 1 (N)) be a newform of level N, <strong>and</strong> let A f ⊂ J 1 (N) be the corresp<strong>on</strong>dingabelian variety.The following c<strong>on</strong>jecture refines C<strong>on</strong>jecture 17.1.5 in the case L(A, 1) ≠ 0. Werecall some of the notati<strong>on</strong> below, where we give a formula for L(A, 1)/Ω A , whichcan be computed up to an vinteger, which we call the Manin index. Note that thedefiniti<strong>on</strong>s, results, <strong>and</strong> proofs in this secti<strong>on</strong> are all true exactly as stated withX 1 (N) replaced by X 0 (N), which is relevant if <strong>on</strong>e wants to do computati<strong>on</strong>s.C<strong>on</strong>jecture 17.2.1 (Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer). Suppose L(A, 1) ≠ 0. ThenL(A, 1)= #X(A) · ∏p|N c pΩ A #A(Q) tor · #A ∨ .(Q) torBy Theorem 17.1.10, the group X(A) is finite, so the right h<strong>and</strong> side makessense. The right h<strong>and</strong> side is a rati<strong>on</strong>al number, so if C<strong>on</strong>jecture 17.2.1 is true,then the quotient L(A, 1)/Ω A should also be a rati<strong>on</strong>al number. In fact, this is


156 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jecture[[THERE IS A COMMENTED OUT POSTSCRIPT PICTURE – look at source<strong>and</strong> redo it in Sage]]FIGURE 17.2.1. Graphs of real soluti<strong>on</strong>s to y 2 z = x 3 − xz 2 <strong>on</strong> three affine patchestrue, as we will prove below (see Theorem 17.2.11). Below we will discuss aspectsof the proof of rati<strong>on</strong>ality in the case that A is an elliptic curve, <strong>and</strong> at the end ofthis secti<strong>on</strong> we give a proof of the general case.In to more easily underst<strong>and</strong>ing L(A, 1)/Ω A , it will be easiest to work withA = A ∨ f , where A∨ fis the dual of A f . We view A naturally as a quotient ofJ 1 (N) as follows. Dualizing the map A f ↩→ J 1 (N) we obtain a surjective mapJ 1 (N) → A ∨ f . Passing to the dual doesn’t affect whether or not L(A, 1)/Ω A israti<strong>on</strong>al, since changing A by an isogeny does not change L(A, 1), <strong>and</strong> <strong>on</strong>ly changesΩ A by multiplicati<strong>on</strong> by a n<strong>on</strong>zero rati<strong>on</strong>al number.17.2.1 The Number of real comp<strong>on</strong>entsDefiniti<strong>on</strong> 17.2.2 (Real Comp<strong>on</strong>ents). Let c ∞ be the number of c<strong>on</strong>nected comp<strong>on</strong>entsof A(R).If A is an elliptic curve, then c ∞ = 1 or 2, depending <strong>on</strong> whether the graph ofthe affine part of A(R) in the plane R 2 is c<strong>on</strong>nected. For example, Figure 17.2.1shows the real points of the elliptic curve defined by y 2 = x 3 − x in the three affinepatches that cover P 2 . The completed curve has two real comp<strong>on</strong>ents.In general, there is a simple formula for c ∞ in terms of the acti<strong>on</strong> of complexc<strong>on</strong>jugati<strong>on</strong> <strong>on</strong> H 1 (A(R), Z), which can be computed using modular symbols. Theformula islog 2 (c ∞ ) = dim F2 A(R)[2] − dim(A).17.2.2 The Manin indexThe map J 1 (N) → A induces a map J → A <strong>on</strong> Nér<strong>on</strong> models. Pullback ofdifferentials defines a mapH 0 (A, Ω 1 A/Z ) → H0 (J , Ω 1 J /Z). (17.2.1)One can show 1 that there is a q-expansi<strong>on</strong> map 1H 0 (J , Ω 1 J /Z) → Z[[q]] (17.2.2)which agrees with the usual q-expansi<strong>on</strong> map after tensoring with C. (For us X 1 (N)is the curve that parameterizes pairs (E, µ N ↩→ E), so that there is a q-expansi<strong>on</strong>map with values in Z[[q]].)Let ϕ A be the compositi<strong>on</strong> of (17.2.1) with (17.2.2).Definiti<strong>on</strong> 17.2.3 (Manin Index). The Manin index c A of A is the index ofϕ A (H 0 (A, Ω 1 A/Z)) in its saturati<strong>on</strong>. I.e., it is the order of the quotient group()Z[[q]]ϕ A (H 0 (A, Ω 1 A/Z )) .tor1 reference or further discussi<strong>on</strong>


Open Problem 17.2.4. Find an algorithm to compute c A .17.2 Refined rank zero c<strong>on</strong>jecture 157Manin c<strong>on</strong>jectured that c A = 1 when dim A = 1, <strong>and</strong> I think c A = 1 in general.C<strong>on</strong>jecture 17.2.5 (Agashe, <strong>Stein</strong>). c A = 1.This c<strong>on</strong>jecture is false if A is not required to be attached to a newform, evenif A f ⊂ J 1 (N) new . For example, Adam Joyce, a student of Kevin Buzzard, foundan A ⊂ J 1 (431) (<strong>and</strong> also A ′ ⊂ J 0 (431)) whose Manin c<strong>on</strong>stant is 2. Here A isisogenous over Q to a product of two elliptic curves. 2 Also, the Manin index for 2J 0 (33) (viewed as a quotient of J 0 (33)) is divisible by 3, because there is a cuspform in S 2 (Γ 0 (33)) that has integer Fourier expansi<strong>on</strong> at ∞, but not at <strong>on</strong>e of theother cusps.Theorem 17.2.6. If f ∈ S 2 (Γ 0 (N)) then the Manin index c of A ∨ fcan <strong>on</strong>lydivisible by 2 or primes whose square divides N. Moreover, if 4 ∤ N, then ord 2 (c) ≤dim(A f ).The proof involves applying n<strong>on</strong>trivial theorems of Raynaud about exactnessof sequences of differentials, then using a trick with the Atkin-Lehner involuti<strong>on</strong>,which was introduced by Mazur in [Maz78], <strong>and</strong> finally <strong>on</strong>e applies the“q-expansi<strong>on</strong> principle” in characteristic p to deduce the result (see [?]). Also,Edixhoven claims he can prove that if A f is an elliptic curve then c A is <strong>on</strong>ly divisibleby 2, 3, 5, or 7. His argument use his semistable models for X 0 (p 2 ), but myunderst<strong>and</strong>ing is that the details are not all written up. 3 317.2.3 The Real volume Ω ADefiniti<strong>on</strong> 17.2.7 (Real Volume). The real volume Ω A of A(R) is the volumeof A(R) with respect to a measure obtained by wedging together a basis forH 0 (A, Ω 1 ).If A is an elliptic curve with minimal Weierstrass equati<strong>on</strong>then <strong>on</strong>e can show thaty 2 + a 1 xy + a 3 y = x 3 + a 2 x 2 + a 4 x + a 6 ,ω =is a basis for H 0 (A, Ω 1 ). Thus∫Ω A =dx2y + a 1 x + a 3(17.2.3)A(R)dx2y + a 1 x + a 3.There is a fast algorithm for computing Ω A , for A an elliptic curve, which relies <strong>on</strong>the quickly-c<strong>on</strong>vergent Gauss arithmetic-geometric mean (see [Cre97, §3.7]). Forexample, if A is the curve defined by y 2 = x 3 − x (this is a minimal model), thenΩ A ∼ 2 × 2.622057554292119810464839589.For a general abelian variety A, it is an open problem to compute Ω A . However, wecan compute Ω A /c A , where c A is the Manin index of A, by explicitly computing Aas a complex torus using the period mapping Φ, which we define in the next secti<strong>on</strong>.2 Add better reference.3 Refine.


158 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jecture17.2.4 The Period mappingLetΦ : H 1 (X 1 (N), Z) → Hom C (Cf 1 + · · · + Cf d , C)be the period mapping <strong>on</strong> integral homology induced by integrating homologyclasses <strong>on</strong> X 0 (N) against the C-vector space spanned by the Gal(Q/Q)-c<strong>on</strong>jugatesf i of f. Extend Φ to H 1 (X 1 (N), Q) by Q-linearity. We normalize Φ so thatΦ({0, ∞})(f) = L(f, 1). More explicitly, for α, β ∈ P 1 (Q), we have∫ βΦ({α, β})(f) = −2πi f(z)dz.αThe motivati<strong>on</strong> for this normalizati<strong>on</strong> is thatL(f, 1) = −2πi∫ i∞0f(z)dz, (17.2.4)which we see immediately from the Mellin transform definiti<strong>on</strong> of L(f, s):∫ i∞L(f, s) = (2π) s Γ(s) −1 (−iz) s f(z) dzz .17.2.5 The Manin-Drinfeld theoremRecall the Manin-Drinfeld theorem, which we proved l<strong>on</strong>g ago, asserts that {0, ∞} ∈H 1 (X 0 (N), Q). We proved this by explicitly computing (p + 1 − T p )({0, ∞}), forp ∤ N, noting that the result is in H 1 (X 0 (N), Z), <strong>and</strong> inverting p + 1 − T p . Thusthere is an integer n such that n{0, ∞} ∈ H 1 (X 0 (N), Z).Suppose that A = A ∨ f is an elliptic curve quotient of J 0(N). Rewriting (17.2.4)in terms of Φ, we have Φ({0, ∞}) = L(f, 1). Let ω be a minimal differential <strong>on</strong> A,as in (17.2.3), so ω = −c A · 2πif(z)dz, where c A is the Manin index of A, <strong>and</strong> theequality is after pulling ω back to H 0 (X 0 (N), Ω) ∼ = S 2 (Γ 0 (N)). Note that whenwe defined c A , there was no factor of 2πi, since we compared ω with f(q) dqq , <strong>and</strong>q = e 2πiz , so dq/q = 2πidz.17.2.6 The Period latticeThe period lattice of A with respect to a n<strong>on</strong>zero differential g <strong>on</strong> A is{∫}L g = g : γ ∈ H 1 (A, Z) ,γ<strong>and</strong> we have A(C) ∼ = C/L g . This is the Abel-Jacobi theorem, <strong>and</strong> the significanceof g is that we are choosing a basis for the <strong>on</strong>e-dimensi<strong>on</strong>al C-vector spaceHom(H 0 (A, Ω), C), in order to embed the image of H 1 (A, Z) in C.The integral ∫ g is “visible” in terms of the complex torus representati<strong>on</strong>A(R)of A(C) = C/L g . More precisely, if L g is not rectangular, then A(R) may beidentified ∫ with the part of the real line in a fundamental domain for L g , <strong>and</strong>A(R) g is the length of this segment of the real line. If L g is rectangular, thenit is that line al<strong>on</strong>g with another line above it that is midway to the top of thefundamental domain.0


The real volume, which appears in C<strong>on</strong>jecture 17.2.1, is∫∫Ω A = ω = −c A · 2πi f.A(R)17.2 Refined rank zero c<strong>on</strong>jecture 159A(R)Thus Ω A is the least positive real number in L ω = −c A · 2πiL f , when the periodlattice is not rectangular, <strong>and</strong> twice the least positive real number when it is.17.2.7 The Special value L(A, 1)Propositi<strong>on</strong> 17.2.8. We have L(f, 1) ∈ R.Proof. With the right setup, this would follow immediately from the fact thatz ↦→ −z fixes the homology class {0, ∞}. However, we d<strong>on</strong>’t have such a setup, sowe give a direct proof.Just as in the proof of the functi<strong>on</strong>al equati<strong>on</strong> for Λ(f, s), use that f is aneigenvector for the Atkin-Lehner operator W N <strong>and</strong> (17.2.4) to write L(f, 1) as thesum of two integrals from i/ √ N to i∞. Then use the calculati<strong>on</strong>∫ i∞2πii/ √ N∞∑∞∑∫ i∞a n e 2πinz dz = −2πi a nn=1= −2πi= 2πin=1∞∑n=1∞∑n=1i/ √ Ne 2πinz dza n12πin e−2πn/√ Na n12πin e2πn/√ Nto see that L(f, 1) = L(f, 1).Remark 17.2.9. The BSD c<strong>on</strong>jecture implies that L(f, 1) ≥ 0, but this is unknown(it follows from GRH for L(f, s)).17.2.8 Rati<strong>on</strong>ality of L(A, 1)/Ω APropositi<strong>on</strong> 17.2.10. Suppose A = A f is an elliptic curve. Then L(A, 1)/Ω A ∈Q. More precisely, if n is the smallest multiple of {0, ∞} that lies in H 1 (X 0 (N), Z)<strong>and</strong> c A is the Manin c<strong>on</strong>stant of A, then 2n · c A · L(A, 1)/Ω A ∈ Z.Proof. By the Manin-Drinfeld theorem n{0, ∞} ∈ H 1 (X 0 (N), Z), s<strong>on</strong> · L(f, 1) = −n · 2πi ·∫ i∞Combining this with Propositi<strong>on</strong> 17.2.8, we see that0f(z)dz ∈ −2πi · L f = 1c AL ω .n · c A · L(f, 1) ∈ L + ω ,where L + ω is the submodule fixed by complex c<strong>on</strong>jugati<strong>on</strong> (i.e., L + ω = L∩R). Whenthe period lattice is not rectangular, Ω A generates L + ω , <strong>and</strong> when it is rectangular,12 Ω A generates. Thus n · c A · L(f, 1) is an integer multiple of 1 2 Ω A, which provesthe propositi<strong>on</strong>.


160 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jecturePropositi<strong>on</strong> 17.2.10 can be more precise <strong>and</strong> generalized to abelian varietiesA = A ∨ fattached to newforms. One can also replace n by the order of the imageof (0) − (∞) in A(Q).Theorem 17.2.11 (Agashe, <strong>Stein</strong>). Suppose f ∈ S 2 (Γ 1 (N)) is a newform <strong>and</strong> letA = A ∨ fbe the abelian variety attached to f. Then we have the following equalityof rati<strong>on</strong>al numbers:|L(A, 1)| 1= · [Φ(H 1 (X 1 (N), Z)) + : Φ(T{0, ∞})].Ω A c ∞ · c ANote that L(A, 1) ∈ R, so |L(A, 1)| = ±L(A, 1), <strong>and</strong> <strong>on</strong>e expects, of course, thatL(A, 1) ≥ 0.For V <strong>and</strong> W lattices in an R-vector space M, the lattice index [V : W ] is bydefiniti<strong>on</strong> the absolute value of the determinant of a change of basis taking a basisfor V to a basis for W , or 0 if W has rank smaller than the dimensi<strong>on</strong> of M.Proof. Let ˜Ω A be the measure of A(R) with respect to a basis for S 2 (Γ 1 (N), Z)[I f ],where I f is the annihilator in T of f. Note that ˜Ω A · c A = Ω A , where c A is theManin index. Unwinding the definiti<strong>on</strong>s, we find that˜Ω A = c ∞ · [Hom(S 2 (Γ 1 (N), Z)[I f ], Z) : Φ(H 1 (X 0 (N), Z)) + ].For any ring R the pairing 4 4T R × S 2 (Γ 1 (N), R) → Rgiven by 〈T n , f〉 = a 1 (T n f) is perfect, so (T/I f ) ⊗ R ∼ = Hom(S 2 (Γ 1 (N), R)[I f ], R).Using this pairing, we may view Φ as a mapso thatΦ : H 1 (X 1 (N), Q) → (T/I f ) ⊗ C,˜Ω A = c ∞ · [T/I f : Φ(H 1 (X 0 (N), Z)) + ].Note that (T/I f ) ⊗ C is isomorphic as a ring to a product of copies of C, with<strong>on</strong>e copy corresp<strong>on</strong>ding to each Galois c<strong>on</strong>jugate f i of f. Let π i ∈ (T/I f ) ⊗ C bethe projector <strong>on</strong>to the subspace of (T/I f ) ⊗ C corresp<strong>on</strong>ding to f i . ThenΦ({0, ∞}) · π i = L(f i , 1) · π i .Since the π i form a basis for the complex vector space (T/I f ) ⊗ C, if we viewΦ({0, ∞}) as the operator “left-multiplicati<strong>on</strong> by Φ({0, ∞})”, thendet(Φ({0, ∞})) = ∏ iL(f i , 1) = L(A, 1),Letting H = H 1 (X 0 (N), Z), we have[Φ(H) + : Φ(T{0, ∞})] = [Φ(H) + : (T/I f ) · Φ({0, ∞})]= [Φ(H) + : T/I f ] · [T/I f : T/I f · Φ({0, ∞})]= c ∞˜Ω A· | det(Φ({0, ∞}))|= c ∞c AΩ A· |L(A, 1)|,4 reference!


17.3 General refined c<strong>on</strong>jecture 161which proves the theorem.Remark 17.2.12. Theorem 17.2.11 is false, in general, when A is a quotient ofJ 1 (N) not attached to a single Gal(Q/Q)-orbit of newforms. It could be modifiedto h<strong>and</strong>le this more general case, but the generalizati<strong>on</strong> seems not to has beenwritten down.17.3 General refined c<strong>on</strong>jectureC<strong>on</strong>jecture 17.3.1 (Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer). Let r = ord s=1 L(A, s). Thenr is the rank of A(Q), the group X(A) is finite, <strong>and</strong>L (r) (A, 1)r!= #X(A) · Ω A · Reg A · ∏p|N c p#A(Q) tor · #A ∨ (Q) tor.17.4 The C<strong>on</strong>jecture for n<strong>on</strong>-modular abelian varietiesC<strong>on</strong>jecture 17.3.1 can be extended to general abelian varieties over global fields.Here we discuss <strong>on</strong>ly the case of a general abelian variety A over Q. We follow thediscussi<strong>on</strong> in [Lan91, 95-94] (Lang, Number Theory III) 5 , which describes Gross’s 5formulati<strong>on</strong> of the c<strong>on</strong>jecture for abelian varieties over number fields, <strong>and</strong> to whichwe refer the reader for more details.For each prime number l, the l-adic Tate module associated to A isTa l (A) = lim ←−nA(Q)[l n ].Since A(Q)[l n ] ∼ = (Z/l n Z) 2 dim(A) , we see that Ta l (A) is free of rank 2 dim(A) as aZ l -module. Also, since the group structure <strong>on</strong> A is defined over Q, Ta l (A) comesequipped with an acti<strong>on</strong> of Gal(Q/Q):ρ A,l : Gal(Q/Q) → Aut(Ta l (A)) ≈ GL 2d (Z l ).Suppose p is a prime <strong>and</strong> let l ≠ p be another prime. Fix any embeddingQ ↩→ Q p , <strong>and</strong> notice that restricti<strong>on</strong> defines a homorphism r : Gal(Q p /Q p ) →Gal(Q/Q). Let G p ⊂ Gal(Q/Q) be the image of r. The inertia group I p ⊂ G p isthe kernel of the natural surjective reducti<strong>on</strong> map, <strong>and</strong> we have an exact sequence0 → I p → Gal(Q p /Q p ) → Gal(F p /F p ) → 0.The Galois group Gal(F p /F p ) is isomorphic to Ẑ with can<strong>on</strong>ical generator x ↦→ xp .Lifting this generator, we obtain an element Frob p ∈ Gal(Q p /Q p ), which is welldefinedup to an element of I p . Viewed as an element of G p ⊂ Gal(Q/Q), theelement Frob p is well-defined up I p <strong>and</strong> our choice of embedding Q ↩→ Q p . One5 Remove paren.


162 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jecturecan show that this implies that Frob p ∈ Gal(Q/Q) is well-defined up to I p <strong>and</strong>c<strong>on</strong>jugati<strong>on</strong> by an element of Gal(Q/Q).For a G p -module M, letM Ip = {x ∈ M : σ(x) = x all σ ∈ I p }.Because I p acts trivially <strong>on</strong> M Ip , the acti<strong>on</strong> of the element Frob p ∈ Gal(Q/Q)<strong>on</strong> M Ip is well-defined up to c<strong>on</strong>jugati<strong>on</strong> (I p acts trivially, so the “up to I p ”obstructi<strong>on</strong> vanishes). Thus the characteristic polynomial of Frob p <strong>on</strong> M Ip is welldefined,which is why L p (A, s) is well-defined. The local L-factor of L(A, s) at pis1L p (A, s) =det ( I − p −s Frob −1).p | Hom Zl (Ta l (A), Z l ) IpDefiniti<strong>on</strong> 17.4.1. L(A, s) = ∏ all pL p (A, s)For all but finitely many primes Ta l (A) Ip = Ta l (A). For example, if A = A f isattached to a newform f = ∑ a n q n of level N <strong>and</strong> p ∤ l·N, then Ta l (A) Ip = Ta l (A).In this case, the Eichler-Shimura relati<strong>on</strong> implies that L p (A, s) equals ∏ L p (f i , s),where the f i = ∑ a n,i q n are the Galois c<strong>on</strong>jugates of f <strong>and</strong> L p (f i , s) = (1 − a p,i ·p −s + p 1−2s ) −1 . The point is that Eichler-Shimura can be used to show that thecharacteristic polynomial of Frob p is ∏ dim(A)i=1(X 2 − a p,i X + p 1−2s ).Theorem 17.4.2. L(A f , s) = ∏ di=1 L(f i, s).17.5 Visibility of Shafarevich-Tate groupsLet K be a number field. Suppose0 → A → B → C → 0is an exact sequence of abelian varieties over K. (Thus each of A, B, <strong>and</strong> C is acomplete group variety over K, whose group is automatically abelian.) Then thereis a corresp<strong>on</strong>ding l<strong>on</strong>g exact sequence of cohomology for the group Gal(Q/K):0 → A(K) → B(K) → C(K) δ −→ H 1 (K, A) → H 1 (K, B) → H 1 (K, C) → · · ·The study of the Mordell-Weil group C(K) = H 0 (K, C) is popular in arithmeticgeometry. For example, the Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer c<strong>on</strong>jecture (BSD c<strong>on</strong>jecture),which is <strong>on</strong>e of the milli<strong>on</strong> dollar Clay Math Problems, asserts that thedimensi<strong>on</strong> of C(K) ⊗ Q equals the ordering vanishing of L(C, s) at s = 1.The group H 1 (K, A) is also of interest in c<strong>on</strong>necti<strong>on</strong> with the BSD c<strong>on</strong>jecture,because it c<strong>on</strong>tains the Shafarevich-Tate group()X(A) = X(A/K) = KerH 1 (K, A) → ⊕ vH 1 (K v , A)⊂ H 1 (K, A),where the sum is over all places v of K (e.g., when K = Q, the fields K v are Q pfor all prime numbers p <strong>and</strong> Q ∞ = R).


17.5 Visibility of Shafarevich-Tate groups 163The group A(K) is fundamentally different than H 1 (K, C). The Mordell-Weilgroup A(K) is finitely generated, whereas the first Galois cohomology H 1 (K, C) isfar from being finitely generated—in fact, every element has finite order <strong>and</strong> thereare infinitely many elements of any given order.This talk is about “dimensi<strong>on</strong> shifting”, i.e., relating informati<strong>on</strong> about H 0 (K, C)to informati<strong>on</strong> about H 1 (K, A).17.5.1 Definiti<strong>on</strong>sElements of H 0 (K, C) are simply points, i.e., elements of C(K), so they are relativelyeasy to “visualize”. In c<strong>on</strong>trast, elements of H 1 (K, A) are Galois cohomologyclasses, i.e., equivalence classes of set-theoretic (c<strong>on</strong>tinuous) maps f : Gal(Q/K) →A(Q) such that f(στ) = f(σ) + σf(τ). Two maps are equivalent if their differenceis a map of the form σ ↦→ σ(P ) − P for some fixed P ∈ A(Q). From this point ofview H 1 is more mysterious than H 0 .There is an alternative way to view elements of H 1 (K, A). The WC group of Ais the group of isomorphism classes of principal homogeneous spaces for A, wherea principal homogeneous space is a variety X <strong>and</strong> a map A × X → X that satisfiesthe same axioms as those for a simply transitive group acti<strong>on</strong>. Thus X is a twist asvariety of A, but X(K) = ∅, unless X ≈ A. Also, the n<strong>on</strong>trivial elements of X(A)corresp<strong>on</strong>d to the classes in WC that have a K v -rati<strong>on</strong>al point for all places v, butno K-rati<strong>on</strong>al point.Mazur introduced the following definiti<strong>on</strong> in order to help unify diverse c<strong>on</strong>structi<strong>on</strong>sof principal homogeneous spaces:Definiti<strong>on</strong> 17.5.1 (Visible). The visible subgroup of H 1 (K, A) in B isVis B H 1 (K, A) = Ker(H 1 (K, A) → H 1 (K, B))= Coker(B(K) → C(K)).Remark 17.5.2. Note that Vis B H 1 (K, A) does depend <strong>on</strong> the embedding of Ainto B. For example, suppose B = B 1 × A. Then there could be n<strong>on</strong>zero visibleelements if A is embedding into the first factor, but there will be no n<strong>on</strong>zerovisible elements if A is embedded into the sec<strong>on</strong>d factor. Here we are using thatH 1 (K, B 1 × A) = H 1 (K, B 1 ) ⊕ H 1 (K, A).The c<strong>on</strong>necti<strong>on</strong> with the WC group of A is as follows. Suppose0 → A f −→ B g −→ C → 0is an exact sequence of abelian varieties <strong>and</strong> that c ∈ H 1 (K, A) is visible in B.Thus there exists x ∈ C(K) such that δ(x) = c, where δ : C(K) → H 1 (K, A) isthe c<strong>on</strong>necting homomorphism. Then X = π −1 (x) ⊂ B is a translate of A in B, sothe group law <strong>on</strong> B gives X the structure of principal homogeneous space for A,<strong>and</strong> <strong>on</strong>e can show that the class of X in the WC group of A corresp<strong>on</strong>ds to c.Lemma 17.5.3. The group Vis B H 1 (K, A) is finite.Proof. Since Vis B H 1 (K, A) is a homomorphic image of the finitely generated groupC(K), it is also finitely generated. On the other h<strong>and</strong>, it is a subgroup of H 1 (K, A),so it is a torsi<strong>on</strong> group. The lemma follows since a finitely generated torsi<strong>on</strong> abeliangroup is finite.


164 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jecture17.5.2 Every element of H 1 (K, A) is visible somewherePropositi<strong>on</strong> 17.5.4. Let c ∈ H 1 (K, A). Then there exists an abelian variety B =B c <strong>and</strong> an embedding A ↩→ B such that c is visible in B.Proof. By definiti<strong>on</strong> of Galois cohomology, there is a finite extensi<strong>on</strong> L of K suchthat res L (c) = 0. Thus c maps to 0 in H 1 (L, A L ). By a slight generalizati<strong>on</strong> of theShapiro Lemma from group cohomology (which can be proved by dimensi<strong>on</strong> shifting;see, e.g., 6 Atiyah-Wall in Cassels-Frohlich), there is a can<strong>on</strong>ical isomorphism 6H 1 (L, A L ) ∼ = H 1 (K, Res L/K (A L )) = H 1 (K, B),where B = Res L/K (A L ) is the Weil restricti<strong>on</strong> of scalars of A L back down to K.The restricti<strong>on</strong> of scalars B is an abelian variety of dimensi<strong>on</strong> [L : K] · dim A thatis characterized by the existence of functorial isomorphismsMor K (S, B) ∼ = Mor L (S L , A L ),for any K-scheme S, i.e., B(S) = A L (S L ). In particular, setting S = A we findthat the identity map A L → A L corresp<strong>on</strong>ds to an injecti<strong>on</strong> A ↩→ B. Moreover,c ↦→ res L (c) = 0 ∈ H 1 (K, B).Remark 17.5.5. The abelian variety B in Propositi<strong>on</strong> 17.5.4 is a twist of a powerof A.17.5.3 Visibility in the c<strong>on</strong>text of modularityUsually we focus <strong>on</strong> visibility of elements in X(A). There are a number of otherresults about visibility in various special cases, <strong>and</strong> large tables of examples in thec<strong>on</strong>text of elliptic curves <strong>and</strong> modular abelian varieties. There are also interestingmodularity questi<strong>on</strong>s <strong>and</strong> c<strong>on</strong>jectures in this c<strong>on</strong>text.Motivated by the desire to underst<strong>and</strong> the Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer c<strong>on</strong>jecturemore explicitly, I developed 7 (with significant input from Agashe, Crem<strong>on</strong>a, Mazur, 7<strong>and</strong> Merel) computati<strong>on</strong>al techniques for unc<strong>on</strong>diti<strong>on</strong>ally c<strong>on</strong>structing Shafarevich-Tate groups of modular abelian varieties A ⊂ J 0 (N) (or J 1 (N)). For example, ifA ⊂ J 0 (389) is the 20-dimensi<strong>on</strong>al simple factor, thenZ/5Z × Z/5Z ⊂ X(A),as predicted by the Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer c<strong>on</strong>jecture. See [CM00] for exampleswhen dim A = 1. We will spend the rest of this secti<strong>on</strong> discussing the examplesof [AS05, AS02] in more detail.Tables 17.5.1–17.5.4 illustrate the main computati<strong>on</strong>al results of [AS05]. Thesetables were made by gathering data about certain arithmetic invariants of the19608 abelian varieties A f of level ≤ 2333. Of these, exactly 10360 have satisfyL(A f , 1) ≠ 0, <strong>and</strong> for these with L(A f , 1) ≠ 0, we compute a divisor <strong>and</strong> multipleof the c<strong>on</strong>jectural order of X(A f ). We find that there are at least 168 such thatthe Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jecture implies that X(A f ) is divisible by an6 Fix7 change.


17.5 Visibility of Shafarevich-Tate groups 165odd prime, <strong>and</strong> we prove for 37 of these that the odd part of the c<strong>on</strong>jectural orderof X(A f ) really divides #X(A f ) by c<strong>on</strong>structing n<strong>on</strong>trivial elements of X(A f )using visibility.The meaning of the tables is as follows. The first column lists a level N <strong>and</strong> anisogeny class, which uniquely specifies an abelian variety A = A f ⊂ J 0 (N). Thenth isogeny class is given by the nth letter of the alphabet. We will not discuss theordering further, except to note that usually, the dimensi<strong>on</strong> of A, which is given inthe sec<strong>on</strong>d column, is enough to determine A. When L(A, 1) ≠ 0, C<strong>on</strong>jecture 17.2.1predicts that#X(A) ? =L(A, 1)Ω A· #A(Q) tor · #A ∨ (Q)∏torp|N c .pWe view the quotient L(A, 1)/Ω A , which is a rati<strong>on</strong>al number, as a single quantity.We can compute multiples <strong>and</strong> divisors of every quantity appearing in theright h<strong>and</strong> side of this equati<strong>on</strong>, <strong>and</strong> this yields columns three <strong>and</strong> four, which area divisor S l <strong>and</strong> a multiple S u of the c<strong>on</strong>jectural order of X(A) (when S u = S l , weput an equals sign in the S u column). Column five, which is labeled odd deg(ϕ A ),c<strong>on</strong>tains the odd part of the degree of the polarizati<strong>on</strong>ϕ A : (A ↩→ J 0 (N) ∼ = J 0 (N) ∨ → A ∨ ). (17.5.1)The sec<strong>on</strong>d set of columns, columns six <strong>and</strong> seven, c<strong>on</strong>tain an abelian varietyB = B g ⊂ J 0 (N) such that #(A ∩ B) is divisible by an odd prime divisor of S l<strong>and</strong> L(B, 1) = 0. When dim(B) = 1, we have verified that B is an elliptic curveof rank 2. The eighth column A ∩ B c<strong>on</strong>tains the group structure of A ∩ B, wheree.g., [2 2 302 2 ] is shorth<strong>and</strong> notati<strong>on</strong> for (Z/2Z) 2 ⊕ (Z/302Z) 2 . The final column,labeled Vis, c<strong>on</strong>tains a divisor of the order of Vis A+B (X(A)).The following propositi<strong>on</strong> explains the significance of the odd deg(ϕ A ) column.Propositi<strong>on</strong> 17.5.6. If p ∤ deg(ϕ A ), then p ∤ Vis J0(N)(H 1 (Q, A)).Proof. There exists a complementary morphism ˆϕ A , such that ϕ A ◦ ˆϕ A = ˆϕ A ◦ϕ A =[n], where n is the degree of ϕ A . If c ∈ H 1 (Q, A) maps to 0 in H 1 (Q, J 0 (N)), thenit also maps to 0 under the following compositi<strong>on</strong>H 1 (Q, A) → H 1 (Q, J 0 (N)) → H 1 (Q, A ∨ ) ˆϕ A−−→ H 1 (Q, A).Since this compositi<strong>on</strong> is [n], it follows that c ∈ H 1 (Q, A)[n], which proves thepropositi<strong>on</strong>.Remark 17.5.7. Since the degree of ϕ A does not change if we extend scalars toa number field K, the subgroup of H 1 (K, A) visible in J 0 (N) K , still has orderdivisible <strong>on</strong>ly by primes that divide deg(ϕ A ).The following theorem explains the significance of the B column, <strong>and</strong> how it wasused to deduce the Vis column.Theorem 17.5.8. Suppose A <strong>and</strong> B are abelian subvarieties of an abelian varietyC over Q <strong>and</strong> that A(Q) ∩ B(Q) is finite. Assume also that A(Q) is finite.Let N be an integer divisible by the residue characteristics of primes of bad reducti<strong>on</strong>for C (e.g., N could be the c<strong>on</strong>ductor of C). Suppose p is a prime suchthatp ∤ 2 · N · #((A + B)/B)(Q) tor · #B(Q) tor · ∏c A,l · c B,l ,l


166 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jecturewhere c A,l = #Φ A,l (F l ) is the Tamagawa number of A at l (<strong>and</strong> similarly for B).Suppose furthermore that B(Q)[p] ⊂ A(Q) as subgroups of C(Q). Then there is anatural injecti<strong>on</strong>B(Q)/pB(Q) ↩→ Vis C (X(A)).A complete proof of a generalizati<strong>on</strong> of this theorem can be found in [AS02].Sketch of Proof. Without loss of generality, we may assume C = A + B. Ourhypotheses yield a diagram0 B[p] B p B00 A C B ′ 0,where B ′ = C/A. Taking Gal(Q/Q)-cohomology, we obtain the following diagram:0 B(Q)pB(Q)B(Q)/pB(Q)00 C(Q)/A(Q) B ′ (Q) Vis C (H 1 (Q, A)) 0.The snake lemma <strong>and</strong> our hypothesis that p ∤ #(C/B)(Q) tor imply that the rightmostvertical map is an injecti<strong>on</strong>i : B(Q)/pB(Q) ↩→ Vis C (H 1 (Q, A)), (17.5.2)since C(A)/(A(Q) + B(Q)) is a sub-quotient of (C ′ /B)(Q).We show that the image of (17.5.2) lies in X(A) using a local analysis at eachprime, which we now sketch. At the archimedian prime, no work is needed sincep ≠ 2. At n<strong>on</strong>-archimedian primes l, <strong>on</strong>e uses facts about Nér<strong>on</strong> models (when l =p) <strong>and</strong> our hypothesis that p does not divide the Tamagawa numbers of B (whenl ≠ p) to show that if x ∈ B(Q)/pB(Q), then the corresp<strong>on</strong>ding cohomology classres l (i(x)) ∈ H 1 (Q l , A) splits over the maximal unramified extensi<strong>on</strong>. However,H 1 (Q url /Q l , A) ∼ = H 1 (F l /F l , Φ A,l (F l )),<strong>and</strong> the right h<strong>and</strong> cohomology group has order c A,l , which is coprime to p.Thus res l (i(x)) = 0, which completes the sketch of the proof.17.5.4 Future directi<strong>on</strong>sThe data in Tables 17.5.1-17.5.4 could be investigated further.It should be possible to replace the hypothesis that B[p] ⊂ A, with the weakerhypothesis that B[m] ⊂ A, where m is a maximal ideal of the <strong>Hecke</strong> algebra T. Forexample, this improvement would help <strong>on</strong>e to show that 5 2 divides the order of theShafarevich-Tate group of 1041E. Note that for this example, we <strong>on</strong>ly know thatL(B, 1) = 0, not that B(Q) has positive rank (as predicted by C<strong>on</strong>jecture 17.1.5),which is another obstructi<strong>on</strong>.


17.6 Kolyvagin’s Euler system of Heegner points 167One can c<strong>on</strong>sider visibility at a higher level. For example, there are elementsof order 3 in the Shafarevich-Tate group of 551H that are not visible in J 0 (551),but these elements are visible in J 0 (2 · 551), according to the computati<strong>on</strong>s in[Ste04] (Studying the Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jecture for <strong>Modular</strong> AbelianVarieties Using MAGMA). 8 8C<strong>on</strong>jecture 17.5.9 (<strong>Stein</strong>). Suppose c ∈ X(A f ), where A f ⊂ J 0 (N). Then thereexists M such that c is visible in J 0 (NM). In other words, every element of X(A f )is “modular”.17.6 Kolyvagin’s Euler system of Heegner pointsIn this secti<strong>on</strong> we will briefly sketch some of the key ideas behind Kolyvagin’sproof of Theorem 17.1.9. We will follow [Rub89] very closely. Two other excellentreferences are [Gro91] <strong>and</strong> [McC91]. Kolyvagin’s original papers 9 <strong>on</strong> this theorem 9are not so easy to read because they are all translati<strong>on</strong>s from Russian, but n<strong>on</strong>eof the three papers cited above give a complete proof of his theorem.We <strong>on</strong>ly sketch a proof of the following special case of Kolyvagin’s theorem.Theorem 17.6.1. Suppose E is an elliptic curve over Q such that L(E, 1) ≠ 0.Then E(Q) is finite <strong>and</strong> X(E)[p] = 0 for almost all primes p.The strategy of the proof is as follows. Applying Galois cohomology to themultiplicati<strong>on</strong> by p sequencewe obtain a short exact sequence0 → E[p] → E p −→ E → 0,0 → E(Q)/pE(Q) → H 1 (Q, E[p]) → H 1 (Q, E)[p] → 0.The inverse image of X(E)[p] in H 1 (Q, E[p]) is called the Selmer group of E at p,<strong>and</strong> we will denote it by Sel (p) (E). We have an exact sequence0 → E(Q)/pE(Q) → Sel (p) (E) → X(E)[p] → 0.Because E(Q) is finitely generated, to prove Theorem 17.6.1 it suffices to provethat Sel (p) (E) = 0 for all but finitely many primes p. We do this by using complexmultiplicati<strong>on</strong> elliptic curves to c<strong>on</strong>struct Galois cohomology classes with preciselocal behavior.It is usually easier to say something about the Galois cohomology of a moduleover a local field than over Q, so we introduce some more notati<strong>on</strong> to help formalizethis. For any prime p <strong>and</strong> place v we have a commutative diagram0 E(Q)/pE(Q) H 1 (Q, E[p])H 1 (Q, E)[p]00 E(Q v )/pE(Q v ) H 1 (Q v , E[p]) H 1 (Q v , E)[p] 0res vres v8 remove9 Add references.


168 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jectureTABLE 17.5.1. Visibility of N<strong>on</strong>trivial Odd Parts of Shafarevich-Tate GroupsA dim S l S u odd deg(ϕ A ) B dim A ∩ B Vis389E∗ 20 5 2 = 5 389A 1 [20 2 ] 5 2433D∗ 16 7 2 = 7·111 433A 1 [14 2 ] 7 2446F∗ 8 11 2 = 11·359353 446B 1 [11 2 ] 11 2551H 18 3 2 = 169 NONE563E∗ 31 13 2 = 13 563A 1 [26 2 ] 13 2571D∗ 2 3 2 = 3 2·127 571B 1 [3 2 ] 3 2655D∗ 13 3 4 = 3 2·9799079 655A 1 [36 2 ] 3 4681B 1 3 2 = 3·125 681C 1 [3 2 ] −707G∗ 15 13 2 = 13·800077 707A 1 [13 2 ] 13 2709C∗ 30 11 2 = 11 709A 1 [22 2 ] 11 2718F∗ 7 7 2 = 7·5371523 718B 1 [7 2 ] 7 2767F 23 3 2 = 1 NONE794G 12 11 2 = 11·34986189 794A 1 [11 2 ] −817E 15 7 2 = 7·79 817A 1 [7 2 ] −959D 24 3 2 = 583673 NONE997H∗ 42 3 4 = 3 2 997B 1 [12 2 ] 3 2997C 1 [24 2 ] 3 21001F 3 3 2 = 3 2·1269 1001C 1 [3 2 ] −91A 1 [3 2 ] −1001L 7 7 2 = 7·2029789 1001C 1 [7 2 ] −1041E 4 5 2 = 5 2·13589 1041B 2 [5 2 ] −1041J 13 5 4 = 5 3·21120929983 1041B 2 [5 4 ] −1058D 1 5 2 = 5·483 1058C 1 [5 2 ] −1061D 46 151 2 = 151·10919 1061B 2 [2 2 302 2 ] −1070M 7 3·5 2 3 2·52 3·5·1720261 1070A 1 [15 2 ] −1077J 15 3 4 = 3 2·1227767047943 1077A 1 [9 2 ] −1091C 62 7 2 = 1 NONE1094F∗ 13 11 2 = 11 2·172446773 1094A 1 [11 2 ] 11 21102K 4 3 2 = 3 2·31009 1102A 1 [3 2 ] −1126F∗ 11 11 2 = 11·13990352759 1126A 1 [11 2 ] 11 21137C 14 3 4 = 3 2·64082807 1137A 1 [9 2 ] −1141I 22 7 2 = 7·528921 1141A 1 [14 2 ] −1147H 23 5 2 = 5·729 1147A 1 [10 2 ] −1171D∗ 53 11 2 = 11·81 1171A 1 [44 2 ] 11 21246B 1 5 2 = 5·81 1246C 1 [5 2 ] −1247D 32 3 2 = 3 2·2399 43A 1 [36 2 ] −1283C 62 5 2 = 5·2419 NONE1337E 33 3 2 = 71 NONE1339G 30 3 2 = 5776049 NONE1355E 28 3 3 2 3 2·2224523985405 NONE1363F 25 31 2 = 31·34889 1363B 2 [2 2 62 2 ] −1429B 64 5 2 = 1 NONE1443G 5 7 2 = 7 2·18525 1443C 1 [7 1 14 1 ] −1446N 7 3 2 = 3·17459029 1446A 1 [12 2 ] −


17.6 Kolyvagin’s Euler system of Heegner points 169TABLE 17.5.2. Visibility of N<strong>on</strong>trivial Odd Parts of Shafarevich-Tate GroupsA dim S l S u odd deg(ϕ A ) B dim A ∩ B Vis1466H∗ 23 13 2 = 13·25631993723 1466B 1 [26 2 ] 13 21477C∗ 24 13 2 = 13·57037637 1477A 1 [13 2 ] 13 21481C 71 13 2 = 70825 NONE1483D∗ 67 3 2·52 = 3·5 1483A 1 [60 2 ] 3 2·521513F 31 3 3 4 3·759709 NONE1529D 36 5 2 = 535641763 NONE1531D 73 3 3 2 3 1531A 1 [48 2 ] −1534J 6 3 3 2 3 2·635931 1534B 1 [6 2 ] −1551G 13 3 2 = 3·110659885 141A 1 [15 2 ] −1559B 90 11 2 = 1 NONE1567D 69 7 2·412 = 7·41 1567B 3 [4 4 1148 2 ] −1570J∗ 6 11 2 = 11·228651397 1570B 1 [11 2 ] 11 21577E 36 3 3 2 3 2·15 83A 1 [6 2 ] −1589D 35 3 2 = 6005292627343 NONE1591F∗ 35 31 2 = 31·2401 1591A 1 [31 2 ] 31 21594J 17 3 2 = 3·259338050025131 1594A 1 [12 2 ] −1613D∗ 75 5 2 = 5·19 1613A 1 [20 2 ] 5 21615J 13 3 4 = 3 2·13317421 1615A 1 [9 1 18 1 ] −1621C∗ 70 17 2 = 17 1621A 1 [34 2 ] 17 21627C∗ 73 3 4 = 3 2 1627A 1 [36 2 ] 3 41631C 37 5 2 = 6354841131 NONE1633D 27 3 6·72 = 3 5·7·31375 1633A 3 [6 4 42 2 ] −1634K 12 3 2 = 3·3311565989 817A 1 [3 2 ] −1639G∗ 34 17 2 = 17·82355 1639B 1 [34 2 ] 17 21641J∗ 24 23 2 = 23·1491344147471 1641B 1 [23 2 ] 23 21642D∗ 14 7 2 = 7·123398360851 1642A 1 [7 2 ] 7 21662K 7 11 2 = 11·16610917393 1662A 1 [11 2 ] −1664K 1 5 2 = 5·7 1664N 1 [5 2 ] −1679C 45 11 2 = 6489 NONE1689E 28 3 2 = 3·172707180029157365 563A 1 [3 2 ] −1693C 72 1301 2 = 1301 1693A 3 [2 4 2602 2 ] −1717H∗ 34 13 2 = 13·345 1717B 1 [26 2 ] 13 21727E 39 3 2 = 118242943 NONE1739F 43 659 2 = 659·151291281 1739C 2 [2 2 1318 2 ] −1745K 33 5 2 = 5·1971380677489 1745D 1 [20 2 ] −1751C 45 5 2 = 5·707 103A 2 [505 2 ] −1781D 44 3 2 = 61541 NONE1793G∗ 36 23 2 = 23·8846589 1793B 1 [23 2 ] 23 21799D 44 5 2 = 201449 NONE1811D 98 31 2 = 1 NONE1829E 44 13 2 = 3595 NONE1843F 40 3 2 = 8389 NONE1847B 98 3 6 = 1 NONE1871C 98 19 2 = 14699 NONE


170 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jectureTABLE 17.5.3. Visibility of N<strong>on</strong>trivial Odd Parts of Shafarevich-Tate GroupsA dim S l S u odd deg(ϕ A ) B dim A ∩ B Vis1877B 86 7 2 = 1 NONE1887J 12 5 2 = 5·10825598693 1887A 1 [20 2 ] −1891H 40 7 4 = 7 2·44082137 1891C 2 [4 2 196 2 ] −1907D∗ 90 7 2 = 7·165 1907A 1 [56 2 ] 7 21909D∗ 38 3 4 = 3 2·9317 1909A 1 [18 2 ] 3 41913B∗ 1 3 2 = 3·103 1913A 1 [3 2 ] 3 21913E 84 5 4·612 = 5 2·61·103 1913A 1 [10 2 ] −1913C 2 [2 2 610 2 ] −1919D 52 23 2 = 675 NONE1927E 45 3 2 3 4 52667 NONE1933C 83 3 2·7 3 2·72 3·7 1933A 1 [42 2 ] 3 21943E 46 13 2 = 62931125 NONE1945E∗ 34 3 2 = 3·571255479184807 389A 1 [3 2 ] 3 21957E∗ 37 7 2·112 = 7·11·3481 1957A 1 [22 2 ] 11 21957B 1 [14 2 ] 7 21979C 104 19 2 = 55 NONE1991C 49 7 2 = 1634403663 NONE1994D 26 3 3 2 3 2·46197281414642501 997B 1 [3 2 ] −1997C 93 17 2 = 1 NONE2001L 11 3 2 = 3 2·44513447 NONE2006E 1 3 2 = 3·805 2006D 1 [3 2 ] −2014L 12 3 2 = 3 2·126381129003 106A 1 [9 2 ] −2021E 50 5 6 = 5 2·729 2021A 1 [100 2 ] 5 42027C∗ 94 29 2 = 29 2027A 1 [58 2 ] 29 22029C 90 5 2·2692 = 5·269 2029A 2 [2 2 2690 2 ] −2031H∗ 36 11 2 = 11·1014875952355 2031C 1 [44 2 ] 11 22035K 16 11 2 = 11·218702421 2035C 1 [11 1 22 1 ] −2038F 25 5 5 2 5 2·92198576587 2038A 1 [20 2 ] −1019B 1 [5 2 ] −2039F 99 3 4·52 = 13741381043009 NONE2041C 43 3 4 = 61889617 NONE2045I 39 3 4 = 3 3·3123399893 2045C 1 [18 2 ] −409A 13 [9370199679 2 ] −2049D 31 3 2 = 29174705448000469937 NONE2051D 45 7 2 = 7·674652424406369 2051A 1 [56 2 ] −2059E 45 5·7 2 5 2·72 5 2·7·167359757 2059A 1 [70 2 ] −2063C 106 13 2 = 8479 NONE2071F 48 13 2 = 36348745 NONE2099B 106 3 2 = 1 NONE2101F 46 5 2 = 5·11521429 191A 2 [155 2 ] −2103E 37 3 2·112 = 3 2·11·874412923071571792611 2103B 1 [33 2 ] 11 22111B 112 211 2 = 1 NONE2113B 91 7 2 = 1 NONE2117E∗ 45 19 2 = 19·1078389 2117A 1 [38 2 ] 19 2


17.6 Kolyvagin’s Euler system of Heegner points 171TABLE 17.5.4. Visibility of N<strong>on</strong>trivial Odd Parts of Shafarevich-Tate GroupsA dim S l S u odd deg(ϕ A ) B dim A ∩ B Vis2119C 48 7 2 = 89746579 NONE2127D 34 3 2 = 3·18740561792121901 709A 1 [3 2 ] −2129B 102 3 2 = 1 NONE2130Y 4 7 2 = 7·83927 2130B 1 [14 2 ] −2131B 101 17 2 = 1 NONE2134J 11 3 2 = 1710248025389 NONE2146J 10 7 2 = 7·1672443 2146A 1 [7 2 ] −2159E 57 13 2 = 31154538351 NONE2159D 56 3 4 = 233801 NONE2161C 98 23 2 = 1 NONE2162H 14 3 3 2 3·6578391763 NONE2171E 54 13 2 = 271 NONE2173H 44 199 2 = 199·3581 2173D 2 [398 2 ] −2173F 43 19 2 3 2·192 3 2·19·229341 2173A 1 [38 2 ] 19 22174F 31 5 2 = 5·21555702093188316107 NONE2181E 27 7 2 = 7·7217996450474835 2181A 1 [28 2 ] −2193K 17 3 2 = 3·15096035814223 129A 1 [21 2 ] −2199C 36 7 2 = 7 2·13033437060276603 NONE2213C 101 3 4 = 19 NONE2215F 46 13 2 = 13·1182141633 2215A 1 [52 2 ] −2224R 11 79 2 = 79 2224G 2 [79 2 ] −2227E 51 11 2 = 259 NONE2231D 60 47 2 = 91109 NONE2239B 110 11 4 = 1 NONE2251E∗ 99 37 2 = 37 2251A 1 [74 2 ] 37 22253C∗ 27 13 2 = 13·14987929400988647 2253A 1 [26 2 ] 13 22255J 23 7 2 = 15666366543129 NONE2257H 46 3 6·292 = 3 3·29·175 2257A 1 [9 2 ] −2257D 2 [2 2 174 2 ] −2264J 22 73 2 = 73 2264B 2 [146 2 ] −2265U 14 7 2 = 7 2·73023816368925 2265B 1 [7 2 ] −2271I∗ 43 23 2 = 23·392918345997771783 2271C 1 [46 2 ] 23 22273C 105 7 2 = 7 2 NONE2279D 61 13 2 = 96991 NONE2279C 58 5 2 = 1777847 NONE2285E 45 151 2 = 151·138908751161 2285A 2 [2 2 302 2 ] −2287B 109 71 2 = 1 NONE2291C 52 3 2 = 427943 NONE2293C 96 479 2 = 479 2293A 2 [2 2 958 2 ] −2294F 15 3 2 = 3·6289390462793 1147A 1 [3 2 ] −2311B 110 5 2 = 1 NONE2315I 51 3 2 = 3·4475437589723 463A 16 [13426312769169 2 ] −2333C 101 83341 2 = 83341 2333A 4 [2 6 166682 2 ] −


172 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jectureObserve thatSel (p) (E) = ⋂ vres −1v ( image E(Q v )).For s ∈ Sel (p) (E), let s v denote the inverse image of res v (s) in E(Q v )/pE(Q v ).Our first propositi<strong>on</strong> asserts that if we can c<strong>on</strong>struct a cohomology class withcertain properties, then that cohomology class forces Sel (p) (E) to be locally trivial.Propositi<strong>on</strong> 17.6.2. Suppose l is a prime such that E(Q l )[p] ∼ = Z/pZ <strong>and</strong> supposethere is a cohomology class c l ∈ H 1 (Q, E)[p] such that res v (c l ) ≠ 0 if <strong>and</strong><strong>on</strong>ly if v = l. Then res l (Sel (p) (E)) = 0 ⊂ H 1 (Q l , E[p]).Sketch of proof. Suppose s ∈ Sel (p) (E). Using the local Tate pairing we obtain, foreach v, a n<strong>on</strong>degenerate pairing〈 , 〉 v : E(Q v )/pE(Q v ) × H 1 (Q v , E)[p] → Z/pZ.Unwinding the definiti<strong>on</strong>s, <strong>and</strong> using the fact that the sum of the local invariantsof an element of Br(Q) is trivial, we see that our hypothesis that all but <strong>on</strong>erestricti<strong>on</strong> of c l is trivial implies that 〈s l , res l (c l )〉 = 0. Since res l (c l ) ≠ 0, thisimplies that s l = 0.Let τ ∈ Gal(Q/Q) denote a fixed choice of complex c<strong>on</strong>jugati<strong>on</strong>, which is definedby fixing an embedding of Q into C. For any module M <strong>on</strong> which τ acts, letM + = {x ∈ M : τ(x) = x} <strong>and</strong> M − = {x ∈ M : τ(x) = −x}.If E is defined by y 2 = x 3 + ax + b <strong>and</strong> K = Q( √ d), then the twist of E by K isthe elliptic curve dy 2 = x 3 + ax + b.Theorem 17.6.3. There exists infinitely many quadratic imaginary fields K inwhich all primes dividing the c<strong>on</strong>ductor N of E split, <strong>and</strong> such that L ′ (E K , 1) ≠ 0,where E K is the twist of E over K.Fix a K as in the theorem such that OK ∗ = {±1}, i.e., so that the discriminant Dof K is not −3 or −4.Lemma 17.6.4. Suppose l ∤ pDN <strong>and</strong> Frob l (K(E[p])/Q) = [τ]. Then l is inertin K, we have a l ≡ l + 1 ≡ 0 (mod p), <strong>and</strong> the four groups E(Q l )[p], E(F l )[p],E(K l )[p] − , <strong>and</strong> E(F l 2)[p] − are all cyclic of order p.Proof. The first asserti<strong>on</strong> is true because τ| K has order 2. The characteristic polynomialof Frob l (K(E[p])/Q) <strong>on</strong> E[p] is x 2 − a l x + l, <strong>and</strong> the characteristic polynomialof τ <strong>on</strong> E[p] is x 2 − 1, which proves the sec<strong>on</strong>d asserti<strong>on</strong>. For the third, wehave(Z/pZ) 2 ∼ = E(Q)[p] ∼ = E(Kl )[p] ∼ = E(Q l )[p] ⊕ E(K l )[p] − ,<strong>and</strong> each summ<strong>and</strong> <strong>on</strong> the right must be n<strong>on</strong>zero since τ ≠ 1 <strong>on</strong> E[p].For the rest of the argument, we assume that p is odd <strong>and</strong> does not divide theorder of the class group of K. We will now use the theory of complex multiplicati<strong>on</strong>elliptic curves <strong>and</strong> class field theory to c<strong>on</strong>struct cohomology classes c l which wewill use in Propositi<strong>on</strong> 17.6.2.


17.6 Kolyvagin’s Euler system of Heegner points 173Since every prime dividing the c<strong>on</strong>ductor N of E splits in K, there is an idealN of K such that O K /N ∼ = Z/NZ. Fix such an ideal for the rest of the argument.LetN −1 = {x ∈ K : xN ⊂ O K }be the inverse of N in the group of fracti<strong>on</strong>al ideals of O K . The pair(C/O K , N −1 /O K )then defines a point <strong>on</strong> x ∈ X 0 (N). By complex multiplicati<strong>on</strong> theory, the pointx is defined over the Hilbert Class Field H of K, which is the maximal unramifiedabelian extensi<strong>on</strong> of K. Also, class field theory implies that Gal(H/K) isisomorphic to the ideal class group of K.Let π : X 0 (N) → E be a (minimal) modular parametrizati<strong>on</strong>. Then y K =Tr H/K π(x H ) ∈ E(K), <strong>and</strong> we cally = y K − τ(y K ) ∈ E(K) −the Heegner point associated to K. Our hypotheses <strong>on</strong> K imply, by the theoremof Gross <strong>and</strong> Zagier, that y has infinite order, a fact which will be crucial in ourc<strong>on</strong>structi<strong>on</strong> of cohomology classes c l . If we were to choose a K such that y weretorsi<strong>on</strong> (of order coprime to p), then our classes c l would be locally trivial, hencegive c<strong>and</strong>idate elements of X(E) (see [Gro91, §11, pg. 254] <strong>and</strong> [McC91] for more<strong>on</strong> this c<strong>on</strong>necti<strong>on</strong>).Suppose l ∤ N is inert in K (i.e., does not split). LetO l = Z + lO K ,which is the order of c<strong>on</strong>ductor l in O K . Notice that O K /O l∼ = Z/lZ as groups.Since K ⊂ C, it make sense to view O l as a lattice in C. The pair(C/O l , (N ∩ O l ) −1 /O l )then defines a CM point x l ∈ X 0 (N). This point w<strong>on</strong>’t be defined over the Hilbertclass field H, though, but instead over an abelian extensi<strong>on</strong> K[l] of K that is acyclic extensi<strong>on</strong> of H of degree l + 1 totally ramified over H at l <strong>and</strong> unramifiedeverywhere else. Lety l = π(x l ) ∈ E(K[l]).Propositi<strong>on</strong> 17.6.5. We have Tr K[l]/H (y l ) = a l y H , where y H = π(x H ). Also,for any prime λ of K[l] lying above l, we haveas elements of E(F l 2).ỹ l = Frob l (ỹ H ),We will not prove this propositi<strong>on</strong>, except to note that it follows from the explicitdescripti<strong>on</strong>s of the acti<strong>on</strong> of Gal(Q/Q) <strong>on</strong> CM points <strong>on</strong> X 0 (N) <strong>and</strong> of the <strong>Hecke</strong>corresp<strong>on</strong>dences.We are almost ready to c<strong>on</strong>struct the cohomology classes c l . Suppose l ∤ pDN isany prime such that Frob l (K(E[p])/Q) = [τ]. By Lemma 17.6.4, [K[l] : H] = l+1is divisible by p, so there is a unique extensi<strong>on</strong> H ′ of H of degree p c<strong>on</strong>tained inK[l]. Let ϕ be a lift of Frob l (H/Q) to Gal(H ′ /Q), <strong>and</strong> letz ′ = Tr K[l]/H ′(y l − ϕ(y l )) − a lp (y H − ϕ(y H )) ∈ E(H ′ ).


174 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jectureLemma 17.6.6. Tr H ′ /H(z ′ ) = 0.Proof. Using properties of the trace <strong>and</strong> Propositi<strong>on</strong> 17.6.5, we haveTr H′ /H(z ′ ) = Tr K[l]/H (y l ) − Tr K[l]/H (ϕ(y l )) − a l (y H ) − a l ϕ(y H )= a l y H − a l ϕ(y H ) − a l (y H ) − a l ϕ(y H )= 0.For the rest of the proof we <strong>on</strong>ly c<strong>on</strong>sider primes p such thatH 1 (Q unrv /Q v , E(Q unrv ))[p] = 0for all v, where Q unrv is the maximal unramified extensi<strong>on</strong> of Q v . (This <strong>on</strong>ly excludesfinitely many primes, by [Milne, Arithmetic Duality Theorems, Prop. I.3.8].)Propositi<strong>on</strong> 17.6.7. There is an element c l ∈ H 1 (Q, E)[p] such that res v (c l ) = 0for all v ≠ l, <strong>and</strong> res l (c l ) ≠ 0 if <strong>and</strong> <strong>on</strong>ly if y ∉ pE(K l ).Proof. Recall that we assumed that p ∤ [H : K]. Thus there is a unique extensi<strong>on</strong> K ′of K of degree p in K[l], which is totally ramified at l <strong>and</strong> unramified everywhereelse, such that H ′ = HK ′ . Letz = Tr H ′ /K ′(z′ ) ∈ E(K ′ ).By Lemma 17.6.6, Tr K ′ /K(z) = 0.The extensi<strong>on</strong> Gal(K ′ /K) is cyclic (since it has degree p), so it is generated bya single element, say σ. The cohomology of a cyclic group is easy to underst<strong>and</strong>(see Atiyah-Wall 10 ). In particular, we have a can<strong>on</strong>ical isomorphism 10H 1 (K ′ /K, E(K ′ )) ∼ = ker(Tr K ′ /K : E(K ′ ) → E(K))(σ − 1)E(K ′ .)Define c ′ l to be the element of H1 (K ′ /K, E(K ′ )) that corresp<strong>on</strong>ds to z under thisisomorphism. A more careful analysis shows that c ′ lis the restricti<strong>on</strong> of an elementc l in H 1 (Q, E)[p].If v ≠ l thenres v (c l ) ∈ H 1 (Q unrv /Q v , E(Q unrv ))[p] = 0,as asserted. The asserti<strong>on</strong> about res l (c l ) is obtained with some work by reducingmodulo l <strong>and</strong> applying Lemma 17.6.4 <strong>and</strong> Propositi<strong>on</strong> 17.6.5.Remark 17.6.8. McCallum has observed that c l is represented by the 1-cocycleσ ↦→ − (σ − 1)y l,lwhere ((σ−1)y l )/l is the unique point P ∈ E(K[l]) such that lP = (σ−1)y l . (Thepoint P is unique because no n<strong>on</strong>trivial p-torsi<strong>on</strong> <strong>on</strong> E is defined over E(K[l]).)10 exp<strong>and</strong>


17.6 Kolyvagin’s Euler system of Heegner points 175Corollary 17.6.9. If y ∉ pE(K l ), then res l (Sel (p) (E)) = 0.Proof. Combine Propositi<strong>on</strong>s 17.6.2 <strong>and</strong> 17.6.7.If t ∈ H 1 (K, E[p]), denote by ˆt the image of t under restricti<strong>on</strong>:H 1 (K, E[p]) → Hom(Gal(Q/K(E[p])), E[p]). (17.6.1)It is useful to c<strong>on</strong>sider ˆt, since homomorphisms satisfy a local-to-global principle.A homomorphism ϕ : Gal(Q/F ) → M is 0 if <strong>and</strong> <strong>on</strong>ly if it is 0 when restricted toGal(F λ /F λ ) for all primes λ of F (this fact follows from the Chebotarev densitytheorem).For the rest of the proof we now assume that p is large enough that there are noQ-rati<strong>on</strong>al cyclic subgrups of E of order p, <strong>and</strong> H 1 (K(E[p])/K, E[p]) = 0. (Thatwe can do this follows from a theorem of Serre when E does not have CM, or CMtheory when E does have CM.)Lemma 17.6.10. Suppose t ∈ H 1 (K, E[p]) ± <strong>and</strong> the image of ˆt is cyclic. Thent = 0.Proof. Since τ acts <strong>on</strong> ˆt by ±, the image of ˆt is rati<strong>on</strong>al over Q. Thus the image ofτ is trivial. The kernel of the restricti<strong>on</strong> map (17.6.1) is also trivial by assumpti<strong>on</strong>,so t = 0.We are now ready to prove Theorem 17.6.1. Choose p large enough so thaty ∉ pE(K), in additi<strong>on</strong> to all other “sufficiently large” c<strong>on</strong>straints that we put<strong>on</strong> p above.Fix s ∈ Sel (p) (E) <strong>and</strong> write ŝ for the restricti<strong>on</strong> of s to a homomorphismGal(Q/K(E[p])) → E[p]. Our goal is to prove that s = 0. Write ŷ for the restricti<strong>on</strong>to Gal(Q/K(E[p])) of the image of y under the injecti<strong>on</strong>E(K) − /pE(K) − → H 1 (K, E[p]) − . (17.6.2)Fix a finite extensi<strong>on</strong> F of K(E[p]) that is Galois over Q, so that both homomorphismŝ <strong>and</strong> ŷ factor through G = Gal(F/K(E[p])).Suppose γ ∈ G, <strong>and</strong> use the Chebotarev Density Theorem to find a primel ∤ pDN such that Frob l (F/Q) = [γτ]. Then<strong>and</strong>Frob l (K(E[p])/Q) = [τ],Frob l (F/K(E[p])) = [γτ] order(τ) = [(γτ) 2 ].By Lemma 17.6.10, s l = 0 implies that ŝ((γτ) 2 ) = 0 for all γ ∈ G. Likewise,y ∈ pE(K l ) implies that ŷ((γτ) 2 ) = 0 for all γ ∈ G. Since s ∈ H 1 (Q, E) we haveτ(ŝ) = ŝ <strong>and</strong> by (17.6.2) we have τ(ŷ) = −ŷ, soŝ((γτ) 2 ) = ŝ(γ) + ŝ(τγτ) = (1 + τ)ŝ(γ) (17.6.3)ŷ((γτ) 2 ) = ŷ(γ) + ŷ(τγτ) = (1 − τ)ŷ(γ). (17.6.4)By Corollary 17.6.9, for every γ ∈ G, <strong>on</strong>e of ŝ((γτ) 2 ) or ŷ((γτ) 2 ) is 0, so by (17.6.3)<strong>and</strong> (17.6.4) at least <strong>on</strong>e of the following holds:ŝ(γ) ∈ E[p] −


176 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jectureorŷ(γ) ∈ E[p] + .ThusG = ŝ −1 (E[p] − ) ⋃ ŷ −1 (E[p] + ).Since a group cannot be the uni<strong>on</strong> of two proper subgroups, either ŝ(G) ⊂ E[p] −or ŷ(G) ⊂ E[p] + . By Lemma 17.6.10, either s = 0 or y ∈ pE(K). But we assumedthat y ∉ pE(K), so s = 0, as claimed.17.6.1 A Heegner point when N = 11Let E be the elliptic curve X 0 (11), which has c<strong>on</strong>ductor 11 <strong>and</strong> Weierstrass equati<strong>on</strong>y 2 + y = x 3 − x 2 − 10x − 20. The following uses a n<strong>on</strong>st<strong>and</strong>ard MAGMApackage...> E := EllipticCurve(Crem<strong>on</strong>aDatabase(),"11A");> E;Elliptic Curve defined by y^2 + y = x^3 - x^2 - 10*x - 20 over Rati<strong>on</strong>al Field>[]> A := [-D : D in [1..20] | HeegnerHypothesis(E,-D)]; A;[ -7, -8, -19 ]> P := [* *]; for D in A do time p,q := HeegnerPoint(E,D); Append(~P,); end for;> P; // pairs the (heegner point, its trace to Q)[* ,, *]> P[1];> Sqrt(-7)/2;2.64575131106459059050161575359*i> Sqrt(-7)/2;1.322875655532295295250807876795*i> 2*Sqrt(-7);5.291502622129181181003231507183*i> // Thus ((1+sqrt(-7))/2 , -2+2*sqrt(-7)) is the Heegner point> // corresp<strong>on</strong>ding to Q(sqrt(-7)).


17.6 Kolyvagin’s Euler system of Heegner points 17717.6.2 Kolyvagin’s Euler system for curves of rank at least 2111111 Summary of my work to show n<strong>on</strong>triviality in some cases. Make sure to also menti<strong>on</strong> workof Rubin, Mazur, Howard, Weinstein, etc., in this secti<strong>on</strong>. One interesting idea might be to addmy whole book <strong>on</strong> BSD into this book...


178 17. The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jecture


18The Gorenstein Property for <strong>Hecke</strong>Algebras18.1 Mod l representati<strong>on</strong>s associated to modular formsSuppose f = ∑ a n q n is a newform of exact level N <strong>and</strong> weight 2 for the c<strong>on</strong>gruencesubgroup Γ 0 (N). Let E = Q(. . . , a n , . . .) <strong>and</strong> let λ be a place of E lying over theprime l of Q. The acti<strong>on</strong> of Gal(Q/Q) <strong>on</strong> the l-adic Tate module of the associatedabelian variety A f gives rise to a representati<strong>on</strong>ρ λ : Gal(Q/Q) → GL 2 (E λ )that satisfies det(ρ λ ) = χ l <strong>and</strong> tr(ρ λ (Frob p )) = a p for p ∤ lN. Using the followinglemma, it is possible to reduce ρ λ module λ.Lemma 18.1.1. Let O be the ring of integers of E λ . Then ρ λ is equivalent to arepresentati<strong>on</strong> that takes values in GL 2 (O).Proof. View GL 2 (E λ ) as the group of automorphisms of a 2-dimensi<strong>on</strong>al E λ -vectorspace V . A lattice L ⊂ V is a free O-module of rank 2 such that L ⊗ E λ∼ = V .It suffices to find an O-lattice L in V that is invariant under the acti<strong>on</strong> via ρ λof Gal(Q/Q). For then the matrices of Gal(Q/Q) with respect to a basis of Vc<strong>on</strong>sisting of vectors from L will have coefficients in O. Choose any lattice L 0 ⊂ V .Since L 0 is discrete <strong>and</strong> Gal(Q/Q) is compact, the set of lattices ρ λ (g)L 0 withg ∈ Gal(Q/Q) is finite. Let L = ∑ ρ λ (g)L 0 be the sum of the finitely manyc<strong>on</strong>jugates of L 0 ; then L is Galois invariant. The sum is a lattice because it isfinitely generated <strong>and</strong> torsi<strong>on</strong> free, <strong>and</strong> O is a principal ideal domain.Choose an O as in the lemma, <strong>and</strong> tentatively write ρ λ = ρ λ mod λ:Gal(Q/Q)ρ λ GL 2 (O)false ρ λGL 2 (O/λ)


180 18. The Gorenstein Property for <strong>Hecke</strong> AlgebrasThe drawback to this definiti<strong>on</strong> is that ρ λ is not intrinsic; the definiti<strong>on</strong> depends<strong>on</strong> making a choice of O. Instead we define ρ λ to be the semisimplificati<strong>on</strong> of thereducti<strong>on</strong> of ρ λ modulo λ. The semisimplificati<strong>on</strong> of a representati<strong>on</strong> is the directsum of the Jordan-Hölder factors in a filtrati<strong>on</strong> of a vector space affording therepresentati<strong>on</strong>. We have det(ρ λ ) ≡ χ l (mod l), where χ l is the mod l cyclotomiccharacter, <strong>and</strong> tr(ρ λ (Frob p )) = a p mod λ. Thus the characteristic polynomials inthe semisimplificati<strong>on</strong> ρ λ are independent of our choice of reducti<strong>on</strong> of ρ λ . Thefollowing theorem implies that ρ λ depends <strong>on</strong>ly <strong>on</strong> f <strong>and</strong> λ, <strong>and</strong> not <strong>on</strong> the choiceof the reducti<strong>on</strong>.Theorem 18.1.2 (Brauer-Nesbitt). Suppose ρ 1 , ρ 2 : G → GL(V ) are two finitedimensi<strong>on</strong>al semisimple representati<strong>on</strong>s of a group G over a finite field k. Assumefurthermore that for every g ∈ G the characteristic polynomial of ρ 1 (g) is the sameas the characteristic polynomial of ρ 2 (g). Then ρ 1 <strong>and</strong> ρ 2 are equivalent.Proof. For a proof, see [CR62, §30, p. 215].The <strong>Hecke</strong> operators T n act as endomorphisms of S 2 (Γ 0 (N)). LetT := Z[. . . , T n , . . .] ⊂ End(S 2 (Γ 0 (N))),<strong>and</strong> recall that T is a commutative ring; as a Z-module T has rank equal todim C S 2 (Γ 0 (N)). Let m be a maximal ideal of T <strong>and</strong> set k := T/m ≈ F l ν .Propositi<strong>on</strong> 18.1.3. There is a unique semisimple representati<strong>on</strong>ρ m : Gal(Q/Q) → GL 2 (k)such that ρ m is unramified outside lN <strong>and</strong>tr(ρ m (Frob p )) = T p mod mdet(ρ m (Frob p )) = p mod m.Proof. It is enough to prove the asserti<strong>on</strong> with T replaced by the subalgebraT 0 = Z[{. . . , T n , . . . : (n, N) = 1}].Indeed, the maximal ideal m of T pulls back to a maximal ideal m 0 of T 0 , <strong>and</strong>k 0 = T 0 /m 0 ⊂ k. Nowt∏T 0 ⊂ T 0 ⊗ Q =with the E i number fields. Let O Ei be the ring of integers of E i <strong>and</strong> let O = ∏ O Ei .By the going up theorem there is a maximal ideal λ ⊂ ∏ O Ei lying over m 0 :λ i=1E i∏OEi O/λm 0 T 0 k 0Using the above c<strong>on</strong>structi<strong>on</strong>, we make a representati<strong>on</strong>ρ λ : Gal(Q/Q) → GL 2 (O/λ)


18.1 Mod l representati<strong>on</strong>s associated to modular forms 181that satisfies det(ρ λ ) ≡ χ l (mod λ) <strong>and</strong> tr(ρ λ ) = T p (mod λ). Because of howwe have set things up, T p plays the role of a p . Thus this representati<strong>on</strong> has therequired properties, but it takes values in GL 2 (O/λ) instead of k 0 .Since the characteristic polynomial of every ρ λ (g) for g ∈ Gal(Q/Q) has coefficientsin the subfield k 0 ⊂ O/λ there is a model for ρ λ over k 0 . This is a classicalresult of I. Schur. Brauer groups of finite fields are trivial (see e.g., [Ser79, X.7,Ex. a]), so the argument of [Ser77, §12.2] completes the proof of the propositi<strong>on</strong>.Alternatively, when the residue characteristic l of k 0 is odd, the following moredirect proof can be used. Complex c<strong>on</strong>jugati<strong>on</strong> acts through ρ λ as a matrix withdistinct F l -rati<strong>on</strong>al eigenvalues; another well known theorem of Schur [Sch06, IX a](cf. [Wal85, Lemme I.1]) then implies that ρ λ can be c<strong>on</strong>jugated into a representati<strong>on</strong>with values in GL(2, k 0 ).Let us look at this c<strong>on</strong>structi<strong>on</strong> in another way. WriteT 0 ⊗ Q = E 1 × · · · × E t<strong>and</strong> recall that each number field E i corresp<strong>on</strong>ds to a newform of level M | N; <strong>on</strong>ecan obtain E i by adjoining the coefficients of some newform of level M | N to Q.Likewise, the Jacobian J 0 (N) is isogenous to a product A 1 ×· · ·×A t . C<strong>on</strong>sider <strong>on</strong>eof the factors, say E 1 , <strong>and</strong> to fix ideas suppose that it corresp<strong>on</strong>ds to a newformof exact level N. Since Tate l A 1 is free of rank 2 over E 1 ⊗ Q Q l , we obtain a2-dimensi<strong>on</strong>al representati<strong>on</strong> ρ λ . Reducing mod λ <strong>and</strong> semisimplifying gives therepresentati<strong>on</strong> c<strong>on</strong>structed in the above propositi<strong>on</strong>. But it is also possible that<strong>on</strong>e of the fields E i corresp<strong>on</strong>ds to a newform f of level M properly dividing N.In this case, we repeat the whole c<strong>on</strong>structi<strong>on</strong> with J 0 (M) to get a 2-dimensi<strong>on</strong>alrepresentati<strong>on</strong>.C<strong>on</strong>sider <strong>on</strong>e of the abelian varieties A 1 as above, which we view as an abeliansubvariety of J 0 (N). Then T acts <strong>on</strong> A 1 ; let T denote the image of T in End A 1 .Although T sits naturally in O 1 , which is the ring of integers of a field, T might not2 dim A1l,be integrally closed. C<strong>on</strong>sider the usual Z l -adic Tate module Tate l (A 1 ) ≈ Zwhere dim A 1 = [E 1 : Q]. In the 1940s Weil proved that T ⊗ Z l acts faithfully <strong>on</strong>the Tate module. By the theory of semilocal rings (see, e.g., [Eis95, Cor. 7.6]), wehaveT ⊗ Z l = ∏ T m ,m|lwhere the product is over all maximal ideals of T of residue characteristic l. Theidempotents e m , in this decompositi<strong>on</strong>, decompose Tate l as a product ∏ m Tate m(A 1 ).It would be nice if Tate l were free of rank 2 over T ⊗ Z l but this is not known tobe true in general, although it has been verified in many special cases. For this tobe true we must have that, for all m | l, that Tate m (A 1 ) is free of rank 2 over T m .Next we put things in a finite c<strong>on</strong>text instead of a projective limit c<strong>on</strong>text. LetJ = J 0 (N), then by Albanese or Picard functoriality T ⊂ End J. Let m ⊂ T be amaximal ideal. LetJ[m] = {t ∈ J(Q) : xt = 0 for all x ∈ m}.Note that J[m] ⊂ J[l] where l is the rati<strong>on</strong>al prime lying in m. Now J[l] is an F lvector space of rank 2g where g is the genus of X 0 (N). Although it is true thatJ[l] is a T/l-module, it is not c<strong>on</strong>venient to work with T/l since it might not


182 18. The Gorenstein Property for <strong>Hecke</strong> Algebrasbe a product of fields because of unpleasant ramificati<strong>on</strong>. It is more c<strong>on</strong>venient towork with J[m] since T/m is a field. Thus, via this optic, J[m] is a k[G]-modulewhere k = T/m <strong>and</strong> G = Gal(Q/Q). The naive hope is that J[m] is a model forρ m , at least when ρ m is irreducible. This does not quite work, but we do have thefollowing theorem.Theorem 18.1.4. If l ∤ 2N then J[m] is a model for ρ m .18.2 The Gorenstein propertyC<strong>on</strong>sider the <strong>Hecke</strong> algebraT := Z[. . . , T n , . . .] ⊂ End J 0 (N),<strong>and</strong> let m ⊂ T be a maximal ideal of residue characteristic l. We have c<strong>on</strong>structeda semisimple representati<strong>on</strong>ρ m : Gal(Q/Q) → GL 2 (T/m).It is unramified outside lN, <strong>and</strong> for any prime p ∤ lN we havetr(ρ m (Frob p )) = T p mod mdet(ρ m (Frob p )) = p mod m.We will usually be interested in the case when ρ m is irreducible. Let T m =lim←− T/mi T denote the completi<strong>on</strong> of T at m. Note that T ⊗ Z Z l = ∏ m|l T m.Our goal is to prove that if m ∤ 2N <strong>and</strong> ρ m is irreducible, then T m is Gorenstein.Let O be a complete discrete valuati<strong>on</strong> ring. Let T be a local O-algebra which asa module is finite <strong>and</strong> free over O. Then T is a Gorenstein O-algebra if there isan isomorphism of T -modules T −→ ∼ Hom O (T, O). Thus T m is Gorenstein if thereis an isomorphism Hom Zl (T m , Z l ) ≈ T m of T m -modules. Intuitively, this meansthat T m is “autodual”.Theorem 18.2.1. Let J = J 0 (N) <strong>and</strong> let m be a maximal ideal of the <strong>Hecke</strong>algebra. Assume that ρ m is irreducible <strong>and</strong> that m ∤ 2N. Then dim T/m J[m] = 2<strong>and</strong> J[m], as a Galois module, is a 2-dimensi<strong>on</strong>al representati<strong>on</strong> giving rise to ρ m .An easy argument shows that the theorem implies T m is Gorenstein.We first c<strong>on</strong>sider the structure of W = J[m]. Suppose the two dimensi<strong>on</strong>alrepresentati<strong>on</strong>ρ m : Gal(Q/Q) → Aut T/m Vc<strong>on</strong>structed before is irreducible. C<strong>on</strong>sider the semisimplificati<strong>on</strong> W ß of W , thusW ß is the direct sum of its Jordan-Hölder factors as a Gal(Q/Q)-module. Mazurproved the following theorem.Theorem 18.2.2. There is some integer t ≥ 0 so thatW ß ∼ = V × · · · × V = V t .If ρ m is in fact absolutely irreducible then it is a result of Bost<strong>on</strong>, Lenstra, <strong>and</strong>Ribet [BpR91] that W ∼ = V × · · · × V . A representati<strong>on</strong> is absolutely irreducible if


18.2 The Gorenstein property 183it is irreducible over the algebraic closure. It can be shown that if l ≠ 2 <strong>and</strong> ρ m isirreducible then ρ m must be absolutely irreducible.The c<strong>on</strong>structi<strong>on</strong> of W is nice <strong>and</strong> gentle whereas the c<strong>on</strong>structi<strong>on</strong> of V is accomplishedvia brute force.Proof. (Mazur) We want to compare V with W . Let d = dim W . LetW ∗ = Hom T/m (W, T/m(1))where T/m(1) = T/m ⊗ Z µ l . We need to show thatW ß ⊕ W ∗ß ∼ = V × · · · × V = Vdas representati<strong>on</strong>s of Gal(Q/Q). Note that each side is a semisimple module ofdimensi<strong>on</strong> 2d. To obtain the isomorphism we show that the two representati<strong>on</strong>shave the same characteristic polynomials so they are isomorphic.We want to show that the characteristic polynomial of Frob p is the same forboth W ß ⊕ W ß <strong>and</strong> V × · · · × V . The characteristic polynomial of Frob p <strong>on</strong> V isX 2 − T p X + p = (X − r)(X − pr −1 ) where r lies in a suitable algebraic closure. Itfollows that the characteristic polynomial of Frob p <strong>on</strong> V d is (x − r) d (x − pr −1 ) d .On W the characteristic polynomial of Frob p is (X − α 1 ) · · · (X − α d ) where α iis either r or pr −1 . This is because Eichler-Shimura implies Frob p must satisfyFrob 2 p −T p Frob p +p = 0. [[I d<strong>on</strong>’t see this implicati<strong>on</strong>.]] On W ∗ the characteristicpolynomial of Frob p is (X − pα1 −1 ) · · · (X − pα−1d). [[This is somehow tied up withthe definiti<strong>on</strong> of W ∗ <strong>and</strong> I can’t quite underst<strong>and</strong> it.]] Thus <strong>on</strong> W ⊕ W ∗ , thecharacteristic polynomial of Frob p isd∏(X − α i )(X − pα −1i ) =i=1d∏(X − r)(X − pr −1 ) = (X − r) d (X − pr −1 ) d .i=1Therefore the characteristic polynomial of Frob p <strong>on</strong> W ⊕ W ∗ is the same as thecharacteristic polynomial of Frob p <strong>on</strong> V × · · · × V . The point is that although theα i could all a priori be r or pr −1 , by adding in W ∗ everything pairs off correctly.[[I d<strong>on</strong>’t underst<strong>and</strong> why we <strong>on</strong>ly have to check that the two representati<strong>on</strong>s agree<strong>on</strong> Frob p . There are lots of other elements in Gal(Q/Q), right?]]We next show that J[m] ≠ 0. This does not follow from the theorem provedabove because it does not rule out the possibility that t = 0 <strong>and</strong> hence W ß ∼ = 0.Suppose J[m] = 0, then we will show that J[m i ] = 0 for all i ≥ 1. We c<strong>on</strong>siderthe l-divisible group J m = ∪ i J[m i ]. To get a better feel for what is going <strong>on</strong>,temporarily forget about m <strong>and</strong> just c<strong>on</strong>sider the Tate module corresp<strong>on</strong>ding to l.It is st<strong>and</strong>ard to c<strong>on</strong>sider the Tate moduleTate l J = lim J[l i ] ∼ ←−= Z 2 dim JIt is completely equivalent to c<strong>on</strong>siderl .J l := ∪ ∞ i=1J[l i ] ∼ −→ (Q l /Z l ) 2 dim J .Note that since Q l /Z l is not a ring the last isomorphism must be viewed as an isomorphismof abelian groups. In [Maz77] Mazur called Tate l J ∼ = Hom(Q l /Z l , J l )the covariant Tate module. CallTate ∗ l J := Hom(J l , Q l /Z l ) ∼ = Hom Zl (Tate l J, Z l )


184 18. The Gorenstein Property for <strong>Hecke</strong> Algebrasthe c<strong>on</strong>travariant Tate module. [[Why are the last two isomorphic?]] The covariant<strong>and</strong> c<strong>on</strong>travariant Tate modules are related by a Weil pairing J[l i ] × J[l i ] → µ l i.Taking projective limits we obtain a pairingThis gives a map〈·, ·〉 : Tate l J × Tate l J → Z l (1) = lim ←−µ l i.Tate l J → Hom(Tate l J, Z l (1)) = (Tate ∗ l J)(1)where (Tate ∗ l J)(1) = (Tate ∗ l J) ⊗ Z l (1). Z l (1) is a Z l -module where∑ai l i · ζ = ζ ∑ a il i[[This should probably be said l<strong>on</strong>g ago.]] This pairing is not a pairing of T-modules, since if t ∈ T then 〈tx, y〉 = 〈x, t ∨ y〉. It is more c<strong>on</strong>venient to use anadapted pairing defined as follows. Let w = w V ∈ End J 0 (N) be the Atkin-Lehnerinvoluti<strong>on</strong> so that t ∨ = wtw. Define a new T-compatible pairing by [x, y] :=〈x, wy〉. Then[tx, y] = 〈tx, wy〉 = 〈x, t ∨ wy〉 = 〈x, wty〉 = [x, ty].The pairing [·, ·] defines an isomorphism of T ⊗ Z l -modulesTate l J ∼ −→ Hom Zl (Tate l J, Z l (1)).Since Z l (1) is a free module of rank 1 over Z l a suitable choice of basis gives anisomorphism of T ⊗ Z l -modules(Tate ∗ l J)(1) ∼ = Hom Zl (Tate l J, Z l (1)) ∼ = Hom Zl (Tate l J, Z l ) = Tate ∗ l J.Thus Tate l J ∼ −→ Tate ∗ l J.Proof. (That J[m] ≠ 0.) The point is that the c<strong>on</strong>travariant Tate module Hom(J l , Q l /Z l )is the P<strong>on</strong>trjagin dual of T l . How does this relate to Tate m J? Since T ⊗ Z l∼ =∏m|l T m, Tate l J ∼ = ∏ m|l Tate m J so we can define Tate ∗ m J := Hom Zl (Tate m J, Z l ).Weil proved that Tate m J ∼ = T ate ∗ mJ is n<strong>on</strong>zero. View Tate ∗ m J as being dual toJ m in the sense of P<strong>on</strong>trjagin duality <strong>and</strong> so (Tate ∗ m J)/(m Tate ∗ m J) is dual toJ[m]. If J[m] = 0 then this quotient is 0, so Nakayama’s lemma would imply thatTate ∗ m J = 0. This would c<strong>on</strong>tradicts Weil’s asserti<strong>on</strong>. Therefore J[m] ≠ 0.18.3 Proof of the Gorenstein propertyWe are c<strong>on</strong>sidering the situati<strong>on</strong> with respect to J 0 (N) although we could c<strong>on</strong>siderJ 1 (N). Let T ⊂ End J 0 (N) be the <strong>Hecke</strong> algebra <strong>and</strong> let m ⊂ T be a maximal ideal.Let l be the characteristic of the residue class field T/m. Let T m = lim T/m i T.←−Then T ⊗ Z Z l = ∏ m|l T m. [[I want to put a good reference for this Atiyah-Macd<strong>on</strong>ald like fact here.]] Each T m acts <strong>on</strong> Tate l J 0 (N) so we obtain a productdecompositi<strong>on</strong>Tate l J 0 (N) = ∏ Tate m J 0 (N).m|lWe have the following two facts:


18.3 Proof of the Gorenstein property 1851. Tate m J 0 (N) ≠ 02. Tate l J 0 (N) is T ⊗ Z l -autodual <strong>and</strong> each Tate m J 0 (N) is Z l -autodual.[[autoduality for which dual? I think it is the linear dual since this is used.]]Let W = J[m], then the acti<strong>on</strong> of Gal(Q/Q) <strong>on</strong> W gives a representati<strong>on</strong> ofGal(Q/Q) over the field T/m. We compared W with a certain two dimensi<strong>on</strong>alrepresentati<strong>on</strong> ρ m : Gal(Q/Q) → V over T/m. Assume unless otherwise statedthat V is irreducible as a Gal(Q/Q)-module. Let Tate l = Tate l J 0 (N) <strong>and</strong> Tate m =Tate m J 0 (N). A formal argument due to Mazur showed thatW ß ∼ = V × · · · × V = V ⊕t .We have not yet determined t but we would like to show that t = 1. The P<strong>on</strong>trjagindual of a module M is the module M ∧ := Hom(M, Q/Z) where M is viewedas an abelian group (if M is topological, <strong>on</strong>ly take those homomorphisms whosekernel is compact). The linear dual of a module M over a ring R is the moduleM ∗ = Hom R (M, R).Exercise 18.3.1. Note that (Q l /Z l ) ∧ = Z l <strong>and</strong> Z ∧ l = Q l/Z l .Soluti<strong>on</strong>.. We can think of Q l /Z l as{∑−1n=−ka n l n : k > 0 <strong>and</strong> 0 ≤ a i < l}.Let (b i ) ∈ Z l so b i ∈ Z/l i Z <strong>and</strong> b i+1 ≡ b i (mod l i ). Define a map Q l /Z l → Q/Zby 1/l i ↦→ b i /l i . To check that this is well-defined it suffices to check that 1/l imaps to the same place as l · 1/l i+1 . Now 1/l i ↦→ b i /l i <strong>and</strong>So we just need to check thatl · 1/l i+1 ↦→ l · b i+1 /l i+1 = b i+1 /l i .b i+1 /l i ≡ b i /l i(mod Z).This is just the asserti<strong>on</strong> that (b i+1 − b i )/l i ∈ Z which is true since b i+1 ≡ b i(mod l i ).Propositi<strong>on</strong> 18.3.2. Let the notati<strong>on</strong> be as above, then t > 0.Proof. The idea is to use Nakayama’s lemma to show that if t = 0 <strong>and</strong> henceW = 0 then Tate m = 0 which is clearly false. But the relati<strong>on</strong> between W <strong>and</strong>Tate m is rather c<strong>on</strong>voluted. In fact J[l ∞ ] is the P<strong>on</strong>trjagin dual of Tate ∗ l , that is,<strong>and</strong>J[l ∞ ] ∧ = Tate ∗ l = Hom Zl (Tate l , Z l )(Tate ∗ l ) ∧ = Hom(Tate ∗ l , Q l /Z l ) = J[l ∞ ].[[First: Why are they dual? Sec<strong>on</strong>d: Why are we homing into Q l /Z l instead ofQ/Z?]] Looking at the m-adic part shows thatJ[m ∞ ] = Hom(Tate ∗ m, Q l /Z l )


186 18. The Gorenstein Property for <strong>Hecke</strong> Algebras<strong>and</strong> henceJ[m] = Hom(Tate ∗ m /m Tate ∗ m, Z/lZ).Thus if J[m] = 0 then Nakayama’s lemma implies Tate ∗ m = 0. By autoduality thisimplies Tate m = 0.We have two goals. The first is to show that t = 1, i.e., that J[m] is 2-dimensi<strong>on</strong>alover T/m. The sec<strong>on</strong>d is to prove that T m is Gorenstein, i.e., that T m∼ = HomZl (T m , Z l ).This is <strong>on</strong>e of the main theorems in the subject. We are assuming throughout thatρ m is irreducible <strong>and</strong> l ∤ 2N. Loosely speaking the c<strong>on</strong>diti<strong>on</strong> that l ̸ |2N means thatJ[m] has good reducti<strong>on</strong> at l <strong>and</strong> that J[m] can be understood just by underst<strong>and</strong>ingJ[m] in characteristic l. We want to prove that T m is Gorenstein because thisproperty plays an essential role in proving that T m is a local complete intersecti<strong>on</strong>.Example 18.3.3. LetT = {(a, b, c, d) ∈ Z 4 p : a ≡ b ≡ c ≡ d(mod p)}.Then T is a local ring that is not Gorenstein.For now we temporarily postp<strong>on</strong>e the proof of the first goal <strong>and</strong> instead showthat the first goal implies the sec<strong>on</strong>d.Theorem 18.3.4. Suppose J[m] is two dimensi<strong>on</strong>al over T/m (thus t = 1). ThenT m is Gorenstein.Proof. We have seen before thatJ[m] = Hom Z/lZ (Tate ∗ m /m Tate ∗ m, Z/lZ)= Hom T/m (Tate ∗ m /m Tate ∗ m, T/m).Thus the dual of Tate ∗ m /m Tate ∗ m is two dimensi<strong>on</strong>al over T/m <strong>and</strong> hence Tate ∗ m /m Tate ∗ mitself is two dimensi<strong>on</strong>al over T/m. By Nakayama’s lemma <strong>and</strong> autoduality ofTate m this implies Tate m is generated by 2 elements over T m . There is a surjecti<strong>on</strong>T m × T m → Tate m .In fact it is true that rank Zl Tate m = 2 rank Zl T m . We temporarily postp<strong>on</strong>e theproof of this claim. Assuming this claim <strong>and</strong> using that a surjecti<strong>on</strong> between Z l -modules of the same rank is an isomorphism implies that Tate m∼ = Tm × T m .Now T m is a direct summ<strong>and</strong> of the free Z l module Tate m so T m is projective. Aprojective module over a local ring is free. Thus T m is free of rank 1 <strong>and</strong> henceautodual (Gorenstein). [[This argument is an alternative to Mazur’s – it seems tooeasy... maybe I am missing something.]]We return to the claim thatThis is equivalent to the asserti<strong>on</strong> thatrank Zl Tate m = 2 rank Zl T m .dim Ql Tate m ⊗ Zl Q l = 2 dim Ql T m ⊗ Zl Q l .The module Tate l J 0 (N) is the projective limit of the l-power torsi<strong>on</strong> <strong>on</strong> the JacobianJ(C) = Hom C(S 2 (Γ 0 (N), C), C).H 1 (X 0 (N), Z)


18.4 Finite flat group schemes 187Let L = H 1 (X 0 (N), Z) be the lattice. Then L is a T-module <strong>and</strong> Tate l = L ⊗ Z Z l(since L/l i L ∼ = ( 1 l i L)/L). Tensoring with R givesL ⊗ Z R = Hom C (S 2 (Γ 0 (N), C), C)= Hom R (S 2 (Γ 0 (N), R), C)= Hom R (S 2 (Γ 0 (N), R), R) ⊗ R C = (T ⊗ Z R) ⊗ R CThus L⊗ Z R is free of rank 2 over T⊗ Z R <strong>and</strong> L⊗ Z C is free of rank 2 over T⊗ Z C.Next choose an embedding Q l ↩→ C. Now Tate l is a module over T⊗Z l = ∏ m|l T mso we have a decompositi<strong>on</strong> Tate l = ∏ m|l Tate m. SinceTate l ⊗ Zl Q l = ∏ mTate m ⊗ Zl Q lwe can tensor with C to see thatTate l ⊗ Zl C = ∏ mTate m ⊗ Zl C.But Tate l ⊗ Zl C = L ⊗ Z C is free of rank 2 over T ⊗ C. Therefore the product∏m Tate m ⊗ Zl C is free of rank 2 over T ⊗ C. SinceT ⊗ C = (T ⊗ Z Z l ) ⊗ Zl C = ∏ m(T m ⊗ Zl C)we c<strong>on</strong>clude that for each m, Tate m ⊗ Zl C is free of rank 2 over T m ⊗ Zl C. Thisimpliesdim C Tate m ⊗ Zl C = 2 dim C T m ⊗ Zl Cwhich completes the proof.18.3.1 Vague commentsOgus commented that this same proof shows that T ⊗ Z C is Gorenstein. Then hesaid that something called “faithfully flat descent” could then show that T ⊗ Z Qis Gorenstein.We have given the classical argument of Mazur that T m is Gorenstein, but westill haven’t shown that J[m] has dimensi<strong>on</strong> 2. This will be accomplished next timeusing Dieud<strong>on</strong>né modules.18.4 Finite flat group schemesLet S be a scheme. Then a group scheme over S is a group object in the categoryof S-schemes. Thus a group scheme over S is a scheme G/S equipped with S-morphisms m : G × G → G, inv : G → G <strong>and</strong> a secti<strong>on</strong> 1 G : S → G satisfying theusual group axioms.Suppose G is a group scheme over the finite field F q . If R is an F q -algebra thenG(R) = Mor(Spec R, G) is a group. It is the group of R-valued points of G.We c<strong>on</strong>sider several st<strong>and</strong>ard examples of group schemes.


188 18. The Gorenstein Property for <strong>Hecke</strong> AlgebrasExample 18.4.1. The multiplicative group scheme G m is G m = Spec Z[x, 1 x ] withmorphisms. [[give maps, etc.]] The additive group scheme is Spec Z[x]...Example 18.4.2. The group scheme µ p is the kernel of the morphism G m → G minduced by x ↦→ x p . Thus µ p = Spec Z[x]/(x p − 1) <strong>and</strong> so for any F q -algebra Rwe have that µ p (R) = {r ∈ R : r p = 1}. The group scheme α p is the kernel of themorphism G a → G a induced by [[what!! what is alphap?? it should be the additivegroup scheme of order p, no?]].Let A be a finite algebra over F p <strong>and</strong> suppose G = Spec A affine group scheme(over F p ). Then the order of G is defined to be the dimensi<strong>on</strong> of A as an F p vectorspace.Example 18.4.3. Let E/F p be an elliptic curve. Then G = E[p] is a group schemeof order p 2 [[why is this true?]]. This is w<strong>on</strong>derful because this is the order thatE[p] should have in analogy with the characteristic 0 situati<strong>on</strong>. When we just lookat points we have{1 supersingular#G(F p ) =.p ordinary18.5 Reformulati<strong>on</strong> of V = W problemLet J = J 0 (N) be the Jacobian of X 0 (N). Then J is defined over Q <strong>and</strong> hasgood reducti<strong>on</strong> at all primes not dividing N. Assume l is a prime not dividing N.J[l] extends to a finite flat group scheme over Z[ 1 N]. This is a n<strong>on</strong>trivial result ofGrothendieck (SGA 7I, LNM 288). Since l ∤ N, J[l] gives rise to a group schemeover F l .We have “forcefully” c<strong>on</strong>structed a Galois representati<strong>on</strong> ρ m : G → V of dimensi<strong>on</strong>2 over T/m. Our goal is to show that this is isomorphic to the naturallydefined Galois representati<strong>on</strong> W = J[m]. So far we know that0 ⊂ V ⊂ W ⊂ J[l].Our assumpti<strong>on</strong>s are that l ∤ N, V is irreducible, <strong>and</strong> l ≠ 2.Let J be J thought of as a scheme over Z l . Grothendieck showed that J is thespectrum of a free finite Z l -module. Raynaud (1974) showed that if l ≠ 2 thenessentially everything about J[l] can be seen in terms of J[l](Q l ). He goes <strong>on</strong> toc<strong>on</strong>struct group scheme V <strong>and</strong> W over Z l such thatV ⊂ W ⊂ J[l].Our goal is to prove that the inclusi<strong>on</strong> V ↩→ W of Galois modules is an isomorphism.Raynaud showed noted that the category of finite flat group schemesover Z l is an abelian category so the cokernel Q = W /V is defined. Furthermore,V = W if <strong>and</strong> <strong>on</strong>ly if Q = 0. Since Q is flat QFlhas the same dimensi<strong>on</strong> over F las QQlhas over Q l . Passing to characteristic l yields an exact sequence0 → V Fl→ W Fl→ QFl→ 0.Thus V ↩→ W is an isomorphism if <strong>and</strong> <strong>on</strong>ly if V Fl↩→ W Flis an isomorphism.Since V , W , <strong>and</strong> Q have an acti<strong>on</strong> of k = T/m that are k-vector space schemes.This leads us to Dieud<strong>on</strong>né theory.


18.6 Dieud<strong>on</strong>né theory 18918.6 Dieud<strong>on</strong>né theoryLet G/k be a finite k-vector space scheme where k is a finite field of order q.Suppose G has order q n so G is locally the spectrum of a rank n algebra over k.The Dieud<strong>on</strong>né functor c<strong>on</strong>travariantly associates to G an n dimensi<strong>on</strong>al k-vectorspace D(G). Let Frob : G → G be the morphism induced by the pth power map<strong>on</strong> the underlying rings <strong>and</strong> let Ver be the dual of Frob. Let ϕ = D(Frob) <strong>and</strong>ν = D(Ver), then it is a property of the functor D that ϕ ◦ ν = ν ◦ ϕ = 0. Thefunctor D is a fully faithful functor.Example 18.6.1. Let k = F p . If G is either µ p , α p , or Z/pZ then D(G) is a <strong>on</strong>edimensi<strong>on</strong>alvector space over k. In the case of α p , ϕ = ν = 0. For µ p , ϕ = 0 <strong>and</strong>ν ≠ 1 <strong>and</strong> for Z/pZ, ϕ ≠ 1 <strong>and</strong> ν = 0. [[The latter two could be reversed!]]Let G ∨ = Hom(G, µ p ) denote the Cartier dual of the scheme G. Then(ϕ <strong>and</strong> ν are switched.)D(G ∨ ) = Hom k (D(G), k)Example 18.6.2. Let A/F l be an abelian variety <strong>and</strong> let G = A[l]. Then G isan F l -vector space scheme of order l 2g . Thus D(G) is a 2g-dimensi<strong>on</strong>al F l -vectorspace <strong>and</strong> furthermore D(G) = HDR 1 (A/F l). The Hodge filtrati<strong>on</strong> <strong>on</strong> HDR 1 of theabelian variety A gives rise to a diagramHom(H 0 (A ∨ , Ω 1 ), F l ) Tan(A ∨ )H 0 (A, Ω 1 ) D(G) H 1 DR (A/F l) H 1 (A, O)D(A[l]) D(A[Ver])There is an exact sequence0 → W Fl → J Fl [l] −→ m J Fl [l]so because D is an exact functor the sequenceD(J Fl [l]) −→ m D(J Fl [l]) → D(W Fl ) → 0is exact. Following F<strong>on</strong>taine we c<strong>on</strong>siderD(W Fl [Ver]) = H 1 (J, O)/mH 1 (J, O).18.7 The proof: part II[[We all just returned from the Washingt<strong>on</strong> D.C. c<strong>on</strong>ference <strong>and</strong> will now resumethe proof.]]Let J 0 (N) be the Jacobian of X 0 (N). Let m ⊂ T be a maximal ideal <strong>and</strong>suppose m|l. Assume that l ≠ 2 <strong>and</strong> l ∤ N. The assumpti<strong>on</strong> that l ≠ 2 is necessary


190 18. The Gorenstein Property for <strong>Hecke</strong> Algebrasfor Raynaud’s theory <strong>and</strong> we assume that l ∤ N so that our group schemes willhave good reducti<strong>on</strong>. We attach to m a 2 dimensi<strong>on</strong>al semisimple representati<strong>on</strong>ρ m : Gal(Q/Q) → V <strong>and</strong> <strong>on</strong>ly c<strong>on</strong>sider the case that ρ m is irreducible.The m-torsi<strong>on</strong> of the Jacobian, W = J 0 (Q)[m], is naturally a Galois module. Wehave shown that W ≠ 0. By [BpR91] W ∼ = V × · · · × V (the number of fracti<strong>on</strong>sis not determined). We proved that W s.s. ∼ = V × · · · × V . Choose an inclusi<strong>on</strong>V ↩→ W <strong>and</strong> let Q = W/V be the cokernel.Theorem 18.7.1. Q = 0 so dim T/m W = 2To prove the theorem we introduce the “machine” of finite flat group schemesover Z l . For example, W extends to a finite flat group scheme W Zl which is definedto be the Zariski closure of W in J Zl [l]. Passing to group schemes yields an exactsequence0 → V Zl → W Zl → Q Zl → 0.Reducing mod l then yields an exact sequence of F l -group schemes0 → V Fl → W Fl → Q Fl → 0.The point is that Q = 0 if <strong>and</strong> <strong>on</strong>ly if Q Zl = 0 if <strong>and</strong> <strong>on</strong>ly if Q Fl = 0.Next we introduced the exact c<strong>on</strong>travariant Dieud<strong>on</strong>né functorD : ( Groups Schemes /F l ) −→ ( Linear Algebra ).D sends a group scheme G to a T/m vector space equipped with two endomorphismsϕ = Frob <strong>and</strong> ν = Ver. Applying D gives an exact sequence of T/m-vectorspaces0 → D(Q) → D(W ) → D(V ) → 0where everything is now viewed over F l .Lemma 18.7.2. D(W [Ver]) = (H 0 (X 0 (N) Fl , Ω 1 )[m]) ∗Proof. We have the diagramD(J Fl [l]) = HDR 1 (J F l)↓↓D(J Fl [Ver]) = H 1 (J Fl , O JFl )↓↓D(W [Ver]) = H 1 (J Fl , O J )/mH 1 (J Fl , O J )Furthermore we have the identificati<strong>on</strong>sH 1 (J Fl , O J ) = Tan(J ∨ F l) = Cot(J ∨ F l) ∗= H 0 (J ∨ , Ω 1 ) ∗ = H 0 (X 0 (N), Ω 1 ) ∗For the last identificati<strong>on</strong> we must have J ∨ = Alb(X 0 (N)). FinallyD(W [Ver]) = H 1 (J, O J )/mH 1 (J, O J )= (H 0 (X 0 (N) Fl , Ω 1 )[m]) ∗ .


18.7 The proof: part II 191Lemma 18.7.3. H 0 (X 0 (N) Fl , Ω 1 )[m] has T/m dimensi<strong>on</strong> ≤ 1.Proof. Let S = H 0 (X 0 (N) Fl , Ω 1 )[m]. Then S ↩→ F l [[q]]. We defined the <strong>Hecke</strong>operators T n <strong>on</strong> S via the identificati<strong>on</strong> S ∼ = H 1 (J, O J ) so that they act <strong>on</strong> S ⊂F l [[q]] in the st<strong>and</strong>ard way. Let T(S) be the subalgebra of End(S) generated bythe images of the T n in End(S). (T(S) is not a subring of T.) There is a perfectpairingT(S) × S −→ F l(T, f) ↦→ a 1 (f|T )Thus dim Fl T(S) = dim Fl S <strong>and</strong> so dim T(S) S ≤ 1. Since m acts trivially <strong>on</strong> Sthere is a surjecti<strong>on</strong> T/m → T(S). Thusas desired.dim T/m S ≤ dim T(S) S ≤ 1An applicati<strong>on</strong> of the above lemma shows that D(W [Ver]) has T/m dimensi<strong>on</strong>≤ 1.Lemma 18.7.4. D(W [Ver]) ∼ = D(V [Ver])Proof. C<strong>on</strong>sider the following diagram.0000 D(Q) D(W )VerVer0 D(Q) D(W )D(V )VerD(V )000 D(Q)/ Ver D(Q)?∼ =?D(W )/ Ver D(W ) D(V )/ Ver D(V ) 00 0 0D(V ) has dimensi<strong>on</strong> 2 so since Ver ◦ Frob = Frob ◦ Ver = 0 <strong>and</strong> Ver, Frob areboth n<strong>on</strong>zero they must each have rank 1 (in the sense of undergraduate linearalgebra). Since D is exact, D(V [Ver]) = D(V )/ Ver D(V ) <strong>and</strong> D(W [Ver]) =D(W )/ Ver D(W ). By the previous lemma dim D(W )/ Ver D(W ) = 1. Thus D(W )/ Ver D(W ) →D(V )/ Ver D(V ) is a map of 1 dimensi<strong>on</strong>al vector spaces so to show that it is anisomorphism we just need to show that it is surjective. This follows from the commutativityof the squareD(W )D(V )0D(W )/ Ver D(W ) D(V )/ Ver D(V )0


192 18. The Gorenstein Property for <strong>Hecke</strong> AlgebrassoSuppose for the moment that we admit [BpR91]. ThenW = V × · · · × V = V ⊕tD(W [Ver]) ∼ = D(V [Ver]) ⊕t<strong>and</strong> hence t = 1.Alternatively we can avoid the use of [BpR91]. Suppose Q ≠ 0. Then there isan injecti<strong>on</strong> V ↩→ Q. [[I can’t see this without using B-L-R. It isn’t obvious to mefrom 0 → V → W → Q → 0.]] Thus over F l , V [Ver] ↩→ Q[Ver]. Since V [Ver] ≠ 0this implies Q[Ver] ≠ 0. Thus D(Q)/ Ver D(Q) = D(Q[Ver]) ≠ 0. But the bottomrow of the above diagram implies D(Q)/ Ver D(Q) = 0 so Q = 0.18.8 Key result of Bost<strong>on</strong>-Lenstra-RibetLet G be a group (i.e., G = Gal(Q/Q)), let k be a field (i.e., k = T/m), <strong>and</strong> let Vbe a two dimensi<strong>on</strong>al k-representati<strong>on</strong> of G given byρ : k[G] → End k (V ) = M 2 (k).The key hypothesis is that V is absolutely irreducible, i.e., that ρ is surjective. Foreach g ∈ G c<strong>on</strong>siderp g = g 2 − g tr ρ(g) + det ρ(g) ∈ k[G].By the Cayley-Hamilt<strong>on</strong> theorem ρ(p g ) = 0. Let J be the two-sided ideal of k[G]generated by all p g such that g ∈ G. Since J ⊂ ker ρ , ρ induces a mapσ : k[G]/J → End k (V ).Theorem 18.8.1 (Bost<strong>on</strong>-Lenstra-Ribet). If σ is surjective then σ is an isomorphism.In particular if V is absolutely irreducible then σ is surjective. The theorem canbe false when dim V > 2.Suppose W is a sec<strong>on</strong>d representati<strong>on</strong> of G given by µ : k[G] → End(W ) <strong>and</strong>that µ(J) = {0} ⊂ End(W ). Then W is a module over k[G]/J = End(V ). ButEnd(V ) is a semisimple ring so any End(V ) module is a direct sum of simpleEnd(V ) modules. The <strong>on</strong>ly simple End(V ) module is V . Thus W ∼ = V ⊕n for somen.


19Local Properties of ρ λLet f be a newform of weight 2 <strong>on</strong> Γ 0 (N). To f we have associated an abelianvariety A = A f furnished with an acti<strong>on</strong> of E = Q(. . . , a n (f), . . .). Let O E be thering of integers of E <strong>and</strong> λ ⊂ O E a prime. Then we obtain a λ-adic representati<strong>on</strong>ρ λ : Gal(Q/Q) → GL 2 (E λ )<strong>on</strong> the Tate module Tate l A = lim ←−A[λ i ]. We will study the local properties of ρ λat various primes p.19.1 Definiti<strong>on</strong>sTo view ρ λ locally at p we restrict to the decompositi<strong>on</strong> group D p = Gal(Q/Q)at p. Recall the definiti<strong>on</strong> of D p . Let K be a finite extensi<strong>on</strong> of Q <strong>and</strong> let w be aprime of K lying over p. Then the decompositi<strong>on</strong> group at w is defined to beD w = {σ ∈ Gal(K/Q) : σw = w}.Propositi<strong>on</strong> 19.1.1. D w∼ = Gal(Kw /Q p )Proof. Define a map Gal(K w /Q p ) → D w by σ ↦→ σ |K . Since σ |K fixes Q thisrestricti<strong>on</strong> is an element of Gal(K/Q). Since wO Kw is the unique maximal ideal ofO Kw <strong>and</strong> σ induces an automorphism of O Kw , it follows that σ(wO Kw ) = wO Kw .Thus σ |K (w) = w so σ |K ∈ D w . The map σ ↦→ σ |K is bijective because K is densein K w .LetD p = lim D ←− w = lim Gal(K ←− w /Q p ) = Gal(Q p /Q p ).w|p w|pFor each w|p let the inertia group I w be the kernel of the map from D w intoGal(O K /w, Z/pZ). Let I p = lim I w . ←−w|p


194 19. Local Properties of ρ λ19.2 Local properties at primes p ∤ NNext we study local properties of ρ λ at primes p ∤ N. Thus suppose p ∤ N <strong>and</strong>p ≠ l = char(O E /λ). Let D p = Gal(Q p /Q p ). Then1) ρ λ |D p is unramified (i.e., ρ λ (I p ) = {1}) thus ρ λ |D p factors through D p /I pso ρ λ (Frob p ) is defined.2) tr(ρ λ (Frob p )) = a p (f)2+) We can describe ρ λ |D p up to isomorphism. It is the unique semisimple representati<strong>on</strong>satisfying 1) <strong>and</strong> 2).19.3 Weil-Deligne GroupsNotice that everything is sort of independent of λ. Using Weil-Deligne groups wecan summarize all of these λ-adic representati<strong>on</strong>s in terms of data which makes λdisappear.We have an exact sequence1 → I p → Gal(Q p /Q p ) → Gal(F p /F p ) → 0.Since Gal(F p /F p ) = Ẑ there is an injecti<strong>on</strong> Z ↩→ Gal(F p/F p ). Define the Weilgroup W (Q p /Q p ) ⊂ Gal(Q p /Q p ) to be the set of elements of Gal(Q p /Q p ) mappingto Z ⊂ Gal(F p /F p ). W fits into the exact sequence1 → I p → W (Q p /Q p ) → Z → 1.There is a st<strong>and</strong>ard way in which the newform f gives rise to a representati<strong>on</strong>of W . Factor the polynomial x 2 − a p (f)x + p as a product (x − r)(x − r ′ ) withr, r ′ ∈ C. Define maps α, β byα : Z → C ∗ : 1 ↦→ rβ : Z → C ∗ : 1 ↦→ r ′Combining α <strong>and</strong> β <strong>and</strong> the map W (Q p /Q p ) → Z yields a mapα ⊕ β : W (Q p /Q p ) → GL 2 (C)σ ↦→( α(σ) 0).0 β(σ)Moreover α ⊕ β gives rise via some c<strong>on</strong>structi<strong>on</strong> to all the λ-adic representati<strong>on</strong>sρ λ .19.4 Local properties at primes p | NSuppose p|N but p ≠ l. Carayol was able to generalize 1) in his thesis which buildsup<strong>on</strong> the work of Langl<strong>and</strong>s <strong>and</strong> Deligne in the directi<strong>on</strong> of Deligne-Rapaport <strong>and</strong>Katz-Mazur. The idea is that the abelian variety A has a c<strong>on</strong>ductor which is apositive integer divisible by those primes of bad reducti<strong>on</strong>. The c<strong>on</strong>ductor of Asatisfies c<strong>on</strong>d(A) = M g where g = dim A <strong>and</strong> M is the reduced c<strong>on</strong>ductor of A.


Theorem 19.4.1 (Carayol). M = N.19.5 Definiti<strong>on</strong> of the reduced c<strong>on</strong>ductor 195How can we generalize 2) or 2+)? For each p dividing N there is a representati<strong>on</strong>σ p of WD(Q p /Q p ) over C such that σ p gives rise to ρ λ |D p for all λ ∤ p. HereWD(Q p /Q p ) is the Weil-Deligne group which is Deligne’s generalizati<strong>on</strong> of theWeil group. WD(Q p /Q p ) is an extensi<strong>on</strong> of W (Q p /Q p ). What is σ p supposed tobe? The point is that σ p is determined by f. Thus every f gives rise to a family(σ p ) p prime . To really think about σ p we must think about modular forms in anadelic c<strong>on</strong>text instead of viewing them as holomorphic functi<strong>on</strong>s <strong>on</strong> the complexupper halfplane.If p 2 |N <strong>and</strong> f = ∑ a n q n is a newform of level p 2 then it is a classical fact thata p = 0. But the study of ρ λ |D p is rich <strong>and</strong> “corresp<strong>on</strong>ds to a rather innocuouslooking crystal”.19.5 Definiti<strong>on</strong> of the reduced c<strong>on</strong>ductorWe now define the reduced c<strong>on</strong>ductor. Let λ be a prime of E <strong>and</strong> p a prime of Qsuch that λ ∤ p. We want to define some integer e(p) so that p e(p) is the p-part ofthe reduced c<strong>on</strong>ductor. We will not define e(p) but what we will do is define aninteger e(p, λ) which is the p part of the c<strong>on</strong>ductor. e(p, λ) is independent of λ butthis will not be proved here.C<strong>on</strong>siderρ λ |D p : Gal(Q p /Q p ) → Aut Eλ V.Let V I ⊂ V be the inertia invariants of V , i.e.,V I = {v ∈ V : ρ λ (σ)(v) = v for all σ ∈ I}.Since ρ λ is unramified at p if <strong>and</strong> <strong>on</strong>ly if ρ λ (I p ) = {1} we comment that ρ λ isunramified at p if <strong>and</strong> <strong>on</strong>ly if V I = V . Lete(p, λ) = dim V/V I + δ(p, λ)where δ(p, λ) is the Swan c<strong>on</strong>ductor. By working with finite representati<strong>on</strong>s wedefine δ(p, λ) as follows. Choose a D p -stable lattice L in V by first choosing anarbitrary <strong>on</strong>e then taking the sum of its finitely many c<strong>on</strong>jugates. Let V = L/λLwhich is a 2 dimensi<strong>on</strong>al vector space over k = O E /λ. Let G be the quotientof Gal(Q p /Q p ) by the kernel of the map Gal(Q p /Q p ) → Aut K V . Thus G =Gal(K/Q p ) for some finite extensi<strong>on</strong> K/Q p . The extensi<strong>on</strong> K is finite over Q psince G ⊂ Aut k V <strong>and</strong> Aut k V is a 2 × 2 matrix ring over a finite field. Thecorresp<strong>on</strong>ding diagram isGal(Q p /Q p )Aut k VG = Gal(K/Q p )C<strong>on</strong>sider in G the sequence of “higher ramificati<strong>on</strong> groups”G = G −1 ⊃ G 0 ⊃ G 1 ⊃ · · · .


196 19. Local Properties of ρ λHere G 0 is the inertia group of K/Q p <strong>and</strong> G 1 is the p-sylow subgroup of G 0[[the usages of “the” in this sentence makes me nervous.]] Let G i = {g ∈ G 0 :ord(gπ − π) ≥ i + 1} where π is some kind of uniformizing parameter [[I missedthis – what is π?]] Letδ(p, λ) =∞∑i=11(G 0 : G i ) dim(V /V Gi ).It is a theorem that δ(p, λ) is an integer <strong>and</strong> does not depend <strong>on</strong> λ.If to start with we <strong>on</strong>ly had V <strong>and</strong> not V we could defineThenc<strong>on</strong>d(V ) =∞∑i=01(G 0 : G i ) dim(V /V Gi ).e(p, λ) = c<strong>on</strong>d(V ) + (dim V I − dim V I ).A reference for much of this material is Serre’s Local Factors of L-functi<strong>on</strong>s ofλ-adic Representati<strong>on</strong>s.


20Adelic Representati<strong>on</strong>sOur goal is to study local properties of the λ-adic representati<strong>on</strong>s ρ λ arising from aweight 2 newform of level N <strong>on</strong> Γ 0 (N). There is a theorem of Carayol which statesroughly that if p ≠ l then ρ λ |D p is predictable from “the comp<strong>on</strong>ent at p of f”. Tounderst<strong>and</strong> this theorem we must underst<strong>and</strong> what is meant by “the comp<strong>on</strong>entat p of f”. If p 2 ∤ N this comp<strong>on</strong>ent is easy to determine but if p 2 |N it is harder.One reas<strong>on</strong> is that when p 2 |N then a p (f) = 0. [[this should be easy to see so thereshould be an argument here.]] If a p (f) = 0 <strong>and</strong> ρ λ : Gal(Q/Q) → Aut(V λ ) thenV Ipλ= 0. This means that there is no ramificati<strong>on</strong> going <strong>on</strong> at p. See Casselman,“On representati<strong>on</strong>s of GL(2) <strong>and</strong> the arithmetic of modular curves”, Antwerp II.20.1 Adelic representati<strong>on</strong>s associated to modular formsLet R be a subring of A 2 = R 2 × (Ẑ ⊗ Z Q) 2 , suppose that R ∼ = Q 2 <strong>and</strong> thatR ⊗ Q A ∼ = A 2 . Let L = R ∩ (R 2 × Ẑ2 ). Then the natural map L ⊗ Ẑ → Z2 isan isomorphism. [[Is the isomorphism implied by the definiti<strong>on</strong> of L or is it partof the requirement for L to actually form an adelic lattice?]] L is called an adeliclattice.The space of modular forms S 2 (Γ 0 (N)) is isomorphic to a certain space of functi<strong>on</strong>s<strong>on</strong> G(A) = GL(2, A). See Borel-Jacquet [[Corvallis?]] or Diam<strong>on</strong>d-Taylor,Inventi<strong>on</strong>es Mathematica, 115 (1994) [[what is title?]] We will describe this isomorphism.Write A = R × A f = A ∞ × A ∞ where A f = ∏ Q p (restricted product) isthe ring of finite adeles. A ∞ denotes the adeles with respect to the place ∞ soA ∞ = R, <strong>and</strong> A ∞ denotes the adeles away from the place ∞ so A ∞ = A f .S 2 (Γ 0 (N)) is isomorphic to the set of functi<strong>on</strong>s ϕ : G(A) → C which satisfy0) ϕ is left invariant by G(Q), i.e., ϕ(x) = ϕ(gx) for all g ∈ G(Q) <strong>and</strong> allx ∈ G(A),


198 20. Adelic Representati<strong>on</strong>s1) ϕ(xu ∞ ) = ϕ(x) for all x ∈ G(A) <strong>and</strong> all u ∞ ∈ U ∞ ,2) ϕ(xu ∞ ) = ϕ(x)j(u ∞ , i) −k det(u ∞ ) for all x ∈ G(A) <strong>and</strong> u ∞ ∈ U ∞ ,3) holomorphy, cuspidal, <strong>and</strong> growth c<strong>on</strong>diti<strong>on</strong>s.U ∞ is the adelic versi<strong>on</strong> of Γ 0 (N). Thus U ∞ is the compact open subgroup(U ∞ a b)= { ∈ GLc d 2 (Ẑ) : c ≡ 0 (mod N)} ⊂ G(A f ).(For Γ 1 (N) the c<strong>on</strong>diti<strong>on</strong> is that c ≡ 0 (mod N) <strong>and</strong> d ≡ 1 (mod N) but a is notrestricted.)Next we describe U ∞ ⊂ GL 2 (R). GL 2 (R) operates <strong>on</strong> h ± = C−R by z ↦→ az+bcz+d .Let U ∞ be the stabilizer of i.The third c<strong>on</strong>diti<strong>on</strong> involves the automorphy factor j defined by( a b)j( , z) = cz + d.c dTo explain the holomorphy c<strong>on</strong>diti<strong>on</strong> 3) we define, for any g ∈ G(A f ) a mapα g,ϕ : h ± → Chi ↦→ ϕ(gh)j(h, i) k (det(h)) −1 .Here h ∈ G(A f ) so hi ∈ h ± . There may be several different h ∈ G(A f ) which givethe same hi ∈ h ± so it must be checked that α g,ϕ is well-defined. Suppose hi = h ′ i,then h −1 h ′ i = i. Thus h −1 h ′ ∈ U ∞ , so by 2),ThusSubstituting gh for g yieldsϕ(gh −1 h ′ ) = ϕ(g)j(h −1 h ′ , i) −k det(h −1 h ′ ).ϕ(gh −1 h ′ ) det(h ′ ) −1 = ϕ(g)j(h −1 h ′ , i) −k det(h) −1 .ϕ(gh ′ )(det(h ′ )) −1 = ϕ(gh)j(h −1 h ′ , i) −k (det(h)) −1 .[[The the automorphy factor work out right. Why?]] The holomorphy c<strong>on</strong>diti<strong>on</strong> isthat the family of maps α g,ϕ are all holomorphic.The cuspidal c<strong>on</strong>diti<strong>on</strong> is that for all g ∈ G(A), the integral∫ϕ( ( )1 u0 1 , g)du = 0u∈A/Qvanishes. Since A/Q is compact it has a Haar measure defined modulo k ∗ whichinduces du. Although the integral is not well-defined the vanishing or n<strong>on</strong>-vanishingof the integral is.We can now describe the isomorphism between S k (Γ 0 (N)) <strong>and</strong> the space of suchfuncti<strong>on</strong>s ϕ <strong>on</strong> G(A).{ space of ϕ satisfying 0-3} → S k (Γ 0 (N))ϕ ↦→ f


20.1 Adelic representati<strong>on</strong>s associated to modular forms 199where f is the restricti<strong>on</strong> to h = h + of the functi<strong>on</strong>hi ↦→ ϕ(h)j(h, i) k (det h) −1 .Now we can associate to a newform f a representati<strong>on</strong> of G(A). We can weakenc<strong>on</strong>diti<strong>on</strong> 1) to get1-) ϕ(xu ∞ ) = ϕ(x) for all x ∈ G(A) <strong>and</strong> all u ∞ ∈ U ∞ where U ∞ is somecompact open subgroup of G(A f ).[[can the U ∞ vary for each x ∈ G(A) or are they fixed throughout?]] Let S be thespace of all functi<strong>on</strong>s satisfying all c<strong>on</strong>diti<strong>on</strong>s except 1-) replaces 1). This spacehas a left acti<strong>on</strong> of G(A):(g ∗ ϕ)(x) = ϕ(xg)If f is a newform corresp<strong>on</strong>ding to some ϕ via the above isomorphism then viathis acti<strong>on</strong> f gives rise to an infinite dimensi<strong>on</strong>al representati<strong>on</strong> π of G(A). In factwe obtain, for each prime p, a representati<strong>on</strong> π p of GL(2, Q p ). The representati<strong>on</strong>space is ∑ g∈GL C · g ∗ ϕ. Our immediate goal is to underst<strong>and</strong> π 2(Q p) p for as manyp as possible. [[spherical representati<strong>on</strong>s have something to do with this. are theπ p spherical reps?]]We are studying local properties of the λ-adic representati<strong>on</strong>s ρ λ associated to anewform f of weight 2, level N <strong>and</strong> character ε : (Z/NZ) ∗ → C ∗ (with c<strong>on</strong>d(ε)|N).Let l ∈ Z be the prime over which λ lies. We look at ρ λ locally at p, p ≠ l.As we saw last time f gives rise to an irreducible representati<strong>on</strong> of GL(2, A).An irreducible representati<strong>on</strong> of GL(2, A) gives rise to a family of representati<strong>on</strong>s(π v ) where v is a prime or ∞ <strong>and</strong> π v is an irreducible representati<strong>on</strong> of GL(2, Q v ).This is because Q v ⊂ A so GL(2, Q v ) ⊂ GL(2, A).Carayol proved that if p ≠ l then ρ λ | Dp depends, up to isomorphism, <strong>on</strong>ly <strong>on</strong>π p . The most difficult case in the proof of this theorem is when p 2 ∤ N. The toolsneeded to obtain a proof were already available in the work of Langl<strong>and</strong>s [1972].To get an idea of what is going <strong>on</strong> we will first c<strong>on</strong>sider the case when p ∤ N.The characteristic polynomial of Frobenious (at least psychologically) under therepresentati<strong>on</strong> ρ λ | Dp isx 2 − a p x + pε(p) = (x − r)(x − s).Because of Weil’s proof of the Riemann hypothesis for abelian varieties [over finitefields?] <strong>on</strong>e knows that |r| = |s| = √ p. Since ρ λ arises from the acti<strong>on</strong> of Galois<strong>on</strong> an abelian variety which has good reducti<strong>on</strong> at p (since p ∤ N) it follows thatρ λ | Dp is unramified. [[Is this in Serre-Tate, 1968?]] We also know that ρ λ (Frob p )has characteristic polynomialx 2 − a p x + pε(p) ∈ E[x].In this situati<strong>on</strong> <strong>on</strong>e also knows that ρ λ (Frob p ) is semisimple [[proof: Ribet noddedat Coleman who smiled at nodded back.]] Thus( r 0)ρ λ ∼ .0 sIn this situati<strong>on</strong> what is the representati<strong>on</strong> π p of GL(2, Q p )? There are two charactersα, β : Q ∗ p → C ∗(called “Grössencharacters of type (a,0)”) such that


200 20. Adelic Representati<strong>on</strong>s1. α <strong>and</strong> β are unramified in the sense thatα| Z ∗ p= β| Z ∗ p= 1.This is a reas<strong>on</strong>able c<strong>on</strong>diti<strong>on</strong> since under some sort of local class field theoryQ ∗ p embeds as a dense subgroup of Gal(Q p /Q p ) <strong>and</strong> under this embeddingthe inertia subgroup I ⊂ Gal(Q p /Q p ) corresp<strong>on</strong>ds to Z ∗ p. [[This could bewr<strong>on</strong>g. Also, is I ∩ Q ∗ p = Z ∗ p?]]2. α(p −1 ) = r, β(p −1 ) = s.In the 1950’s Weil found a way under which α <strong>and</strong> β corresp<strong>on</strong>d to c<strong>on</strong>tinuouscharacters α λ , β λ <strong>on</strong> Gal(Q p /Q p ) with values in E ∗ λ such that1. α λ <strong>and</strong> β λ are unramified.2. α λ (Frob p ) = r <strong>and</strong> β λ (Frob p ) = s.One has that ρ λ = α λ ⊕ β λ . See [ST68]. [[Why see this?]]Define a character Θ <strong>on</strong> the Borel subgroupbyThenB =( ∗ ∗)⊂ GL(2, Q0 ∗p )( x y)Θ = α(x)β(z) ∈ C ∗ .0 zπ p = Ind GL(2,Qp)B:= C[GL(2, Q p )] ⊗ C[B] C.We call this induced representati<strong>on</strong> π p the unramified principal series representati<strong>on</strong>associated to α <strong>and</strong> β <strong>and</strong> write π p = PS(α, β). People say π p is sphericalin the sense that there is a vector in the representati<strong>on</strong> space invariant under themaximal compact subgroup of GL(2, Q p ). [[Ribet was slightly unsure about thecorrect definiti<strong>on</strong> of spherical.]] [[For some mysterious reas<strong>on</strong>]] since π p = PS(α, β)it follows that α, β <strong>and</strong> hence ρ λ |D p is completely determined by π p . [[This is thepoint <strong>and</strong> i d<strong>on</strong>’t see this.]]Next we c<strong>on</strong>sider the more difficult case when p||N (p divides N exactly). Thereare two cases to c<strong>on</strong>sidera ε is ramified at p (p| c<strong>on</strong>d(ε))b ε is unramified at p (p ∤ c<strong>on</strong>d(ε)).20.2 More local properties of the ρ λ .Let f be a newform of level N. Then f corresp<strong>on</strong>ds to a representati<strong>on</strong> π = ⊗π v . Ifλ|l <strong>and</strong> l ≠ p then ρ λ |D p corresp<strong>on</strong>ds to π p under the Langl<strong>and</strong>s corresp<strong>on</strong>dence.The details of this corresp<strong>on</strong>dence were figured out by Philip Kutzko, but Carayolcompleted it in the excepti<strong>on</strong>al case (to be defined later). There are three cases toc<strong>on</strong>sider.


20.2 More local properties of the ρ λ . 201a) p ∤ Nb) p||Nc) p 2 | NWe c<strong>on</strong>sidered the first two cases last time. The third case is different because thesame sort of analysis as we applied to the first two cases no l<strong>on</strong>ger works in thesense that we no l<strong>on</strong>ger know what π p looks like.20.2.1 Possibilities for π pCase 1 (principal series) In this case, π p = PS(α, β). Here α, β : Q ∗ p → C ∗are unramified characters. Q ∗ p corresp<strong>on</strong>ds to a dense subgroup of the abelianGalois group of Q p under the corresp<strong>on</strong>dence elucidated in Serre’s “Local ClassfieldTheory” (in Cassels <strong>and</strong> Frohlich).Q ∗ pGal(Q p /Q p ) A∩BZ Ẑ = Gal(F p /F p )Under this corresp<strong>on</strong>dence α <strong>and</strong> β corresp<strong>on</strong>d to Galois representati<strong>on</strong>s α λ <strong>and</strong>β λ <strong>and</strong> ρ λ |D p ∼ α λ ⊕ β λ .Case 2 (special) In this case, π p is the special automorphic representati<strong>on</strong>corresp<strong>on</strong>ding to the Galois representati<strong>on</strong> κ ⊗ st where st is the <strong>Stein</strong>berg representati<strong>on</strong>which arises somehow from a split-multiplicative reducti<strong>on</strong> elliptic curve<strong>and</strong> κ is a Dirichlet character. In this case( χl ∗)ρ λ |D p = κ ⊗ .0 1Case 3 (cuspidal) Case 3 occurs when π p does not fall into either of theprevious cases. Such a π p is called cuspidal or super-cuspidal. Some of these π pcome from the following recipe. Fix an algebraic closure Q p of Q p . Let K be aquadratic extensi<strong>on</strong> of Q p . Let ψ : K ∗ → C ∗ be a Grössencharacter. Then ψ givesrise to a characterψ λ : Gal(Q p /K) → E ∗ λwhich induces ρ λ |D p . That is,ρ λ |D p = Ind Gal(Q p /Qp)Gal(Q p /K) ψ λ : Gal(Q p /Q p ) → GL(2, E λ ).This representati<strong>on</strong> is irreducible if <strong>and</strong> <strong>on</strong>ly if ψ is not invariant under the can<strong>on</strong>icalc<strong>on</strong>jugati<strong>on</strong> of K/Q p . The pair ψ, K gives rise (via the c<strong>on</strong>structi<strong>on</strong> of Jacquet-Langl<strong>and</strong>s) to a representati<strong>on</strong> of π ψ,K of GL(2, Q p ). Those representati<strong>on</strong>s whichdo not come from this recipe <strong>and</strong> which do not fall into case 1 or case 2 above arecalled extraordinary. They can <strong>on</strong>ly occur when p ≤ 2.When can the various cases occur?Case 1) occurs, e.g, if p ∤ N, <strong>and</strong> also if p||N <strong>and</strong> ε (the character of f) is ramifiedat p.Case 2) occurs if p||N <strong>and</strong> ε is unramified at p.


202 20. Adelic Representati<strong>on</strong>s20.2.2 The case l = pWe c<strong>on</strong>sider ρ λ |D p where λ|p <strong>and</strong> p = l. Write f = ∑ a n q n . The case when λ ∤ a pis called the ordinary case. This case is very similar to the case for an ordinaryelliptic curve. In other words,ρ λ |D p =( α ∗).0 β[[Ribet menti<strong>on</strong>ed that there is a paper <strong>on</strong> this by Mazur <strong>and</strong> Wiles in the AmericanJournal of Mathematics. Look the reference up.]] An important point is that β isunramified so it makes sense to c<strong>on</strong>sider β(Frob p ) = a p ∈ Eλ ∗ . Since αβ is thedeterminant, αβ = χ k−1p ε = χ p ε (after setting k = 2 to fix ideas), this gives somedescripti<strong>on</strong> of what is going <strong>on</strong>. The obvious questi<strong>on</strong> to ask is whether or not * isn<strong>on</strong>trivial. That is, is ρ λ |D p semisimple or not?When f has weight 2, then f gives rise to an abelian variety A = A f . Then ρ λis defined by looking at the acti<strong>on</strong> of Galois <strong>on</strong> the λ-adic divisi<strong>on</strong> points <strong>on</strong> A. Ifn<strong>on</strong>e of the λ lying over p divide a p then A is ordinary at p. A str<strong>on</strong>ger statementis that the p-divisible group A[p ∞ ] has good ordinary reducti<strong>on</strong>.One simple case is when f has CM. By this we mean that there is a characterκ ≠ 1 of order 2 such that a n = κ(n)a n for all n prime to c<strong>on</strong>d(κ). [[This is not atypo, I do not mean a n = κ(n)a n . Coleman said that a n = κ(n)a n is just a funnyway to say that half of the a n are 0.]] It is easy to prove that f has CM if <strong>and</strong> <strong>on</strong>ly ifthe ρ λ become abelian <strong>on</strong> some open subgroup of Gal(Q/Q) of finite index. Ribetexplains this in his article in [Rib77]. If f has CM then since the representati<strong>on</strong>ρ λ is almost abelian <strong>on</strong>e can show that * is trivial. Ribet said he does not knowwhether the c<strong>on</strong>verse is true. Note that f has CM if <strong>and</strong> <strong>on</strong>ly if A f /Q has CM. Ifall λ|p are ordinary (i.e., they do not divide a p ) <strong>and</strong> if * is trivial for every ρ λ |D pit is easy to show using [Ser98] that A has CM.Next we will say something more about representati<strong>on</strong>s which appear to beordinary. C<strong>on</strong>sider the situati<strong>on</strong> in which f has weight 2 <strong>and</strong> p exactly divides thelevel N of f. Suppose furthermore that the character ε of f is unramified at p.Then π p is a special representati<strong>on</strong>. The λ-adic representati<strong>on</strong>s for l ≠ p are (upto characters of finite order) like representati<strong>on</strong>s attached to some Tate curve. Thesituati<strong>on</strong> is similar when l = p sinceρ λ |D p =( ) α ∗.0 βAs in the case of a Tate curve α/β = χ p . Up to a quadratic character we know thesituati<strong>on</strong> since αβ = χ p ε. Also β is still unramified <strong>and</strong> β(Frob p ) = a p is a unit.We know that a 2 p = ε(p) is a root of unity. When k = 2 the case of a sphericalrepresentati<strong>on</strong> mimics what happens for ordinary reducti<strong>on</strong>. The upper right h<strong>and</strong>entry * is never trivial in this case [[because of something to do with extensi<strong>on</strong>s<strong>and</strong> Kummer theory]].Ribet said he knows nothing about the situati<strong>on</strong> when k > 2. If p||N <strong>and</strong> ε isunramified at p then π p is special so ρ λ , l ≠ p are again of the form ( α ∗)0 β . So westill know everything up to a quadratic character. But if l = p some charactersare Hodge-Tate [[what does that mean?]] so by a theorem of Faltings they cannot be bizarre powers of the cyclotomic character. The multiplicative case like theordinary case is very special to the case k = 2. Wiles uses this heavily in his proof.


20.2 More local properties of the ρ λ . 203If k is arbitrary then a 2 p = ε(p)p k 2 −1 so a p is not a unit for k > 2 so there isno invariant line. So the representati<strong>on</strong> is probably irreducible in the case k > 2.[[Echos of “yeah”, “strange” <strong>and</strong> “very strange” are heard throughout the room.]]20.2.3 Tate curvesSuppose E/Q p is an elliptic curve with multiplicative reducti<strong>on</strong> at p <strong>and</strong> thatj ∈ Q p is the j-invariant of E. Using a formula which can be found in [Sil94, V]<strong>on</strong>e finds q = q(j) with |q| < 1. The Tate curve is E(Q p ) = Q p /q Z . The p torsi<strong>on</strong><strong>on</strong> the Tate curve is {ζp a (q 1/p ) b : 0 ≤ a, b ≤ p − 1}. Galois acts by ζ p ↦→ ζpa <strong>and</strong>q 1/p ↦→ ζp a q 1/p . Thus the associated Galois representati<strong>on</strong> is ( χ p ∗)0 1 .


204 20. Adelic Representati<strong>on</strong>s


21Serre’s C<strong>on</strong>jectureThis is versi<strong>on</strong> 0.2 of this secti<strong>on</strong>. I am still unsatisfied with theorganizati<strong>on</strong>, level of detail, <strong>and</strong> coherence of the presentati<strong>on</strong>. Alot of work remains to be d<strong>on</strong>e. Some of Ken’s lectures in Utah willbe integrated into this chapter as well.Let l be a prime number. In this chapter we study certain mod l Galois representati<strong>on</strong>s,by which we mean c<strong>on</strong>tinuous homomorphisms ρGal(Q/Q)ρ GL(2, F)where F is a finite field of characteristic l. <strong>Modular</strong> forms give rise to a large classof such representati<strong>on</strong>s.newforms fmod l representati<strong>on</strong>s ρThe motiving questi<strong>on</strong> is:Given a mod l Galois representati<strong>on</strong> ρ, which newforms f if any, ofvarious weight <strong>and</strong> level, give rise to ρ?We will assume that ρ is irreducible. It is nevertheless sometimes fruitful to c<strong>on</strong>siderthe reducible case (see [SW97] <strong>and</strong> forthcoming work of C. Skinner <strong>and</strong> A. Wiles).Serre [Ser87] has given a very precise c<strong>on</strong>jectural answer to our motivating questi<strong>on</strong>.The result, after much work by many mathematicians, is that certain ofSerre’s c<strong>on</strong>jectures are valid in the sense that if ρ arises from a modular form atall, then it arises from <strong>on</strong>e having a level <strong>and</strong> weight as predicted by Serre. Themain trends in the subject are “raising” <strong>and</strong> “lowering.”Our motivating questi<strong>on</strong> can also be viewed through the opposite optic. Whatis the most bizarre kind of modular form that gives rise to ρ? A close study of theramificati<strong>on</strong> behavior of ρ allows <strong>on</strong>e to at least obtain some sort of c<strong>on</strong>trol overthe possible weights <strong>and</strong> levels. This viewpoint appears in [Wil95].


206 21. Serre’s C<strong>on</strong>jectureIn this chapter whenever we speak of a Galois representati<strong>on</strong> the homomorphismis assumed to be c<strong>on</strong>tinuous. The reader is assumed to be familiar with local fields,some representati<strong>on</strong> theory, <strong>and</strong> some facts about newforms <strong>and</strong> their characters.21.1 The Family of λ-adic representati<strong>on</strong>s attached to anewformFirst we briefly review the representati<strong>on</strong>s attached to a given newform. Letf = ∑ n≥1a n q n ∈ S k (N, ε)be a newform of level N, weight k, <strong>and</strong> character ε. Set K = Q(. . . , a n , . . .) <strong>and</strong> letO be the ring of integers of K. If λ is a n<strong>on</strong>zero prime ideal of O we always let l bethe prime of Z over which λ lies, so λ ∩ Z = (l). Let K λ denote the completi<strong>on</strong> ofK at λ, thus K λ is a finite extensi<strong>on</strong> of Q l . The newform f gives rise to a system(ρ f,λ ) of λ-adic representati<strong>on</strong>s<strong>on</strong>e for each λ.ρ f,λ : Gal(Q/Q) −→ GL(2, K λ )Theorem 21.1.1 (Carayol, Deligne, Serre). Let f be as above <strong>and</strong> l a prime.There exists a Galois representati<strong>on</strong>ρ f,l : Gal(Q/Q) −→ GL(2, K ⊗ Q l )with the following property: If p ∤ lN is a prime, then ρ f,l is unramified at p, <strong>and</strong>the image under ρ f,l of any Frobenius element for p is a matrix with trace a p <strong>and</strong>determinant ε(p)p k−1 .The actual c<strong>on</strong>structi<strong>on</strong> of ρ f,l w<strong>on</strong>’t be used in what follows. Since K ⊗ Q l isa product ∏ λ K λ of the various completi<strong>on</strong>s of K at the primes λ of K lying overl, we have a decompositi<strong>on</strong>GL(2, K ⊗ Q l ) = ∏ λ|lGL(2, K λ )<strong>and</strong> projecti<strong>on</strong> <strong>on</strong>to GL(2, K λ ) gives ρ f,λ .By Lemma 18.1.1 ρ f,λ is equivalent to a representati<strong>on</strong> taking values in GL(2, O)where O is the ring of integers of K. Since λ is a prime of O reducti<strong>on</strong> modulo λdefines a map GL(2, O) → GL(2, F) where F = O/λ is the residue class field of λ,<strong>and</strong> we obtain a mod l Galois representati<strong>on</strong>ρ f,λ : Gal(Q/Q) −→ GL(2, F).21.2 Serre’s C<strong>on</strong>jecture ASerre [Ser87] c<strong>on</strong>jectured that certain mod l representati<strong>on</strong>s arise from modularforms, <strong>and</strong> then gave a precise recipe for which type of modular form would give


21.2 Serre’s C<strong>on</strong>jecture A 207rise to the representati<strong>on</strong>. We refer to the first part of his c<strong>on</strong>jecture as “C<strong>on</strong>jectureA” <strong>and</strong> to the sec<strong>on</strong>d as “C<strong>on</strong>jecture B”.Let ρ : Gal(Q/Q) → GL(2, F) be a Galois representati<strong>on</strong> with F a finite field.We say that ρ arises from a modular form or that ρ is modular if there is somenewform f = ∑ a n q n <strong>and</strong> some prime ideal λ of K = Q(. . . , a n , . . .) such that ρis isomorphic to ρ f,λ over F.O/λ FRecall that a Galois representati<strong>on</strong> ρ is odd if det(ρ(c)) = −1 where c ∈ Gal(Q/Q)is a complex c<strong>on</strong>jugati<strong>on</strong>. Galois representati<strong>on</strong> arising from modular forms arealways odd. [More detail?]C<strong>on</strong>jecture 21.2.1 (Serre’s C<strong>on</strong>jecture A). SupposeFρ : Gal(Q/Q) → GL(2, F)is an odd irreducible (c<strong>on</strong>tinuous) Galois representati<strong>on</strong> with F a finite field. Thenρ arises from a modular form.21.2.1 The Field of definiti<strong>on</strong> of ρOne difficulty is that ρ sometimes takes values in a slightly smaller field thanO/λ. We illustrate this by way of an example. Let f be <strong>on</strong>e of the two c<strong>on</strong>jugatenewforms of level 23, weight 2, <strong>and</strong> trivial character. Thenf = q + αq 2 + (−2α − 1)q 3 + (−α − 1)q 4 + 2αq 5 + · · ·with α 2 + α − 1 = 0. The coefficients of f lie in O = Z[α] = Z[ 1+√ 52]. Take λ to bethe unique prime of O lying over 2, then O/λ ∼ = F 4 <strong>and</strong> so ρ f,λ is a homomorphismto GL(2, F 4 ).Propositi<strong>on</strong> 21.2.2. If p ≠ 2 then a p ∈ Z[ √ 5].Proof. We have f = f 1 + αf 2 withf 1 = q − q 3 − q 4 + · · ·f 2 = q 2 − 2q 3 − q 4 + 2q 5 + · · ·Because S 2 (Γ 0 (23)) has dimensi<strong>on</strong> two, it is spanned by f 1 <strong>and</strong> f 2 . Let η(q) =∏q 1 24n≥1 (1 − qn ). By Propositi<strong>on</strong> ??, g = (η(q)η(q 23 )) 2 ∈ S 2 (Γ 0 (23)). An explicitcalculati<strong>on</strong> shows that g = q 2 − 2q 3 + · · · so we must have g = f 2 . Next observethat g is a power series in q 2 , modulo 2:g = q 2 ∏ (1 − q n ) 2 (1 − q 23n ) 2≡ q 2 ∏ (1 − q 2n )(1 − q 46n ) (mod 2)≡ q 2 ∏ (1 + q 2n + q 46n + q 48n ) (mod 2)Thus the coefficient in f 2 of q p with p ≠ 2 prime is even <strong>and</strong> the propositi<strong>on</strong>follows.


208 21. Serre’s C<strong>on</strong>jectureThus those a p with p ≠ 2 map modulo λ to F 2 ⊂ F 4 , <strong>and</strong> hence the traces<strong>and</strong> determinants of Frobenius, at primes where Frobenius is defined, take valuesin F 2 . This is enough to imply that ρ f,λ is isomorphic over F to a representati<strong>on</strong>ρ : Gal(Q/Q) → GL(2, F 2 ). Geometrically, GL(2, F 2 ) ∼ = S 3 <strong>and</strong> the representati<strong>on</strong>ρ is <strong>on</strong>e in which Galois acts via S 3 <strong>on</strong> three points of X 0 (23).Thus in general, if we start with ρ <strong>and</strong> wish to see that ρ satisfies C<strong>on</strong>jecture A,we may need to pass to an algebraic closure of F. In our example, starting withρ : Gal(Q/Q) → GL 2 (F 2 ) we say that “ρ is modular” because ρ ∼ = ρ λ,f , but keepin mind that this particular isomorphism <strong>on</strong>ly takes place over F 4 .At this point K. Buzzard comments, “Maybe there is some better modular formso that all of the a p actually lie in F 2 <strong>and</strong> the associated representati<strong>on</strong> is isomorphicto ρ. That would be a str<strong>on</strong>ger c<strong>on</strong>jecture.” Ribet resp<strong>on</strong>ds that he has neverthought about that questi<strong>on</strong>.21.3 Serre’s C<strong>on</strong>jecture BSerre’s sec<strong>on</strong>d c<strong>on</strong>jecture asserts that ρ arises from a modular form in a particularspace. Supposeρ : Gal(Q/Q) → GL(2, F)is an odd irreducible Galois representati<strong>on</strong> with F a finite field.C<strong>on</strong>jecture 21.3.1 (Serre’s C<strong>on</strong>jecture B). Suppose ρ arises from some modularform. Then ρ arises from a modular form of level N(ρ), weight k(ρ) <strong>and</strong> characterε(ρ). The exact recipe for N(ρ) <strong>and</strong> k(ρ) will be given later.C<strong>on</strong>jecture B has largely been proven.Theorem 21.3.2. Suppose l is odd. If the mod l representati<strong>on</strong> ρ is irreducible<strong>and</strong> modular, then ρ arises from a newform f of level N(ρ) <strong>and</strong> weight k(ρ).21.4 The LevelThe level N(ρ) is a c<strong>on</strong>ductor of ρ, essentially the Artin c<strong>on</strong>ductor except that weomit the factor corresp<strong>on</strong>ding to l. The level is a productN(ρ) = ∏ p≠lp e(p) .We now define the e(p). Let V be the representati<strong>on</strong> space of ρ, so V is a twodimensi<strong>on</strong>al F-vector space <strong>and</strong> we view ρ as a homomorphismρ : Gal(Q/Q) → Aut(V ),or equivalently view V as an F[Gal(Q/Q)]-module. Let K = Q ker(ρ) be the fieldcut out by ρ, it is a finite Galois extensi<strong>on</strong> of Q with Galois group which we callG. Choose a prime ℘ of K lying over p <strong>and</strong> letD = {σ ∈ G : σ(℘) ⊂ ℘}


21.4 The Level 209denote the decompositi<strong>on</strong> group at ℘. Let O be the ring of integers of K <strong>and</strong> recallthat the higher ramificati<strong>on</strong> groups G −1 ⊃ G 0 ⊃ G 1 ⊃ G 2 ⊃ · · · areG i = {σ ∈ D : σ acts trivially <strong>on</strong> O/℘ i+1 }.Thus G −1 = D <strong>and</strong> G 0 is the inertia group. Each G i is a normal subgroup of Dbecause G i is the kernel of D → Aut(O/℘ i+1 ). LetV Gi = {v ∈ V : σ(v) = v for all σ ∈ G i }.Lemma 21.4.1. The subspace V Gi is invariant under D.Proof. Let g ∈ D, h ∈ G i <strong>and</strong> v ∈ V Gi . Since G i is normal in D, there existsh ′ ∈ G i so that g −1 hg = h ′ . Then h(gv) = gh ′ v = gv so gv ∈ V Gi .By [Frö67, §9] that there is an integer i so that G i = 0. We can now definee(p) =∞∑i=01[G 0 : G i ] dim(V/V Gi ).Thus e(p) depends <strong>on</strong>ly <strong>on</strong> ρ|I p where I p = G 0 is an inertia group at p. In particular,if ρ is unramified at p then all G i for i ≥ 0 vanish <strong>and</strong> e(p) = 0. Separatingout the term dim(V/V Ip ) corresp<strong>on</strong>ding to i = 0 allows us to writee(p) = dim(V/V Ip ) + δ(p) ≤ 2 + δ(p)since dim V = 2. The term δ(p) is called the Swan c<strong>on</strong>ductor. By [Ser77, 19.3] δ(p)is an integer, hence e(p) is an integer. We call ρ tamely ramified at p if all G i fori > 0 vanish, in which case the Swan c<strong>on</strong>ductor is 0.Suppose ρ ∼ = ρ f,λ (over F) for some newform f of level N. There is a relati<strong>on</strong>shipbetween e(p) <strong>and</strong> something involving <strong>on</strong>ly f. Let V λ be the representati<strong>on</strong> spaceof ρ f,λ , so V λ is a vector space over an extensi<strong>on</strong> of Q l . It turns out thatord p (N) = dim(V λ /V Ipλ) + δ(p).Thusord p (N) = e(p) + error termwhere the error term is dim(V Ip ) − dim(V Ipλ) ≤ 2. The point is that more canbecome invariant up<strong>on</strong> reducing modulo λ. Thus if f gives rise to ρ then the powerof p in the level N of f is c<strong>on</strong>strained as it is given by a certain formula <strong>on</strong>lydepending <strong>on</strong> e(p) <strong>and</strong> an error term which has magnitude at most 2.As we will see, the weight k(ρ) depends <strong>on</strong>ly <strong>on</strong> ρ|I l . Thus k(ρ) can be viewedas an analogue of e(l).21.4.1 Remark <strong>on</strong> the case N(ρ) = 1One c<strong>on</strong>sequence of C<strong>on</strong>jecture B is that every modular mod l representati<strong>on</strong> ρmust come from a newform f of level prime to l. Suppose that f is a modularform of level l. C<strong>on</strong>sider the corresp<strong>on</strong>ding mod l representati<strong>on</strong>. It is unramifiedoutside l so N(l) = 1. The c<strong>on</strong>jecture then implies that this mod l representati<strong>on</strong>


210 21. Serre’s C<strong>on</strong>jecturecomes from a level 1 modular form, i.e., a modular form <strong>on</strong> SL 2 (Z), of possiblyhigher weight. This is a classical result.For the rest of this subsecti<strong>on</strong> we assume that l ≥ 11. There is a relati<strong>on</strong>shipbetween mod l forms <strong>on</strong> SL 2 (Z) of weight l+1 <strong>and</strong> mod l forms <strong>on</strong> Γ 0 (l) of weight2. The dimensi<strong>on</strong>s of each of these spaces are the same over F l or over C, i.e.,We now describe a mapThe weight k Eisenstein seriesdim Fl S 2 (Γ 0 (l); F l ) = dim C S 2 (Γ 0 (l); C).dim Fl S l+1 (SL 2 (Z); F l ) = dim C S l+1 (SL 2 (Z); C).F : S 2 (Γ 0 (l); F l ) −→ S l+1 (SL 2 (Z); F l ).E k = 1 − 2kB k∞ ∑n=1σ k−1 (n)q nis a modular form for SL 2 (Z). Here the Bernoulli numbers B k are defined byt∞e t − 1 = ∑ t kB kk! ,k=0so B 0 = 1, B 1 = − 1 2 , B 2 = 1 12 , B 3 = − 1720 , . . ..Propositi<strong>on</strong> 21.4.2. If l ≥ 5, thenE l−1 ≡ 1(mod l).Proof. We must check that l | 2(l−1)B l−1, or equivalently that l |1B l−1. Set n = l − 1,p = l <strong>and</strong> apply [Lan95, X.2.2].Suppose f ∈ S 2 (Γ 0 (l); F l ) is the reducti<strong>on</strong> of f ∈ S 2 (Γ 0 (l); F l ). Multiplicati<strong>on</strong>by E l−1 gives a mod l form f · E l−1 of weight l + 1 <strong>on</strong> Γ 0 (l). We haveF (f) = tr(f · E l−1 ) ∈ S l+1 (SL 2 (Z); F l )where tr is induced by X 0 (l) → X 0 (1). The map F was discovered by Serre [DK73].That F has the properties necessary to deduce the prime level case of Serre’sC<strong>on</strong>jecture B was proved by explicit computati<strong>on</strong> in the Berkeley Ph.D. thesis ofC. Queen [Que77]. Katz believes that it is easy to prove the formula from the rightmagical moduli point of view, but neither author has seen this. The first authorgave a c<strong>on</strong>crete proof for l > 2 in [Rib94]. It would be nice if some<strong>on</strong>e wouldc<strong>on</strong>struct a clear proof for the case l = 2.21.4.2 Remark <strong>on</strong> the proof of C<strong>on</strong>jecture BWe now give a very brief outline of the proof of C<strong>on</strong>jecture B when l > 2. Startwith a representati<strong>on</strong> ρ which comes from some (possibly terrible) newform f oflevel N(f) <strong>and</strong> weight k(f).


21.5 The Weight 211Step 1. Using a c<strong>on</strong>crete argument replace f by another newform giving rise to ρso that l ∤ N(f).Step 2. Compare N(ρ) <strong>and</strong> N(f). These are two prime to l numbers. What ifp | N(f)N(ρ)? Carayol separated this into several cases. In each case replace f by abetter form so that the ratio has a smaller power of p in it. This is level lowering.Eventually we get to the case N(ρ) = N(f).Step 3. Using a paper of Edixhoven [Edi92b] <strong>on</strong>e shows that f can be replacedwith a another form of the same level so that the weight k is equal to k(ρ).21.5 The Weight21.5.1 The Weight modulo l − 1We first give some background which motivates Serre’s numerical recipe for theweight. We start with a newform f of low weight k <strong>and</strong> c<strong>on</strong>sider the behavior ofthe representati<strong>on</strong> ρ = ρ f,λ |I l , where I l is an inertia group at l. Assume that weare in the following situati<strong>on</strong>:• N = N(f) is prime to l,• 2 ≤ k ≤ l + 1.There are other cases to c<strong>on</strong>sider but we c<strong>on</strong>sider this <strong>on</strong>e first. Let ε be thecharacter of f <strong>and</strong> recall that for p ∤ lNdet(ρ f,l (Frob p )) = ε(p)p k−1 .The mod l cyclotomic character χ l : Gal(Q/Q) → F ∗ lsending Frob p to p ∈ F ∗ l . Thusis the homomorphismdet(ρ) = εχ k−1l.Since ε is a Dirichlet character mod N <strong>and</strong> l ∤ N, ε|I l = 1 <strong>and</strong> sodet(ρ|I l ) = χ k−1l.We see immediately that det(ρ|I l ) determines k modulo l − 1. Since 2 ≡ l + 1(mod l − 1), this still doesn’t distinguish between 2 <strong>and</strong> l + 1.21.5.2 Tameness at lLetρ : Gal(Q/Q) −→ GL(2, F l ν )be a modular mod l Galois representati<strong>on</strong>. Let D = D l ⊂ Gal(Q/Q) be a decompositi<strong>on</strong>group at l. Let σ be the semisimplificati<strong>on</strong> of ρ|D. Thus σ is eithera direct sum of two characters or σ = ρ|D depending <strong>on</strong> whether or not ρ|D isirreducible. By [Frö67, 8.1] the ramificati<strong>on</strong> group P = G 1 is the unique Sylowl-subgroup of I l = G 0 .


212 21. Serre’s C<strong>on</strong>jectureLemma 21.5.1. The semisimplificati<strong>on</strong> σ of ρ is tame, i.e., σ|P = 0 where P isthe Sylow l-subgroup of the inertia group I l .Proof. Since I l is normal in D <strong>and</strong> any automorphism of I l sends P to someSylow l-subgroup <strong>and</strong> P is the <strong>on</strong>ly such, it follows that P is normal in D. LetW = F l ν × F l ν be the representati<strong>on</strong> space of σ. ThenW P = {w ∈ W : σ(τ)w = w for all τ ∈ P }is a subspace of W invariant under the acti<strong>on</strong> of D. For this let α ∈ D <strong>and</strong> supposew ∈ W P . Since P is normal in D, α −1 τα = τ ′ for some τ ′ ∈ P . Thereforeσ(α) −1 σ(τ)σ(α)w = σ(τ ′ )w = wso σ(τ)σ(α)w = σ(α)w hence σ(α)w ∈ W P .But W P ≠ 0. To see this write W as a disjoint uni<strong>on</strong> of its orbits under theacti<strong>on</strong> of P . Since P is an l-Sylow group <strong>and</strong> W is finite we see that the size ofeach orbit is either 1 or a positive power of l. Now {0} is a singlet<strong>on</strong> orbit, Whas l-power order, <strong>and</strong> all n<strong>on</strong>-singlet<strong>on</strong> orbits have order a positive power of l sothere must be at least l − 1 other singlet<strong>on</strong> orbits. Each of these other singlet<strong>on</strong>orbits gives a n<strong>on</strong>zero element of W P .If W P = W then P acts trivially so we are d<strong>on</strong>e. If W P ≠ W , then since W P isn<strong>on</strong>zero it is a <strong>on</strong>e dimensi<strong>on</strong>al subspace invariant under D, so by semisimplicityσ is diag<strong>on</strong>al. Let τ ∈ P , then τ has order l n for some n. Write( ) α 0σ(τ) =,0 βthen α ln = β ln = 1. Since α, β ∈ F l ν they have order dividing |F ∗ l ν | = lν − 1. Butgcd(l ν − 1, l n ) = 1 from which it follows that α = β = 1. Thus P = {1} <strong>and</strong> againP acts trivially, as claimed.21.5.3 Fundamental characters of the tame extensi<strong>on</strong>The lemma implies σ|I l factors through the tame quotient I t = I l /P . We now describecertain characters of I t more explicitly. Denote by Q tmlthe maximal tamelyramified extensi<strong>on</strong> of Q l , it is the fixed field of P . Let K = Q urlbe the maximalunramified extensi<strong>on</strong>, i.e., the fixed field of the inertia group I l . By Galois theoryI t = Gal(Q tml /Q url ).For each positive integer n coprime to l there is a tower of Galois extensi<strong>on</strong>sQ tmlI tK( n√ l)K = Q urlµ n (K)Q l


21.5 The Weight 213Since Q(ζ n )/Q is unramified at l the nth roots of 1 are c<strong>on</strong>tained in K so byKummer theoryGal(K( n√ l)/K) ∼ = µ n (K)where µ n (K) denotes the group of nth roots of unity in K. Thus for each n primeto l we obtain a map I t → µ n (K). They are compatible so up<strong>on</strong> passing to thelimit we obtain a mapI t −→ lim µ ←− n (K) = ∏ lim←− µ r a(K) = ∏ Z r (1).r≠lr≠lIn fact [Frö67, 8, Corollary 3] the above maps are isomorphisms. Viewed morecleverly mod l we obtain a mapThus for each i we have a mapI t −→ lim ←−µ n (F l ) = lim ←−F ∗ l i.I t → F ∗ l i.which is called the fundamental character of level i. The c<strong>on</strong>structi<strong>on</strong> of this character<strong>on</strong> I t is somewhat unnatural because we had to choose an embedding F l i ↩→ F l .Instead Serre begins with a “disembodied” field F of order l i . There are i differentmaps F l i → F corresp<strong>on</strong>ding to the i automorphisms of F l i. Restricting thesemaps to F ∗ l<strong>and</strong> composing with I i t → F ∗ lgives the i fundamental charactersiof level i. The unique fundamental character of level 1 is the mod l cyclotomiccharacter χ l .21.5.4 The Pair of characters associated to ρRecall that we have a representati<strong>on</strong> ρ whose semisimplificati<strong>on</strong> gives a representati<strong>on</strong>which we denote by σ:σ : I t −→ GL(2, F l ν ).Since σ(I t ) is a finite abelian group <strong>and</strong> the elements of I t have order prime to l,this representati<strong>on</strong> is semisimple <strong>and</strong> can be diag<strong>on</strong>alized up<strong>on</strong> passing to F l . Infact, since the characteristic polynomials all have degree two, σ can be diag<strong>on</strong>alizedover F l 2ν . Thus σ corresp<strong>on</strong>ds to a pair of charactersα, β : I t −→ F ∗ l 2ν .These characters have some stability properties since σ is the restricti<strong>on</strong> of ahomomorphism from the full decompositi<strong>on</strong> group. C<strong>on</strong>sider the tower of fieldsK( n√ l)K = Q urlQ l


214 21. Serre’s C<strong>on</strong>jectureLet G = Gal(K( n√ l)/Q l ). Recall that Gal(K( n√ l)/K) ∼ = µ n (K) <strong>and</strong> Gal(K/Q l )is topologically generated by Frob l . If h ∈ µ n (K) <strong>and</strong> g ∈ G restricts to Frob lthen we have the c<strong>on</strong>jugati<strong>on</strong> formula:ghg −1 = h l .Applying this to our representati<strong>on</strong> σ with h ∈ I t we find thatsoσ(ghg −1 ) = σ(h l ) = σ(h) lσ(g)σ(h)σ(g −1 ) = σ(ghg −1 ) = σ(h) l .The point is that the representati<strong>on</strong> h ↦→ σ(h) l is equivalent to h ↦→ σ(h) viac<strong>on</strong>jugati<strong>on</strong> by σ(g). We c<strong>on</strong>clude that the pair of characters {α, β} is stableunder l-th powering, i.e., as a set{α, β} = {α l , β l }.What does this mean? There are two possibilities:• Level 1: α l = α <strong>and</strong> β l = β.• Level 2: α l = β <strong>and</strong> β l = α, but α ≠ β.Note that in the level 1 case, α <strong>and</strong> β take values in F ∗ l ν .21.5.5 Recipe for the weightWe will play a carnival game, “guess your weight.” First we c<strong>on</strong>sider the level 2 case.Our strategy is to express α <strong>and</strong> β in terms of the two fundamental characters oflevel 2. We then observe how the characters associated to a newform are expressedin terms of the two fundamental characters.The remainder of this chapter is devoted to motivating the following definiti<strong>on</strong>.Let ρ <strong>and</strong> σ be as above. Let ψ, ψ ′ be the two fundamental characters of level2 <strong>and</strong> χ the fundamental character of level 1 (the cyclotomic character). Serre’srecipe for k(ρ) is as follows.1. Suppose that α <strong>and</strong> β are of level 2. We have( ) α 0ρ|I l = .0 βAfter interchanging α <strong>and</strong> β if necessary, we have (uniquely) α = ψ r+lq =ψ r (ψ ′ ) q <strong>and</strong> β = (ψ ′ ) r ψ q with 0 ≤ r < q ≤ l − 1. We set k(ρ) = 1 + lr + q.2. Suppose that α <strong>and</strong> β are of level 1. We have( ) χr∗ρ|I l =0 χ q .(a) If ∗ = 0, normalize <strong>and</strong> reorder r, q so that 0 ≤ r ≤ q ≤ l − 2. We setk(ρ) = 1 + lr + q.(b) If ∗ ≠ 0, normalize so that 0 ≤ q ≤ l − 2 <strong>and</strong> 1 ≤ r ≤ l − 1. We seta = min(r, q), b = max(r, q). If χ r−q = χ <strong>and</strong> ρ ⊗ χ −q is not finite at lthen we set k(ρ) = 1 + la + b + l − 1; otherwise we set k(ρ) = 1 + la + b.


21.5.6 The World’s first view of fundamental characters21.5 The Weight 215First some background. Suppose E/Q is an elliptic curve <strong>and</strong> l is a prime (2 isallowed). Assume E has good supersingular reducti<strong>on</strong> at l, so Ẽ(F l)[l] = {0}.Then there is a Galois representati<strong>on</strong>ρ : Gal(Q/Q) −→ Aut(E[l])which, a priori, may or may not be irreducible. As above, ρ gives rise to twocharactersα, β : I t → F ∗ l 2.Serre (see [V´72]) proved that α, β are the two fundamental characters of level 2<strong>and</strong> that I t acts irreducibly over F l . [[Is this right?]] He also observed that thereis a mapI t → F ∗ l 2 ⊂ GL(2, F l)where F ∗ lsits inside GL(2, F 2l ) via the acti<strong>on</strong> of the multiplicative group of a field<strong>on</strong> itself after choice of a basis, <strong>and</strong> the map to F ∗ lis through <strong>on</strong>e of the two2fundamental characters of level 2.21.5.7 F<strong>on</strong>taine’s theoremSerre next asked F<strong>on</strong>taine to identify the characters α <strong>and</strong> β attached to more generalmodular representati<strong>on</strong>s. F<strong>on</strong>taine’s answer was published in [Edi92b]. Supposef = ∑ a n q n is a newform of weight k such that 2 ≤ k ≤ l <strong>and</strong> the level N(f)of f is prime to l. Let ρ = ρ f,λ where λ is a prime of Q(. . . , a n , . . .) lying over l.Assume we are in the supersingular case, i.e.,a l ≡ 0 mod λ.Semisimplifying as before gives a pair of charactersα, β : I t −→ F ∗ l 2ν .Let ψ, ψ ′ : I t → F ∗ lbe the two fundamental characters of level 2 so ψ = (ψ ′ ) l <strong>and</strong>2ψ ′ = ψ l .Theorem 21.5.2. With the above notati<strong>on</strong> <strong>and</strong> hypothesis, the characters α, βarising by semisimplifying <strong>and</strong> restricting ρ f,λ satisfy{α, β} = {ψ k−1 , (ψ ′ ) k−1 }.21.5.8 Guessing the weight (level 2 case)We now try to guess the weight in the level 2 case. We begin with a representati<strong>on</strong>ρ whose semisimplificati<strong>on</strong> is a representati<strong>on</strong> σ, which in turn gives rise to a pairof level 2 characters{α, β} = {ψ a , (ψ ′ ) a = ψ la }.Since ψ takes values in F ∗ l 2 , we may think of a as a number modulo l 2 − 1. Notealso that the pair is unchanged up<strong>on</strong> replacing a by la. The c<strong>on</strong>diti<strong>on</strong> that we arenot in level 1 is that a is not divisible by l + 1 sinceψψ ′ = ψ l+1 : I t → F ∗ l


216 21. Serre’s C<strong>on</strong>jectureis the unique fundamental character of level 1, i.e., the mod l cyclotomic character.Normalize a so that 0 ≤ a < l 2 − 1 <strong>and</strong> write a = ql + r. What are the possiblevalues for q <strong>and</strong> r? By the Euclidean algorithm 0 ≤ r ≤ l − 1 <strong>and</strong> 0 ≤ q ≤ l − 1. Ifr = q then a is a multiple of l + 1, so r ≠ q. If r > q, multiply the above relati<strong>on</strong>by l to obtain al = ql 2 + rl. But we are working mod l 2 − 1 so this becomesal = q + rl (mod l 2 − 1). Thus if we replace a by al then the roles of q <strong>and</strong> r areswapped in the Euclidean divisi<strong>on</strong>. Thus we can assume that 0 ≤ r < q ≤ l − 1.Nowα = ψ a = ψ ql+r = (ψ ′ ) q ψ r = (ψψ ′ ) r (ψ ′ ) q−r ,so{α, β} = {(ψψ ′ ) r (ψ ′ ) q−r , (ψψ ′ ) r ψ q−r }.Since ψψ ′ = χ is the mod l cyclotomic character, we can view {α, β} as a pair ofcharacters (ψ ′ ) q−r <strong>and</strong> ψ q−r which has been multiplied, as a pair, by χ r .What weight do we guess if r = 0? In this case{α, β} = {(ψ ′ ) k−1 , ψ k−1 }where k = 1 + q. So in analogy with Theorem 21.5.2 we guess thatk(ρ) = 1 + q(r = 0, supersingular case).What do we guess in general? Suppose f = ∑ a n q n is a modular form thoughtof mod l which gives rise to ρ, <strong>and</strong> that l does not divide the level of f. We mightas well ask what modular form gives rise to ρ ⊗ χ. In [?] we learn thatθf = ∑ na n q n (mod l)is a mod l eigenform, <strong>and</strong> it evidently gives rise to ρ ⊗ χ. Furthermore, if k is theweight of f, then θf has weight k + l + 1. Sincewe guess that{α, β} = {ψ q−r , (ψ ′ ) q−r } · χ rk(ρ) = q − r + 1 + (l + 1)r = 1 + lr + q(supersingular case)But be careful! the minimal weight k does not have to go up by l − 1, thoughit usually does. This is described by the theory of θ-cycles which we will reviewshortly.21.5.9 θ-cyclesThe theory of the θ operator was first developed by Serre <strong>and</strong> Swinnert<strong>on</strong>–Dyer<strong>and</strong> then later jazzed up by Katz in [?]. There is a noti<strong>on</strong> of modular forms modl <strong>and</strong> of q-expansi<strong>on</strong> which gives a mapα : ⊕ k≥0M k (Γ 1 (N); F l ) −→ F l [[q]].This map is not injective. The kernel is the ideal generated by A − 1 where A isthe Hasse invariant.


21.5 The Weight 217Suppose f ∈ F l [[q]] is in the image of α. If f ≠ 0 let w(f) denote the smallest kso that f comes from some M k . If f does not coming from any single M k do notdefine w(f). Define an operator θ <strong>on</strong> F l [[q]] byθ( ∑ a n q n ) = q d dq (∑ a n q n ) = ∑ na n q n .Serre <strong>and</strong> Swinnert<strong>on</strong>-Dyer showed that θ preserves the image of α.Theorem 21.5.3. Suppose f ≠ 0 is a mod l modular form as above. If l ∤ w(f)then w(θf) = w(f) + l + 1.Associated to f we have a sequence of n<strong>on</strong>negative integersw(f), w(θf), w(θ 2 f), . . .Fermat’s little theorem implies that this sequence is periodic because θ l f = θf<strong>and</strong> so w(θ l f) = w(θf). We thus call the cyclic sequence w(f), w(θf), w(θ 2 f), . . .the θ-cycle of f. Tate asked:What are the possible θ-cycles?This questi<strong>on</strong> was answered in [Joc82] <strong>and</strong> [Edi92b]. We now discuss the answerin a special case.Let f be an eigenform such that 2 ≤ k = w(f) ≤ l <strong>and</strong> f is supersingular, i.e.,a l (f) = 0. Since f is an eigenform the a n are multiplicative (a nm = a n a m for(n, m) = 1) <strong>and</strong> if ε denotes the character of f thena l i = a l i−1 · a l − ε(l)l k−1 a l−2= a l i−1 · a l = 0since we are working in characteristic l <strong>and</strong> k ≥ 2. Thus a n (f) = 0 whenever l | n<strong>and</strong> so θ l−1 f = f hence w(θ l−1 f) = w(f).If we apply θ successively to f what happens? Before proceeding we remark thatit can be proved that there is at most <strong>on</strong>e drop in the sequencek = w(f), w(θf), w(θ 2 f), . . . , w(θ l−2 f).First suppose k = 2. The θ-cycle must be2, 2 + (l + 1), 2 + 2(l + 1), . . . , 2 + (l − 2)(l + 1).This is because, by Theorem 21.5.3, applying θ raises the weight by l + 1 so l<strong>on</strong>gas the weight is not a multiple of l. Only the last term in the above sequence isdivisible by l. There are l − 1 terms so this is the full θ-cycle.Next suppose k = l. The θ-cycle isl, 3, 3 + (l + 1), . . . , 3 + (l + 1)(l − 3).The last term is divisible by l, no earlier term after the first is, <strong>and</strong> there are l − 1terms so this is the full θ-cycle. We know that the sec<strong>on</strong>d term must be 3 since itis the <strong>on</strong>ly number so that the θ-cycle works out right, i.e., so that the (l − 1)st


218 21. Serre’s C<strong>on</strong>jectureterm is divisible by l but no earlier term except the first is. For example, if wewould have tried 2 instead of 3 we would have obtained the sequencel, 2, 2 + (l + 1), . . . , 2 + (l + 1)(l − 3), 2 + (l + 1)(l − 2).This sequence has <strong>on</strong>e too many terms.Now we c<strong>on</strong>sider the remaining values of k: 2 < k < l. The θ-cycle isk, k + (l + 1), . . . , k + (l + 1)(l − k), k ′ , k ′ + l + 1, . . . , k ′ + (l + 1)(k − 3).The first l − k + 1 terms of the sequence are obtained by adding l + 1 successivelyuntil obtaining a term k + (l + 1)(l − k) which is divisible by l. Applying θ to aform of weight k + (l + 1)(l − k) causes the weight to drop to some k ′ . How canwe guess k ′ ? It must be such that k ′ + (l + 1)(k − 3) is divisible by l. Thus thecorrect answer isk ′ = l + 3 − k.21.5.10 Edixhoven’s paperSuppose that ρ is an irreducible mod l representati<strong>on</strong> so that the level of the charactersα, β associated to the semisimplificati<strong>on</strong> of ρ are level 2. To avoid problemsin a certain excepti<strong>on</strong>al case assume l ≥ 3. Edixhoven [Edi92b] proved that ifρ arises from an eigenform in S k (Γ 1 (N)) with (N, l) = 1, then ρ arises from aneigenform in S k(ρ) (Γ 1 (N)). What are the elements of the proof?1. The behavior of ρ|I l when ρ arises from an f with 2 ≤ w(f) ≤ l + 1.2. The fact that every modular representati<strong>on</strong> ρ has the form ρ f,λ ⊗ χ i wherei ∈ Z/(l − 1)Z, <strong>and</strong> 2 ≤ w(f) ≤ l + 1.Serre knew the sec<strong>on</strong>d element but never published a proof. How did Serre talkabout his result before Edixhoven’s paper? The eigenform f correp<strong>on</strong>ds to a systemof eigenvalues in F l of the <strong>Hecke</strong> operators T r , r ∤ lN. Eigenforms of weight atmost l + 1 give, up to twist, all systems of <strong>Hecke</strong> eigenvalues. A possible proof ofthis uses the c<strong>on</strong>structi<strong>on</strong> of ρ f in terms of certain étale cohomology groups.21.6 The CharacterLetρ : G Q = Gal(Q/Q) −→ GL(2, F l ν )be a Galois representati<strong>on</strong>. Assume that ρ is irreducible, modular ρ ∼ = ρ λ,f , <strong>and</strong>l > 2. The degree of a character ϕ : (Z/NZ) ∗ → C ∗ is the cardinality |ϕ((Z/NZ) ∗ )|of its image.Theorem 21.6.1. Under the above hypothesis, ρ comes from a modular form fof weight k(ρ), level N(ρ), <strong>and</strong> under a certain extra assumpti<strong>on</strong>, character ε ofdegree prime to l.Extra Assumpti<strong>on</strong>: Not all of the following are true.1. l = 3,


21.6 The Character 2192. ρ| Gal(Q/Q( √ −3)) is abelian, <strong>and</strong>3. det(ρ) is not a power of the mod 3 cyclotomic character χ.Example 21.6.2. If ρ comes from the Galois representati<strong>on</strong> <strong>on</strong> the 3-torsi<strong>on</strong> of anelliptic curve, then det(ρ) = χ is the mod 3 cyclotomic character, so the extraassumpti<strong>on</strong> does hold <strong>and</strong> the theorem applies.Let ρ be a representati<strong>on</strong> as above. Then it is likely thatdet ρ : G Q → F l νis ramified at l. Let χ be the mod l cyclotomic character.Propositi<strong>on</strong> 21.6.3. Let ϕ : G Q → F ∗ lbe a c<strong>on</strong>tinuous homomorphism. Thenνϕ = θχ i for some i <strong>and</strong> some θ : G Q → F ∗ lν which is unramified at l.Proof. Since ϕ is c<strong>on</strong>tinuous <strong>and</strong> F ∗ lis finite, the subfield K of Q fixed by ker(ϕ) isνa finite Galois extensi<strong>on</strong> of Q. Since the image of ϕ is abelian, the Galois group of Kover Q is abelian. By the Kr<strong>on</strong>ecker-Weber theorem [Lan94, X.3] there is a cyclotomicextensi<strong>on</strong> Q(ζ n ) = Q(exp 2πi/n ) which c<strong>on</strong>tains K. Since Gal(Q(ζ n )/Q) ∼ =(Z/nZ) ∗ , the character ϕ gives a homomorphism ϕ ′ : (Z/nZ) ∗ → F ∗ l . Writeνn = ml j with m coprime to l. Then(Z/nZ) ∗ ∼ = (Z/mZ × Z/l j Z) ∗ ∼ = (Z/mZ) ∗ × (Z/l j Z) ∗which decomposes ϕ ′ as a product θ ′ × ψ ′ whereθ ′ : (Z/mZ) ∗ → F ∗ l νψ ′ : (Z/l j Z) ∗ → F ∗ l ν .QQ(ζ n ) Q(ζ l j )Q(ζ m ) K(Z/l j Z) ∗ (Z/mZ) ∗Q⊂F ∗ l νThe character θ is obtained by lifting θ ′ . It is unramified at l because it factorsthrough Gal(Q(ζ m )/Q) <strong>and</strong> l ∤ m. The cardinality of (Z/l j Z) ∗ is (l − 1)l j−1whereas the cardinality of F ∗ l is (l − ν 1)(lν−1 + · · · + 1) so the image of ψ ′ lies inF ∗ l . Thus ψ′ lifts to a power χ i of the cyclotomic character.Using the propositi<strong>on</strong> write det(ρ) = θχ i with θ unramified at l. As in the proofof the propositi<strong>on</strong> we can write θ as a Dirichlet character (Z/mZ) ∗ → F ∗ lν . It can


220 21. Serre’s C<strong>on</strong>jecturebe shown using properties of c<strong>on</strong>ductors that m can be chosen so that m|N(ρ).Thus we view θ as a Dirichlet characterθ : (Z/N(ρ)Z) ∗ → F ∗ l ν .Let H = ker θ ⊂ (Z/N(ρ)Z) ∗ . Define a c<strong>on</strong>gruence subgroup Γ H (N) by{( )}a bΓ H (N) = ∈ Γc d 0 (N) : a, d ∈ H .We have the following theorem.Theorem 21.6.4. Suppose l > 2 <strong>and</strong> ρ satisfies the above assumpti<strong>on</strong>s includingthe extra assumpti<strong>on</strong>. Then ρ arises from a form inS k(ρ) ( Γ H (N(ρ)) ).In particular, the theorem applies if ρ comes from the l-torsi<strong>on</strong> representati<strong>on</strong><strong>on</strong> an elliptic curve, since then det = χ so the extra assumpti<strong>on</strong> is satisfied.21.6.1 A CounterexampleOne might ask if the extra assumpti<strong>on</strong> in Theorem 21.6.4 is really necessary. Atfirst Serre suspected it was not. But he was surprised to discover the followingexample which shows that the extra assumpti<strong>on</strong> can not be completely eliminated.The space S 2 (Γ 1 (13)) is 2 dimensi<strong>on</strong>al, spanned by the eigenformf = q + αq 2 + (−2α − 4)q 3 + (−3α − 7)q 4 + (2α + 3)q 5 + · · ·<strong>and</strong> the Gal(Q( √ −3)/Q)-c<strong>on</strong>jugate of f, where α 2 + 3α + 3 = 0. The characterε : (Z/13Z) ∗ → C ∗ of f has degree 6. Let λ = ( √ 3) <strong>and</strong> letρ f,λ : Gal(Q/Q) → GL(2, F 3 )be the associated Galois representati<strong>on</strong>. Then det(ρ f,λ ) = χθ where χ is the mod3 cyclotomic character <strong>and</strong> θ ≡ ε (mod 3). In particular θ has order 2. ThusH = ker(θ) ⊂ (Z/13Z) ∗ is exactly the index two subgroup of squares in (Z/13Z) ∗ .The c<strong>on</strong>clusi<strong>on</strong> of the theorem can not hold since S 2 (Γ H (13)) = 0. This is becauseany form would have to have a character whose order is at most two since it mustbe trivial <strong>on</strong> H, but S 2 (Γ H (13)) ⊂ S 2 (Γ 1 (13)) <strong>and</strong> S 2 (Γ 1 (13)) is spanned by f <strong>and</strong>its Galois c<strong>on</strong>jugate, both of which have character of order 6. In this example• l = 3,• ρ λ,f | Gal(Q/Q( √ −3)) is abelian, <strong>and</strong>• det ρ is not a power of χ.A good way to see the sec<strong>on</strong>d asserti<strong>on</strong> is to c<strong>on</strong>sider the following formula:f ⊗ ε −1 = f(up to an Euler factor at 13) in the sense thatρ f,λ ⊗ ε −1 = ρ f.


21.7 The Weight revisited: level 1 case 221Now reduce mod 3 to obtainρ f,λ ⊗ ε −1 ∼ = ρf,λsince f ≡ f (mod √ −3) (since 3 is ramified). Thus ρ f,λ is isomorphic to a twistof itself by a complex character so ρ f,λ is reducible <strong>and</strong> abelian over the fieldcorresp<strong>on</strong>ding to its kernel. In fact, by the same argument, ρ f,λ is also abelianwhen restricted to the Galois groups of Q( √ 13) <strong>and</strong> Q( √ 39).[[Give more details <strong>and</strong> describe David J<strong>on</strong>es’s thesis.]]21.7 The Weight revisited: level 1 caseWe are interested in the recipe for k(ρ) in the level 1 case. If we semisimplify <strong>and</strong>restrict to inertia we obtain a direct sum of two representati<strong>on</strong>s. In the level 1 caseboth representati<strong>on</strong>s are powers of the cyclotomic character. There are thus twopossibilities for ρ|I:ρ|I =( ) χα∗0 χ βor ρ|I =( ) χα00 χ β .In the sec<strong>on</strong>d case we guess k(ρ) by looking at the exp<strong>on</strong>ents <strong>and</strong> normalizing asbest we can. Since α <strong>and</strong> β are <strong>on</strong>ly defined mod l − 1 we may, after relabeling ifnecessary, assume that 0 ≤ α ≤ β ≤ l − 2. Factoring out χ α we obtain( ) 1 0χ α ⊗0 χ β−α .Next (secretly) recall that if f is ordinary of weight k then f gives rise to therepresentati<strong>on</strong> (χ k−1 ∗)0 1(here ∗ can be trivial). If f is of weight β − α + 1, applying the θ operator α timesgives the desired representati<strong>on</strong>. Thus the recipe for the weight isw(ρ) = (l + 1)α + β − α + 1 = β + lα + 1.There is <strong>on</strong>e caveat: Serre was uncomfortable with weight 1 forms, so if α = β = 0he defines k(ρ) = l instead of k(ρ) = 1.Ogus asks what is wr<strong>on</strong>g with weight 1, <strong>and</strong> Ribet replies that Serre didn’t knowa satisfactory way in which to define modular forms in weight 1. Merel then addsthat Serre was frustrated because he could not do explicit computati<strong>on</strong>s in weight1.21.7.1 Compani<strong>on</strong> formsSuppose f is ordinary (a l ∉ λ) of weight k, 2 ≤ k ≤ l + 1, <strong>and</strong> let ρ f,λ be theassociated representati<strong>on</strong>. Then( )χk−1∗ρ f,λ |I l =.0 1


222 21. Serre’s C<strong>on</strong>jectureTo introduce the idea of a compani<strong>on</strong> form suppose that somehow by chance ∗ = 0.Twisting ρ by χ 1−k gives( ) 1 0ρ ⊗ χ 1−k |I l = ρ ⊗ χ l−k |I l =0 χ l−k .What is the minimum weight of a newform giving rise to such a representati<strong>on</strong>?Because the two characters 1 <strong>and</strong> χ take values in F ∗ lwe are in the level 1 situati<strong>on</strong>.The representati<strong>on</strong> is semisimple, reducible, <strong>and</strong> α = 0, β = l − k, so the naturalweight is l+1−k. Thus C<strong>on</strong>jecture B predicts that there should exist another formg of weight l + 1 − k such that ρ g∼ = ρf ⊗ χ l−k (over F l ). Such a form g is calleda compani<strong>on</strong> of f. In characteristic l, we have g = θ l−k f. This c<strong>on</strong>jecture was forthe most part proved by Gross [Gro90] when k ≠ 2, l, <strong>and</strong> by Coleman-Voloch[CV92] when k = l. [[It would be nice to say something here about the subtletiesinvolved in going from this mod l form g to the compani<strong>on</strong> form g produced byGross-Coleman-Voloch. Roughly, how are the two objects linked?]]21.7.2 The Weight: the remaining level 1 caseLetρ : Gal(Q/Q) −→ GL(2, F l ν )be a Galois representati<strong>on</strong> with l > 2. Assume that ρ is irreducible <strong>and</strong> modular.Then ρ comes from a modular form in S k(ρ) (Γ 1 (Nl)), with 2 ≤ k(ρ) ≤ l 2 − 1. Westill must define k(ρ) in the remaining level 1 case in which( ) χα∗ρ|I l =0 χ β .If α ≠ β + 1 thenk(ρ) = 1 + la + bwhere a = min(α, β) <strong>and</strong> b = max(α, β). Now assume α = β + 1. Then( )ρ|I l = χ β χ ∗⊗ .0 1Define a representati<strong>on</strong> of Gal(Q/Q) by( )σ = ρ ⊗ χ −β ∼ χ ∗ = .0 1We now give a motivated recipe for k(σ). Granted that “finite at l” is defined inthe next secti<strong>on</strong>, the recipe is{2 if σ is finite at l,k(σ) =l + 1 otherwise.This is enough to determine k(ρ) givingk(ρ) = k(χ β ⊗ σ) = (l + 1)β + k(σ).


21.7.3 FinitenessWe c<strong>on</strong>tinue with the notati<strong>on</strong> of the previous secti<strong>on</strong>. Letdenote a decompositi<strong>on</strong> group at l.21.7 The Weight revisited: level 1 case 223D = D l = Gal(Q l /Q l ) ↩→ Gal(Q/Q)Definiti<strong>on</strong> 21.7.1. We say that σ|D is finite if it is equivalent to a representati<strong>on</strong>of the form G(Q l ), where G is a finite flat F l ν -vector space scheme over Z l .This definiti<strong>on</strong> may not be terribly enlightening, so we c<strong>on</strong>sider the followingspecial case. Suppose E/Q is an elliptic curve with semistable (=good or multiplicative)reducti<strong>on</strong> at l. Then E[l] defines a representati<strong>on</strong>σ : Gal(Q/Q) → Aut E[l] ∼ = GL(2, F l ).Let ∆ E be the minimal discriminant of E.Propositi<strong>on</strong> 21.7.2. With notati<strong>on</strong> as above, σ is finite at l if <strong>and</strong> <strong>on</strong>ly iford l ∆ E ≡ 0(mod l).If p ≠ l (<strong>and</strong> E has semistable reducti<strong>on</strong> at p) then σ is unramified at p if <strong>and</strong><strong>on</strong>ly iford p ∆ E ≡ 0 (mod l).We give some hint as to how the proof goes when p ≡ 1 (mod l). Let E be anelliptic curve with multiplicative reducti<strong>on</strong> at p. Set V = E[l] = F l ⊕ F l . We havea representati<strong>on</strong>σ : D = Gal(Q l /Q l ) → Aut V.The theory of Tate curves gives an exact sequence of D-modules0 → X → V α −→ Y → 0.Each of these terms is a D module <strong>and</strong> X <strong>and</strong> Y are 1-dimensi<strong>on</strong>al as F l -vectorspaces. The acti<strong>on</strong> of D <strong>on</strong> Y is given by an unramified character ε of degreedividing 2. The acti<strong>on</strong> of D <strong>on</strong> X is given by χε.Next we define an element of H 1 (D, Hom Fl (Y, X)). A splitting s : Y → V isan F l -linear map (not necessarily a map of D-modules) such that αs = 1. Choosesuch a splitting. For each d ∈ D c<strong>on</strong>sider the twisting d s : Y → V defined byd s(y) = ds(d −1 y). Sinceα(ds(d −1 y)) = d(α(s(d −1 y))) = d(d −1 y) = yif follows that d s is again a splitting. Thus d s − s : Y → V followed by α : V → Y isthe zero map. Since d s − s is a linear map, d s − s ∈ Hom Fl (Y, X). The map d ↦→ d sdefines a 1-cocycle which gives an element of H 1 (D, Hom Fl (Y, X)). There is anisomorphism of D-modules Hom Fl (Y, X) ∼ = µ l so we have isomorphismsH 1 (D, Hom Fl (Y, X)) ∼ = H 1 (D, µ l ) ∼ = Q ∗ p/(Q ∗ p) l .The last isomorphism follows from Kummer theory since Q p is assumed to c<strong>on</strong>tainthe l-th roots of unity (our assumpti<strong>on</strong> that p ≡ 1 (mod l)). Thus σ defines an


224 21. Serre’s C<strong>on</strong>jectureelement of Q ∗ p/(Q ∗ p) l . Serre proved that τ is finite if <strong>and</strong> <strong>on</strong>ly if the corresp<strong>on</strong>dingelement of Q ∗ p/(Q ∗ p) l is in the image of Z ∗ p.If E/Q p is an elliptic curve with multiplicative reducti<strong>on</strong> then there exists aTate parameter q ∈ Q ∗ p with Val p (q) > 0 such thatE ∼ = E q := G m /q Zover the unique quadratic unramified extensi<strong>on</strong> of Q p . The kernel of multiplicati<strong>on</strong>by l gives rise to an exact sequence as above, which is obtained by applying thesnake lemma c<strong>on</strong>necting E[l] to Y in the following diagram:0 q Z lq ZYX G mlG m 0E[l] El EFurthermore, the element of Q ∗ p/(Q ∗ p) l defined by the representati<strong>on</strong> commingfrom E[l] is just the image of q. One has∞∏∆ E = q (1 − q n ) 24 .n=1Note that the product factor is a unit in Q p , so Val p ∆ E = Val p q.


22Fermat’s Last Theorem22.1 The applicati<strong>on</strong> to Fermat“As part of this parcel, I can sketch the applicati<strong>on</strong> to Fermat.”Suppose that semistable elliptic curves over Q are modular. Then FLT is true.Why? “As I have explained so many times. . . ” Suppose l > 5 <strong>and</strong>a l + b l + c l = 0with abc ≠ 0, all relatively prime, <strong>and</strong> such that A = a l ≡ −1 (mod 4), B = b l iseven <strong>and</strong> C = c l ≡ 1 (mod 4). Then we c<strong>on</strong>sider the elliptic curveThe minimal discriminant isE : y 2 = x(x − A)(x − B).∆ E = (ABC)22 8as discussed in Serre’s [Ser87] <strong>and</strong> [DK95]. The c<strong>on</strong>ductor N E is equal to the productof the primes dividing ABC (so in particular N E is square-free). Furthermore,E is semistable – the <strong>on</strong>ly hard place to check is at 2. Diam<strong>on</strong>d-Kramer checksthis explicitly.Here is how to get Fermat’s theorem. View E[l] as a G = Gal(Q/Q)-module.The idea is to show that this representati<strong>on</strong> must come from a modular form ofweight 2 <strong>and</strong> level 2. This will be a c<strong>on</strong>tradicti<strong>on</strong> since there are no modular formsof weight 2 <strong>and</strong> level 2. But to apply the level <strong>and</strong> weight theorem we need toknow that E[l] is irreducible. The proof of this is due to Mazur.Let ρ : G → Aut E[l] be the Galois representati<strong>on</strong> <strong>on</strong> the l torsi<strong>on</strong> of E. SinceE is semistable for p ≠ l,(ρ|I p∼ 1 ∗)= .0 1


226 22. Fermat’s Last TheoremAssume ρ is reducible. Then ρ has an invariant subspace soρ ∼ =( α ∗).0 βThen from the form of ρ|I p we see that the characters α <strong>and</strong> β could be ramified<strong>on</strong>ly at l. Thus α = χ i <strong>and</strong> β = χ 1−i where χ is the mod l cyclotomic character.The exp<strong>on</strong>ents are i <strong>and</strong> 1 − i since αβ is the determinant which is χ. [[Why is χsupposed to be the <strong>on</strong>ly possible unramified character? probably since whateverthe character is, it is a product of χ times something else, <strong>and</strong> the other factor isramified.]]What happens to ρ at l, i.e., what is ρ|I l ? There are <strong>on</strong>ly two possibilities. Eitherρ|I l is the direct sum of the two fundamental characters or it is the sum of thetrivial character with χ. The sec<strong>on</strong>d possibility must be the <strong>on</strong>e which occurs. [[Ido not underst<strong>and</strong> why... something about “characters globally are determined bylocal informati<strong>on</strong>.”]] So either i = 0 or i = 1. If i = 0,ρ =( ) 1 ∗.0 χThis means that there is an element of E[l] whose subspace is left invariant underthe acti<strong>on</strong> of Galois. Thus E has a point of order l rati<strong>on</strong>al over Q. If i = 1 thenρ =( ) χ ∗.0 1Therefore µ l ↩→ E[l]. Divide to obtain E ′ = E/µ l . The representati<strong>on</strong> <strong>on</strong> E[l]/µ lis c<strong>on</strong>stant ( (this ) is basic linear algebra) so the resulting representati<strong>on</strong> <strong>on</strong> E ′ [l] hasthe form so E ′ has a rati<strong>on</strong>al point of order l.1 ∗0 χNow E[2] is a trivial Galois module since it c<strong>on</strong>tains 3 obvious rati<strong>on</strong>al points,namely (0, 0), (A, 0), <strong>and</strong> (B, 0). Thus the group structure <strong>on</strong> the curve E (or E ′ )(which we c<strong>on</strong>structed from a counterexample to Fermat) c<strong>on</strong>tainsZ/2Z ⊕ Z/2Z ⊕ Z/lZ.In Mazur’s paper [[“rati<strong>on</strong>al isogenies of prime degree”]] he proves that l ≤ 3.Since we assumed that l > 5, this is a c<strong>on</strong>tradicti<strong>on</strong>. Notice that we have not justproved FLT. We have dem<strong>on</strong>strated the irreducibility of the Galois representati<strong>on</strong><strong>on</strong> the l torsi<strong>on</strong> of the elliptic curve E arising from a hypothetical counterexampleto FLT.We now have a representati<strong>on</strong>ρ : Gal(Q/Q) → GL(2, F l ) = Aut E[l]which is irreducible <strong>and</strong> modular of weight 2 <strong>and</strong> level N = N E (the c<strong>on</strong>ductorof E). Because ρ is irreducible we c<strong>on</strong>clude that ρ is modular of weight k(ρ) <strong>and</strong>level N(ρ). Furthermoreso k(ρ) = 2.ord l ∆ E = ord l (ABC) 2 = 2l ord l abc ≡ 0 (mod l)


22.2 <strong>Modular</strong> elliptic curves 227We can also prove that N(ρ) = 2. Clearly N(ρ)|N E . This is because N(ρ)computed locally at p ≠ l divides the power of p in the c<strong>on</strong>ductor of the l-adicrepresentati<strong>on</strong> for E at p. [[I do not underst<strong>and</strong> this.]] Since ρ is <strong>on</strong>ly ramified at 2or maybe l, N(ρ) must be a power of 2. For p ≠ l, ρ is ramified at p if <strong>and</strong> <strong>on</strong>ly iford p ∆ E ≢ 0 (mod l) which does happen when p = 2. Since N E is square free thisimplies that N(ρ) = 2. But S 2 (Γ 1 (2)) = 0, which is the ultimate c<strong>on</strong>tradicti<strong>on</strong>!But how do we know semistable elliptic curves over Q are modular?22.2 <strong>Modular</strong> elliptic curvesTheorem 22.2.1 (Theorem A). Every semistable elliptic curve over Q is modular.There are several ways to define a modular elliptic curve. Let E be an ellipticcurve. Then E is modular if there is a prime l > 2 such that the associated l-adicGalois representati<strong>on</strong>ρ E,l ∞ : G = Gal(Q/Q) → GL(2, Z l )defined by the l-power divisi<strong>on</strong> points <strong>on</strong> E is modular (i.e., it is a ρ f,λ ). Let E bean elliptic curve of c<strong>on</strong>ductor N E . For each prime p not dividing N E <strong>on</strong>e associatesa number a p related [[in a simple way!]] to the number of points <strong>on</strong> the reducti<strong>on</strong>of E modulo p. Then E is modular if there exists a cusp form f = ∑ b n q n whichis an eigenformfor the <strong>Hecke</strong> operators such that a p = b p for almost all p. In the end <strong>on</strong>e deducesthat f can be chosen to have weight 2, trivial character, <strong>and</strong> level N E . [Shimura]An elliptic curve E is modular if there is n<strong>on</strong>c<strong>on</strong>stant map X 0 (N) → E for someN.Theorem 22.2.2 (Theorem B. Wiles, Taylor-Wiles). Suppose E is a semistableelliptic curve over Q <strong>and</strong> suppose l is an odd prime such that E[l] is irreducible<strong>and</strong> modular, then ρ E,l ∞ is modular <strong>and</strong> hence E is modular.Now we sketch a proof that theorem B implies theorem A. First take l = 3. IfE[3] is irreducible then by work of Langl<strong>and</strong>s-Tunnel we win. The idea is to takeρ : G → GL(2, F 3 ) ↩→ GL(2, Z[ √ −2]) ⊂ GL(2, C).The point is that there are two maps Z[ √ −2] → F 3 given by reducti<strong>on</strong> modulo eachof the primes lying over 3. Choosing <strong>on</strong>e gives a map GL(2, Z[ √ −2]) → GL(2, F 3 )which, for some amazing reas<strong>on</strong> [[which is?]], has a secti<strong>on</strong>. Then ρ : G → GL(2, C)is a c<strong>on</strong>tinuous representati<strong>on</strong> with odd determinant which must still be irreducible.By Langl<strong>and</strong>s-Tunnel we know that ρ is modular <strong>and</strong> in fact ρ comes from a weight1 cusp form f of level a power of 3 times powers of most primes dividing c<strong>on</strong>d(E).Reducing f modulo some prime of Z[ √ −2] lying over 3 we obtain a mod 3 modularform which corresp<strong>on</strong>ds to ρ : G → E[3]. The proof of all this uses the immensebase-change business in Langl<strong>and</strong>s’ book. [[Ribet next says: “have to get 3’s outof the level! This jacks up the weight, <strong>and</strong> the level is still not square free. Thenhave to adjust the weight again.” I do not know what the point of this is.]]Kevin Buzzard asked a questi<strong>on</strong> relating to how <strong>on</strong>e knows the hypothesis neededfor the theorem <strong>on</strong> weights <strong>and</strong> levels applies in our situati<strong>on</strong>. To answer this


228 22. Fermat’s Last Theoremsuppose l = 3 or 5. Form the associated representati<strong>on</strong> ρ : G → GL(2, F l ) comingfrom E[l] <strong>and</strong> assume it is irreducible, modular <strong>and</strong> semistable.A mod l Galois representati<strong>on</strong> is semistable if for all p ≠ l, the inertia group I pacts unipotently <strong>and</strong> the c<strong>on</strong>jectured weight is 2 or l + 1.Note that det ρ = χ.Lemma 22.2.3. Under the above assumpti<strong>on</strong>s l divides the order of the image ofρ.Proof. If not, then ρ is finite at all primes p, since for primes p ≠ l inertia actstrivially [[some other argument for l]]. Inertia acts trivially because if the order ofthe image of ρ is prime to l then ρ acts diag<strong>on</strong>ally. For if not then since ρ|I p is(unipotent (hence all eigenvalues are 1), in a suitable basis something like1 ψ0 1)isin the image of ρ <strong>and</strong> has order l, a c<strong>on</strong>tradicti<strong>on</strong>. Because of finiteness k(ρ) = 2<strong>and</strong> N(ρ) = 1 which is a c<strong>on</strong>tradicti<strong>on</strong> since there are no forms of weight 2 <strong>and</strong>level 1.Next we c<strong>on</strong>sider what happens if E[3] is reducible. There are two cases toc<strong>on</strong>sider. First suppose E[5] is also reducible. Then E c<strong>on</strong>tains a rati<strong>on</strong>al subgroupof order 15. We can check by h<strong>and</strong> that all such curves are modular. The key resultthat is that X 0 (15)(Q) is finite.The sec<strong>on</strong>d possibility is that E[5] is irreducible. In this case we first find a curveE ′ which is semistable over Q such that• E ′ [5] ∼ = E[5] (this is easy to do because of the lucky coincidence that X 0 (5)has genus 0)• E ′ [3] is irreducibleNext we discover that E ′ is modular since E ′ [3] is irreducible. This implies E ′ [5]is modular hence E[5] is modular. Theorem B then implies E is modular.


23Deformati<strong>on</strong>s23.1 Introducti<strong>on</strong>For the rest of the semester let l be an odd prime. Let ρ : G → GL(2, F l ν ) be suchthat• ρ is modular• ρ is irreducible• ρ is semistableThe representati<strong>on</strong> ρ is semistable if• for all p ≠ l,( )ρ|I p∼ 1 ∗ = ,0 1(∗ is typically trivial since most primes are unramified.)• k(ρ) = 2 or l + 1.This means that there are 2 possibilities.1. ρ is finite at D l .2.(ρ|D l∼ α ∗)=0 βwhere β is unramified. (Since det(ρ) = χ we can add that α|I l = χ.)If k(ρ) = 2 then possibility 1 occurs. If k(ρ) = l + 1 then we are in case 2, butbeing in case 2 does not imply that k(ρ) = l + 1. If ρ comes from an elliptic curveE/Q, then case 1 occurs if E has good reducti<strong>on</strong> at l whereas case 2 occurs if


230 23. Deformati<strong>on</strong>sE has ordinary multiplicative reducti<strong>on</strong> [[I am not sure about this last asserti<strong>on</strong>because I missed it in class.]].What is a deformati<strong>on</strong> of ρ <strong>and</strong> when can we prove that it is modular?Let A be a complete local Noetherian ring with maximal ideal m <strong>and</strong> residuefield F l ν (so A is furnished with a map A/m −→ ∼ F l ν ). Let˜ρ A : G → GL(2, A)be a representati<strong>on</strong> which is ramified at <strong>on</strong>ly finitely many primes. Assume ˜ρ = ˜ρ Alifts ρ, i.e., the reducti<strong>on</strong> of ˜ρ mod m gives ρ.Theorem 23.1.1. Let the notati<strong>on</strong> be as above, then ˜ρ is modular if <strong>and</strong> <strong>on</strong>ly ifit satisfies (∗).Neither of the terms in this theorem have been defined yet.23.2 C<strong>on</strong>diti<strong>on</strong> (∗)Letρ : G = Gal(Q/Q) → GL(2, F l ν )be a Galois representati<strong>on</strong>.The statement we wish to underst<strong>and</strong> is“All ˜ρ which satisfy (∗) are modular.”Let A be a complete local Noetherian ring with residue field F l ν . This meansthat we are given a map A/m ∼ = F l ν . Suppose ˜ρ : G → GL(2, A) satisfies thefollowing c<strong>on</strong>diti<strong>on</strong>s:• ˜ρ lifts ρ,• det ˜ρ = ˜χ,• ˜ρ is ramified at <strong>on</strong>ly finitely many primes, <strong>and</strong>• c<strong>on</strong>diti<strong>on</strong> (∗).What is c<strong>on</strong>diti<strong>on</strong> (∗)? It the requirement that ˜ρ have the same qualitativeproperties as ρ locally at l. There are two cases to c<strong>on</strong>sider.Case 1. (arising from supersingular reducti<strong>on</strong> at l) Suppose ρ is finite <strong>and</strong> flatat l. Then ρ|I l is given by the 2 fundamental charactersI l → F ∗ l 2of level 2 (instead of from powers of these characters because of the semistabilityassumpti<strong>on</strong>). C<strong>on</strong>diti<strong>on</strong> (∗) is that the lift of ρ is also c<strong>on</strong>strained to be finite <strong>and</strong>flat. This means that for every n ≥ 1,˜ρ|D l mod m n : D l → GL(2, A/m n )is finite <strong>and</strong> flat, i.e., it comes from a finite flat group scheme over Z l which isprovided with an acti<strong>on</strong> of A/m n as endomorphisms.


23.3 Classes of liftings 231Case 2. (bad multiplicative reducti<strong>on</strong> at l or good ordinary reducti<strong>on</strong> at l) Incase 2(ρ|D l∼ α ∗)=0 βwhere β is unramified (equivalently α|I l = χ). [[In the case of good ordinaryreducti<strong>on</strong> β is given by the acti<strong>on</strong> of Galois <strong>on</strong> E[l] in characteristic l.]] C<strong>on</strong>diti<strong>on</strong>(∗) is the requirement that˜ρ|D l∼(˜α ∗)= ,0 ˜βwhere ˜β is unramified, ˜α|I l = ˜χ. It follows automatically that ˜β lifts β.23.2.1 Finite flat representati<strong>on</strong>sIn general what does it mean for ρ to be finite <strong>and</strong> flat. It means that there existsa finite flat group scheme G over Z l such that G(Q l ) is the representati<strong>on</strong> spaceof ρ. This definiti<strong>on</strong> is subtle.Coleman asked if there is a way to reformulate the definiti<strong>on</strong> without menti<strong>on</strong>inggroup schemes. Ribet menti<strong>on</strong>ed Hopf algebras but then stopped. Buzzardsuggested some of the subtlety of the definiti<strong>on</strong> by claiming that in some situati<strong>on</strong>χ is finite flat whereas χ 2 is not. Ogus vaguely c<strong>on</strong>jectured that F<strong>on</strong>taine’slanguage is the way to underst<strong>and</strong> this.It is possible in case 2 above for ρ|D l to be finite flat without ˜ρ finite flat. Thequintessential example is an elliptic curve E with supersingular reducti<strong>on</strong> at l suchthat l| ord l ∆ E .23.3 Classes of liftingsLet Σ be a finite set of prime numbers. We characterize a class of liftings ˜ρ whichdepends <strong>on</strong> Σ. What does it mean for ˜ρ to be in the class of deformati<strong>on</strong>s corresp<strong>on</strong>dingto Σ?23.3.1 The case p ≠ lFirst we talk about the case when p ≠ l. If p ∈ Σ then there is no special c<strong>on</strong>diti<strong>on</strong><strong>on</strong> ˜ρ|I p . If p ∉ Σ <strong>on</strong>e requires that ˜ρ is qualitatively the same as ρ. This means1. If ρ is unramified at p (which is the usual case), then we just require that ˜ρis unramified at p.2. If ρ is ramified but unipotent at p so(ρ|I p∼ 1 ∗)=0 1we require that( )˜ρ|I p∼ 1 ∗ = .0 1This situati<strong>on</strong> can occur with an elliptic curve which has multiplicative reducti<strong>on</strong>at p <strong>and</strong> for which l ∤ ord p ∆ E .


232 23. Deformati<strong>on</strong>sAt this point I menti<strong>on</strong> the prime example.Example 23.3.1. Suppose ˜ρ = ρ f,λ is the λ-adic representati<strong>on</strong> attached to f. Whatcan we say about ord p N(f)? We know thatord p N(f) = ord p N(ρ) + dim(ρ) Ip − dim(˜ρ) Ipwhere (ρ) Ip means the inertia invariants in the representati<strong>on</strong> space of ρ. If ρ issemistable thenord p N(ρ) + dim(ρ) Ip = 2.Since we are assuming ρ is semistable, ord p N(f) ≤ 2. Furthermore, the c<strong>on</strong>diti<strong>on</strong>p ∉ Σ is a way of sayingord p N(f) = ord p N(ρ).Thus the requirement that p ∉ Σ is that the error term dim(ρ) Ip −dim(˜ρ) Ip vanish.Note that ord p N(˜ρ) = ord p N(f) by Carayol’s theorem. Thus ord p N(f) is justa different way to write ord p N(˜ρ).Example 23.3.2. Imagine ρ is ramified at p <strong>and</strong> ord p N(ρ) = 1. Then ord p N(f) iseither 1 or 2. The requirement that p ∉ Σ is that ord p N(f) = 1.23.3.2 The case p = lNext we talk about the case p = l. There are two possibilities: either l ∈ Σ orl ∉ Σ. If l ∈ Σ then we impose no further c<strong>on</strong>diti<strong>on</strong> <strong>on</strong> ˜ρ (besides the alreadyimposed c<strong>on</strong>diti<strong>on</strong> (∗), semistability at l, etc.). If l ∉ Σ <strong>and</strong> ρ if finite <strong>and</strong> flat(which is not always the case) then we require ˜ρ to be finite <strong>and</strong> flat. If l ∉ Σ <strong>and</strong>ρ is not finite flat then no further restricti<strong>on</strong> (this is the Tate curve situati<strong>on</strong>).Suppose ˜ρ = ρ f,λ , <strong>and</strong> ˜ρ bel<strong>on</strong>gs to the class defined by Σ. We want to guess(since there is no Carayol theorem) an integer N Σ such that N(f)|N Σ . What isN Σ ? It will bep 2 · ∏p ordp N(ρ) · l δ .N Σ = ∏ p≠lp∈Σp≠lp∉ΣHere ord p N(ρ) is 1 if <strong>and</strong> <strong>on</strong>ly if ρ is ramified at p. The exp<strong>on</strong>ent δ is either 0 or1. It is 1 if <strong>and</strong> <strong>on</strong>ly if k(ρ) = l + 1 or l ∈ Σ <strong>and</strong> ρ is ordinary at l, i.e.,(ρ|D l∼ α ∗)= .0 β[[This definiti<strong>on</strong> could be completely wr<strong>on</strong>g. It was definitely not presented clearlyin class.]]There is an exercise associated with this. It is to justify a priori the definiti<strong>on</strong>of δ. Suppose, for example, that ρ f,λ satisfies (∗), then we want to show thatl 2 ∤ N(f).Theorem 23.3.3. Every ˜ρ of class Σ comes from S 2 (Γ 0 (N Σ )).Define an approximati<strong>on</strong> T to the <strong>Hecke</strong> algebra by lettingT = Z[. . . , T n , . . .] ⊂ End(S 2 (Γ 0 (N Σ )))


23.4 Wiles’s <strong>Hecke</strong> algebra 233where we adjoin <strong>on</strong>ly those T n for which (n, lN Σ ) = 1. For some reas<strong>on</strong> there existsa mapT → F l ν : T r ↦→ tr ρ(Frob r ).Why should such a map exist? The point is that we know by the theorem that ρcomes by reducti<strong>on</strong> from a ρ f,λ with f ∈ S 2 (Γ 0 (N Σ )).But there is a wrinkle. [[I do not underst<strong>and</strong>: He says: “Clearly N(ρ)|N Σ . k(ρ) =2 or l + 1. If k(ρ) = l + 1 then δ = 1 so l|N Σ . Have to slip over to weight 2 inorder to get f.” This does not make any sense to me.]]23.4 Wiles’s <strong>Hecke</strong> algebraComposing the map T → O Ef with reducti<strong>on</strong> mod λ from O Ef to F l ν we obtaina map T → F l ν . Let m be the kernel. Then m is a maximal ideal of T. Wiles’s<strong>Hecke</strong> algebra is the completi<strong>on</strong> T m of T at m. [[For some reas<strong>on</strong>]] there exists˜ρ : G = Gal(Q/Q) → GL(2, T m )such that tr(˜ρ(Frob r )) = T r . What makes this useful is that ˜ρ is universal for liftsof type Σ. This means that given any lift τ of type Σ there exists a mapϕ : GL(2, T m ) → GL(2, A)such that τ = ϕ˜ρ.Another key idea that the approximati<strong>on</strong> T obtained by just adjoining thoseT n with (n, lN Σ ) = 1 is, after completing at certain primes, actually equal to thewhole <strong>Hecke</strong> algebra.


234 23. Deformati<strong>on</strong>s


24The <strong>Hecke</strong> Algebra T Σ24.1 The <strong>Hecke</strong> algebraThroughout this lecture l ≠ p <strong>and</strong> l ≥ 3. We are studying the representati<strong>on</strong>ρ : G → GL(2, F l ν ). This is an irreducible representati<strong>on</strong>, l is odd, ρ is semistable,<strong>and</strong> det ρ = χ. To single out certain classes of liftings we let Σ be a finite set ofprimes. Let A be a complete local Noetherian ring with residue filed F l ν . We takeliftings ˜ρ : G → GL(2, A) such that ˜ρ reduces down to ρ, det ˜ρ = ˜χ, <strong>and</strong> ˜ρ is “like”ρ away from Σ. For example, if ρ is unramified at p we also want ˜ρ unramified atp, etc.Assume ˜ρ is modular. We guess the serious divisibility c<strong>on</strong>diti<strong>on</strong> that N(f)|N Σ .Recall thatp 2 · ∏p ordp N(ρ) · l δ .N Σ = ∏ p≠lp∈ΣTo define δ c<strong>on</strong>sider two cases.p≠lp∉Σ• level 1 case. Take δ = 1 if l ∈ Σ or if ρ is not finite at l. Take δ = 0 otherwise.• level 2 case. This is the case when ρ|I l has order l 2 − 1. Take δ = 0.A priori nobody seems to know how to prove that N(f)|N Σ given <strong>on</strong>ly that ˜ρis modular. In the end we will show that all modular ˜ρ in fact come fromS 2 (Γ 0 (N Σ )).This can be regarded as a proof that N(ρ)|N Σ .Last time we tried to get things going by defining the anemic <strong>Hecke</strong> algebraT = Z[. . . , T n , . . .] ⊂ End(S 2 (Γ 0 (N Σ )))where we <strong>on</strong>ly adjoin those T n for which (n, lN Σ ) = 1. By some level loweringtheorem there exists an f ∈ S w (Γ 0 (N Σ )) giving ˜ρ so we obtain a map T → F. The


236 24. The <strong>Hecke</strong> Algebra T Σmap sends T n to the reducti<strong>on</strong> modulo λ of its eigenvalue <strong>on</strong> f. (λ is a prime ofthe ring of integers of E f lying over l.) [[... something about needing an f of theright level = N(ρ).]]Let m ⊂ T be the kernel of the above defined map T → F. Then m is a maximalideal. Let T m be the completi<strong>on</strong> of T at m <strong>and</strong> note thatT m ↩→ T ⊗ Z Z l .We need to know that T m is Gorenstein. This is d<strong>on</strong>e by comparing T m to somefull <strong>Hecke</strong> ring. ThusT ⊂ R = Z[. . . , T n , . . .] ⊂ End(S 2 (Γ 0 (N Σ )))where the full <strong>Hecke</strong> ring R is obtained by adjoining the T n for all integers n. Weshould think of R asR = T[T l , {U P : p|N Σ }].Note that T l may or may not be a U l depending <strong>on</strong> if l|N Σ .Lemma 24.1.1. If l ∤ N Σ then R = T[U p , . . .]. Thus if T l is not a U l then we d<strong>on</strong>ot need T l .Proof. “This lemma is an interesting thing <strong>and</strong> the proof goes as follows. Oooh.Sorry, this is not true. Ummm.... hmm.”The ring T[U p , . . .] is clearly of finite index in R since: if q is a r<strong>and</strong>om primenumber c<strong>on</strong>sider R ⊗ Z Q q compared to T[U p , . . .] ⊗ Z Q q . [[I do not know how todo this argument. The lemma as stated above probably isn’t really true. The pointis that the following lemma is the <strong>on</strong>e we need <strong>and</strong> it is true.]]Let T[U p , . . .] be the ring obtained by adjoining to T just those U p with p|N Σ .Lemma 24.1.2. If l ∤ N Σ then the index (R : T[U p , . . .]) is prime to l. Note thatwe assume l ≥ 3.Proof. We must show that the map T[U p , . . .] → R/lR is surjective. [[I thoughtabout it for a minute <strong>and</strong> did not see why this suffices. Am I being stupid?]] LetA = F l [T n : (n, l) = 1] be the image of T[U p , . . .] in R/lR so we have a diagramT[U p , . . .]R/lR AWe must show that A = R/lR. There is a beautiful dualityR/lR × S 2 (Γ 0 (N Σ ); F l ) −→ F l(perfect pairing).Thus A ⊥ = 0 if <strong>and</strong> <strong>on</strong>ly if A = R/lR.Suppose 0 ≠ f ∈ A ⊥ , then a n (f) = 0 for all n such that (n, l) = 1. Thusf = ∑ a nl q nl . Let θ = q d dqbe the theta operator. Since the characteristic is l,θ(f) = 0. On the other h<strong>and</strong> w(f) = 2 <strong>and</strong> since l ≥ 3, l ∤ 2. Thus w(θf) =w(f)+l+1 = 2+l+1 which is a c<strong>on</strong>tradicti<strong>on</strong> since θf = 0 <strong>and</strong> w(0) = 0 ≠ 3+l.[[The weight is an integer not a number mod l, right?]]


24.2 The Maximal ideal in R 237Example 24.1.3. The lemma <strong>on</strong>ly applies if l ≥ 3. Suppose l = 2 <strong>and</strong> c<strong>on</strong>siderS 2 (Γ 0 (23)). ThenT[U p , . . .] = Z[ √ 5] ⊂ R = Z[ 1 + √ 5]2so (R : T[U p , . . .]) is not prime to 2.Remark 24.1.4. If N = N Σ thenrank Z T = ∑ M|NS 2 (Γ 0 (M)) new<strong>and</strong>There is an injecti<strong>on</strong>rank Z R = dim S 2 (Γ 0 (N)).⊕S 2 (Γ 0 (M)) new ↩→ S 2 (Γ 0 (N))M|Nbut ⊕S 2 (Γ 0 (M)) new is typically much smaller than S 2 (Γ 0 (N)).24.2 The Maximal ideal in RThe plan is to find a special maximal ideal m R of R lying over m.m RRmTOnce we finally find the correct m <strong>and</strong> m R we will be able to show that the mapT m → R mR is an isomorphism. In finding m R we will not invoke some abstractgoing up theorem but we will “produce” m R by some other process. The ideal mwas defined by a newform f level M|N Σ . The coefficients of f lie inO λ = (O Ef ) λwhere E f is the coefficient ring of f <strong>and</strong> λ is a prime lying over l. Thus O λ is anl-adic integer ring. Composing the residue class map O λ → F with the eigenvaluemap T → O λ we obtain the map T → F.In order to obtain the correct m R we will make a sequence of changes to f tomake some good newform. [[Is the motivati<strong>on</strong> for all this that the lemma will notapply if l|N Σ ?]]24.2.1 Strip away certain Euler factorsWrite f = ∑ a n q n . Replace f byh =∑a n q ncertain n


238 24. The <strong>Hecke</strong> Algebra T Σwhere the sum is over those n which are prime to each p ∈ Σ. What does this mean?If we think about the L-functi<strong>on</strong> L(f, s) = ∏ p L p(f, s), then h has L-functi<strong>on</strong>L(h, s) = ∏ p∉ΣL p (f, s).Furthermore making this change does not take us out of S 2 (Γ 0 (N Σ )), i.e., h ∈S 2 (Γ 0 (N Σ )). [[He explained why but my notes are very incomplete. They say:Why. Because h = (f ⊗ ε) ⊗ ε, (ε Dirichlet character ramified at primes in N Σ ).Get a form of level lcm( ∏ p∈Σ p2 , N(f))|N Σ . Can strip <strong>and</strong> stay in space since N Σhas correct squares built into it.]]Remark 24.2.1. Suppose that p||N Σ . Then p ∉ Σ <strong>and</strong> p||N(ρ). The level of f is{N(ρ), if k(ρ) = 2N(f) =N(ρ)l, if k(ρ) = l + 1 .If p||N(ρ) then f|T p = a p (f)f. Thus h is already an eigenform for T p unless p ∈ Σin which the eigenvalue is 0.[[I do not underst<strong>and</strong> this remark. Why would the eigenvalue being 0 mean thatT p is not an eigenform? Furthermore, what is the point of this remark in the widerc<strong>on</strong>text of transforming f into a good newform.]]24.2.2 Make into an eigenform for U lWe perform this operati<strong>on</strong> to f to obtain a form g then apply the above operati<strong>on</strong>to get the ultimate h having the desired properties. Do this if l|N Σ but l ∤ N(f).This happens precisely if l ∈ Σ <strong>and</strong> ρ is good <strong>and</strong> ordinary at l. Then f|U l isjust some r<strong>and</strong>om junk. C<strong>on</strong>sider g = f + ∗f(q l ) where ∗ is some coefficient. Wesee that if ∗ is chosen correctly then g|U l = Cg for some c<strong>on</strong>stant C. There are 2possible choices for ∗ which lead to 2 choices for C. Let a l = a l (f), then C can beeither root ofX 2 + a l X + l = 0.This equati<strong>on</strong> has exactly <strong>on</strong>e unit root in O λ . The reas<strong>on</strong> is because we are inthe ordinary situati<strong>on</strong> so a l is a unit. But l is not a unit. The sum of the roots isa unit but the product is not. [[Even this is not clear to me right now. Definitelycheck this later.]] Make the choice of ∗ so that real root is C. Then we obtain a gsuch thatg|U l = (unit) · g.Next apply the above procedure to strip g <strong>and</strong> end up with an h such that• h|U l = (unit) · h,• h|U p = 0 for p ∈ Σ (p ≠ l), <strong>and</strong>• h|T p = a p (f) · h for p ∉ Σ.Now take the form h. It gives a map R → O λ which extends T → O λ . Let m Rbe the kernel of the map R → F obtained by composing R → O λ with O λ → F.A lot of further analysis shows that T m → R mr is an isomorphism. We end uphaving to show separately that the map is injective <strong>and</strong> surjective.[[In this whole lecture p ≠ l.]][[Wiles’s notati<strong>on</strong>: His T mR is my R <strong>and</strong> his T ′ is my T m .


24.3 The Galois representati<strong>on</strong>24.3 The Galois representati<strong>on</strong> 239We started with a representati<strong>on</strong> ρ, chose a finite set of primes Σ <strong>and</strong> then made thecompleted <strong>Hecke</strong> algebra T m . Our goal is to c<strong>on</strong>struct the universal deformati<strong>on</strong>of ρ of type Σ. The universal deformati<strong>on</strong> is a representati<strong>on</strong>˜ρ : G → GL(2, T m )such that for all primes p with p ∤ lN Σ , ˜ρ(Frob p ) has trace T p ∈ T m <strong>and</strong> determinantp.We now proceed with the c<strong>on</strong>structi<strong>on</strong> of ˜ρ. LetT = Z[. . . , T n , . . .], (n, lN Σ ) = 1be the anemic <strong>Hecke</strong> algebra. Then T ⊗ Q decomposes as a product of fieldsT ⊗ Q = ∏ fE fwhere the product is over a set of representatives for the Galois c<strong>on</strong>jugacy classesof newforms of weight 2, trivial character, <strong>and</strong> level dividing N Σ . Since T is integral(it is for example a finite rank Z-module), T ↩→ ∏ f O f . Since Z l is a flat Z-module,T ⊗ Z l ↩→ ∏ fO f ⊗ Z l = ∏ f,λO f,λwhere the product is over a set of representatives f <strong>and</strong> all λ|l.T m is a direct factor of T ⊗ Z l . [[This is definitely not the asserti<strong>on</strong> that T m isan O f,λ . What exactly is it the asserti<strong>on</strong> of really?]]We can restrict the product to a certain finite set S <strong>and</strong> still obtain an injecti<strong>on</strong>T m ↩→∏O f,λ .(f,λ)∈SThe finite set S c<strong>on</strong>sists of those (f, λ) such that the prime λ of O f pulls back to munder the map T → O f obtained by composing T → ∏ f O f with the projecti<strong>on</strong><strong>on</strong>to O f . [[Why is this enough so that T m still injects in?]] Restricting to a finiteproduct is needed so that∏[ O f,λ : T m ] < ∞.(f,λ)∈SGiven f <strong>and</strong> λ there exists a representati<strong>on</strong>ρ f,λ : G → GL(2, O f,λ ).It is such that tr ρ f,λ (Frob p ) = a p is the image of T p under the inclusi<strong>on</strong>T ⊗ Z l ↩→∏O f,λ .(f,λ)∈SPut some of these ρ f,λ together to create a new representati<strong>on</strong>∏(f,λ)∈Sρ f,λ : Gρ ′−−−−−−−−→ GL(2, ∏ O f,λ ) ⊂ GL(2, T m ⊗ Q).


240 24. The <strong>Hecke</strong> Algebra T ΣThe sought after universal deformati<strong>on</strong> ˜ρ is a map making the following diagramcommuteρ ′G GL(2, T m ⊗ Q)˜ρGL(2, T m )Theorem 24.3.1. ρ ′ is equivalent to a representati<strong>on</strong> taking values in GL(2, T m ).One way to [[try to]] prove this theorem is by invoking a general theorem ofCarayol. [[<strong>and</strong> then what? does this way work? why is it not a good way?]] Butthe right way to prove the theorem is Wiles’s way.24.3.1 The Structure of T mJust as an aside let us review the structure of T m .• T m is local.• T m is not necessarily a discrete valuati<strong>on</strong> ring.• T m ⊗ Q is a product of finite extensi<strong>on</strong>s of Q l .• T m is not necessarily a product of rings O f,λ .• T m need not be integral.24.3.2 The Philosophy in this pictureChoose c to be a complex c<strong>on</strong>jugati<strong>on</strong> in G = Gal(Q/Q). Since l is odd det(c) = −1is a very str<strong>on</strong>g c<strong>on</strong>diti<strong>on</strong> which rigidifies the situati<strong>on</strong>.24.3.3 Massage ρChoose two 1-dimensi<strong>on</strong>al subspaces so thatρ(c) =( ) −1 00 1with respect to any basis c<strong>on</strong>sisting of <strong>on</strong>e vector from each subspace. For anyσ ∈ G write( ) aσ bρ(σ) = σ.c σ d σThen a σ d σ <strong>and</strong> b σ c σ are somehow intrinsically defined. This is becauseρ(σc) =( ) −aσ ?? d σsoa σ =tr(ρ(σ)) − tr(ρ(σc))2


24.3 The Galois representati<strong>on</strong> 241<strong>and</strong>tr(ρ(σ)) + tr(ρ(σc))d σ = .2Since we know the determinant it follows that b σ c σ is also intrinsically known.[[The point is that we know certain things about these matrices in terms of theirtraces <strong>and</strong> determinants.]]Propositi<strong>on</strong> 24.3.2. There exists g ∈ G such that b g c g ≠ 0.Proof. Since ρ is irreducible there exists σ 1 such that b σ1 ≠ 0 <strong>and</strong> there exists σ 2such that c σ2 ≠ 0. If b σ2 ≠ 0 or c σ1 ≠ 0 then we are d<strong>on</strong>e. So the <strong>on</strong>ly problemcase is when b σ1 = 0 <strong>and</strong> c σ2 = 0. Easy linear algebra shows that in this situati<strong>on</strong>g = σ 1 σ 2 has the required property.Now rigidify by choosing a basis so that b g = 1. Doing this does not fix a basisbecause they are many ways to choose such a basis.24.3.4 Massage ρ ′Choose a basis of (T m ⊗ Q) 2 so thatFor any σ ∈ G writeρ ′ (c) =ρ ′ (σ) =( −1 0).0 1( ) aσ b σc σ d σUsing an argument as above shows that a σ , d σ ∈ T m since the traces live in T m .Furthermore b σ c σ ∈ T m since the determinant is in T m .The key observati<strong>on</strong> is that b σ c σ reduces mod m to give the previous b σ c σ ∈ Fcorresp<strong>on</strong>ding to ρ(σ). This is because the determinants <strong>and</strong> traces of ρ ′ are liftsof the <strong>on</strong>es from ρ. Since m is the maximal ideal of a local ring <strong>and</strong> b g c g reducesmod m to something n<strong>on</strong>zero it follows that b g c g is a unit in T m .Choose a basis so that( )ρ ′ −1 0(c) =0 1<strong>and</strong> also so thatHere u is a unit in T m .Propositi<strong>on</strong> 24.3.3. Writeρ ′ (g) =( ag 1)∈ GL(2, Tu d m ).gρ ′ (σ) =( ) aσ b σc σ d σwith respect to the basis chosen above. Then a σ , b σ , c σ , d σ ∈ T m .Proof. We already know that a σ , d σ ∈ T m . The questi<strong>on</strong> is how to show thatb σ , c σ ∈ T m . Since(ρ ′ aσ a(σg) = g + b σ u ?)? c σ + d σ d g


242 24. The <strong>Hecke</strong> Algebra T Σwe see that a σ a g + b σ u ∈ T m . Since a σ a g ∈ T m it follows that b σ u ∈ T m . Since uis a unit in T m this implies b σ ∈ T m . Similarly c σ + d σ d g ∈ T m so c σ ∈ T m .As you now see, in this situati<strong>on</strong> we can prove Carayol’s theorem with just somematrix computati<strong>on</strong>s. [[This is basically a field lowering representati<strong>on</strong> theorem.The thing that makes it easy is that there exists something (namely c) with distincteigenvalues which is rati<strong>on</strong>al over the residue field. Schur’s paper, models oversmaller fields. “Schur’s method”.]]24.3.5 Representati<strong>on</strong>s from modular forms mod lIf you remember back in the 70’s people would take an f ∈ S 2 (Γ 0 (N); F) which isan eigenform for almost all the <strong>Hecke</strong> operatorsThe questi<strong>on</strong> is then: Can you findT p f = c p f for almost all p, <strong>and</strong> c p ∈ F.ρ : G → GL(2, F)such thattr(ρ(Frob p )) = c p <strong>and</strong> det(ρ(Frob p )) = pfor all but finitely many p? The answer is yes. The idea is to find ρ by taking ρ f,λ[[which was c<strong>on</strong>structed by Shimura?]] for some f <strong>and</strong> reducing mod λ. The <strong>on</strong>lyspecial thing that we need is a lemma saying that the eigenvalues in characteristicl lift to eigenvalues in characteristic 0.24.3.6 Representati<strong>on</strong>s from modular forms mod l nSerre <strong>and</strong> Deligne asked: “What happens mod l n ?”More precisely, let R be a local finite Artin ring such that l n R = 0 for some n.Take f ∈ S 2 (Γ 0 (N); R) satisfying the hypothesis{r ∈ R : rf = 0} = {0}.This is d<strong>on</strong>e to insure that certain eigenvalues are unique. Assume that for almostall p <strong>on</strong>e has T p f = c p f with c p ∈ R. The problem is to find ρ : G → GL(2, R)such thattr(ρ(Frob p )) = c p <strong>and</strong> det(ρ(Frob p )) = pfor almost all p.The big stumbling block is that ρ need not be the reducti<strong>on</strong> of some ρ g,λ for anyg, λ. [[I couldn’t underst<strong>and</strong> why – I wrote “can mix up f’s from characteristic 0so can not get <strong>on</strong>e which reduces correctly.”]]Let T = Z[. . . , T p , . . .] where we <strong>on</strong>ly adjoin those T p for which f is an eigenvector[[I made this last part up, but it seems very reas<strong>on</strong>able]]. Then f obviouslygives a rise to a mapT → R : T ↦→ eigenvalue of T <strong>on</strong> f .


24.4 ρ ′ is of type Σ 243The strange hypothesis <strong>on</strong> f insures that the eigenvalue is unique. Indeed, supposeT f = af <strong>and</strong> T f = bf, then 0 = T f −T f = af −bf = (a−b)f so by the hypothesisa − b = 0 so a = b.Since the pullback of the maximal ideal of R is a maximal ideal of T we get amap T m → R for some m. [[I do not underst<strong>and</strong> why we suddenly get this map<strong>and</strong> I do not know why the pullback of the maximal ideal is maximal.]]Now the problem is solved. Take ρ ′ : G → GL(2, T m ) with the sought after trace<strong>and</strong> determinant properties. Then let ρ be the map obtained by composing withthe map GL(2, T m ) → GL(2, R).24.4 ρ ′ is of type ΣLet ρ be modular irreducible <strong>and</strong> semistable mod l representati<strong>on</strong> with l > 2. Let Σbe a finite set of primes. Then N(ρ)|N Σ . We c<strong>on</strong>structed the anemic <strong>Hecke</strong> algebraT which c<strong>on</strong>tains a certain maximal ideal m. We then c<strong>on</strong>sider the completi<strong>on</strong> T mof T at m. Next we c<strong>on</strong>structedlifting ρ. Thus the diagramcommutes.Some defining properties of ρ ′ are• det ρ ′ = ˜χ.ρ ′ : G → GL(2, T m )ρ ′G GL(2, T m )ρ GL(2, F)• tr ρ ′ (Frob r ) = T r . Since topologically T m is generated by the Frob r this is atight c<strong>on</strong>diti<strong>on</strong>.• ρ ′ is a lift of type Σ.To say ρ ′ is a lift of type Σ entails that ρ ′ is unramified outside primes p|N Σ .This is true because ρ ′ is c<strong>on</strong>structed from various ρ f,λ with N(f)|N Σ . If p ≠ l<strong>and</strong> p|N(ρ) then p||N Σ . Recall that( ) α ∗ρ|D p ∼0 βwhere α = βχ <strong>and</strong> α <strong>and</strong> β are unramified. For ρ ′ to be a lift of type Σ we requirethat) (˜α ∗ρ ′ |D p ∼0 ˜βwhere ˜α <strong>and</strong> ˜β are unramified lifts <strong>and</strong> ˜α = ˜βχ. [[I find it mighty odd that αis χ times an unramified character <strong>and</strong> yet α is not ramified! How can that be?


244 24. The <strong>Hecke</strong> Algebra T ΣRestricted to inertia β <strong>and</strong> α would be trivial but χ would not be.]] Is this true ofρ ′ ? Yes since by a theorem of Langl<strong>and</strong>s the factors) (˜α ∗ρ f,λ |D p ∼ .0 ˜β[[Ribet said more about this but it does not form a cohesive whole in my mind. Hereis what I have got. Since ρ f,λ obviously ramified at p, p||N(f)|N Σ . ρ f,λ |D p is like anelliptic curve with bad multiplicative reducti<strong>on</strong> at p. That ρ f,λ |D p ∼ ( )a bc d ˜α∗0 ˜βreally comes down to Deligne-Rapaport. If write f = ∑ a n q n , then a p ≠ 0 <strong>and</strong>˜β(Frob p ) = a p , ˜α(Frob p ) = pa p . Thus a 2 p = 1 since ˜α ˜β = χ. Thus a p = ±1 <strong>and</strong>a p mod λ = β(Frob p ) = ±1 independent of (f, λ). So we have these numbersa p = a p (f) = ±1, independent of λ. ]]24.5 Isomorphism between T m <strong>and</strong> R mRLet T ⊂ R = Z[. . . , T n , . . .] be the anemic <strong>Hecke</strong> algebra with maximal ideal m.The difference between T <strong>and</strong> R is that R c<strong>on</strong>tains all the <strong>Hecke</strong> operators whereasT <strong>on</strong>ly c<strong>on</strong>tains the T p with p ∤ lN Σ . Wiles proved that the map T m → R mR isan isomorphism. Which <strong>Hecke</strong> operators are going to hit the missing T p ? If we dothe analysis in R mR we see that [[I think for p ≠ l!]]{±1, for p|N Σ , p ∉ ΣT p =.0, for p ∈ ΣThis takes care of everything except T l . In proving the surjectivity of T m → R mRwe are quite happy to know that T p = ±1 or 0. The n<strong>on</strong>trivial proof is given in[DDT94].C<strong>on</strong>sider the commuting diagram T m ∏Of,λR mRThe map R mR → ∏ O f,λ is c<strong>on</strong>structed by massaging f by stripping away certainEuler factors so as to obtain an eigenvector for all the <strong>Hecke</strong> operators. Thisdiagram forces T m → R mR to be injective.[[Ogus: Is it clearly surjective <strong>on</strong> the residue field? Ribet: Yes. Ogus: OK, thenwe just need to prove it is ètale.]]From the theory of the θ operator we already know two-thirds of the times thatT c<strong>on</strong>tains T l .Suppose l|N Σ . This entails that we are in the ordinary case, ρ is not finite at l,or l ∈ Σ. We did not prove in this situati<strong>on</strong> that T l ∈ Z[. . . , T n , . . . : (n, l) = 1].Using generators <strong>and</strong> relati<strong>on</strong>s <strong>and</strong> brute force <strong>on</strong>e shows that R mR → ∏ O f,λis an injecti<strong>on</strong>. Then we can compare everything in ∏ O f,λ . Now) (˜α ∗ρ f,λ |D l ∼0 ˜β


24.6 Deformati<strong>on</strong>s 245<strong>and</strong>˜β(Frob l ) = T l ∈ ∏ O f,λ .Using arguments like last time <strong>on</strong>e shows that ˜β(Frob l ) can be expressed in termsof the traces of various operators. This proves surjectivity in this case.Ultimately we have T m∼ = RmR . The virtue of T m is that it is generated bytraces. The virtue of R mR is that it is Gorenstein. We have seen this if l ∤ N Σ . Infact it is Gorenstein even if l||N Σ . [[Ribet: As I st<strong>and</strong> here today I do not know howto prove this last asserti<strong>on</strong> in exactly <strong>on</strong>e case. Ogus: You mean there is anothergap in Wiles’s proof. Ribet: No, it is just something I need to work out.]] Whenl||N Σ there are 2 cases. Either ρ is not finite at l or it is. The case when ρ is notfinite at l was taken care of in [MR91]. A proof that R mR is Gorenstein when ρ isfinite at l (l ∈ Σ) is not in the literature.Now forget R mR <strong>and</strong> just think of T m in both ways: trace generated <strong>and</strong> Gorenstein.24.6 Deformati<strong>on</strong>sFix an absolutely irreducible modular mod l representati<strong>on</strong> ρ <strong>and</strong> a finite set ofprimes Σ. C<strong>on</strong>sider the category C of complete local Noetherian W (F)-algebras A(with A/m = F). Here F = F l ν <strong>and</strong> W (F) is the ring of Witt vectors, i.e., the ringof integers of an unramified extensi<strong>on</strong> of Q l of degree ν.Define a functor F : C → Set by sending A in C to the set of equivalence classesof lifts˜ρ : G → GL(2, A)of ρ of type Σ. The equivalence ( ) relati<strong>on</strong> is that ˜ρ 1 ∼ ˜ρ 2 if <strong>and</strong> <strong>on</strong>ly if there exists1 0M ∈ GL(2, A) with M ≡ (mod m) such that ˜ρ0 11 = M −1 ˜ρ 2 M.Mazur proved [Maz89] that F is representable.Theorem 24.6.1. There exists a liftρ univ : G → GL(2, R Σ )of type Σ such that given any lift ˜ρ : G → GL(2, A) there exists a unique homomorphismϕ : R Σ → A such that ϕ◦ρ univ ∼ ˜ρ in the sense of the above equivalencerelati<strong>on</strong>.Lenstra figured out how to c<strong>on</strong>cretely c<strong>on</strong>struct R Σ .Back in the student days of Ribet <strong>and</strong> Ogus, Schlessinger wrote a widely quotedthesis which gives c<strong>on</strong>diti<strong>on</strong>s under which a certain class of functors can be representable.Mazur checks these c<strong>on</strong>diti<strong>on</strong>s in his paper.[[Buzzard: What happened to Schlessinger anyways? Ribet: He ended up atUniversity of North Carolina, Chapel Hill.]]We have c<strong>on</strong>structed ˜ρ = ρ ′ : G → GL(2, T m ). By the theorem there exists aunique morphism ϕ : R Σ → T m such that ρ ′ = ϕ ◦ ρ univ .Theorem 24.6.2. ϕ is an isomorphism thus ρ ′ is the universal deformati<strong>on</strong> <strong>and</strong>T m is the universal deformati<strong>on</strong> ring.


246 24. The <strong>Hecke</strong> Algebra T ΣThis will imply that any lift of type Σ is modular.The morphism ϕ is surjective sinceT p = tr ˜ρ(Frob p ) = tr ϕ ◦ ρ univ (Frob p ) = ϕ(tr(ρ univ (Frob p ))).We have two very abstractly defined local Noetherian rings. How would youprove they are isomorphic? Most people would be terrified by this questi<strong>on</strong>. Wilesdealt with it.24.7 Wiles’s main c<strong>on</strong>jecture“We are like a train which is trying to reach Fermat’s Last Theorem.Of course it has not made all of its scheduled stops. But it is <strong>on</strong> itsway.”We have a representati<strong>on</strong> ρ : G → GL(2, F). Take F = F l for our applicati<strong>on</strong>stoday. Then the ring of Witt vectors is W (F) = Z l . The <strong>Hecke</strong> algebra can beembedded asT m ⊂∏O g,µ .(g,µ)∈AThe <strong>Hecke</strong> algebra T m has the following properties.• The index of T m in ∏ O g,µ is finite.• Gorenstein as a Z l -module, i.e., there exists an isomorphism Hom Zl (T m , Z l ) ∼ =T m .• T m is generated by the T r with r prime <strong>and</strong> (r, lN Σ ) = 1.We have c<strong>on</strong>structed a representati<strong>on</strong>ρ ′ : G → GL(2, T m ).Composing appropriately with the map T m ↩→ ∏ O g,µ gives a mapG → ∏GL(2, O g,µ ).(g,µ)This is the product of representati<strong>on</strong>s ∏ ρ g,µ . The triangle isρ ′G GL(2, T m )∏ ρg,µ∏ GL(2, Og,µ )Moreover, ρ ′ is a deformati<strong>on</strong> of ρ of type Σ so it lifts ρ <strong>and</strong> satisfies certain “niceas ρ” properties at primes p ∉ Σ.Letρ univ : G → GL(2, R Σ )


24.7 Wiles’s main c<strong>on</strong>jecture 247be the universal deformati<strong>on</strong> of ρ of type Σ. Lenstra gave a very c<strong>on</strong>crete paper[dSL97] c<strong>on</strong>structing this ρ univ . Before his paper there was <strong>on</strong>ly Schlesinger’s thesis.By the definiti<strong>on</strong> of ρ univ there exists a unique map ϕ : R Σ → T m = T Σ such thatϕ ◦ ρ univ = ρ ′ . By ϕ ◦ ρ univ we mean the compositi<strong>on</strong> of ρ univ with the mapGL(2, R Σ ) → GL(2, T m ) induces by ϕ. As noted last time it is easy to see that ϕis surjective.Theorem 24.7.1 (Wiles’s main ‘c<strong>on</strong>jecture’). ϕ is an isomorphism (for each Σ).The theorem implies the following useful result.Theorem 24.7.2. Suppose E is a semistable elliptic curve over Q <strong>and</strong> that forsome l > 2 the representati<strong>on</strong> ρ = ρ l,E <strong>on</strong> E[l] is irreducible <strong>and</strong> modular. ThenE is modular.Proof. The representati<strong>on</strong>˜ρ = ρ l∞ ,E : G → GL(2, Z l )<strong>on</strong> the l-power torsi<strong>on</strong> E[l ∞ ] = ∪E[l n ] is a lift ofρ : G → Aut(E[l]) = GL(2, F l ).Furthermore, ˜ρ is a deformati<strong>on</strong> of ρ of type Σ. Applying universality <strong>and</strong> usingthe theorem that R Σ∼ = TΣ we get a mapT Σ → Z l : T r ↦→ a r = a r (E) = Tr ˜ρ(Frob r ).The relevant diagram isR Σ T Σwhere the map R Σ → Z l is given by Z lTr ρ univ (Frob r ) ↦→ a r .Now the full <strong>Hecke</strong> algebra Z[. . . , T n , . . .] embeds into the completi<strong>on</strong> T Σ . Composingthis with the map T Σ → Z l above we obtain a mapα : Z[. . . , T n , . . .] → Z l .Because of the duality between the <strong>Hecke</strong> algebra <strong>and</strong> modular forms there exists amodular form h ∈ S 2 (Γ 0 (N Σ ), Z l ) corresp<strong>on</strong>ding to α. Since α is a homomorphismh is a normalized eigenform. Furthermore a r (h) = a r (E) ∈ Z for all primes r ∤lN Σ . Since almost all coefficients of h are integral it follows that h is integral.Because we know a lot about eigenforms we can massage h to an eigenform inS 2 (Γ 0 (N E ), Z).[[Some undigested comments follow.]]• Once there is any c<strong>on</strong>necti<strong>on</strong> between ρ <strong>and</strong> a modular form <strong>on</strong>e can proveTaniyama-Shimura in as str<strong>on</strong>g a form as desired. See the article Numbertheory as Gadfly.


248 24. The <strong>Hecke</strong> Algebra T Σ• Take the abelian variety A h attached to h. The λ-adic representati<strong>on</strong> willhave pieces with the same representati<strong>on</strong>s. Using Tate’s c<strong>on</strong>jecture we seethat E is isogenous to A h . Use at some points Carayol’s theorem: If g is aform giving rising to the abelian variety A then the c<strong>on</strong>ductor of A is thesame as the c<strong>on</strong>ductor of g.• Tate proved that if two elliptic curves have isomorphic ρ l ∞they are isogenous.for some l then24.8 T Σ is a complete intersecti<strong>on</strong>Recall the c<strong>on</strong>structi<strong>on</strong> of T Σ = T m . Let T be the anemic <strong>Hecke</strong> algebra. ThenT ⊗ Z l ↩→ ∏ O g,µwhere the product is over a complete set of representatives (for the acti<strong>on</strong> of Galois<strong>on</strong> eigenforms) g <strong>and</strong> primes µ lying over l. We found a specific (f, λ) for Σ = ∅such that ρ λ,f = ρ. The maximal ideal m was defined as follows. The form finduces a map T → O f,λ . Taking the quotient of O f,λ by its maximal ideal weobtain a map T → F l . Then m is the kernel of this map. The diagram isT O f,λ F lThe map O f,λ → F l isa r (f) ↦→ Tr ρ(Frob r ).To fix ideas we cheat <strong>and</strong> suppose O f,λ = Z l . [[In Wiles’s optic this is OK sincehe can work this way then tensor everything at the end.]]Now ρ f,λ is a distinguished deformati<strong>on</strong> of ρ [[“Distinguished” is not meant ina mathematical sense]]. The map f gives rise to a map T Σ → O = Z l which wealso denote by fϕR ΣT Σ fO = Z l24.9 The Inequality #O/η ≤ #℘ T /℘ 2 T ≤ #℘ R/℘ 2 RLetρ : G = Gal(Q/Q) → GL(2, F l )be irreducible <strong>and</strong> modular with l > 2. Let Σ be a finite set of primes. We assumethere is a modular form f of weight 2 with coefficients in Z l which gives rise ρ. Letρ f,λ : G → GL(2, Z l )


24.9 The Inequality #O/η ≤ #℘ T /℘ 2 T ≤ #℘ R/℘ 2 R 249be the representati<strong>on</strong> coming from f, then ρ f,λ reduces to ρ modulo l.Let R Σ be the universal deformati<strong>on</strong> ring, so every deformati<strong>on</strong> of ρ of type Σfactors through R Σ in an appropriate sense. Let T Σ be the <strong>Hecke</strong> ring associatedto Σ. It is a Z l -algebra which is free of finite rank. FurthermoreT Σ ⊂∏∏O g,µ = O f,µ ×(g,µ)∈A(g,µ)∈A−{(f,µ)}O g,µwhere A is as defined before. Define projecti<strong>on</strong>s pr 1 <strong>and</strong> pr 2 <strong>on</strong>to the first <strong>and</strong> restof the factors, respectively∏pr 1 : O g,µ → O = O f,λpr 2 :(g,µ)∈A∏(g,µ)∈AO g,µ →∏(g,µ)≠(f,λ)O g,µLet ϕ : R Σ → T Σ be the map coming from the universal property of R Σ . Thismap is surjective. The famous triangle which dominates all of the theory isR ΣϕT Σpr 1O = Z l24.9.1 The Definiti<strong>on</strong>s of the idealsWe now define two ideals. View T Σ as sitting in the product ∏ O g,µ .1. The c<strong>on</strong>gruence ideal η ⊂ O isη := O ∩ T Σ = ker(pr 2 : T Σ → ∏ )O g,µ(g,µ)≠(f,λ)2. The prime ideal ℘ T ⊂ T Σ is()℘ T = ker pr 1 : T Σ → OIt is true that#O/η ≤ #℘ T /℘ 2 T .The c<strong>on</strong>diti<strong>on</strong> for equality is a theorem of Wiles.Theorem 24.9.1. T Σ is a complete intersecti<strong>on</strong> if <strong>and</strong> <strong>on</strong>ly if #O/η = #℘ T /℘ 2 T .There is an analogous c<strong>on</strong>structi<strong>on</strong> forψ = pr 1 ◦ϕ : R Σ → O.The diagram isR Σϕ−→ TΣψ ↘ ↓ pr 1O.


250 24. The <strong>Hecke</strong> Algebra T ΣLet ℘ R be the kernel of ψ. From the commutativity of the above diagram we seethat ψ maps ℘ R → ℘ T . Thus we have an induced map ψ <strong>on</strong> “tangent spaces”It follows thatThere is an analogous theorem.ψ : ℘ R /℘ 2 R → ℘ T /℘ 2 T .#O/η ≤ #℘ T /℘ 2 T ≤ #℘ R /℘ 2 R.Theorem 24.9.2. The above inequalities are all equalities iff• ϕ : R Σ → T Σ is an isomorphism, <strong>and</strong>• T Σ is a complete intersecti<strong>on</strong> ring.24.9.2 Aside: Selmer groupsLet M be the set of matrices in M(2, Q l /Z l ) which have trace 0. Then GL(2, Z l )operates <strong>on</strong> M by c<strong>on</strong>jugati<strong>on</strong>. Thus G acts <strong>on</strong> M via the representati<strong>on</strong> ρ ′ : G →GL(2, Z l ). To ℘ R /℘ 2 R there corresp<strong>on</strong>ds the Selmer group which is a subgroup ofH 1 (Gal(Q/Q), M). The subgroup isH 1 Σ(G, M) = Hom O (℘ R /℘ 2 R, Q l /Z l ) ⊂ H 1 (Gal(Q/Q), M).[[Since Flach’s thesis there has been a problem of trying to get an upper boundfor the Selmer group. Wiles c<strong>on</strong>verted it into the above problem.]]24.9.3 Outline of some proofsWe outline the key steps in the proof that #℘ R /℘ 2 R ≤ #O/η.Step 1: Σ = ∅The key step is the minimal case when Σ = ∅. This is d<strong>on</strong>e in [TW95]. They claimto be proving the apparently weaker statement that T Σ a complete intersecti<strong>on</strong>implies#℘ T /℘ 2 T = #O/η.But in Wiles’s paper [Wil95] he obtains the inequalityCombining these two shows that#℘ R /℘ 2 R ≤ (#℘ T /℘ 2 T )2.#O/η#℘ R /℘ 2 R ≤ #O/η.In an appendix to [TW95] Faltings proves directly that#℘ R /℘ 2 R ≤ #O/η.At this point there were some remarks about why Wiles might have taken acircuitous route in his Annals paper. Ribet replied,“As Serre says, it is sometimes better to leave out any psychologicalbehavior related to how people did something but instead just report<strong>on</strong> what they did.”


Step 2: Passage from Σ = ∅ to σ general24.9 The Inequality #O/η ≤ #℘ T /℘ 2 T ≤ #℘ R/℘ 2 R 251The sec<strong>on</strong>d step is the inducti<strong>on</strong> step in which we must underst<strong>and</strong> what happensas Σ grows. Thus Σ is replaced by Σ ′ = Σ ∪ {q} where q is some prime not in Σ.We will use the following notati<strong>on</strong>. The object attached to Σ ′ will be denotedthe same way as the object attached to Σ but with a ′ . Thus (℘ R /℘ 2 R )′ denotesthe Selmer group for Σ ′ .The change in the Selmer group when Σ is replaced by Σ ′ is completely governedby some local cohomology group. There is a c<strong>on</strong>stant c q such thatSo we just need to know that#(℘ R /℘ 2 R) ′ ≤ c q #(℘ R /℘ 2 R) ≤ c q #O/η.#O/η ′ ≥ c q #O/η,i.e., that η ′ is small as an ideal in O. We need a formula for the ratio of the twoorders.Let T be the anemic ring of <strong>Hecke</strong> operators <strong>on</strong> S 2 (Γ 0 (N Σ )) obtained by adjoiningto Z all the <strong>Hecke</strong> operators T n with n prime to lN Σ . Let T ′ be the anemic<strong>Hecke</strong> ring of <strong>Hecke</strong> operators <strong>on</strong> S 2 (Γ 0 (N Σ ′)).Since N Σ |N Σ ′ there is an inclusi<strong>on</strong>S 2 (Γ 0 (N Σ )) ↩→ S 2 (Γ 0 (N Σ ′)).There is <strong>on</strong>e subtlety, this injecti<strong>on</strong> is not equivariant for all of the <strong>Hecke</strong> operators.But this is no problem because T <strong>and</strong> T ′ are anemic. So the inclusi<strong>on</strong> induces arestricti<strong>on</strong> map r : T ′ → T.We now introduce a relative versi<strong>on</strong> of η which is an ideal I ⊂ T. One way tothink of I is as T ∩ T ′ where T <strong>and</strong> T ′ are both viewed as subrings of ∏ O g,µT ′ r T ′ m ′ T T m ∏(g,µ) O g,µ ∏more (g,µ) O g,µThe definiti<strong>on</strong> Lenstra would give is thatThe amazing formula isI := r(Ann T ′(ker(r))).η ′ = η · f(I)where f : T → O is the map induced by the modular form f. [[After introducingthis definiti<strong>on</strong> Ogus was very curious about how deep it is, in particular, aboutwhether its proof uses the Gorensteiness of T. Ribet said, “somehow I do not thinkthis formula can possibly be profound.”]]


252 24. The <strong>Hecke</strong> Algebra T ΣWe pause with an aside to c<strong>on</strong>sider Wiles’s original definiti<strong>on</strong> of η. By dualitythe map f : T Σ → Z l inducesf ∨ : Hom Zl (Z l , Z l ) → Hom Zl (T Σ , Z l ) ∼ = T Σ .Because T Σ is Gorenstein there is an isomorphism Hom Zl (T Σ , Z l ) ∼ = T Σ . Nowf ∨ (id) ∈ T Σ so f(f ∨ (id)) ∈ O = Z l . Wiles let η = (f(f ∨ (id))) be the idealgenerated by f(f ∨ (id)).To finish step 2 we must show that #O/f(I) ≥ c q , i.e., that “I is small”. [[I d<strong>on</strong>ot see how this actually finishes step 2, but it is reas<strong>on</strong>able that it should. Howdoes this index relate to the index of f(I)η in O?]]Let J = J 0 (N Σ ) <strong>and</strong> J ′ = J 0 (N Σ ′). SinceS 2 (Γ 0 (N Σ )) ↩→ S 2 (Γ 0 (N Σ ′))functoriality of the Jacobian induces a map J ↩→ J ′ . By autoduality we alsoobtain an injecti<strong>on</strong> J ∨ ↩→ J ′ <strong>and</strong> J ∩ J ∨ is a finite subgroup of J ′ . [[I definitely d<strong>on</strong>ot underst<strong>and</strong> why J is not just equal to J ⊥ . Where does the other embeddingJ ⊥ ↩→ J ′ come from?]]It can be seen that J ∩ J ∨ = J[δ] for some δ ∈ T. It turns out thatAnn T (J ∩ J ∨ ) = T ∩ δ End(J) ⊇ δT.It is an observable fact that f(δT) is an ideal of O of norm c q . The heart of thewhole matter is to see that the inclusi<strong>on</strong>δT ⊆ T ∩ δ End(J)is an equality after localizati<strong>on</strong> at m. To do this we have to know that Tate m (J) ∼ =T 2 m. This is equivalent to the Gorenstein business. With this in h<strong>and</strong> <strong>on</strong>e can justcheck this equality.Unfortunately, it is the spring of 1996 <strong>and</strong> we are now 10 minutes past when thecourse should end. Realizing this, Ken brings the course to a close. In the gr<strong>and</strong>Berkeley traditi<strong>on</strong>, the room fills with applause.


25Computing with <strong>Modular</strong> <strong>Forms</strong> <strong>and</strong>Abelian Varieties


254 25. Computing with <strong>Modular</strong> <strong>Forms</strong> <strong>and</strong> Abelian Varieties


26The <strong>Modular</strong> Curve X 0 (389)Let N be a positive integer, <strong>and</strong> let X 0 (N) be the compactified coarse moduli spacethat classifies pairs (E, C) where E is an elliptic curve <strong>and</strong> C is a cyclic subgroupof order N. The space X 0 (N) has a can<strong>on</strong>ical structure of algebraic curve over Q,<strong>and</strong> its properties have been very well studied during the last forty years. Forexample, Breuil, C<strong>on</strong>rad, Diam<strong>on</strong>d, Taylor, <strong>and</strong> Wiles proved that every ellipticcurve over Q is a quotient of some X 0 (N).The smallest N such that the Jacobian of X 0 (N) has positive Mordell-Weil rankis 37, <strong>and</strong> Zagier studied the genus-two curve X 0 (37) in depth in his paper [Zag85b].From this viewpoint, the next modular curve deserving intensive investigati<strong>on</strong> isX 0 (389), which is the first modular curve whose Jacobian has Mordell-Weil ranklarger than that predicted by the signs in the functi<strong>on</strong>al equati<strong>on</strong>s of the L-seriesattached to simple factors of its Jacobian; in fact, 389 is the smallest c<strong>on</strong>ductorof an elliptic curve with Mordell-Weil rank 2. Note that 389 is prime <strong>and</strong> X 0 (389)has genus g = 32, which is much larger than the genus 2 of X 0 (37), which makesexplicit investigati<strong>on</strong> more challenging.Work of Kolyvagin [Kol88b, Kol88a] <strong>and</strong> Gross-Zagier [GZ86] has completelyresolved the rank asserti<strong>on</strong> of the Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer c<strong>on</strong>jecture (see,e.g., [Tat66]) for elliptic curves E with ord s=1 L(E, s) ≤ 1. The lowest-c<strong>on</strong>ductorelliptic curve E that doesn’t submit to the work of Kolyvagin <strong>and</strong> Gross-Zagieris the elliptic curve E of c<strong>on</strong>ductor 389 menti<strong>on</strong>ed in the previous paragraph. Atpresent we d<strong>on</strong>’t even have a c<strong>on</strong>jectural natural c<strong>on</strong>structi<strong>on</strong> of a finite-indexsubgroup of E(Q) analogous to that given by Gross <strong>and</strong> Zagier for rank 1 (but seeMazur’s work <strong>on</strong> universal norms, which might be used to c<strong>on</strong>struct E(Q) ⊗ Z pfor some auxiliary prime p).Inspired by the above observati<strong>on</strong>s, <strong>and</strong> with an eye towards providing helpfuldata for any<strong>on</strong>e trying to generalize the work of Gross, Zagier, <strong>and</strong> Kolyvagin,in this paper we compute everything we can about the modular curve X 0 (389).Some of the computati<strong>on</strong>s of this paper have already proved important in severalother papers: the discriminant of the <strong>Hecke</strong> algebra attached to X 0 (389) plays


256 26. The <strong>Modular</strong> Curve X 0(389)a roll in [Rib99], the verificati<strong>on</strong> of c<strong>on</strong>diti<strong>on</strong> 3 in [MS01], <strong>and</strong> the remark afterTheorem 1 of [?]; also, the arithmetic of J 0 (389) provides a key example in [AS02,§4.2]. Finally, this paper serves as an entry in an “encyclopaedia, atlas or hiker’sguide to modular curves”, in the spirit of N. Elkies (see [Elk98, pg. 22]).We hilight several surprising “firsts” that occur at level 389. The discriminantof the <strong>Hecke</strong> algebra attached to S 2 (Γ 0 (389)) has the apparently unusual propertythat it is divisible by p = 389 (see Secti<strong>on</strong> 26.2.1). Also N = 389 is the smallestinteger such that the order of vanishing of L(J 0 (N), s) at s = 1 is larger thanpredicted by the functi<strong>on</strong>al equati<strong>on</strong>s of eigenforms (see Secti<strong>on</strong> 26.1.3). The authorc<strong>on</strong>jectures that N = 389 is the smallest level such that an optimal newform factorof J 0 (N) appears to have Shafarevich-Tate group with n<strong>on</strong>trivial odd part (seeSecti<strong>on</strong> 26.4.1). Atkin c<strong>on</strong>jectures that 389 is the largest prime such that the cuspof X 0 + (389) fails to be a Weierstrass point (see Secti<strong>on</strong> 26.4.2).26.1 Factors of J 0 (389)To each newform f ∈ S 2 (Γ 0 (389)), Shimura [Shi73] associated a quotient A f ofJ 0 (389), <strong>and</strong> J 0 (389) is isogeneous to the product ∏ A f , where the product runsover the Gal(Q/Q)-c<strong>on</strong>jugacy classes of newforms. Moreover, because 389 is primeeach factor A f cannot be decomposed further up to isogeny, even over Q (see[Rib75]).26.1.1 Newforms of level 389There are five Gal(Q/Q)-c<strong>on</strong>jugacy classes of newforms in S 2 (Γ 0 (389)). The firstclass corresp<strong>on</strong>ds to the unique elliptic curve of c<strong>on</strong>ductor 389, <strong>and</strong> its q-exansi<strong>on</strong>beginsf 1 = q − 2q 2 − 2q 3 + 2q 4 − 3q 5 + 4q 6 − 5q 7 + q 9 + 6q 10 + · · · .The sec<strong>on</strong>d has coefficients in the quadratic field Q( √ 2), <strong>and</strong> has q-expansi<strong>on</strong>f 2 = q + √ 2q 2 + ( √ 2 − 2)q 3 − q 5 + (−2 √ 2 + 2)q 6 + · · · .The third has coefficients in the cubic field generated by a root α of x 3 − 4x − 2:f 3 = q + αq 2 − αq 3 + (α 2 − 2)q 4 + (−α 2 + 1)q 5 − α 2 q 6 + · · · .The fourth has coefficients that generate the degree-six field defined by a root βof x 6 + 3x 5 − 2x 4 − 8x 3 + 2x 2 + 4x − 1 <strong>and</strong> q-expansi<strong>on</strong>f 4 = q + βq 2 + (β 5 + 3β 4 − 2β 3 − 8β 2 + β + 2)q 3 + · · · .The fifth <strong>and</strong> final newform (up to c<strong>on</strong>jugacy) has coefficients that generate thedegree 20 field defined by a root off 5 = x 20 − 3x 19 − 29x 18 + 91x 17 + 338x 16 − 1130x 15 − 2023x 14 + 7432x 13+6558x 12 − 28021x 11 − 10909x 10 + 61267x 9 + 6954x 8 − 74752x 7+1407x 6 + 46330x 5 − 1087x 4 − 12558x 3 − 942x 2 + 960x + 148.


26.1 Factors of J 0(389) 257f12,522f5222f223f331f4FIGURE 26.1.1. C<strong>on</strong>gruences Between NewformsC<strong>on</strong>gruencesThe vertices in Figure 26.1.1 corresp<strong>on</strong>d to the newforms f i ; there is an edgebetween f i <strong>and</strong> f j labeled p if there is a maximal ideal ℘ | p of the field generatedby the Fourier coefficients of f i <strong>and</strong> f j such that f i ≡ f j (mod ℘).26.1.2 Isogeny structureWe deduce from the above determinati<strong>on</strong> of the newforms in S 2 (Γ 0 (389)) thatJ 0 (389) is Q-isogenous to a product of Q-simple abelian varietiesJ ∼ A 1 × A 2 × A 3 × A 4 × A 5 .View the duals A ∨ i of the A i as abelian subvarieties of J 0 (389). Using modularsymbols as in [AS05, §3.4] we find that, for i ≠ j, a prime p divides #(A ∨ i ∩ A∨ j )if <strong>and</strong> <strong>on</strong>ly if f i ≡ f j (mod ℘) for some prime ℘ | p (recall that the c<strong>on</strong>gruenceprimes are given in Figure 26.1.1 above).26.1.3 Mordell-Weil ranksSuppose f ∈ S 2 (Γ 0 (N)) is a newform of some level N. The functi<strong>on</strong>al equati<strong>on</strong> forL(f, s) implies that ord s=1 L(f, s) is odd if <strong>and</strong> <strong>on</strong>ly if the sign of the eigenvalueof the Atkin-Lehner involuti<strong>on</strong> W N <strong>on</strong> f is +1.Propositi<strong>on</strong> 26.1.1. If f ∈ S 2 (Γ 0 (N)) is a newform of level N < 389, thenord s=1 L(f, s) is either0 or 1.Proof. The proof amounts to a large computati<strong>on</strong>, which divides into two parts:1. Verify, for each newform f of level N < 389 such that W N (f) = −f, thatL(f, 1) = ∗ ∫ i∞f(z)dz (for some n<strong>on</strong>zero ∗) is n<strong>on</strong>zero. This is a purely0algebraic computati<strong>on</strong> involving modular symbols.


258 26. The <strong>Modular</strong> Curve X 0(389)2. Verify, for each newform f of level N < 389 such that W N (f) = f, thatL ′ (f, 1) ≠ 0 (see [FpS + 01, §4.1], which points to [Cre97, §2.11,§2.13]). We dothis by approximating an infinite series that c<strong>on</strong>verges to L ′ (f, 1) <strong>and</strong> notingthat the value we get is far from 0.Thus N = 389 is the smallest level such that the L-series of some factor A f ofJ 0 (N) has order of vanishing higher than that which is forced by the sign in thefuncti<strong>on</strong>al equati<strong>on</strong>.Propositi<strong>on</strong> 26.1.2. The following table summarizes the dimensi<strong>on</strong>s <strong>and</strong> Mordell-Weil ranks (over the image of the <strong>Hecke</strong> ring) of the newform factors of J 0 (N):A 1 A 2 A 3 A 4 A 5Dimensi<strong>on</strong> 1 2 3 6 20Rank 2 1 1 1 0Proof. The elliptic curve A 1 is 389A in Crem<strong>on</strong>a’s tables, which is the elliptic curveof smallest c<strong>on</strong>ductor having rank 2. For A 5 we directly compute whether or notthe L-functi<strong>on</strong> vanishes using modular symbols, by taking an inner product withthe winding element e w = −{0, ∞}. We find that the L-functi<strong>on</strong> does not vanish.By Kolyvagin-Logachev, it follows that A 5 has Mordell-Weil rank 0.For each of the other three factors, the sign of the functi<strong>on</strong>al equati<strong>on</strong> is odd,so the analytic ranks are odd. As in the proof of Propositi<strong>on</strong> 26.1.1, we verify thatthe analytic rank is 1 in each case. By work of Gross, Zagier, <strong>and</strong> Kolyvagin itfollows that the ranks are 1.26.2 The <strong>Hecke</strong> algebra26.2.1 The Discriminant is divisible by pLet N be a positive integer. The <strong>Hecke</strong> algebra T ⊂ End(S 2 (Γ 0 (N))) is the subringgenerated by all <strong>Hecke</strong> operators T n for n = 1, 2, 3, . . .. We are c<strong>on</strong>cerned with thediscriminant of the trace pairing (t, s) ↦→ Tr(ts).When N is prime, T Q = T ⊗ Z Q is a product K 1 × · · · × K n of totally realnumber fields. Let ˜T denote the integral closure of T in T Q ; note that ˜T = ∏ O iwhere O i is the ring of integers of K i . Then disc(T) = [ ˜T : T] · ∏ni=1 disc(K i).Propositi<strong>on</strong> 26.2.1. The discriminant of the <strong>Hecke</strong> algebra associated to S 2 (Γ 0 (389))is2 53 · 3 4 · 5 6 · 31 2 · 37 · 389 · 3881 · 215517113148241 · 477439237737571441.Proof. By [?], the <strong>Hecke</strong> algebra T is generated as a Z-module by T 1 , T 2 , . . . T 65 .To compute disc(T), we proceed as follows. First, compute the space S 2 (Γ 0 (389))of cuspidal modular symbols, which is a faithful T-module. Choose a r<strong>and</strong>omelement x ∈ S 2 (Γ 0 (389)) + of the +1-quotient of the cuspidal modular symbols,then compute the images v 1 = T 1 (x), v 2 = T 2 (x), . . . , v 65 = T 65 (x). If these d<strong>on</strong>’tspan a space of dimensi<strong>on</strong> 32 = rank Z T choose a new r<strong>and</strong>om element x <strong>and</strong>repeat. Using the Hermite Normal Form, find a Z-basis b 1 , . . . , b 32 for the Z-span


26.2 The <strong>Hecke</strong> algebra 259of v 1 , . . . , v 65 . The trace pairing <strong>on</strong> T induces a trace pairing <strong>on</strong> the v i , <strong>and</strong> hence<strong>on</strong> the b i . Then disc(T) is the discriminant of this pairing <strong>on</strong> the b i . The reas<strong>on</strong>we embed T in S 2 (Γ 0 (389)) + as Tx is because directly finding a Z-basis for Twould involve computing the Hermite Norm Form of a list of 65 vectors in a 1024-dimensi<strong>on</strong>al space, which is unnecessarily difficult (though possible).We compute this discriminant by applying the definiti<strong>on</strong> of discriminant to a matrixrepresentati<strong>on</strong> of the first 65 <strong>Hecke</strong> operators T 1 , . . . , T 65 . Matrices representingthese <strong>Hecke</strong> operators were computed using the modular symbols algorithmsdescribed in [Cre97]. [Sturm, On the c<strong>on</strong>gruence of modular forms].In the case of X 0 (389), T ⊗ Q = K 1 × K 2 × K 3 × K 6 × K 20 , where K d hasdegree d over Q. We haveK 1 = Q,K 2 = Q( √ 2)K 3 = Q(β), β 3 − 4β − 2 = 0,K 6 = Q(γ), γ 6 + 3γ 5 − 2γ 4 − 8γ 3 + 2γ 2 + 4γ − 1 = 0,K 20 = Q(δ), δ 20 − 3δ 19 − 29δ 18 + 91δ 17 + 338δ 16 − 1130δ 15 − 2023δ 14 + 7432δ 13<strong>and</strong>+6558δ 12 − 28021δ 11 − 10909δ 10 + 61267δ 9 + 6954δ 8 − 74752δ 7+1407δ 6 + 46330δ 5 − 1087δ 4 − 12558δ 3 − 942δ 2 + 960δ + 148 = 0.The discriminants of the K i areK 1 K 2 K 3 K 61 2 3 2 2 · 37 5 3 · 3881disc(K 20 ) = 2 14 · 5 · 389 · 215517113148241 · 477439237737571441.Observe that the discriminant of K 20 is divisible by 389. The product of the discriminantsis2 19 · 5 4 · 37 · 389 · 3881 · 215517113148241 · 477439237737571441.This differs from the exact discriminant by a factor of 2 34 · 3 4 · 5 2 · 31 2 , so the indexof T in its normalizati<strong>on</strong> is[ ˜T : T] = 2 17 · 3 2 · 5 · 31.Notice that 389 does not divide this index, <strong>and</strong> that 389 is not a “c<strong>on</strong>gruenceprime”, so 389 does not divide any modular degrees.Questi<strong>on</strong> 26.2.2. Is there a newform optimal quotient A f of J 0 (p) such that pdivides the modular degree of A f ? (No, if p < 14000.)26.2.2 C<strong>on</strong>gruences primes in S p+1 (Γ 0 (1))K. Ono asked the following questi<strong>on</strong>, in c<strong>on</strong>necti<strong>on</strong> with Theorem 1 of [?].Questi<strong>on</strong> 26.2.3. Let p be a prime. Is p ever a c<strong>on</strong>gruence prime <strong>on</strong> S p+1 (Γ 0 (1))?More precisely, if K is the number field generated by all the eigenforms of weightp + 1 <strong>on</strong> Γ 0 (1), can there be a prime ideal ℘ | p for which f ≡ g (mod ℘) fordistinct eigenforms f, g ∈ S p+1 (Γ 0 (1))?


260 26. The <strong>Modular</strong> Curve X 0(389)The answer is “yes”. There is a st<strong>and</strong>ard relati<strong>on</strong>ship between S p+1 (Γ 0 (1))<strong>and</strong> S 2 (Γ 0 (p)). As noted in Secti<strong>on</strong> 26.2.1, p = 389 is a c<strong>on</strong>gruence prime forS 2 (Γ 0 (389)), so we investigate S 389+1 (Γ 0 (1)).Propositi<strong>on</strong> 26.2.4. There exist distinct newforms f, g ∈ S 389+1 (Γ 0 (1)) <strong>and</strong> aprime ℘ of residue characteristic 389 such that f ≡ g (mod ℘).Proof. We compute the characteristic polynomial f of the <strong>Hecke</strong> operator T 2 <strong>on</strong>S 389+1 (Γ 0 (1)) using nothing more than [Ser73, Ch. VII]. We find that f factorsmodulo 389 as follows:f = (x + 2)(x + 56)(x + 135)(x + 158)(x + 175) 2 (x + 315)(x + 342)(x 2 + 387)(x 2 + 97x + 164)(x 2 + 231x + 64)(x 2 + 286x + 63)(x 5 + 88x 4 + 196x 3 + 113x 2 + 168x + 349)(x 11 + 276x 10 + 182x 9 + 13x 8 + 298x 7 + 316x 6 + 213x 5+248x 4 + 108x 3 + 283x 2 + x + 101)Moreover, f is irreducible <strong>and</strong> 389 || disc(f), so the square factor (x+175) 2 impliesthat 389 is ramified in the degree-32 field L generated by a single root of f. Thusthere are exactly 31 distinct homomorphisms from the ring of integers of L toF 389 . That is, there are exactly 31 ways to reduce the q-expansi<strong>on</strong> of a newform inS 390 (Γ 0 (1)) to obtain a q-expansi<strong>on</strong> in F 389 [[q]]. Let K be the field generated byall eigenvalues of the 32 newforms g 1 , . . . g 32 ∈ S 390 (Γ 0 (1)), <strong>and</strong> let ℘ be a prime ofO K lying over 389. Then the subset {g 1 (mod ℘), g 2 (mod ℘), . . . , g 32 (mod ℘)}of F 389 [[q]] has cardinality at most 31, so there exists i ≠ j such that g i ≡ g j(mod ℘).26.3 Supersingular points in characteristic 38926.3.1 The Supersingular j-invariants in characteristic 389Let α be a root of α 2 + 95α + 20. Then the 33 = g(X 0 (389)) + 1 supersingularj-invariants in F 389 2 are0, 7, 16, 17, 36, 121, 154, 220, 318, 327, 358, 60α + 22, 68α + 166, 80α + 91, 86α + 273,93α + 333, 123α + 350, 123α + 375, 129α + 247, 131α + 151, 160α + 321, 176α + 188,213α + 195, 229α + 292, 258α + 154, 260α + 51, 266α + 335, 266α + 360, 296α + 56,303α + 272, 309α + 271, 321α + 319, 329α + 157.26.4 Miscellaneous26.4.1 The Shafarevich-Tate groupUsing visibility theory [AS02, §4.2], <strong>on</strong>e sees that #X(A 5 ) is divisible by an oddprime, because(Z/5Z) 2 ≈ A 1 (Q)/5A 1 (Q) ⊂ X(A 5 ).Additi<strong>on</strong>al computati<strong>on</strong>s suggest the following c<strong>on</strong>jecture.


26.4 Miscellaneous 261C<strong>on</strong>jecture 26.4.1. N = 389 is the smallest level such that there is an optimalnewform quotient A f of J 0 (N) with #X(A f ) divisible by an odd prime.26.4.2 Weierstrass points <strong>on</strong> X + 0 (p)Oliver Atkin has c<strong>on</strong>jectured that 389 is the largest prime such that the cusp <strong>on</strong>X 0 + (389) fails to be a Weierstrass point. He verified that the cusp of X+ 0 (389)is not a Weierstrass point but that the cusp of X 0 + (p) is a Weierstrass point forall primes p such that 389 < p ≤ 883 (see, e.g., [Elk98, pg.39]). In additi<strong>on</strong>, theauthor has extended the verificati<strong>on</strong> of Atkin’s c<strong>on</strong>jecture for all primes < 3000.Explicitly, this involves computing a reduced-echel<strong>on</strong> basis for the subspace ofS 2 (Γ 0 (p)) where the Atkin-Lehner involuti<strong>on</strong> W p acts as +1, <strong>and</strong> comparing thelargest valuati<strong>on</strong> of an element of this basis with the dimensi<strong>on</strong> of the subspace.These numbers differ exactly when the cusp is a Weierstrass point.26.4.3 A Property of the plus part of the integral homologyFor any positive integer N, let H + (N) = H 1 (X 0 (N), Z) + be the +1 eigen-submodulefor the acti<strong>on</strong> of complex c<strong>on</strong>jugati<strong>on</strong> <strong>on</strong> the integral homology of X 0 (N). ThenH + (N) is a module over the <strong>Hecke</strong> algebra T. LetF + (N) = coker ( H + (N) × Hom(H + (N), Z) → Hom(T, Z) )where the map sends (x, ϕ) to the homomorphism t ↦→ ϕ(tx). Then #F + (p) ∈{1, 2, 4} for all primes p < 389, but #F + (389) = 8.26.4.4 The Field generated by points of small prime order <strong>on</strong> anelliptic curveThe prime 389 arises in a key way in the verificati<strong>on</strong> of c<strong>on</strong>diti<strong>on</strong> 3 in [MS01].


262 26. The <strong>Modular</strong> Curve X 0(389)


References[AS02][AS05][BC04]A. Agashe <strong>and</strong> W. <strong>Stein</strong>, Visibility of Shafarevich-Tate groups ofabelian varieties, J. Number Theory 97 (2002), no. 1, 171–185., Visible evidence for the Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer c<strong>on</strong>jecturefor modular abelian varieties of analytic rank zero, Math. Comp. 74(2005), no. 249, 455–484 (electr<strong>on</strong>ic), With an appendix by J. Crem<strong>on</strong>a<strong>and</strong> B. Mazur. MR 2085902Kevin Buzzard <strong>and</strong> Frank Calegari, A counterexample to the Gouvêa-Mazur c<strong>on</strong>jecture, C. R. Math. Acad. Sci. Paris 338 (2004), no. 10,751–753. MR 2059481 (2005g:11070)[BCDT01] C. Breuil, B. C<strong>on</strong>rad, F. Diam<strong>on</strong>d, <strong>and</strong> R. Taylor, On the modularityof elliptic curves over Q: wild 3-adic exercises, J. Amer. Math. Soc. 14(2001), no. 4, 843–939 (electr<strong>on</strong>ic). MR 2002d:11058[Bir65][Bir71][BK90][BLR90]B. J. Birch, C<strong>on</strong>jectures c<strong>on</strong>cerning elliptic curves, Proceedings of Symposiain Pure Mathematics, VIII, Amer. Math. Soc., Providence, R.I.,1965, pp. 106–112. MR 30 #4759B. J. Birch, Elliptic curves over Q: A progress report, 1969 NumberTheory Institute (Proc. Sympos. Pure Math., Vol. XX, State Univ.New York, St<strong>on</strong>y Brook, N.Y., 1969), Amer. Math. Soc., Providence,R.I., 1971, pp. 396–400.S. Bloch <strong>and</strong> K. Kato, L-functi<strong>on</strong>s <strong>and</strong> Tamagawa numbers of motives,The Grothendieck Festschrift, Vol. I, Birkhäuser Bost<strong>on</strong>, Bost<strong>on</strong>, MA,1990, pp. 333–400.S. Bosch, W. Lütkebohmert, <strong>and</strong> M. Raynaud, Nér<strong>on</strong> models, Springer-Verlag, Berlin, 1990. MR 91i:14034


264 References[BpR91][Buz96]N. Bost<strong>on</strong>, H. W. Lenstra, Jr., <strong>and</strong> K. A. Ribet, Quotients of grouprings arising from two-dimensi<strong>on</strong>al representati<strong>on</strong>s, C. R. Acad. Sci.Paris Sér. I Math. 312 (1991), no. 4, 323–328.Kevin Buzzard, On the eigenvalues of the <strong>Hecke</strong> operator T 2 , J. NumberTheory 57 (1996), no. 1, 130–132. MR 96m:11033[CE98] R. F. Coleman <strong>and</strong> B. Edixhoven, On the semi-simplicity of the U p -operator <strong>on</strong> modular forms, Math. Ann. 310 (1998), no. 1, 119–127.[CF99][CM98][CM00][C<strong>on</strong>01]J. B. C<strong>on</strong>rey <strong>and</strong> D. W. Farmer, <strong>Hecke</strong> operators <strong>and</strong> the n<strong>on</strong>vanishingof L-functi<strong>on</strong>s, Topics in number theory (University Park, PA, 1997),Math. Appl., vol. 467, Kluwer Acad. Publ., Dordrecht, 1999, pp. 143–150. MR 2000f:11055R. Coleman <strong>and</strong> B. Mazur, The Eigencurve, Galois representati<strong>on</strong>s inarithmetic algebraic geometry (Durham, 1996), Cambridge Univ. Press,Cambridge, 1998, pp. 1–113. MR 1 696 469J. E. Crem<strong>on</strong>a <strong>and</strong> B. Mazur, Visualizing elements in the Shafarevich-Tate group, Experiment. Math. 9 (2000), no. 1, 13–28. MR 1 758 797B. C<strong>on</strong>rad, The shimura c<strong>on</strong>structi<strong>on</strong> in weight 2 (appendix to Ribet-<strong>Stein</strong>, <str<strong>on</strong>g>Lectures</str<strong>on</strong>g> <strong>on</strong> Serre’s C<strong>on</strong>jecture), Arithmetic algebraic geometry(Park City, UT, 1999), IAS/Park City Math. Ser., vol. 9, Amer. Math.Soc., Providence, RI, 2001, pp. 205–232. MR 2002h:11047[C<strong>on</strong>03] K. C<strong>on</strong>rad, Partial Euler products <strong>on</strong> the critical line, Preprint (2003).[Cp86][CR62][Cre97][CV92][DDT94][DI95][DK73]J. W. S. Cassels <strong>and</strong> A. Fröhlich (eds.), Algebraic number theory, L<strong>on</strong>d<strong>on</strong>,Academic Press Inc. [Harcourt Brace Jovanovich Publishers],1986, Reprint of the 1967 original.C. W. Curtis <strong>and</strong> I. Reiner, Representati<strong>on</strong> theory of finite groups <strong>and</strong>associative algebras, Interscience Publishers, a divisi<strong>on</strong> of John Wiley& S<strong>on</strong>s, New York-L<strong>on</strong>d<strong>on</strong>, 1962, Pure <strong>and</strong> Applied Mathematics, Vol.XI.J. E. Crem<strong>on</strong>a, Algorithms for modular elliptic curves, sec<strong>on</strong>d ed., CambridgeUniversity Press, Cambridge, 1997,http://www.maths.nott.ac.uk/pers<strong>on</strong>al/jec/book/.R. F. Coleman <strong>and</strong> J. F. Voloch, Compani<strong>on</strong> forms <strong>and</strong> Kodaira-Spencer theory, Invent. Math. 110 (1992), no. 2, 263–281.H. Darm<strong>on</strong>, F. Diam<strong>on</strong>d, <strong>and</strong> R. Taylor, Fermat’s last theorem, Currentdevelopments in mathematics, 1995 (Cambridge, MA), Internat. Press,Cambridge, MA, 1994, pp. 1–154.F. Diam<strong>on</strong>d <strong>and</strong> J. Im, <strong>Modular</strong> forms <strong>and</strong> modular curves, Seminar<strong>on</strong> Fermat’s Last Theorem, Providence, RI, 1995, pp. 39–133.P. Deligne <strong>and</strong> W. Kuyk (eds.), <strong>Modular</strong> functi<strong>on</strong>s of <strong>on</strong>e variable. II,Springer-Verlag, Berlin, 1973, Lecture Notes in Mathematics, Vol. 349.


References 265[DK95][DR73][DS05][dSL97][Edi92a][Edi92b][Eis95][Elk98]F. Diam<strong>on</strong>d <strong>and</strong> K. Kramer, <strong>Modular</strong>ity of a family of elliptic curves,Math. Res. Lett. 2 (1995), no. 3, 299–304.P. Deligne <strong>and</strong> M. Rapoport, Les schémas de modules de courbes elliptiques,<strong>Modular</strong> functi<strong>on</strong>s of <strong>on</strong>e variable, II (Proc. Internat. SummerSchool, Univ. Antwerp, Antwerp, 1972) (Berlin), Springer, 1973,pp. 143–316. Lecture Notes in Math., Vol. 349.Fred Diam<strong>on</strong>d <strong>and</strong> Jerry Shurman, A first course in modular forms,Graduate Texts in Mathematics, vol. 228, Springer-Verlag, New York,2005.B. de Smit <strong>and</strong> Jr. Lenstra, H. W., Explicit c<strong>on</strong>structi<strong>on</strong> of universaldeformati<strong>on</strong> rings, <strong>Modular</strong> forms <strong>and</strong> Fermat’s last theorem (Bost<strong>on</strong>,MA, 1995), Springer, New York, 1997, pp. 313–326.B. Edixhoven, Nér<strong>on</strong> models <strong>and</strong> tame ramificati<strong>on</strong>, Compositio Math.81 (1992), no. 3, 291–306. MR 93a:14041, The weight in Serre’s c<strong>on</strong>jectures <strong>on</strong> modular forms, Invent.Math. 109 (1992), no. 3, 563–594.D. Eisenbud, Commutative algebra with a view toward algebraic geometry,Springer-Verlag, New York, 1995. MR 97a:13001N. D. Elkies, Elliptic <strong>and</strong> modular curves over finite fields <strong>and</strong> relatedcomputati<strong>on</strong>al issues, Computati<strong>on</strong>al perspectives <strong>on</strong> number theory(Chicago, IL, 1995), Amer. Math. Soc., Providence, RI, 1998, pp. 21–76.[Ell02] J. Ellenberg, q-curves <strong>and</strong> Galois Representati<strong>on</strong>s,http://www.math.princet<strong>on</strong>.edu/ ellenber/papers.html#MCAV(2002).[ES00] J. Ellenberg <strong>and</strong> C. Skinner, On the <strong>Modular</strong>ity of Q-curves,http://www.math.princet<strong>on</strong>.edu/ ellenber/papers.html#QCURVE(2000).[FJ02][FM99][FpS + 01][Frö67]D. W. Farmer <strong>and</strong> K. James, The irreducibility of some level 1 <strong>Hecke</strong>polynomials, Math. Comp. 71 (2002), no. 239, 1263–1270 (electr<strong>on</strong>ic).MR 2003e:11046G. Frey <strong>and</strong> M. Müller, Arithmetic of modular curves <strong>and</strong> applicati<strong>on</strong>s,Algorithmic algebra <strong>and</strong> number theory (Heidelberg, 1997), Springer,Berlin, 1999, pp. 11–48.E. V. Flynn, F. Leprévost, E. F. Schaefer, W. A. <strong>Stein</strong>, M. Stoll, <strong>and</strong>J. L. Wetherell, Empirical evidence for the Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyerc<strong>on</strong>jectures for modular Jacobians of genus 2 curves, Math. Comp. 70(2001), no. 236, 1675–1697 (electr<strong>on</strong>ic). MR 1 836 926A. Fröhlich, Local fields, Algebraic Number Theory (Proc. Instructi<strong>on</strong>alC<strong>on</strong>f., Bright<strong>on</strong>, 1965), Thomps<strong>on</strong>, Washingt<strong>on</strong>, D.C., 1967, pp. 1–41.


266 References[Ghi][Gol82][Gro90][Gro91][GZ86]Alex Ghitza, Maeda’s c<strong>on</strong>jecture for weight up to 4096, http://wstein.org/Tables/charpoly_level1/t2/ghitza.html.D. Goldfeld, Sur les produits partiels eulériens attachés aux courbeselliptiques, C. R. Acad. Sci. Paris Sér. I Math. 294 (1982), no. 14,471–474. MR 84d:14031B. H. Gross, A tameness criteri<strong>on</strong> for Galois representati<strong>on</strong>s associatedto modular forms (mod p), Duke Math. J. 61 (1990), no. 2, 445–517., Kolyvagin’s work <strong>on</strong> modular elliptic curves, L-functi<strong>on</strong>s <strong>and</strong>arithmetic (Durham, 1989), Cambridge Univ. Press, Cambridge, 1991,pp. 235–256.B. Gross <strong>and</strong> D. Zagier, Heegner points <strong>and</strong> derivatives of L-series,Invent. Math. 84 (1986), no. 2, 225–320. MR 87j:11057[Har77] R. Hartshorne, Algebraic Geometry, Springer-Verlag, New York, 1977,Graduate Texts in Mathematics, No. 52.[HH]Bill Hart <strong>and</strong> David Harvey, Flint: Fast library for number theory,http://www.flintlib.org/.[Hid00] H. Hida, <strong>Modular</strong>ity Problems of Q-Motives <strong>and</strong> Base-Change,Preprint, http://www.math.ucla.edu/ hida/ (2000).[Igu56] J. Igusa, Fibre systems of Jacobian varieties, Amer. J. Math. 78 (1956),171–199.[Joc82]N. Jochnowitz, A study of the local comp<strong>on</strong>ents of the <strong>Hecke</strong> algebramod l, Trans. Amer. Math. Soc. 270 (1982), no. 1, 253–267.[Kat81] N. Katz, Serre-Tate local moduli, Algebraic surfaces (Orsay, 1976–78),Lecture Notes in Math., vol. 868, Springer, Berlin, 1981, pp. 138–202.MR 638600 (83k:14039b)[KM85][Kna92][Kol88a][Kol88b][KW08][Lan91]N. M. Katz <strong>and</strong> B. Mazur, Arithmetic moduli of elliptic curves, Princet<strong>on</strong>University Press, Princet<strong>on</strong>, N.J., 1985.A. W. Knapp, Elliptic curves, Princet<strong>on</strong> University Press, Princet<strong>on</strong>,NJ, 1992.V. A. Kolyvagin, Finiteness of E(Q) <strong>and</strong> X(E, Q) for a subclass ofWeil curves, Izv. Akad. Nauk SSSR Ser. Mat. 52 (1988), no. 3, 522–540,670–671. MR 89m:11056, The Mordell-Weil <strong>and</strong> Shafarevich-Tate groups for Weil ellipticcurves, Izv. Akad. Nauk SSSR Ser. Mat. 52 (1988), no. 6, 1154–1180,1327. MR 90f:11035C. Khare <strong>and</strong> J.-P. Wintenberger, Serre’s modularity c<strong>on</strong>jecture (i),Preprint (2008).S. Lang, Number theory. III, Springer-Verlag, Berlin, 1991, Diophantinegeometry. MR 93a:11048


[Lan93][Lan94]References 267, Algebra, third ed., Addis<strong>on</strong>-Wesley Publishing Co., Reading,Mass., 1993., Algebraic number theory, sec<strong>on</strong>d ed., Springer-Verlag, NewYork, 1994.[Lan95] , Introducti<strong>on</strong> to modular forms, Springer-Verlag, Berlin, 1995,With appendixes by D. Zagier <strong>and</strong> W. Feit, Corrected reprint of the1976 original.[Li75] W-C. Li, Newforms <strong>and</strong> functi<strong>on</strong>al equati<strong>on</strong>s, Math. Ann. 212 (1975),285–315.[LR11] Álvaro Lozano-Robledo, Elliptic Curves, <strong>Modular</strong> <strong>Forms</strong> <strong>and</strong> their L-functi<strong>on</strong>s, American Mathematical Society, 2011.[Man72][Maz72][Maz77]J. I. Manin, Parabolic points <strong>and</strong> zeta functi<strong>on</strong>s of modular curves, Izv.Akad. Nauk SSSR Ser. Mat. 36 (1972), 19–66. MR 47 #3396B. Mazur, Rati<strong>on</strong>al points of abelian varieties with values in towers ofnumber fields, Invent. Math. 18 (1972), 183–266., <strong>Modular</strong> curves <strong>and</strong> the Eisenstein ideal, Inst. Hautes ÉtudesSci. Publ. Math. (1977), no. 47, 33–186 (1978).[Maz78] , Rati<strong>on</strong>al isogenies of prime degree (with an appendix by D.Goldfeld), Invent. Math. 44 (1978), no. 2, 129–162.[Maz89], Deforming Galois representati<strong>on</strong>s, Galois groups over Q(Berkeley, CA, 1987), Springer, New York, 1989, pp. 385–437.[McC91] W. G. McCallum, Kolyvagin’s work <strong>on</strong> Shafarevich-Tate groups, L-functi<strong>on</strong>s <strong>and</strong> arithmetic (Durham, 1989), Cambridge Univ. Press,Cambridge, 1991, pp. 295–316. MR 92m:11062[Mer94][MFK94][Mil86][Miy89][MR91][MS01]L. Merel, Universal Fourier expansi<strong>on</strong>s of modular forms, On Artin’sc<strong>on</strong>jecture for odd 2-dimensi<strong>on</strong>al representati<strong>on</strong>s, Springer, 1994,pp. 59–94.D. Mumford, J. Fogarty, <strong>and</strong> F. Kirwan, Geometric invariant theory,third ed., Springer-Verlag, Berlin, 1994.J. S. Milne, Abelian varieties, Arithmetic geometry (Storrs, C<strong>on</strong>n.,1984), Springer, New York, 1986, pp. 103–150.T. Miyake, <strong>Modular</strong> forms, Springer-Verlag, Berlin, 1989, Translatedfrom the Japanese by Yoshitaka Maeda.B. Mazur <strong>and</strong> K. A. Ribet, Two-dimensi<strong>on</strong>al representati<strong>on</strong>s in thearithmetic of modular curves, Astérisque (1991), no. 196-197, 6,215–255 (1992), Courbes modulaires et courbes de Shimura (Orsay,1987/1988). MR MR1141460 (93d:11056)L. Merel <strong>and</strong> W. A. <strong>Stein</strong>, The field generated by the points of smallprime order <strong>on</strong> an elliptic curve, Internat. Math. Res. Notices (2001),no. 20, 1075–1082. MR 1 857 596


268 References[MSD74][Mum70][Ogg71]B. Mazur <strong>and</strong> P. Swinnert<strong>on</strong>-Dyer, Arithmetic of Weil curves, Invent.Math. 25 (1974), 1–61. MR 50 #7152D. Mumford, Abelian varieties, Published for the Tata Institute of FundamentalResearch, Bombay, 1970, Tata Institute of Fundamental ResearchStudies in Mathematics, No. 5.A. P. Ogg, Rati<strong>on</strong>al points of finite order <strong>on</strong> elliptic curves, Invent.Math. 12 (1971), 105–111. MR 45 #178[Pyl] E. Pyle, Abelian varieties over Q with large endomorphismalgebras <strong>and</strong> their simple comp<strong>on</strong>ents over Q,http://math.berkeley.edu/ ribet/pyle thesis.pdf.[Que77][Rib75][Rib77][Rib92][Rib94][Rib99][Ros86][RS01][Rub89][Sch06][Sch65]C. Queen, The existence of p-adic Abelian L-functi<strong>on</strong>s, Number theory<strong>and</strong> algebra (New York), Academic Press, 1977, pp. 263–288.K. A. Ribet, Endomorphisms of semi-stable abelian varieties over numberfields, Ann. Math. (2) 101 (1975), 555–562. MR 51 #8120, Galois representati<strong>on</strong>s attached to eigenforms with Nebentypus,<strong>Modular</strong> functi<strong>on</strong>s of <strong>on</strong>e variable, V (Proc. Sec<strong>on</strong>d Internat. C<strong>on</strong>f.,Univ. B<strong>on</strong>n, B<strong>on</strong>n, 1976) (Berlin), Springer, 1977, pp. 17–51. LectureNotes in Math., Vol. 601., Abelian varieties over Q <strong>and</strong> modular forms, Algebra <strong>and</strong>topology 1992 (Taejŏn), Korea Adv. Inst. Sci. Tech., Taejŏn, 1992,pp. 53–79. MR 94g:11042, Report <strong>on</strong> mod l representati<strong>on</strong>s of Gal(Q/Q), Motives (Seattle,WA, 1991), Amer. Math. Soc., Providence, RI, 1994, pp. 639–676., Torsi<strong>on</strong> points <strong>on</strong> J 0 (N) <strong>and</strong> Galois representati<strong>on</strong>s, Arithmetictheory of elliptic curves (Cetraro, 1997), Springer, Berlin, 1999,pp. 145–166. MR 2001b:11054M. Rosen, Abelian varieties over C, Arithmetic geometry (Storrs,C<strong>on</strong>n., 1984), Springer, New York, 1986, pp. 79–101.K. A. Ribet <strong>and</strong> W. A. <strong>Stein</strong>, <str<strong>on</strong>g>Lectures</str<strong>on</strong>g> <strong>on</strong> Serre’s c<strong>on</strong>jectures, Arithmeticalgebraic geometry (Park City, UT, 1999), IAS/Park City Math.Ser., vol. 9, Amer. Math. Soc., Providence, RI, 2001, pp. 143–232. MR2002h:11047K. Rubin, The work of Kolyvagin <strong>on</strong> the arithmetic of elliptic curves,Arithmetic of complex manifolds (Erlangen, 1988), Springer, Berlin,1989, pp. 128–136.I. Schur, Arithmetische Untersuchungen über endliche Gruppen linearerSubstituti<strong>on</strong>en, Sitz. Pr. Akad. Wiss. (1906), 164–184, Gesam. Abhl.,I, 177–197, Springer-Verlag, Berlin-Heidelberg-New York-Tokyo, 1973.O. F. G. Schilling (ed.), Arithmetical algebraic geometry. (Proceedingsof a C<strong>on</strong>ference held at Purdue University, December 5–7, 1963),Harper & Row Publishers, New York, 1965.


[SD67][SD74]References 269P. Swinnert<strong>on</strong>-Dyer, The c<strong>on</strong>jectures of Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer,<strong>and</strong> of Tate, Proc. C<strong>on</strong>f. Local Fields (Driebergen, 1966), Springer,Berlin, 1967, pp. 132–157. MR 37 #6287H. P. F. Swinnert<strong>on</strong>-Dyer, Analytic theory of abelian varieties, CambridgeUniversity Press, L<strong>on</strong>d<strong>on</strong>, 1974, L<strong>on</strong>d<strong>on</strong> Mathematical SocietyLecture Note Series, No. 14. MR 51 #3180[Ser73] J-P. Serre, A Course in Arithmetic, Springer-Verlag, New York, 1973,Translated from the French, Graduate Texts in Mathematics, No. 7.[Ser77][Ser79][Ser87][Ser88][Ser98], Linear representati<strong>on</strong>s of finite groups, Springer-Verlag, NewYork, 1977, Translated from the sec<strong>on</strong>d French editi<strong>on</strong> by Le<strong>on</strong>ard L.Scott, Graduate Texts in Mathematics, Vol. 42., Local fields, Springer-Verlag, New York, 1979, Translated fromthe French by Marvin Jay Greenberg., Sur les représentati<strong>on</strong>s modulaires de degré 2 de Gal(Q/Q),Duke Math. J. 54 (1987), no. 1, 179–230., Algebraic groups <strong>and</strong> class fields, Springer-Verlag, New York,1988, Translated from the French., Abelian l-adic representati<strong>on</strong>s <strong>and</strong> elliptic curves, A K PetersLtd., Wellesley, MA, 1998, With the collaborati<strong>on</strong> of Willem Kuyk <strong>and</strong>John Labute, Revised reprint of the 1968 original.[Shi59] G. Shimura, Sur les intégrales attachées aux formes automorphes, J.Math. Soc. Japan 11 (1959), 291–311.[Shi73][Shi94][Sil92][Sil94][ST68][Ste04][Ste07], On the factors of the jacobian variety of a modular functi<strong>on</strong>field, J. Math. Soc. Japan 25 (1973), no. 3, 523–544., Introducti<strong>on</strong> to the arithmetic theory of automorphic functi<strong>on</strong>s,Princet<strong>on</strong> University Press, Princet<strong>on</strong>, NJ, 1994, Reprint of the 1971original, Kan Memorial <str<strong>on</strong>g>Lectures</str<strong>on</strong>g>, 1.J. H. Silverman, The arithmetic of elliptic curves, Springer-Verlag, NewYork, 1992, Corrected reprint of the 1986 original., Advanced topics in the arithmetic of elliptic curves, Springer-Verlag, New York, 1994.J-P. Serre <strong>and</strong> J. T. Tate, Good reducti<strong>on</strong> of abelian varieties, Ann. ofMath. (2) 88 (1968), 492–517.W. <strong>Stein</strong>, Studying the Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jecture for <strong>Modular</strong>Abelian Varieties Using MAGMA, to appear in J. Cann<strong>on</strong>, ed.,Computati<strong>on</strong>al Experiments in Algebra <strong>and</strong> Geometry, Springer-Verlag(2004).<strong>William</strong> <strong>Stein</strong>, <strong>Modular</strong> forms, a computati<strong>on</strong>al approach, GraduateStudies in Mathematics, vol. 79, American Mathematical Society,Providence, RI, 2007, With an appendix by Paul E. Gunnells. MRMR2289048


270 References[SW97][Tat66][TW95][V´72][Wal85][Wil95][Wil00][Zag85a][Zag85b]C. M. Skinner <strong>and</strong> A. J. Wiles, Ordinary representati<strong>on</strong>s <strong>and</strong> modularforms, Proc. Nat. Acad. Sci. U.S.A. 94 (1997), no. 20, 10520–10527.J. Tate, On the c<strong>on</strong>jectures of Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer <strong>and</strong> a geometricanalog, Séminaire Bourbaki, Vol. 9, Soc. Math. France, Paris,1965/66, pp. Exp. No. 306, 415–440.R. Taylor <strong>and</strong> A. J. Wiles, Ring-theoretic properties of certain <strong>Hecke</strong>algebras, Ann. of Math. (2) 141 (1995), no. 3, 553–572.J. Vélu, Groupes de Galois attachés aux points d’ordre fini des courbeselliptiques sur un corps de nombres, d’après J. P. Serre, Séminairede Théorie des Nombres, 1971–1972 (Univ. Bordeaux I, Talence), Exp.No. 30, Lab. Théorie des Nombres, Centre Nat. Recherche Sci., Talence,1972, p. 12.J.-L. Waldspurger, Quelques propriétés arithmétiques de certainesformes automorphes sur GL(2), Compositio Math. 54 (1985), no. 2,121–171. MR 87g:11061aA. J. Wiles, <strong>Modular</strong> elliptic curves <strong>and</strong> Fermat’s last theorem, Ann. ofMath. (2) 141 (1995), no. 3, 443–551., The Birch <strong>and</strong> Swinnert<strong>on</strong>-Dyer C<strong>on</strong>jecture,http://www.claymath.org/prize problems/birchsd.htm.D. Zagier, <strong>Modular</strong> parametrizati<strong>on</strong>s of elliptic curves, Canad. Math.Bull. 28 (1985), no. 3, 372–384. MR 86m:11041, <strong>Modular</strong> points, modular curves, modular surfaces <strong>and</strong> modularforms, Workshop B<strong>on</strong>n 1984 (B<strong>on</strong>n, 1984), Springer, Berlin, 1985,pp. 225–248.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!