作者: hxypqr
A crash introduction to BSD conjecture
The pdf version is A crash introduction to BSD conjecture .
We begin with the Weierstrass form of elliptic equation, i.e. look it as an embedding cubic curve in .
Definition 1 (Weierstrass form)
, In general the form is given by,
If
, then, we have a much more simper form,
Remark 1
Where
.
We have two way to classify the elliptic curve living in a fix field
. \paragraph{j-invariant} The first one is by the isomorphism in
. i.e. we say two elliptic curves
is equivalent iff
is a isomorphism such that .
Definition 2 (j-invariant) For a elliptic curve
, we have a j-invariant of
, given by,
Why j-invariant is important, because j-invariant is the invariant depend the equivalent class of under the classify of isomorphism induce by
. But in one equivalent class, there also exist a structure, called twist.
Definition 3 (Twist) For a elliptic curve
, all elliptic curve twist with
is given by,
So the twist of a given elliptic curve
is given by:
Remark 2 Of course a elliptic curve
is the same as
, induce by the map
.
But this moduli space induce by the isomorphism of is not good, morally speaking is because of the abandon of universal property. see \cite{zhang}. \paragraph{Level
structure} We need a extension of the elliptic curve
, this is given by the integral model.
Definition 4 (Integral model)
,
.
is regular and minimal, the construction of
is by the following way, we first construct
and then blow up.
is given by the Weierstrass equation with coefficent in
.
Remark 3 The existence of integral model need Zorn’s lemma.
Definition 5 (Semistable) the singularity of the minimal model of
are ordinary double point.
Remark 4 Semistable is a crucial property, related to Szpiro’s conjecture.
Definition 6 (Level
structure)
The weil pairing of
is given by a unit in cycomotic fields, i.e.
What happen if ? In this case we have a analytic isomorphism:
Given by,
Where , and the Weierstrass equation
is given by
. The full n tructure of it is given by
and the value of
, i.e.
Where is induce by
The key point is following:
Theorem 7
, the moduli of elliptic curves with full level n-structure is identified with
Now we discuss the Mordell-Weil theorem.
Theorem 8 (Mordell-Weil theorem)
The proof of the theorem divide into two part:
Remark 5 The proof is following the ideal of infinity descent first found by Fermat. The height is called Faltings height, introduce by Falting. On the other hand, I point out, for elliptic curve
, there is a naive height come from the coefficient of Weierstrass representation, i.e.
.
While the torsion part have a very clear understanding, thanks to the work of Mazur. The rank part of is still very unclear, we have the BSD conjecture, which is far from a fully understanding until now.
But to understanding the meaning of the conjecture, we need first constructing the zeta function of elliptic curve, .
\paragraph{Local points} We consider a local field , and a locally value map
, then we have the short exact sequences,
Topologically, we know are union of disc indexed by
,
. Define , then we have Hasse principle:
Theorem 9 (Hasse principle)
Remark 6 I need to point out, the Hasse principle, in my opinion, is just a uncertain principle type of result, there should be a partial differential equation underlying mystery.
So count the points in reduce to count points in
, reduce to count the Selmer group
. We have a short exact sequences to explain the issue.
I mention the Goldfold-Szipiro conjecture here. , there
such that:
\paragraph{L-series} Now I focus on the construction of , there are two different way to construct the L-series, one approach is the Euler product.
Where or
when
has bad reduction on
.
The second approach is the Galois presentation, one of the advantage is avoid the integral model. Given is a fixed prime, we can consider the Tate module:
Then by the transform of different embedding of , we know
, decompose it into a lots of orbits, so we can define
, the decomposition group of
(extension of
to
). We define
is the inertia group of
.
Then is generated by some Frobenius elements
So we can define
And then .
Faltings have proved is the invariant depending the isogenous class in the follwing meaning:
Theorem 10 (Faltings)
is an isogenous ivariant, i.e.
isogenous to
iff
,
.
Where come from an automorphic representation for
. Now we give the statement of BSD onjecture.
is the regulator of
, i.e. the volume of fine part of
with respect to the Neron-Tate height pairing.
be the volume of
Then we have,
.
.
Here is an explictly positive integer depending only on
for
dividing
.
SL_2(Z) and its congruence subgroups
The pdf version is SL_2(Z) and its congruence subgroups.
We know we can always do the following thing:
Remark 1 Why it is
but not 1? if it is 1, then the action
distribute is not trasitive on
, i.e. every element in unite group present a connected component
Now we consider the subgroup .
We are most interested in the case . So how to investigate
? We can look at the action of it on something, for particular, we look at the action of it on Riemann sphere, i.e.
given by fraction linear map:
Remark 2 What is fraction linear map? This action carry much more information than the action on vector, thanks for the exist of multiplication in
and the algebraic primitive theorem. Due to I always looks the fraction linear map as something induce by the permutation of the roots of polynomial of degree 2, this is true at least for fix points, and could natural extension. So how about the higher dimension generate? consider the transform of
tuples induce by polynomial with degree
?
Remark 3
, then
action faithful on
, i.e. except identity, every action is nontrivial. This is easy to be proved, observed,
- Up half plane
is invariant under the action of
, i.e.
,
. The proof is following,
Now we focus on or the same,
. All the argument for
make sense for
Fix , define,
Then is the kernel of map
, i.e. we have short exact sequences,
Remark 4 The relationship of
is just like
.
Definition 1 (Congruence group) A subgroup of
is called a congruence group iff
,
.
Example 1 We give two examples of congruence subgroups here.
Definition 2 (Fundamental domain)
Now here is a theorem charistization the fundamental domain.
Theorem 3 This domain
is a fundamental domain of
![]()
Proof: Ths key point is have two generators,
.
.
Thanks to this two generator exactly divide the action of on
into a lots of scales, then
is a fundamental domain is a easy corollary.
Remark 5 This is not rigorous,
need be replace by
, but this is very natural to get a modification to a right one.
Remark 6
are
equivalent iff
and
or if
on the unit circle and
![]()
Remark 7 If
, then
expect in the following three case:
if
.
if
.
if
.
Where
.
Remark 8 The group
is generated by the two elements
,
. In other word, any fraction linear transform is a “word” induce by
. But not free group, we have relationship
.
The natural function space on is the memorphic function, under the map:
, it has a
-expension,
And there are only finite many negative such that
.
Dirichlet hyperbola method
A pdf version is Dirichlet hyperbola method.
1. Introduction
Remark 1 I thought this problem initial 5 years ago, cost me several days to find a answer, I definitely get something without the argument of Dirchlet hyperbola method and which is weaker but morally the same camparable with the result get by Dirichlet hyperbola method.
Remark 2 How to get the formula:
In fact,
Which is the integer lattices under or lying on the hyperbola
.
Remark 3 By trivial argument, we can bound the quantity as following way,
The error term is
, which is too big. But fortunately we can use the symmetry of hyperbola to improve the error term.
Proof:
Given a natural number k, use the hyperbola method together
with induction and partial summation to show that
where
denotes a polynomial of degree
with leading term
.
Remark 4
is the residue of
at
.
Proof:
We can establish the dimension 3 case directly, which is the following asymptotic formula,
The approach is following, we first observe that
The problem transform to get a asymptotic formula for the lattices under 3 dimension hyperbola. The first key point is, morally is the central point under the hyperbola.
Then we can divide the range into 3 parts, and try to get a asymptotic formula for each part then add them together. Assume we have:
.
.
.
Then the task transform to get a asymptotic formula,
But we can do the same thing for and then integral it. This end the proof. For general
, the story is the same, by induction.
Induction on and use the Fubini theorem to calculate
.
There is a major unsolved problem called Dirichlet divisor problem.
What is the error term? The conjecture is the error term is , it is known that
is not right.
Remark 5
To beats this problem, need some tools in algebraic geometry.
2. Several problems
, is there a asymptotic formula for
?
,
is a polynomial with degree
, is there a asymptotic formula for
?
,
is a polynomial with degree
, is there a asymptotic formula for
?
Proof:
Now we try to estimate
In fact, we have,
Where ,
.
So by 1 we know,
So we have,
Remark 6 In fact we can get
, by combining the theorem 3 and 1.
3. Lattice points in ball
Gauss use the cube packing circle get a rough estimate,
In the same way one can obtain,
Remark 7 Where
is the volume of the unit ball in
dimension.
Dirchlet’s hyperbola method works nicely for the lattic points in a ball of dimension . Langrange proved that every natural number can be represented as the sum of four squares, i.e.
, and Jacobi established the exact formula for the number of representations
Hence we derive,
This result extend easily for any , write
as the additive convolution of
and
, i.e.
Apply the above result for and execute the summation over the remaining
squares by integration.
Remark 8 Notice that this improve the formula 12 which was obtained by the method of packing with a unit square. The exponent
in 16 is the best possible because the individual terms of summation can be as large as the error term (apart from
), indeed for
we have
if
is odd by the Jacobi formula. The only case of the lattice point problem for a ball which is not yet solved (i.e. the best possible error terms are not yet established) are for the circle(
) and the sphere (
).
Theorem 4
4. Application in finite fields
Suppose is a irreducible polynomial. And for each prime
, let
By Langrange theorem we know . Is there a asymptotic formula for
A general version, we can naturally generated it to algebraic variety.
Is there a asymptotic formula for
Example 1 We give an example to observe what is involved.
. We know
is solvable iff
or
. One side is easy, just by Fermat little theorem, the other hand need Fermat descent procedure, which of course could be done by Willson theorem. In this case,
Which is a special case of Dirichlet prime theorem.
Let be an algebraic number field, i.e. the finite field extension of rational numbers, let
Theorem 5
is a ring, we call it the ring of integer of
.
- He showed further every non-zero ideal of
could write as the product of prime ideal in
uniquely.
- the index of every non-zero ideal
in
is finite, i.e.
, and we can define the norm induce by index.
Then the norm is a multiplication function in the space of ideal, i.e.
.
- Now he construct the Dedekind Riemann zeta function,
Now we consider the analog of the prime number theorem. Let , does the exist a asymptotic formula,
Given a prime , we may consider the prime ideal
Where is different prime ideal in
. But the question is how to find these
? For the question, there is a satisfied answer.
Lemma 6 (existence of primitive element) There always exist a primetive elements in
, such that,
Where
is some algebraic number, which’s minor polynomial
.
Theorem 7 (Dedekind recipe) Take the polynomial
, factorize it in the polynomial ring
,
Consider
. Then apart from finite many primes, we have,
Where
.
Remark 9 The apart primes are those divide the discriminant.
Now we can argue that 4 is morally the same as counting the ideals whose norm is divide by in a certain algebraic number theory.
And we have following, which is just the version in algebraic number fields of 2.
Theorem 8 (Weber)
of ideals of
with norm
equal to,
Diophantine approximation
I explain some general ideal in the theory of diophantine approximation, some of them is original by myself, begin with a toy model, then consider the application on folklore Swirsing-Schmidt conjecture.
\tableofcontents
1. Dirichlet theorem, the toy model
The very basic theorem in the theory of Diophantine approximation is the well known Dirichlet approximation theorem, the statement is following.
Theorem 1 (Dirichlet theorem) for all
is a irrational number, we have infinity rational number
such that:
Remark 1 It is easy to see the condition of irrational is crucial. There is a best constant version of it, said, instead of
, the best constant in the suitable sense for the theorem 1 should be
and arrive by
at least. The strategy of the proof of the best constant version involve the Frey sequences.
Now we begin to explain the strategies to attack the problem.
\paragraph{Argument 1, boxes principle} We begin with a easiest one, i.e. by the argument of box principle, the box principle is following,
Theorem 2 (Boxes principle) Given
and two finite sets
, set
, if we have a map:
Then there exists a element
such that there exist at least two element
,
.
Proof: The proof is trivial.
Now consider, , the sequences
, then
. Divide
in an average way to
part:
. Then the linear structure involve (which, in fact play a crucial role in the approach). And the key point is to look at
and integers.
\paragraph{Argument 2, continue fractional} We know, for irrational number ,
have a infinite long continue fractional:
And we have,
Then .
\paragraph{Argument 3, Bohr set argument} We begin with some kind of Bohr set:
The key point is the shift of Bohr set, on the vertical line i.e. is very slow, and can be explained by
So:
in But in fact they are not really independent, as the number of Bohr sets increase, then you can calculate the correlation, thanks to the harmonic sires increasing very slowly, wwe can get something non trivial by this argument, but it seems not enough to cover the whole theorem 1.
\paragraph{Argument 4, mountain bootstrap argument} This argument is more clever than 3, although both two arguments try to gain the property we want in 1 from investigate the whole space but not
, this argument is more clever.
Now I explain the main argument, it is nothing but sphere packing, with the set of balls
and define its subset
Then , and
. If we can proof,
Remark 2 If we can proof 3, it is easy to see the theorem 1 follows.
Proof: The proof follows very standard in analysis, may be complex analysis? Key point is we start with a ball , whatever it is, this is not important, the important thing is we can take some ball
with the center of
in
, then try to consider
to extension
and then we find the boudary is also larger then we can extension again, step by step just like mountain bootstrap argument. So we involve in two possible ending,
- The extension process could extension
to whole space.
- we can not use the extension argument to extension to the whole space.
If we are in the first situation, then we are safe, there is nothing need proof. If we are in second case, anyway we take a ball . Then try to find good ball
to approximate
, but this is difficult…
Remark 3 Argument 1 is too clever to be true in generalization, argument 2 is standard, by the power of renormalization. argument 3 and argument 4 have gap… I remember I have got a proof similar to argument 4 here many years ago, but I forgot how to get it…
2. Schimidt conjecture
The Schimidt conjecture could be look as the generalization of Dirchlet approximation theorem 1 to algebraic number version, to do this, we need define the height of a algebraic number.
Definition 4 We say a number
is a
order algebraic number if and only is the minimal polynomial of
,
have degree
.
Definition 5 (Height) Now we define the height of a
th order algebraic number as
, Where
Now we state the conjecture:
Theorem 6 (Swiring-Schimidt conjecture) For all transendental number
, there is infinitely
are
th algebraic number such that:
Where
is a constant only related to
but not
.
I point out the conjecture is very related to the map:
Where is the
th symmetric sum.
Remark 4
is a map
, what we consider is its inverse,
, but
is not smooth, it occur singularity when
for some
. And the map, as we know, the singularity depend on the quantity
.
Remark 5 I then say something about the geometric behaviour of the map
, as we know, what we have in mind is consider the map
as a distortion
, Then
is just the pullback of the canonical metric on
(morally) to
.
Discrete harmonic function in Z^n
There is some gap, in fact I can improve half of the argument of Discrete harmonic function , the pdf version is Discrete harmonic function in Z^n, but I still have some gap to deal with the residue half…
1. The statement of result
First of all, we give the definition of discrete harmonic function.
Definition 1 (Discrete harmonic function) We say a function
is a discrete harmonic function on
if and only if for any
, we have:
In dimension 2, the definition reduce to:
Definition 2 (Discrete harmonic function in
) We say a function
is a discrete harmonic function on
if and only if for any
, we have:
The result establish in \cite{paper} is following:
Theorem 3 (Liouville theorem for discrete harmonic functions in
) Given
. There exists a constant
related to
such that, given a discrete harmonic function
in
satisfied for any ball
with radius
, there is
portion of points
satisfied
. then
is a constant function in
.
Remark 1 This type of result contradict to the intuition, at least there is no such result in
. For example. the existence of poisson kernel and the example given in \cite{paper} explain the issue.
Remark 2 There are reasons to explain why there could not have a result in
but in
,
- The first reason is due to every radius
there is only
lattices in
in
so the mass could not concentrate very much in this setting.
- The second one is due to there do not have infinite scale in
but in
.
- The third one is the function in
is automatically locally integrable.
The generation is following:
Theorem 4 (Liouville theorem for discrete harmonic functions in
) Given
. There exists a constant
related to
such that, given a discrete harmonic function
in
satisfied for any ball
with radius
, there is
portion of points
satisfied
. then
is a constant function in
.
In this note, I give a proof of 4, and explicit calculate a constant satisfied the condition in 3, this way could also calculate a constant
satisfied 4. and point the constant calculate in this way is not optimal both in high dimension and 2 dimension.
2. some element properties with discrete harmonic function
We warm up with some naive property with discrete harmonic function. The behaviour of bad points could be controlled, just by isoperimetric inequality and maximum principle we have following result.
Definition 5 (Bad points) We divide points of
into good part and bad part, good part
is combine by all point
such that
, and
is the residue one. So
.
For all
, we define
for convenient.
Theorem 6 (The distribution of bad points) For all bad points
in
, they will divide into several connected part, i.e.
and every part
satisfied
.
Remark 3 We say
is connected in
iff there is a path in
connected
.
Remark 4 the meaning that every point So the behaviour of bad points are just like a tree structure given in the gragh.
Proof: A very naive observation is that for all is a connected compact domain, then there is a function
This could be proved by induction on the diameter if . Then, if there is a connected component of
such that contradict to theorem 6 for simplify assume the connected component is just
, then use the formula 5we know
The last line is due to consider around . But this lead to:
which is contradict to the definition of
. So we get the proof.
Now we begin another observation, that is the freedom of extension of discrete harmonic function in is limited.
Theorem 7 we can say something about the structure of harmonic function space of
, the cube, you will see, if add one value, then you get every value, i.e. we know the generation space of
![]()
Proof: For two dimension case, the proof is directly induce by the graph. The case of dimensional is similar.
Remark 5 The generation space is well controlled. In fact is just like n orthogonal direction line in n dimensional case.
3. sktech of the proof for \ref
}
The proof is following, by looking at the following two different lemmas establish by two different ways, and get a contradiction.
\paragraph{First lemma}
Lemma 8 (Discrete poisson kernel) the poisson kernel in
. We point out there is a discrete poisson kernel in
, this is given by:
And the following properties is true:
,
.
Remark 6 The proof could establish by central limit theorem, brown motion, see the material in the book of Stein \cite{stein}. The key point why this lemma 8 will be useful for the proof is due to this identity always true
, So we will gain a lots of identity, These identity carry information which is contract by another argument.
\paragraph{Second lemma} The exponent decrease of mass.
Lemma 9 The mass decrease at least for exponent rate.
Remark 7 the proof reduce to a random walk result and a careful look at level set, reduce to the worst case by brunn-minkowski inequality or isoperimetry inequality.
\paragraph{Final argument} By looking at lemma 1 and lemma 2, we will get a contradiction by following way, first the value of on
increasing too fast, exponent increasing by lemma2, but on the other hand, it lie in the integral expresion involve with poisson kernel, but the pertubation of poisson kernel is slow, polynomial rate in fact…
\newpage
{99} \bibitem{paper} A DISCRETE HARMONIC FUNCTION BOUNDED ON A LARGE PORTION OF Z2 IS CONSTANT
\bibitem{stein} Functional analysis
Log average sarnak conjecture
This is a note concentrate on the log average Sarnak conjecture, after the work of Matomaki and Raziwill on the estimate of multiplication function of short interval. Given a overview of the presented tools and method dealing with this conjectue.
1. Introduction
Sarnak conjecture \cite{Sarnak} assert that for any obersevable come from a determination systems
, where
,
. The correlation of it and the Liuvillou function is 0, i.e. they are orthongonal to each other, more preseicesly it is just to say,
This is a very natural raised conjecture, Liuville function is the presentation of primes, due to we always believe the distribution of primes in should be randomness.
It has been known as observed by Landau \cite{Laudau} that the simplest case,
already equivalent to the prime number theorem. It is not difficult to deduce the spetial case of Sarnak conjecture when with the obersevation in $latex {(1)}&fg=000000$ come from finite dynamic system is equivalent to the prime number theorem in athremetic progress by the similar argument. Besides this two classical result, may be the first new result was established by Davenport,
Theorem 1 Let
,
is a inrational, then the obersevation come from
is orthogonal to Mobius function. due to
is a basis of
, suffice to proof,
There is a lots of spetial situations of Sarnak’s conjecture have been established, The parts I mainly cared is the following:
- Interval exchange map.
- Skew product flow.
- Obersevable come from One dimensional zero entropy flow.
- Nilsequences.
But in this note, I do not want to explain the tecnical and tools to establish this result, but considering an equivalent conjecture of Sarnak conjecture, named Chowla conjecture, and explain the underlying insight of the suitable weak statement, i.e. the log average Chowla conjecture and the underlying insight of it.
The note is organized as following way, in the next section $latex {(2)}&fg=000000$, we give a self-contained introduction on the tools called Bourgain-Sarnak-Ziegler critation, explain the relationship of this critation and the sum-product phenomenon, also given some more general critation along the philosephy use in establish the Bourgain-Sarnak-Ziegler critation, which maybe useful in following development combine with some other tools. The key point is transform the sum from linear sum to bilinear sum and decomposition the bilinear sum into diagonal part and off-diagonal part, use the assume in the critation to argue the off-diagonal part is small and on the orther hand the diagonal part is also small by the trivial estimate and the volume of diogonal is small, this is very similar to a suitable Caderon-Zugmund decomposition.
In section $latex {(4)}&fg=000000$, I try to give a proof sketch of the result of Matomaki and Raziwill, which is also a key tools to understanding the Sarnak conjecture, or equivalent the Chowla conjecture. The key points of the proof contains following:
- Find a suitable fourier indentity
- Construct a multiplication-addition dense subset
, and proof that the theorem MR hold we need only to proof it hold for
instead of
- Involve the power of euler product formula. divide the whole interval into a lot of small interval with smaller and smaller scale and a residue part. We look the part come from every small scale as a major term and look the residue part as minor term.
- Deal with the major term at every scale, by a combitorios identity and second moments method.
- find a enough decay estimate from a scale to the next smaller scale.
- Deal with the minor term by the H… lemma.
Due to the theorem of MR do not exausted the method they developed, we trying to make some more result with their method, Tao and Matomaki attain the average version of Chowla conjecture is true by this way, and combine this argument and the entropy decresment argument they established the 2 partten of the log average Chowla conjecture is true. Very recently Tao and his coperator proved the odd partten case of log average chowla conjecture is true, combine an argument of frustenberg crresponding principle and entopy decresment argument. But it seems the even and large than 2 case is much difficult and seems need something new to combine with the method of MR and entropy decresment and frunstenberg corresponfing principle to make some progress.
So, in section $latex {(5)}&fg=000000$, we give a self-contain introduction to the entropy decresment argument of Tao, and combine with the frustenberg corresponding principle.
In the last section $latex {(6)}&fg=000000$, I state some result and method and phylosphy of them I get on nilsequences and wish to combine them with the previous method to make some progress on log average Chowla conjecture on the even partten case.
\newpage
2. Bourgain-Sarnak-Zieglar creation
We begin with the easiest one, this is the main result established in \cite{BSZ}, I try to give the main ideal under the proof, but with a no quantitative version is the following,
Theorem 2 (Bourgain-Sarnak-Zieglar creation, not quantitative version) if for all primes
we have:
Then for multiplication function
we have
Remark 1 For simplify we identify
.
The idea is following, break the sum into a bilinear one, so, of course, we multiplication it with itself. i.e. we consider to control,
To control 4, we need exhausted the mutiplication property of
, we have
. We can not get good estimate for all term,
The condition in our hand if following,
So, just like the situation of Cotlar-Stein lemma \cite{Cotlar-Stein lemma}, we wish to estimate like following:
Then we consider divide the sum into diagonal part and non-diagonal part, as following,
But the first part is small, i.e.
Because of
and the second part is small, i.e.
Because diagonal part is small in
and trivial inequality
But the method in remark 2 is not always make sense in any situation, we need to construct two suitable sets and then break up
into
, this mean,
But this could be construct in this situation, thanks to the prime number theorem,
Theorem 3 (Prime number theorem)
Morally speaking, this is the statement that the primes, which is the generator of multiplication function, is not very sparse.
3. Van der curpurt trick
There is the statement of Van der carport theorem:
Theorem 4 (Van der curpurt trick) Given a sequences
in
, if
,
is uniformly distributed, then
is uniformly distributed.
I do not know how to establish this theorem with no extra condition, but this result is true at least for polynomial flow. \newpage Proof:
This type of trick could also establish the following result, which could be understand as a discretization of the Vinegradov lemma.
Remark 3
Uniformly distribution result of
: Given
,
coverages to a uniformly distribution in
as
.
Remark 4 But I definitely do not know how to establish the similar result when
.
Remark 5
This trick could also help to establish estimate of correlation of low complexity sequences and multiplicative function, such as result:
Maybe with the help of B-Z-S theorem.
\newpage
4. Matomaki and Raziwill’s work
In this section we explain the main idea underlying the paper \cite{KAISA MATOMA 虉KI AND MAKSYM RADZIWILL}. But play with a toy model, i.e. the corresponding corollary of the original result on Liouville鈥檚 function.
Definition 5 (Lioville’s function)
Remark 6
is equivalent to the prime number theorem 3.
The most important beakgrouth of analytic number theory is the new understanding of multiplication function on share interval, this result is established by Kaisa Matom盲ki and Maksym Radziwill. Two very young and intelligent superstars.
The main theorem in them article is :
Theorem 6 (Matomaki,Radziwill) As soon as
when
, one has:
for almost all
.
In my understanding of the result, the main strategy is:
- Parseval indetity, transform to Dirchelet polynomial.
- Involved by multiplication property, spectral decomposition.
- From linear to multilinear , Cauchy schwarz inequality.
- major term estimate.
- Estimate the contribution of area which is not filled.
4.1. Parseval indetity, transform to Dirchelet polynomial
We wish to establish the equality,
This is the norm, by Chebyschev inequality, this could be control by
norm, so we only need to establish the following,
We wish to transform from the discretization sum to a continue sum, that is,
Remark 7 There are two points to understand why 19 and 18 are the same.
.
and
is to make that
.
So the Magnitude of 18 and 19 are the same. i.e.
Now we try to transform 19 by Parseval indetity, this is something about the norms of the quality we wish to charge. It is just trying to understanding 19 as a quantity in physical space by a more chargeable quality in frequency space. Image,
Then . Note that,
So by Parseval identity, we have,
Remark 8 We know the Fejer kernel satisfied,
So morally speaking, we get the following identity.
In fact we do a cutoff, the quality we really consider is just:
established the monotonically inequality:
Theorem 7 (Paserval type identity)
Remark 9
In my understanding, This is a perspective of the quality, due to the quality is a multiplicative function integral on a domain
with additive structure, it could be looked as a lots of wave with the periodic given by primes, so we could do a orthogonal decomposition in the fractional space, try to prove the cutoff is a error term and we get such a monotonically inequality.
But at once we get the monotonically inequality, we could look it as a聽compactification process and this process still carry most of the information so lead to the inequality.
It seems something similar occur in the attack of the moments estimate of zeta function by the second author. And it is also could be looked as something similar to the 聽spectral decomposition with some basis come from multiplication generators, i.e. primes.
4.2. Involved by multiplication property, spectral decomposition
I called it is “spectral decomposition”, but this is not very exact. Anyway, the thing I want to say is that for multiplication function , we have Euler-product formula:
But anyway, we do not use the whole power of multiplication just use it on primes, i.e. leads to following result:
This is a identity about the function , the point is it is not just use the multiplication at a point,i.e.
, but take average at a area which is natural generated and compatible with multiplication, this identity carry a lot of information of the multiplicative property. Which is crucial to get a good estimate for the quality we consider about.
4.3. From linear to multilinear , Cauchy schwarz
Now, we do not use one sets , but use several sets
which is carefully chosen. And we do not consider [X,2X] with linear structure anymore , instead reconsider the decomposition:
On every it equipped with a bilinear structure. And
is a very small set,
which is in fact have much better estimate.
Now we just use a Cauchy-Schwarz:
4.4. major term estimate
4.5. estimate the contribution of area which is not filled
\newpage
5. Entropy dcrement argument
\newpage
6. Correlation with nilsequences
I wish to establish the following estimate: is the liouville function we wish the following estimate is true.
Where we have as
,
is a compact space.
I do not know how to prove this but this is result is valuable to consider, because by a Fourier identity we could transform the difficulty of (log average) Chowla conjecture to this type of result.
There is some clue to show this type of result could be true, the first one is the result established by Matomaki and Raziwill in 2015:
Theorem 8 (multiplication function in short interval)
is a multiplicative function, i.e.
.
as
, then we have the following result,
And there also exists the result which could be established by Vinagrodov estimate and B-S-Z critation :
Theorem 9 (correlation of multiplication function and nil-sequences in long interval)
is a multiplicative function, i.e.
.
is a polynomial function then we have the following result,
\newpage {9} \bibitem{Sarnak} Peter Sarnak, Mobius Randomness and Dynamics.
\texttt{https://publications.ias.edu/sites/default/files/Mahler }. \bibitem{Laudau} JA 虂NOS PINTZ (BUDAPEST). LANDAU鈥橲 PROBLEMS ON PRIMES.
\texttt{https://users.renyi.hu/~pintz/pjapr.pdf} \bibitem{BSZ} Knuth: Computers and Typesetting,
\texttt{http://www-cs-faculty.stanford.edu/\~{}uno/abcde.html}
\bibitem{Cotlar-Stein lemma} Almost orthogonality
\texttt{https://hxypqr.wordpress.com/2017/12/18/almost-orthogonality/}
\bibitem{KAISA MATOMA 虉KI AND MAKSYM RADZIWILL} KAISA MATOMA 虉KI AND MAKSYM RADZIWIL, MULTIPLICATIVE FUNCTIONS IN SHORT INTERVALS.
\texttt{https://arxiv.org/abs/1501.04585v4/}.
Two stupid question
The story of the infinite dimensional space of $\Delta$ is following, we eliminate ourself with compact smooth non-boundary manifold $M$ with metric $g$, then we have Betrami-Laplace operator $\Delta_g$. We could instead $\Delta_g$ by hodge laplace $dd^*+d^*d$, but let we consider $\Delta_g$ the eigenvalue problem:
$$\Delta_g u=\lambda u$$
A classical way to investigate the eigenvalue problem is according to consider variational principle and max-min principle. We equip the path integral on the function space $C^{\infty}(M)$:
$$E(f)=\frac{\int_M |\nabla u|^2}{\int_M |u|^2 }$$
Then it have a sequences of eigenvalue, negative of course: $$0<-\lambda_1<-\lambda_2<…<\lambda_k<…$$
Then things become interesting, the morse theory of infinite space involve, called the infinite space as $X$, so at least, shrink the far place of $X$ as a point, in physics, this mean, cut off at fix scale. And we can take the scale to infinite small, we use the cutoff one to approximation the real one. What I can do is the following, I can proof the eigenvalue function is uniformly distributed in $L^2(M_g)$ (after rescaling of course) and the classical weyl law(although can not give a good error term estimate), but thing become more complicated when I try to consider the infinite space $X_{M_g}$’s topology, at finite scale at least, i.e. $X_{M_g}^{h}$ which is the cut off at scale $h$. Among the other thing, I believe the following issue is true, but without ability to proof it:
>**Problem**
for every manifold $M$ and metric $g$ on $M$, the topology of infinite space $X_{M_g}$ is the same, beside this, the inverse could be true, i.e. If $X_{M_1},X_{M_2}$ is not homomorphism for some scale $h$ then $M_1,M_2$ is not homomorphism.
By intuition, I think it is depend by the underling manifold’s topology. But I do not have a rigorous proof, I definitely have a non-rigorous one, if ignore the coverage…
As I find this problem when I try to give a proof of weyl law, I do not check the reference, may be this problem is a classical one? As always, I will appreciate to any interesting comments and answers, thanks a lots!A
2.
We begin with our favorite situation, the Dirchlet problem on bounded simple-connected domain $\Omega$ in $\mathbb R^n$. Let $\lambda_1$ be the first eigenvalue of $$\Delta u=\lambda u \ in\ \Omega$$
$$u=0\ \ on\ \partial\Omega$$
Rescaling $u$ such that $\sup_{\Omega} u=1$, I think the following property of the first eigenvalue is true.
>**Problem**
We have, the Minkowski functional of $\Omega$, called $M_{\Omega}$ and the Minkowski functional with the ball $B$ such that $vol(B)=vol(\Omega)$, then along the level set of $u$, i.e. the fiber: $$\Omega=\cup_{t\in [0,1]}l_t, l_t:=\{t|x\in \Omega, u(x)=t\}$$
We pretend for the isolate point $l_1$ to be a ball with radius 0, so equipped it with the uniformly density at every direction in $S^1$, i.e. the mass distribution given by $M_B$ and the total mass coincide with the total mass induce by $M_{\Omega}$ in $l_0$, i.e.
$$\int_{e\in S_1}M_{\Omega}(e)d\mu=\int_{e\in S_1}M_{B}(e)d\mu$$
The measure $d\mu$ equipped on $S^1$ is the natural Haar measure. And the cost function is given by $c(x,y)=\|x-y\|^2$. Then, among this setting,
I wish the following property to be true:
Along the direction $1\to 0$, the transport of density $\partial_{t_0} M_{\cup_{t=t_0}^1l_t}$ given the unique optimal transport of the natural measure induce by $M(\Omega)$ and $M(B)$.
**Remark 1** As point out by SebastianGoette, the multiplicity of the first eigenvalue must be one, thanks to the eigenfunction never change the symbol, so we are in the best case.
**Remark 2**:I am not very sure this property could always true, there may be a center example when $\Omega$ is not convex, but I tend to believe it is true at least when $\Omega$ is convex.
**Remark 3**: As point out by Dirk, when you try to consider the optimal transport problem, you always need to point out the cost function $c(x,y)$ defined on $\Omega \times \Omega$, for there, I think the naive choice is $c(x,y)=\|x-y\|^2$
The thing I can proof is the following, the level set of $u$ should be convex by brunn-minkowski inequality, and some type of monotonically property, i.e. more and more like a ball when the level set is more and more shirking smaller form $\partial \Omega$ to the point $f$ arrive maximum.
I will appreciate for any relevant comments and answer, thanks!
Uncertainty principle
The pdf version is Uncertainty principle. The nice note of terrence tao seems given a nice answer for the problem below.
1. Introduction
Is there a Brunn-Minkowski inequality approach to the phenomenon charged by uncertainty principle? More precisely, is it possible to say some thing about the Gaussian distribution
to be the best choice that arrive minimum?
Remark 1 Or some other suitable distance space on reasonable function (may be some gromov hausdorff distance? Any way, to say the guassian distribution is the best function to defect the influence of uncertain principle.
I do not know the answer of the problem 1, but this is a phenomenon of a universal phylosphy, aid, uncertainty principle, heuristic:
It is not possible for both function and its Foriour transform
to be localized on small set.
Now let me give some approach by intuition to explain why the phenomenon of “uncertainty principle” could happen.
The approach is based on:
- level set decomposition.
- area formula (or coarea formula), anyway, some kind of change variable formula.
- integral by part.
- Basic understanding on exponential sum.
Let our function the Shwarz space, we begin with a intuition (not very rigorous) calculate:
Now we try to understanding the result of the calculate, it is,
The calculate is wrong, but not very far from the thing that is true, the key point is now the exponential sum involve. We could use the pole coordinate in the frequence space and get some very rough intuition of why the the uncertainty principle could occur.
Remark 2 Why we consider the level set decomposition, due to the integral is a combination of linear sum of the integral on every level set, so shape of level set is the key point.
The part of in 2 is a rotation on the level set, a wave correlation of it and the christization function
of level set
in the whole space, this is of course a exponential sum.
Now we can begin the final intuition explain of the phenomenon of uncertainty principle. If the density of function is very focus on some small part of the physics space, then it is the case for level sets of
, but we could say some thing for the exponential sum
3 related to the level set, just by very simply argument with hardy litterwood circle method or Persaval identity? Any way, something similar to this argument will make sense, due to if the diameter of level set focus ois small, then we can not get a decay estimate for
when
along one direction in frequency space, in fact we could say the inverse, i.e. it could not decay very fast.
2. Bernstein’s bound and Heisenberg uncertainty principle
2.1. Motivation and Bernstein’s bound
There is two different Bernstein’s bound, we discuss the first with the motivation, and proof the second rigorously. \paragraph{Form 1} is a invertible affine map, then for a ball
,
is a ellipsoid.
By a orthogonal transform we could make to be a diagonal matrix, i.e.
. It is said, for
or
is a smooth bump function,
, so we have,
We define dual of ,
.
Remark 3 Why there we use the metric
but not the standard inner product
? How to understand the choice?
Proposition 1 We have the following property:
.
Remark 4
This is a norm of
related to
.
Proof: Suffice to proof 2.
More quantitative we have rigorous one: \paragraph{Form 2} If ,
, then it is not possible for
to be concentrate on a scale much less than
.
Proposition 2 (Bernstein’s bound) Suppose
,
. Then,
Proof: case is trivial by Paserval identity, which said on
, fourier transform is a isometry,
. For general case, integral by part, and use trivial estimate,
2.2. Heisenberg inequality
Theorem 3 (Heisenberg uncertain principle)
, so
,
. then for any
, every direction, we have
Remark 5 We could understand the inequality by the following way. suffice to prove it with
and then by approximation argument.
, define
. then we have the following:
Remark 6 The inequality is shape, the extremizers being precisely given by the modulated Gaussians: arbitrary
There are two proof strategies I have tried, I try them for several hour but not work out with a satisfied answer, the method more involve, I explain what happen in section 1, I have not tried, I will try it later. Both this two strategies i face some difficulties, I explain why I can not work out them with a proof: \paragraph{Strategy 1} The first one is, we could work with of course, by approximation, then we find, by Paserval,
and are both true. then we use our favourite way to use Cauchy-Schwarz, the difficulty is we can not use a integral by part argument directly, even after restrict ourselves with monotonic radical symmetry inequality and by a rearrangement inequality argument, it seems reasonable due to rearrangement decreasing the kinetic energy as said in Lieb’s book. But even work with monotonic one, then one involve with some complicated form, try to use Fubini theorem to rechange the order of integral try to say something, it is possible to work out by this way but I do not know how to do. There is some calculate under this way,
but you know, at a point we have , the reasonable calculate is following,
We want , Then
Seems to be … I do not know.
\paragraph{Strategy 2} The second strategy is, in the quantity we lose two cone very near
, we need use the extra thing to make up them. May be effective argument come from some geometric inequality.
3. The Amerein-Berthier theorem
Next we investigate following problem, the problem is following: if are of finite measure, can there be a nonzero
with
and
? Some argument is folowing: Observe that:
Assume that: then
. So we have, at least
. Some dirty calculate show:
By Fubini, we calculate the Hilbert-Schmidt norm:
So is a compact operator and its
operator norm satisfied
. So if
then we can canculate we can not have
in the original question.
The story is in fact more interesting, the answer of the question is no even for , so in all case. We have the following quatitative theorem:
Theorem 4
finite measure in
, then
for some constant
.
Remark 7 There is a naive approach for this theorem: Area formula trick, the shape of level set. Obvioudly we have:
Let us do some useless further calculate:
So suffice to have:
But there is connter example given by modified scaling Gaussian distribution… The point is form 15 to 16 is too loose.
Following I given a right approach, following by my sprite on level set and area formula argument and discritization.
Proof: The story is the same for a discretization one. We need point out, change the space to
, then every thing become a discretization one, and the change could been argue as a approximation way. What happen then, we have a naive picture in mind which is:
What is the case with norm, it become the standard nner product on
, and the scale involve, i.e. we have the following basic estimate:
Now image if the density of concentrate in a very small area, then by a cut off argument we consider the supp of
,
is very small, then use the argument 18, we could conclute the density of
could not very concentrate in the fraquence space. The constant
could be given presicely by this way, but I do not care about it.
4. Logvinenko-Sereda theorem
Next we formulate some result that provide further evidence of the non-concentration property of functions with Fourier support on .
4.1. A toy model
Theorem 5 Let
an suppose that
satisfies,
If
satisfies
then
Where
as
.
Proof: This is a easy corollary of the argument I give in the proof of Amerein-Berthier theorem 4.
4.2. A refine version
Theorem 6 Suppose that a measurable set
satisfies the following “thinkness” condition: there exists
such that
where
is arbitrary but fixed. Assume that
. Then
where the constant
depends only on
and
.
Remark 8 This proof need some very good estimate come from several complex variables.
5. The Malgrange-Ehrenpreis theorem
Theorem 7 Let
be a bounded domain in
and let
be a polynomial, Then, for all
, there exists
such that
in a distribution sence.
Brunn-Minkowski inequality
In this short note, I posed a conjecture on Brunn-Minkwoski inequality and explain why we could be interested in this inequality, what is it meaning for further developing of some fully nonlinear elliptic equation come from geometry. The main part of the note devoted to discuss several different proof of classical Brunn-Minkowski inequality.
1. Introduction
I believe, every type of Brunn-Minkowski inequality, type of Brunn-Minkowski inequality is in some special sense and will be explained later, will be crucial with a corresponding regularity result of a fully nonlinear elliptic equation which could be realizable by geometric way which will also explained in further note.
So the key point is that Brunn-Minkowski inequality is crucial and have potential application, I posed a problem there and then consider the classical Brunn-Minkowski inequality, we give several proof of the classical Brunn-Minkowski inequality, everyone could help us to have a more refine understanding of the original difficulty with different angle.
Theorem 1 (conjecture) We have a map
We are willing to called the function
as the hamiltonian function. then we could consider the hamiltonian flow of the function
, but this could only true for a even dimension manifold to make there exists
that
is a non-degenerate closed
form.
Anyway we consider the level set of
, we get a foliation i.e
. we consider the gradient flow with
, called the gradient flow begin with
as
. And we wish the gradient flow have a addition structure on itself then we could consider what is the Brunn-Minkowski inequality in this setting, the condition is a group structure on the space of level set
, i.e.
Remark 1 take
in 1, this conjecture reduce to the toy model, i.e. classical Brunn-Minkowski inequality.
Remark 2 We could generate the problem to the problem which is charged by several energy function
, if the induced gradient flow is amenable, then this is somewhat similar with the one dimension case, I wish if we could do something for the single function
, then we can say something for the several functions involved case.
Remark 3 This could also generate to amenable group action case and quantization of it.
Meaning, the cohomology induce by a hamiltonian system on some special foliation on fiber of geometric bundle. This type of result could help to establish the vanish of the cohomology, the get the existence theory and regularity result for corresponding elliptic nonlinear differential equation. And solve the original problem I consider.
Now we given the statement of Brunn-Minkowski inequality.
Theorem 2 (brunn minkowski inequality) For
measurale set in
. we have following,
“>
The general spproach of Brunn-Minkwoski inequality is following,
- divide the measurable set
into small cubes.
- Shinking trick, transform the set into convex one.
for the first one, we have the following lemma,
Lemma 3
,
![]()
measurable set,
,
, and
, and
Proof: The proof of the lemma is a easy corollary of the construction of Lesbegue(or Borel) measurable algebra.
Remark 4 The existence of the property given in the lemma is not the key point, the key point is
.
Has this two simplify in hand, we could give several approach to proof the inequality and these proof carry information more than just a proof, they carry some information with the structure of space . \newpage
2. A proof with discretization
There is a lots of ways to attack the Brunn-Minkowski inequality, the most natural one is discretization. But unfortunately there is some technique obstacle for proof or even state the discretization version of “Brunn-Minkowski” inequality.
The “boundary” and “area” should not compatible.
And we need use the fact,
Now we just state what we expect it should transform in, because we have a fully understanding with the discretization model, there is a result named Cauchy-Daveport inequality.
Theorem 4 (cauchy-daveport inequality) There are two case, one in
, one in finite field
.
Proof: for the case, the story is more or less trivial, just do to a observation, if
, then
There exists a strictly increasing chain of length at least .
For the case, following is a graph to explain what happen, basically we define a operation on tuples, i.e.
, and make the additive energy
decreasing. after induction with this transform and the transform from a tuple to the minimum additive energy by translation, the additive energy decreasing and decreasing then arrive the global minimum. But it is easy to conclude in this case one of
become null set and then the inequality 8 follows.
But when we discrete the Brunn-Minkowski inequality, we expect a high dimension generation of the inequality 4. Naively we wish,
Theorem 5 (naive generation of cauchy-daveport inequality) For
, and
are finite sets,
But this is not the case, there is a counterexample for 5. We could construct some such that
, consider they be very thin line.
So why we are in this worse situation? because we lose the information of ,
. So they have the trend tending to make the “boundary” campatible with “area”. Two thin line in the same direction is exactly the worst case, which is just a equal condition of 1-dimension case.
One natural way to except the situation is to bounded the “isperimetric constant”, to assume varies in a subset of measurable set, with addition condition that
is bounded by some constant. But this is also not the suitable set for our inequality, I explain how to capture the information of the G-H coverage.
Now assume are convex bounded set, and we take a global orthogonal basis in
. named
. We give the definition of
discretization of
, named
.
Definition 6 (
discretization) The construction of
from
is following:
- divide
into
,
is the
cubes.
- use
or
instead of
depending on iff
, where
is a given number only rely on
. i.e.
- glue them, define
.
Now we describe the condition of rigorous meaning campatible with
discretization.
Under the basis, there is a coordinate we could know iff is the cube center at
if it is in
. Due to
is convex,
is lipchitz. So you will have some locolization property, said, at every fix discretization scale
, the position of
is morally known so the number of cubes in
in the one dimensional affine space
which is the subspace of
the number
is asymptopic to the
dimensional hausdorff measure of
. So at least,
Property 11 is crucial, which mean is really a n-dimensional space and automatically we have the bounded on isoperimetric constant
.
Now we can look at every and take limit
. In fact we a in the situation with Accumulation of wood to make the product have smallest volume. Not to optimized the tuples
but fix one of it, said
, optimized the other one, said
. This is the key point of proof, a little bit different from the argument of
dimensional 4 where we optimized the tuple.
Key point:
- we can ignore “small core”.
- This inequality is said, due to
, the convex of the functional
on convex set.
The way of discretization could not handle the problem but definitely said that the difficulty occur with the shape of boundaries .
3. A proof with “central of mass” and Minkowski functional
Definition 7 (Central of mass) The central of mass
of measurable set
, if exist, satisfied,
, there is a subspace
with codimension 1 divide
into two connected part
such that
then
.
Remark 5 For a measurable set
, if central of mass
exists, then there exist only one. This is a easy observation do to the definition of
, i.e. the intersection of suitable affine subspace in every direction.
Proof: It is easy to attain by take
different directions in
, then easy to proof every line
across it be definition of
.
Definition 9 (Minkowski functional) for a measurable set
and a point
, define
on
, such that
Remark 6 If
is convex, then
is a convex function on
, so it is lipchitz.
we have following formula for the measure of .
Theorem 10
Proof: trivial.
Now the task reduce fixing and
to optimized
make
small. It is the same as make
small when fix
and
. Due to
This lead to the whole story, given a proof of 3.
4. A proof with multi-scale analysis
This approach is a nonstandard one, due to I believe the renormlization or continue fractional or multilinear estimate is everywhere. We first play with a toy model, the rectangle.
Theorem 11 Brunn-Minkowski inequality is right for
are rectangles.
Proof:
$latex \displaystyle \begin{array}{rcl} 16 RHS & \overset{A-G}\leq &\frac{1}{d}\sum_i\frac{a_i}{a_i+b_i}+ \frac{1}{d}\sum_i\frac{b_i}{a_i+b_i}\\ & = & 1. \end{array} &fg=000000$
The story is following,
5. connection of Brunn-Minkowski inequality and Sobolev inequality, the firth proof
We begin with a calculate based on intuition and it is not rigorous.
The second line is due to I believe there such that it is a equality, by the equal condition of Minkowski inequality, in fact this is morally inverse of Minkowski inequality. The second reason in general case why the second inequality is true is due to a rescaling argument, change
, by the rescaling argument we conclude if there is a such inequality, the index of it must be the case.