Chat-GPT is a new impressive AI chatbot released by Open AI. Impressive applications of it can be found all over the internet. But can it do math?
By math, we do not mean simply perform computations. Its own design ensures that there will be computational problems it will be unable to solve. What I am more interested in is if it can solve problems that require some mathematical reasoning.
To choose our problems, we use the MMLU dataset. Galactica, a large language model recently released by Meta AI, has achieved some good results on this dataset. Their findings are in section 5.3 of their paper.
In what follows, we’ll pick a few problems and see how Chat-GPT handles them. If it fails, we will try to reword the question in hopes of achieving some partial results.
We note that the MMLU data set has multiple choice questions, but we just present the questions to chat-GPT. We just choose question from the abstract algebra section of the MMLU. Afterwards, we’ll prod chat-GPT for intuition and a proof. We’ll end by suggesting a few applications of chat-GPT for mathematicians.
Question 1
Me: The inverse of -i in the multiplicative group, is
Chat-GPT: The inverse of -i in the multiplicative group is
. The inverse of an element
in a group is an element
such that $latex xy=yx=1$, where
is the identity element of the group. In this group,
is the identity element, so the inverse of an element
is an element
such that
.
For example, the inverse of is
, because
. Similarly, the inverse of
is
, because
.
Remark: Chat-GPT answers this question perfectly and gives an excellent explanation. To be fair, this is an easier question.
Question 2
Me: How many homomorphisms are there of Z into Z_2? Please write in latex format.
Chat-GPT: There are two homomorphisms of into
. They are defined as follows:
such that
for all
such that
for all
Remark: Again, Chat-GPT is spot on. However, I asked this question a few times and it once said 1, and another time said two but gave a false justification:
There are only two homomorphisms of Z (the integers) into Z2 (the integers modulo 2).
One homomorphism sends every integer to 0, and the other homomorphism sends every integer to 1.
Question 3
Me: Let p = (1, 2, 5, 4)(2, 3) in S_5 . Find the index of <p> in S_5. Please write your answer in latex.
Chat-GPT:
The index of <p> in S_5 is the number of elements in S_5 that come before <p> in lexicographic order.
To find the index of <p>, we must first determine the cycle type of p. The cycle type of a permutation is a list of the lengths of its disjoint cycles, listed in decreasing order.
The cycle type of p is [2,2], since p consists of a 2-cycle and a 2-cycle.
Now we can use the cycle index of S_5 to determine the number of permutations in S_5 with cycle type [2,2].
The cycle index of S_5 is given by:
where is the number of permutations in S_5 with i cycles, and
is a variable representing a cycle of length i.
Substituting the cycle type [2,2] into the cycle index gives:
This simplifies to:
The coefficient of in this expression is
, so the number of permutations in S_5 with cycle type [2,2] is
.
Since <p> is the only permutation in S_5 with cycle type [2,2], it is the -th permutation in S_5.
Therefore, the index of <p> in S_5 is .
Remark: Woah! This is very wrong! The very first sentence is completely wrong. Also, its final conclusion, is not even an integer, which is certainly not right. Indeed, the index of a subgroup counts something and most be a positive integer.
We are able to salvage something though. Suppose we computed that the order of is 5, using our algebra skills. Then we can prompt chat-GPT with the following.
Me: Now suppose is a subgroup of order 5. What is the index of
in
?
Chat-GPT: The index of a subgroup in a group
is the number of left cosets of
in
. This is also equal to the number of right cosets of
in
.
Since the order of a subgroup divides the order of the group, the order of must divide the order of
, which is 120. The only divisors of 120 that are less than 120 are 1, 2, 3, 4, 5, 6, 8, 10, 12, 15, 20, 24, 30, 40, and 120. Therefore, the only possible orders for
are 1, 2, 3, 4, 5, 6, 8, 10, 12, 15, 20, 24, 30, 40, and 120.
Since the order of is 5, the index of
in
is
.
Remark: Here chat-GPT answers perfectly with a great explanation. Perhaps it would be nice to mention that the main fact being used is Lagrange’s theorem, though this can easily be obtained by asking it “What theorem did you use above?”
But what’s the intuition?
What makes a research mathematician great is a certain “intuition” for problems. Let’s see if chat-GPT can explain any of that.
Me: Can you explain the intuition behind the Cauchy-Schwarz inequality? Please write any math formula in latex.
Chat-GPT: Sure! The Cauchy-Schwarz inequality is a useful inequality that arises in various areas of mathematics, such as vector algebra, inner product spaces, and probability theory. It states that for any two vectors x
and y
, the following inequality holds:
Intuitively, the Cauchy-Schwarz inequality states that the inner product of two vectors x
and y
is “controlled” by the lengths of the vectors x
and y
. To see this, consider the special case where x
and y
are unit vectors (i.e., vectors of length 1). In this case, the inequality becomes:
which means that the inner product of x
and y
is always less than or equal to 1. This is consistent with our intuition that the inner product of two unit vectors should be “small” if they are “orthogonal” (i.e., perpendicular) and “large” if they are “parallel” (i.e., pointing in the same direction).
I hope this helps! Let me know if you have any questions.
Remark: This is not too bad, but could certainly be improved. For instance, explaining when equality holds in Cauchy-Schwarz (when one vector is a multiple of the other) helps give a more precise meaning the the last paragraph. One can see this first few paragraphs of this blog post of Terence Tao to see an explanation from one of the world’s leading mathematicians. Also, here is a math stack exchange on the topic.
Prove it.
Me: Can you prove it?
Chat-GPT: Sure! Here is a proof of the Cauchy-Schwarz inequality:
First, we can rewrite the inequality as:
Next, we can expand the left-hand side as follows:
This simplifies to:
Now, we can apply the arithmetic-geometric mean inequality to each term on the right-hand side:
This simplifies to:
Finally, we can apply the arithmetic-geometric mean inequality again to the term on the right-hand side:
This simplifies to:
And we are done! This completes the proof of the Cauchy-Schwarz inequality.
I hope this helps! Let me know if you have any questions.
Remark: Unfortunately, this proof is not quite correct. Both the proof method, as well as the details (i.e. its application of AM-GM) are severely flawed.
Conclusion
To be honest, Chat-GPT is quite impressive for a model that is not tailored for mathematics in any way. I think it could already, in its current format, help a mathematician to prepare lecture notes or even easier parts of a paper. Perhaps its greatest utility would come in automating certain parts of grants and job applications