Can ChatGPT Do My Math Homework?

AI is rapidly gaining prominence within the computer science community. Sophisticated computer software can now generate convincingly human-like prose for news stories, articles, ad copy, and even fiction, the quality of which varies significantly. However, software that could solve computer science textbook problems is quite different. Though numerous AI programs claim to be able to do schoolwork, the answers typically have little resemblance to what would receive full credit. My colleagues and I later migrated to work on the problem of giving computers the capability to solve algebra word problems and present our Algebra Word Problem Tutor (AWPT). Since solving word problems in algebra is dependent on natural language understanding, AWPT is relevant for a wider set of math word problems. In recent years, computer solvers have been implemented to solve algebra word problems, but these applications generated a single result that may not be easily understood by humans and typically not the solutions expected by teachers. These solvers fail to communicate with users and hence cannot offer step-by-step explanations of how to get the answer. Step-by-step math word problem solver apps, such as Photomath, got the right idea but mostly give incorrect solutions and show no understanding of why one equation is chosen instead of another. This type of app can have devastating effects on a student and is often a hindrance to learning. AWPT aims to provide the ideal solution for the user and create a better understanding of algebra concepts.

According to OpenAI, math questions are handled by a special model and are rarely new equations. ChatGPT can solve symbolic math and calculus, but often gets this wrong. It is better at more advanced math like linear algebra. When asked a math problem, GPT will provide an answer with an explanation for getting that answer, but the explanation will not be correct in many cases. When it gives an incorrect answer, this could be due to the inability to remember numbers from fifteen words ago – for example, when asked the current multiplication speed of a train after explaining the information on the positions of the trains. The model can fail to recall specific details when answering a multi-step math question given to it. Because the model uses attention to select information it needs from the output of short-term to long-term memory, it can fail on tasks with large numbers of steps and tasks where the question builds upon something said earlier. An example of a specific math problem that this model would fail at is when two students are comparing their ages two years ago and in three years’ time to find out whether they will be the same age in three years’ time. This is definitely too complex for the model.

The abstract and precision of mathematics provide the most prominent tools for understanding the world around us. This skill is valuable not only for its own sake, but the development of mathematical thinking also contributes to powerful problem-solving techniques in a wide variety of situations. Because of its pervasive importance, the best time to learn mathematics is as early as possible. If a student continues to build on their mathematical “foundation,” they will be far better off in terms of thinking skills, ability to solve complicated problems, and understanding complex situations. People often say they have an aversion to mathematics from a young age, and Moore’s research provides a good explanation for this. If an adult is averse to mathematics, they will often unknowingly behave in ways that discourage a child from learning mathematics. Often this is simply a matter of the adult’s lack of confidence in their own mathematical ability. The most damaging statements are those that indicate that the person was simply “never good at math.” This attitude is quite unacceptable socially when compared to a similar statement regarding literacy, but it is commonly tolerated with little question. And thus, a cycle is perpetuated. One must break out of the cycle, and using a tool such as ChatGPT for math homework may, in fact, be a good place to start.

#### 4.1 Overview Given the rapid development of information technology, ChatGPT was designed to enhance communication capabilities among machines, polishing each iteration and version to be much closer as if we were chatting with a human. OpenAI has been continuously conducting research on ChatGPT, experimenting with several iterations which results can be accessed on [https://beta.openai.com/](https://beta.openai.com/) or [https://github.com/openai](https://github.com/openai), covered until GPT-3.5. This is an interesting breakthrough and opportunity for mathematicians to utilize and exploit this software to automate some of their tasks, enhancing data retrieval and analysis, or even tutoring students. This brief note will cover some tips and tricks to use ChatGPT more effectively, especially for solving math problems, from my previous experience in chatting with this AI. Who knows, in the next development, OpenAI itself may provide a feature specifically designed for mathematicians using math symbols and formulas.

In the future, it’s possible that GPT-3 having a better understanding of mathematical problems could be helpful as a student’s homework aid. This would allow for easily receiving hints to difficult problems or presenting a full solution when the student is stuck. Due to the potential demand for such a service, and GPT-3’s versatility outside of just solving math problems, it’s likely that we will see the capabilities of ChatGPT expanded towards this end.

Given the difficulties associated with this, a student would, at this time, be better off doing their math homework themselves. GPT-3’s math abilities are best used for quickly checking if the student’s already completed work is correct. This can be done by entering the math problem, or more simply the answer, into an English language input and see if GPT-3’s presented solution matches the problem.

As of now, GPT-3 and thus ChatGPT is not capable of doing math homework. It can solve a wide array of word problems, at all different levels (elementary school to college) provided the question is stated in a way it understands. However, it is very possible that it will interpret a math problem the wrong way and thus give the wrong answer. It also is incapable of understanding or solving math problems that would be done without a calculator. This problem is further complicated as chatting with GPT-3 generally results in it trying to make the conversation about something else. Depending on the user’s input, it may find a math problem too off-topic and not bother to attempt it.

