Several readers asked me whether they could use AI to calculate taxes after reading my previous post AI Gives Better Answers Than Google. They want a tax projection to help with setting tax withholding, paying estimated taxes, or planning for Roth conversions.
My immediate reaction was that calculating taxes isnโt the best use of AI, because it falls under โverifiable factsโ and โlatest developmentโ categories. The retirement plan contribution limits, tax brackets, the maximum deduction amounts, etc., are all verifiable facts. The IRS sets them every year, and they are what they are. Just go to the IRS website for the latest numbers, or Google. If you ask AI, it had better get the latest numbers online anyway, because AIโs training lags.
On the other hand, tax rules are complicated. After you have all the latest numbers from the IRS, you still need to apply the complex rules โ what counts and what doesnโt, and which rates apply to which income. Online tax calculators usually donโt cover the current year until late in the year, and they can be either too simple or too complicated. It would be nice to have AI calculate taxes specifically for the types of income and deductions we have: not too simple with only limited inputs, and not too complicated with everything under the sun.
The Quiz
I thought I would test how well AI calculates taxes. I came up with this quiz question:
Jill, single, age 63, has these incomes in 2026:
- $30,000 from Social Security
- $10,000 from interest and pre-tax IRA withdrawals
- $37,000 from qualified dividends and long-term capital gains
- $1,000 from muni bond interest
Jill contributes $4,400 to her HSA and donates $1,500 in cash to charities. Whatโs Jillโs federal income tax in 2026?
I designed this question carefully to cover several calculations. Less than 85% of Social Security is taxable. The income consists of ordinary income, tax-exempt income, and investment income taxed at preferential rates. The ordinary income component goes across two tax brackets. So does the investment income. One deduction is above-the-line, and the other is below-the-line.
I added these instructions to encourage AI to get the latest tax numbers from the IRS:
Itโs important to calculate it accurately. Please use tax amounts only for 2026 and only from official IRS sources.
Initial Results
I sent the quiz to all four major AI chatbots: ChatGPT, Claude, Gemini, and Grok. I only used the free version in each one, with the free Thinking, Pro, or Expert mode enabled. I compared the answers to the correct result from my calculator in Calculator: How Much of My Social Security Benefits Is Taxable?
| Source | Answer |
|---|---|
| ChatGPT 5.4 Thinking Mini | $325 |
| Claude Sonnet 4.6 Extended | $1,910 |
| Gemini 3 Pro | $1,910 |
| Grok 4.20 Expert | $1,910 |
| My Tax Calculator | $1,640 |
My quiz was too hard! All four major AI models failed to give the correct answer. The $1,500 cash donation threw them off. It falls squarely in the โlatest developmentโ category. The charity donation deduction for non-itemizers is new. The AI models all have โdonations are deductible only when you itemizeโ imprinted in their training. Claude, Gemini, and Grok all would have given the correct answer if I hadnโt included the cash donation. ChatGPT was off on how much of Social Security is taxable.
When you see AI not giving you a deduction, you can raise a question and ask it to double-check. I followed up with this statement:
I heard that cash donations are deductible for people using the standard deduction, starting in 2026, up to a limit.
I also added this request for ChatGPT:
Another AI model calculated a different amount for how much Social Security is taxable. Please double-check your calculation to see whoโs right.
Round 2
All four AI chatbots double-checked and revised their answers.
| Source | Answer |
|---|---|
| ChatGPT 5.4 Thinking Mini | $1,640 โ |
| Claude Sonnet 4.6 Extended | $1,640 โ |
| Gemini 3 Pro | $1,411 |
| Grok 4.20 Expert | $1,411 |
| My Tax Calculator | $1,640 |
ChatGPT and Claude got the correct answer. Gemini and Grok mistakenly treated the cash donation deduction as above-the-line and used it to reduce the Social Security taxable amount.
I asked Gemini and Grok to clarify whether the HSA contribution and the charity donation should or shouldnโt be included in calculating the Social Security taxable amount. Gemini didnโt realize its mistake. It said both should be included. Grok refused to go further because I reached the message limit for not having an account.
Here are the full chat transcripts. They make an interesting read.
[No sharable link for Grok because I donโt have an account.]
Impressions
I donโt think we can draw definitive conclusions based on only one test. I call these impressions.
1. Whether AI can calculate taxes correctly depends on the complexity. Three out of the four major AI chatbots wouldโve given the correct answer in one go if the question didnโt include the new and tricky charity donation deduction. The explicit request to retrieve the latest information from official IRS sources and turning on the Thinking/Pro/Expert mode probably helped in improving accuracy.
2. AI works better when you make it a conversation. Donโt stop and say itโs useless when you spot some errors in the initial answer. Help the tool help you. Both ChatGPT and Claude reached the correct answer in Round 2 after I asked them to double-check.
3. It takes little effort to send the same question to two or more AI models. You donโt need to know which AI is correct when you get different answers. You can tell one AI that another AI said something different. Itโll re-examine.
4. AI was wrong because human sources were wrong. Gemini and Grok treated the donation deduction as above-the-line because many human sources incorrectly called it above-the-line. If you Googled, you would encounter those incorrect sources too. ChatGPT and Claude used more reliable sources and correctly identified the difference between above-the-line and not requiring itemizing deductions.
5. An online tax calculator is faster and more accurate if you know which one to use. Thatโs a big โif.โ Only saying โDonโt use AI because itโs often wrongโ doesnโt say where you will find more reliable sources.
As much as Iโd like to see everyone use my tax calculator, letโs face it: 99.99% of people donโt know that I exist. They have little chance of finding the few sources that do it accurately because Google doesnโt rank tax calculators by timeliness or accuracy.
Hereโs a small challenge for you:
Pretend youโre not familiar with tax calculations. Find another way to answer the quiz question. Donโt say โI have a spreadsheet,โ or โI use the super-duper Case Study Spreadsheet or Excel1040.โ 99.99% of people donโt have those. Please tell me in the comments section how you could do it otherwise.
Short of finding the good sources, the results from AI arenโt that bad (see #1).
6. AIโs general approach was correct even when it was wrong in some nuances. All four AI chatbots followed these correct steps:
- Calculate how much of Social Security is taxable
- Calculate AGI
- Calculate taxable income
- Separate ordinary income and preferential investment income, and know how they stack
- Apply different tax brackets
This general approach is โcommon knowledge of an insider.โ You can learn the steps from AI if youโre not familiar with tax calculations. Thatโs more valuable than just having a final number. When I saw that ChatGPT didnโt include details of some interim steps, I asked it to show its work. ChatGPT explained the details faster and better than I could (read the transcript).
An online tax calculator gives you a number but doesnโt explain the steps. AI is a better tool in teaching you how to fish than giving you a fish.
Learn the Nuts and Bolts
I put everything I use to manage my money in a book. My Financial Toolbox guides you to a clear course of action.
Read Reviews