Science

erronis

(20,099 posts) Thu Apr 24, 2025, 06:23 PM Apr 24

'Squared blunder': Google engineer withdraws preprint after getting called out for using AI

https://retractionwatch.com/2025/04/24/google-ai-engineer-withdraws-arxiv-preprint-tortured-phrases-genai/

Two of the phrases in the paper identified as AI-generated

Extraction Watch is a good site to follow (I prefer RSS). Some of the rank foolishness exposed is amazing - and perhaps terrifying, as these published documents get picked up and cited elsewhere.

An expert in AI at Google has admitted he used the technology to help write a preprint manuscript that commenters on PubPeer found to contain a slew of AI-generated phrases like “squared blunder” and “info picture.”

The paper, “Leveraging GANs For Active Appearance Models Optimized Model Fitting,” appeared on arXiv.org in January but was withdrawn April 7. The author, Anurag Awasthi, is an engineering lead in AI infrastructure at Google. In a PubPeer comment, he described the paper as a “personal learning exercise.”

In March 2025, sleuth Guillaume Cabanac, creator of the Problematic Paper Screener, pointed out in a PubPeer comment the paper included several tortured phrases. These phrases indicate AI use and occur when large language models try to find synonyms for common phrases. In Awasthi’s paper, “linear regression” became “straight relapse,” and “error rate” became “blunder rate,” among others.

Awasthi replied to the comment saying “phrasing issues were unintentional artifacts from an earlier revision where automated tools were used to rephrase for variety.”

. . .

6 replies

= new reply since forum marked as read

Highlight:

'Squared blunder': Google engineer withdraws preprint after getting called out for using AI (Original Post) erronis Apr 24 OP

At least, he did not try to compute the "vile true blunder" for anything: that calculation can be evil. xocetaceans Apr 24 #1

Paper mill quality in - review mill quality out. Nice comment! erronis Apr 24 #2

It is interesting to read from the paper and try to guess what the AI was substituting into the text. xocetaceans Apr 24 #4

More pollution of our information ecosystem. Even if all the twits using generative AI stopped now, highplainsdem Apr 24 #3

I agree. n/t xocetaceans Apr 24 #5

I love retraction watch. The scientists in the lab where my wife works really appreciated her turning them on to it. NNadir Apr 25 #6

xocetaceans

(4,191 posts)

1. At least, he did not try to compute the "vile true blunder" for anything: that calculation can be evil.

Reply to erronis (Original post)

Thu Apr 24, 2025, 07:57 PM

Apr 24

It would be interesting to see if any of his plots sport "blunder bars"...

Upon looking (in a very cursory manner...still seemingly more deeply than Awasthi did) at Awasthi's paper to see about the much-hoped presence of "blunder bars", another humorous contribution to the paper's (seemingly high) "blunder rate" is the citation in this shortly following passage. (Someone should let the unfortunate Simon Baker know that apparently he is to be cited in the literature as "Bread Cook".)

Leveraging GANs For Active Appearance
Models Optimized Model Fitting
Anurag Awasthi
Google, USA
anuragaw@google.com

...

Fitting AAMs includes taking care of a non-straight [sic: nonlinear?] opti-
mization issue, where the objective is to limit (or expand) a
worldwide mistake [sic: global error?] (or closeness) measure between the infor-
mation picture and the underlying model occurrence.

...

Enhancement-based techniques, presented by Matthews and
Bread Cook [2], utilize Compositional Angle Drop (CGD)
calculations to scientifically limit the blunder measure.

...

REFERENCES

...

[2] I. Matthews and S. Baker, “Active appearance mod-
els revisited,” International Journal of Computer Vision
(IJCV), 2004.

...

https://arxiv.org/pdf/2501.11218v1

Does this count as (smart-)phoning in one's research? The AI-generated text is so glaringly bad that it seems unlikely that one could read it while being conversant with the technical jargon of the field and come away with anything other than the idea that such a paper is not even close to being publishable, even as a preprint. As noted above, citing a man named Baker as Bread Cook is purely risible.

Lastly, the commentary on PubPeer is very interesting to read:

https://pubpeer.com/publications/F19A87982B29CB5C819040BFD7E8EA

erronis

(20,099 posts)

2. Paper mill quality in - review mill quality out. Nice comment!

Reply to xocetaceans (Reply #1)

Thu Apr 24, 2025, 08:18 PM

Apr 24

Of course crap like this has existed ever since we were copying texts on papyrus or clay tablets. But since it can now be automated the crap has expanded billions of time (cite?).

I did enjoy looking at the pubpeer paper you listed. If I had known that "linear regression" was just a "relapse technique" I probably could have aced my stats classes.

xocetaceans

(4,191 posts)

4. It is interesting to read from the paper and try to guess what the AI was substituting into the text.

Reply to erronis (Reply #2)

Thu Apr 24, 2025, 09:10 PM

Apr 24

The following is still from the first page of the paper:

...

II. ACTIVE APPEARANCE MODELS

Dynamic Appearance Models (AAMs) [1, 2] are generative
parametric models intended to catch varieties in shape and
appearance for a particular class of items. AAMs are built
utilizing a bunch of pictures where the spatial places of
important milestones, xi = (xi, yi)T ∈ R2, are characterized
to address the item’s shape. These tourist spots are explained
physically ahead of time.

...

https://arxiv.org/pdf/2501.11218v1

Clearly, the AI has substituted "dynamic" for "active", and it also seems that "catch" is substituted for "detect", "varieties" for "changes", and "pictures" for "images".

The phrase "the spatial places of important milestones" might be something like "locations of waypoints(?)", but it is hard to know.
That phrase "the spatial places of important milestones" seems to correlate with the phrase "these tourist spots" in the final quoted sentence above. I am guessing "are explained physically ahead of time" might make the last sentence a twisted version of "These locations/waypoints are predetermined.".

But I (oh so sarcastically) say, "What unnamed person has cognizable epistemic epiphanies along this ground-based trajectory?"