An open-source LLM text detector

ranok@sopuli.xyz · 2 years ago

An open-source LLM text detector

Puttybrain@beehaw.org · edit-2 2 years ago

I tried running this with some output from a Wizard-Vicuna-7B-Uncensored model and it returned ('Human', 0.06035491254523517)

So I don’t think that this hits the mark, to be fair, I got it to generate something really dumb but a perfect LLM detection tool will likely never exist.

Good thing is that it didn’t false positive my own words.

Below is the output of my LLM, there’s a decent amount of swearing so heads up

Edit:

Tried with a more sensible question and still got a false negative

('Human', 0.03917657845587952)

Mac@lemmy.world · 2 years ago

Lmao what the hell. That was hilarious.

Echolot@sh.itjust.works · 2 years ago

That is a genius approach, though a big challenge is probably going to be to select the correct corpus

TheTrueLinuxDev@beehaw.org · 2 years ago

It’s not if you’re aware of Generative Adversarial Network, it’s a losing game for the detector, because anything you use to try and detect generated content would only strengthen the neural net model itself, so by making a detector, you’re making it better at evading detection. At least in language model, they are looking at how GAN can be applied in language model.

An open-source LLM text detector

An open-source LLM text detector

Meet “ZipPy”, a fast AI LLM text detector