However, at Mercer | Mettl, we also believe that relying solely on the "AI Sense" score for evaluation is incorrect. While these scores assist in assessing the probability of AI usage in generating the code, they do not serve as definitive proof of plagiarism. Instead, they indicate similarity. To verify this, a human evaluator must review the code. Additionally, the "AI Sense" score should not be used as the sole criterion for candidate selection or rejection. Other factors such as test cases, plagiarism detection, proctoring flags, and browsing tolerance should also be taken into consideration.
How are “AI Sense” scores categorized?
Currently, a candidate's “AI Sense” score is labelled as one of the following:
If the AI Sense score is >= 80%, there is a very high probability that this code was generated by AI.
If the AI Sense score is < 80% and >= 60%, there is a high probability that this code was generated by AI.
If the AI Sense score is < 60% and >= 30%, there is a moderate probability that this code was generated by AI.
If the AI Sense score is < 30% and > 0%, there is a low probability that this code was generated by AI.
The AI Sense score will be marked as N/A in the following situations:
The candidate has scored zero or fewer marks.
We have not added support for the programming language used by the candidate.
The AI Sense score is 0%.
AI Sense calculation is in progress.
AI Sense was not enabled for the assessment when the report was generated.
Support for programming languages
As an experimental feature, we currently support the calculation of the “AI Sense” score for the following programming languages: C, C18, CPP, CPP17, Java6, Java7, Java8, Java11, Python2, Python3, PHP, PHP8, JavaScript8, JavaScript19, and Node.
For all other languages, the “AI Sense” score is marked as N/A.