[Bug]: order of similarity matters, WHY? #648

dhandhalyabhavik · 2024-09-09T08:59:23Z

Current Behavior

Using the default onnx model,
Score function

def get_score(a, b):
    return evaluation.evaluation(
        {
            'question': a
        },
        {
            'question': b
        }
    )

Case 1:

a = 'What is neural network?'
b = 'Explain neural network and its components.'
c = 'What are the key components of neural network?'
print (get_score(a, b))
print (get_score(a, c))
print (get_score(b, c))

0.7585506439208984
0.02885962650179863
0.0909486636519432

Case 2:

a = 'What is neural network?'
b = 'Explain neural network and its components.'
c = 'What are the key components of neural network?'
print (get_score(b, a))
print (get_score(c, a))
print (get_score(c, b))

0.17746654152870178
0.013074617832899094
0.8378676772117615

Just changed x,y to y,x while passing argument to get_score, why drastic changes in scores?

Expected Behavior

No response

Steps To Reproduce

No response

Environment

No response

Anything else?

No response

The text was updated successfully, but these errors were encountered:

SimFG · 2024-09-10T03:57:03Z

It seems that you did not experiment with changing x,y to y,x. It seems that you should use
print (get_score(a, b)), print (get_score(b, a)) for comparison.

dhandhalyabhavik · 2024-09-11T05:55:41Z

I did, look at these lines,

print (get_score(a, b)) # in case 1
0.7585506439208984
print (get_score(b, a)) # in case 2
0.17746654152870178

SimFG · 2024-09-11T09:38:31Z

It's amazing that there is such a phenomenon!

Ali-Parandeh · 2024-09-12T18:58:28Z

Is it because the LLM replies differently if ranking/ordering of content is different in a rag application?

SimFG · 2024-09-13T02:40:44Z

I don't know much about this part. Theoretically, the distance between the two vectors should be calculated to get the score. Swapping the positions should not affect the score.

wxywb · 2024-09-13T03:00:41Z

We trained a cross-encoder model to evaluate similarity, where conceptually the pairs should ignore their positions. However, since it uses BERT, there could be some unusual behavior, as it's a lightweight transformer without any constraints to enforce this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: order of similarity matters, WHY? #648

[Bug]: order of similarity matters, WHY? #648

dhandhalyabhavik commented Sep 9, 2024

SimFG commented Sep 10, 2024

dhandhalyabhavik commented Sep 11, 2024

SimFG commented Sep 11, 2024

Ali-Parandeh commented Sep 12, 2024

SimFG commented Sep 13, 2024

wxywb commented Sep 13, 2024

[Bug]: order of similarity matters, WHY? #648

[Bug]: order of similarity matters, WHY? #648

Comments

dhandhalyabhavik commented Sep 9, 2024

Current Behavior

Expected Behavior

Steps To Reproduce

Environment

Anything else?

SimFG commented Sep 10, 2024

dhandhalyabhavik commented Sep 11, 2024

SimFG commented Sep 11, 2024

Ali-Parandeh commented Sep 12, 2024

SimFG commented Sep 13, 2024

wxywb commented Sep 13, 2024