identities: refactor is_malicious check to read minimal amount of data #6346

dshulyak · 2024-09-23T12:57:58Z

part of #6326

in the current implementation IsMalicious has to read whole identities table twice whenever new atx is received.
even if it table relatively small (not GB), it has to scan whole table on every query, best case it is served from memory and then only wastes cpu, in worst case it goes to disk if total amount of memory is low.

changes:

migration to add rowid to identities table, because the table stores a "large" proof and soon it will store many marriages, querying it without rowid is very inefficient
two new indexes, index that has only malicious identities, this will make IsMalicious check to read as little as possible from database, and will likely never hit the disk. index with marriages, this is used to filter or update identities with same marriage, as otherwise any query will have to scan whole identities table
refactor to invoke IsMalicious once
mark any new identity as malicious if marriage is malicious, while proof is stored in one place wherever original equivocation happened
mark all identities within marriage as malicious, when any becomes malicious

dshulyak · 2024-09-24T07:20:51Z

bors try

poszu · 2024-09-24T07:32:18Z

hare3/hare.go

+		err := h.db.WithTx(context.Background(), func(tx sql.Transaction) error {
+			return identities.SetMalicious(
+				h.db, equivocation.Messages[0].SmesherID, codec.MustEncode(proof), time.Now())
+		})
+		if err != nil {


Could you please explain why a transaction is used? Btw, the tx isn't used as h.db is passed to identities.SetMalicious() - is it on purpose?

SetMalicious runs more than one statement, if they are executed without transaction then on failure the state will be inconsistent, e.g only one of the atxs will be marked as malicious

OK makes sense, but in this case it would be great if SetMalicious enforced being run in a transaction by taking a sql.Transaction type instead of sql.Executor, wdyt? Such API would prevent mistakes like in this snippet.

poszu · 2024-09-24T07:54:09Z

activation/handler_v2.go

+	if atx.MarriageATX != nil {
+		receivedProof, err := identities.IsMarriageMalicious(tx, *atx.MarriageATX)
+		if err != nil {
+			return false, fmt.Errorf("checking if marriage ATX is malicious: %w", err)
+		}
+		if receivedProof != nil {
+			err = identities.SetMaliciousBecauseOfMarriage(tx, atx.SmesherID, *receivedProof)
+			if err != nil {
+				return false, fmt.Errorf("setting node as malicious because of marriage: %w", err)
+			}
+			return true, nil
+		}
+	}


I think this might not be sufficient. The marriage ATX (the one that marries IDs - it includes the marriage certificates) must also be marked as malicious when it is processed if any of the married IDs is already malicious. This was previously covered by matching by marriage ATX in the IsMalicious() func.

so there are 2 things now:

all identities with same marriage_atx that are present in db at the moment of SetMalicious are marked malicious

if some identity is learned after SetMalicious and it has same marriage_atx we update it here as malicious

it doesn't cover the case you described?

so the case is MarriageAtx is nil, so that it doesn't point to itself?
but instead it has some certificates, does it work if pick id of such atx as marriage_atx and run everything the same way?

although i don't see that covered in this query

SELECT 1 FROM identities WHERE ( marriage_atx = (SELECT marriage_atx FROM identities WHERE pubkey = ?1 AND marriage_atx IS NOT NULL) AND proof IS NOT NULL ) OR (pubkey = ?1 AND marriage_atx IS NULL AND proof IS NOT NULL);

i feel stupid there is even a failing test and i can't figure what is going on

The problem is that an ATX that contains a marriage and is syntactically valid marks all identities in the marriage set as malicious if at least one identity in the set is already malicious.

I do have a fix for the problem but it requires calling IsMalicious on all identities in a marriage ATX before updating the marriage set (which I think is OK, it should be much faster then inserting the knowledge about the set anyway)

spacemesh-bors · 2024-09-24T07:56:47Z

try

Build failed:

systest-status

hare4/hare.go

activation/handler_v1.go

fasmat

For now I believe this is good enough to merge. For the new malfeasance syncer we probably need to revisit the DB logic without making the performance of the ATX handler worse:

When a peer asks for the set of identities that are malicious we should reconciliate on the full set of identities (not just one identity per marriage set), because nodes might not agree on who's part of a set and who is not
Per marriage set we only send one proof to a peer asking together with a list of identities we consider to also be part of the marriage set the identity we proved malicious belongs to. This list might be incomplete in the view of the requesting node (it might now that additional identities belong to any marriage set).
All proofs served must cover the full set the nodes reconciliated on.

Example: two nodes compare their local list of malicious identities and node A realizies it is missing proofs for identities 100 - 120 while node B realizes that it doesn't consider identities 200 - 210 malfeasant yet

Node A requests proof for identity 100 from node B, node b sends proof together with proof that identities 101 - 110 also belong to that set. Node A now has proof for 100 - 110 and continues with 111 until it has proofs for all identities that B claims to know to be malfeasant and A doesn't.
Node B then requests proof for identity 200, Node A serves a proof for identity 220 (which it already knows to be malfeasant) and proof that 200 belongs to the same marriage set, it just updates it local set and continues from there.

fasmat

After thinking about the change again I think I have identified issues with the way we are updating information about the marriage set and just fixing the failing test is not sufficient to address those 🙁

fasmat · 2024-10-08T18:31:12Z

Superseded by #6378

dshulyak added 3 commits September 23, 2024 14:50

identities: refactor is_malicious check to read minimal amount of data

20510a0

finish with updates

5e3a43f

duplicate column

0bcec1d

dshulyak force-pushed the simplify-is-malicious branch from 811741f to 0bcec1d Compare September 24, 2024 07:16

dshulyak marked this pull request as ready for review September 24, 2024 07:17

dshulyak requested review from fasmat, poszu, ivan4th and acud as code owners September 24, 2024 07:17

spacemesh-bors bot added a commit that referenced this pull request Sep 24, 2024

Try #6346:

bc30278

poszu reviewed Sep 24, 2024

View reviewed changes

fasmat reviewed Sep 24, 2024

View reviewed changes

hare4/hare.go Outdated Show resolved Hide resolved

fasmat reviewed Sep 24, 2024

View reviewed changes

activation/handler_v1.go Outdated Show resolved Hide resolved

dshulyak added 2 commits September 24, 2024 11:39

overlooked hare db usage

5f7ba94

fix error handling after checkDoublePublish

d3f0825

dshulyak force-pushed the simplify-is-malicious branch from 20ea60c to d3f0825 Compare September 24, 2024 09:53

experiment

c03d1a1

fasmat approved these changes Oct 7, 2024

View reviewed changes

fasmat requested changes Oct 7, 2024

View reviewed changes

fasmat added a commit that referenced this pull request Oct 8, 2024

Simplify v1 malfeasance and integrate improvements from #6346

32c05bf

fasmat mentioned this pull request Oct 8, 2024

[Merged by Bors] - Malfeasance proof db update #6378

Closed

4 tasks

fasmat closed this Oct 8, 2024

fasmat deleted the simplify-is-malicious branch October 8, 2024 18:31

fasmat added a commit that referenced this pull request Oct 9, 2024

Simplify v1 malfeasance and integrate improvements from #6346

21eda9a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

identities: refactor is_malicious check to read minimal amount of data #6346

identities: refactor is_malicious check to read minimal amount of data #6346

dshulyak commented Sep 23, 2024 •

edited

Loading

dshulyak commented Sep 24, 2024

poszu Sep 24, 2024

dshulyak Sep 24, 2024

poszu Sep 24, 2024

poszu Sep 24, 2024

dshulyak Sep 24, 2024 •

edited

Loading

dshulyak Sep 24, 2024

dshulyak Sep 24, 2024

fasmat Oct 7, 2024

spacemesh-bors bot commented Sep 24, 2024

fasmat left a comment

fasmat left a comment

fasmat commented Oct 8, 2024

identities: refactor is_malicious check to read minimal amount of data #6346

identities: refactor is_malicious check to read minimal amount of data #6346

Conversation

dshulyak commented Sep 23, 2024 • edited Loading

dshulyak commented Sep 24, 2024

poszu Sep 24, 2024

Choose a reason for hiding this comment

dshulyak Sep 24, 2024

Choose a reason for hiding this comment

poszu Sep 24, 2024

Choose a reason for hiding this comment

poszu Sep 24, 2024

Choose a reason for hiding this comment

dshulyak Sep 24, 2024 • edited Loading

Choose a reason for hiding this comment

dshulyak Sep 24, 2024

Choose a reason for hiding this comment

dshulyak Sep 24, 2024

Choose a reason for hiding this comment

fasmat Oct 7, 2024

Choose a reason for hiding this comment

spacemesh-bors bot commented Sep 24, 2024

try

fasmat left a comment

Choose a reason for hiding this comment

fasmat left a comment

Choose a reason for hiding this comment

fasmat commented Oct 8, 2024

dshulyak commented Sep 23, 2024 •

edited

Loading

dshulyak Sep 24, 2024 •

edited

Loading