Use only one rpc client at time #612

AndreiEres · 2023-12-11T11:32:30Z

Changes have been made to improve the stability of the subxt client (including the light client mode). It's done in two specific parts.

Updated subxt client implementation
Improved rpc node requests

Updated subxt client implementation

We used to spawn a client for a number of watched parachains or even more. Despite the memory and CPU usage it was a working approach until we decided to switch to a light client instead of a full RPC node. Spawning a light client for each parachain leads to overuse of resources and as a result - crashes the entire application.

What changed:

Subxt wrapper replaced by frontend-backend realization of executor.
Backend containing subxt client is initialized at startup.
Frontend can be cloned between services without duplicating the subxt client.
As a result, we have only one subxt client for the entire application.

Improved rpc node requests

Implementing the new subxt client revealed a hidden problem with the number of requests. One client was unable to process all requests at the same speed as many clients and became a bottleneck. After investigation it was found that the most common request is to check the HRMP channels of a given parachain (about 80%).

We used to fetch the ingress and egress channel index of each parachain and then check each channel. It could be optimized because most of the attempts were empty, i.e. not every open channel was used to send a message in a given block.

What changed:

To retreive the inbound channels, we moved from checking the inbound channel index to the HRMP channel digest. It shows which paras sent a message at the given block number for a given recipient. So we exclude empty attempts when fetching inbound channels.
We joined requests whenever possible instead of awaiting them in a loop.
In the meantime, we found a bug with incorrect channel id for outgoing messages: current para_id was set as receiver, but should be sender.

New implementation of HRMP channel requesting works well with a full RPC node, but light client can't hold that load. Since we currently only display HRMP metrics in CLI mode and don't send them to Prometheus, we decided to make them optional for a while. More work on improving HRMP fetching on the light client is coming.

Results

Processing of HRMP channels for one block, while watching all parachains in the Polkadot network.

	Requests	Time
Before	361	60.606s
After	180	0.518s

Memory usage, 5 minutes while watching all parachains in the Polkadot network.

	Memory, MB	Graph
Before	80.86
After, channels enabled	27.92
After, channels disabled	26.45

This reverts commit 02562ef.

sandreim

Nice! Did you measure startup time and resource usage vs before this change ?

sandreim · 2023-12-12T15:28:05Z

essentials/src/api/executor.rs

+}
+
+impl RequestExecutorError {
+	pub fn should_repeat(&self) -> bool {


Suggested change

pub fn should_repeat(&self) -> bool {

pub fn should_retry(&self) -> bool {

sandreim · 2023-12-12T15:28:51Z

essentials/src/api/executor.rs

+
+		loop {
+			tokio::select! {
+				message = from_frontend.recv() => {


you don't really need a select! here, you can just let message = from_frontend.recv().await

sandreim · 2023-12-12T15:39:27Z

essentials/src/api/executor.rs

+	}
+}
+
+pub trait RequestExecutorNodes {


I am a bit slow to understand what does the String represent?

is it the rpc node url ?

Yes, It's a bit odd way to allow pass different types of node urls as an argument.

sandreim · 2023-12-12T15:42:11Z

essentials/src/api/executor.rs

+	) {
+		let client = match build_client(&url, api_client_mode, &self.retry).await {
+			Some(v) => v,
+			None => return,


Shouldn't we log an error here ?

AndreiEres added 19 commits December 8, 2023 18:39

Add executor

f8e1ad7

Add more methods

0d1add7

Remove subxt wrapper

7b84a92

Update

a8bc600

Remove redundant Box

02562ef

Revert "Remove redundant Box"

c051cbb

This reverts commit 02562ef.

Fix lint errors

16b694f

Add state to rpc executor

77015cb

Update

4b112b2

Remove usage of raw executor

8e8d1a4

Remove handle from executor initialization

e4ab8b4

Update initialization

6fca2c2

Update executor building

48136ff

Update error handling

916bcc1

Update

06a01c6

Move build to executor

04fd76c

Update variables

71fd5bc

Update

2d7f2a6

Update building

1a4d6e4

AndreiEres changed the title ~~[WIP] Use only one rpc client at time~~ Use only one rpc client at time Dec 12, 2023

AndreiEres requested a review from sandreim December 12, 2023 11:26

Update executor traits

e049971

sandreim approved these changes Dec 12, 2023

View reviewed changes

AndreiEres added 7 commits December 12, 2023 17:11

Rename

ed8eb75

Fix misusing of select

c88b0ec

Log error with rpc client

26b1f20

Update fetching hrmp channels

e310cd3

Interim

05ab24b

Update variables

ea358a0

Use joined futures for HRMP requests

f61b42a

AndreiEres added 4 commits December 15, 2023 13:17

Refactor method

d69fcfb

Make metrics for hrmp_channels optional

51dc61d

Address clippy errors

3b557e1

FIx import

3e60cf4

AndreiEres merged commit 102fcad into master Jan 11, 2024
6 checks passed

AndreiEres deleted the AndreiEres/single-client branch January 11, 2024 12:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use only one rpc client at time #612

Use only one rpc client at time #612

AndreiEres commented Dec 11, 2023 •

edited

Loading

sandreim left a comment

sandreim Dec 12, 2023

sandreim Dec 12, 2023

sandreim Dec 12, 2023

sandreim Dec 12, 2023

AndreiEres Dec 12, 2023

sandreim Dec 12, 2023

	pub fn should_repeat(&self) -> bool {
	pub fn should_retry(&self) -> bool {

Use only one rpc client at time #612

Use only one rpc client at time #612

Conversation

AndreiEres commented Dec 11, 2023 • edited Loading

Updated subxt client implementation

Improved rpc node requests

Results

Processing of HRMP channels for one block, while watching all parachains in the Polkadot network.

Memory usage, 5 minutes while watching all parachains in the Polkadot network.

sandreim left a comment

Choose a reason for hiding this comment

sandreim Dec 12, 2023

Choose a reason for hiding this comment

sandreim Dec 12, 2023

Choose a reason for hiding this comment

sandreim Dec 12, 2023

Choose a reason for hiding this comment

sandreim Dec 12, 2023

Choose a reason for hiding this comment

AndreiEres Dec 12, 2023

Choose a reason for hiding this comment

sandreim Dec 12, 2023

Choose a reason for hiding this comment

AndreiEres commented Dec 11, 2023 •

edited

Loading