Less constrained ordered task prep #1335

carver · 2018-09-28T23:59:45Z

What was wrong?

While sketching out a skeleton header sync, I found myself wanting OrderedTaskPreparation but with some constraints removed:

The ability to register tasks out of order (eg~ register a header when we don't have its parent yet)
The ability to register tasks without any prereqs (eg~ once we have a header registered, we don't need anything else besides all its ancestors)

How was it fixed?

Reworked how pruning was done, which didn't support out-of-order tasks registration.
Fairly simple to support tasks without prereqs, just had to check at register time if the task was already done
More tests to cover these cases (and the combination of these cases)

I decided that often we don't want to permit out-of-order tasks. It's useful to get the warnings in the downstream sync steps. So permitting out-of-order tasks is an opt-in flag.

This comes at some cost to pruning time, which could be reduced with some more effort, if it seems to be a bottleneck.

Cute Animal Picture

pipermerriam

I honestly still have trouble digesting this API you've created. That isn't to say there is anything wrong with it, more that I haven't actually worked directly with it in any way.

What I don't see in this PR is an update to the docstrings. Can you do a quick audit to see if they need to be updated for these changes?

pipermerriam · 2018-10-01T16:35:14Z

trinity/utils/datastructures.py

+        if task_id not in self._tasks:
+            raise ValidationError(f"No task {task_id} is present")
+        else:
+            return task_id


Returning the task_id here seems non-standard. Most of our validation functions either raise exceptions or return None.

pipermerriam · 2018-10-01T16:37:24Z

trinity/utils/datastructures.py

+
+    def _find_oldest_unpruned_task_id(self, finished_task_id: TTaskID) -> TTaskID:
+        get_dependency_of_id = compose(
+            self._validate_has_task,


Looks like this is the reason you're returning the task_id from the validate function. You can use cytoolz.do to accomplish that.

from cytoolz.curried import do get_dependency_of_id = compose( do(self._validate_has_task), ... )

Hah, I just ran across that last week actually. Pretty handy. Until ethereum/eth-utils#136 is done, this is my workaround:

curry(do)(self._validate_has_task)

pipermerriam · 2018-10-01T16:42:12Z

trinity/utils/datastructures.py

+            self._tasks.get,
+        )
+        ancestry_pipeline = repeat(get_dependency_of_id, self._max_depth)
+        return pipe(finished_task_id, *ancestry_pipeline)


Just functional programming nitpicking. I think you can do the same thing in a less readable manner as follows. Code may have an off-by-one error.

from cytoolz import iterate, nth return nth(self._max_depth, iterate(get_dependency_of_id, finished_task_id))

Fancy!

Hah, nice, yeah I like it better! I was actually looking for something like iterate the other day when I found do. 👍

pipermerriam · 2018-10-01T16:44:58Z

trinity/utils/datastructures.py

+        """
+        root_candidate = task_id
+        get_dependency_of_id = compose(self._dependency_of, attrgetter('task'), self._tasks.get)
+        for depth in count():


Can define an upper bound on this loop? The infiniteness of it seems unnecessary and it seems like it would be easier to debug in that case if it were to explicitly break and throw an exception at some reasonable upper bound.

carver · 2018-10-01T21:42:46Z

I honestly still have trouble digesting this API you've created. That isn't to say there is anything wrong with it, more that I haven't actually worked directly with it in any way.

Yeah, the API is completely designed around being handy from the point of view of the syncer. But still, it's worth iterating on if it's not pretty immediately grok-able. Just maybe not right now: I don't have any better ideas, and it's doing its job of making the syncer easier to write/comprehend.

carver force-pushed the less-constrained-ordered-task-prep branch from b0be735 to 80700b2 Compare September 29, 2018 00:10

pipermerriam approved these changes Oct 1, 2018

View reviewed changes

carver added 2 commits October 1, 2018 14:39

Add out-of-order tasks, in prep for skeleton sync

4242234

Use 0-prereqs, in prep for skeleton sync

7b5601d

carver force-pushed the less-constrained-ordered-task-prep branch from 80700b2 to 7b5601d Compare October 1, 2018 21:40

Update OrderedTaskPreparation docs

5da29cb

carver merged commit 4a2cadc into ethereum:master Oct 1, 2018

carver deleted the less-constrained-ordered-task-prep branch October 1, 2018 23:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Less constrained ordered task prep #1335

Less constrained ordered task prep #1335

carver commented Sep 28, 2018

pipermerriam left a comment

pipermerriam Oct 1, 2018

pipermerriam Oct 1, 2018

carver Oct 1, 2018

pipermerriam Oct 1, 2018

carver Oct 1, 2018

pipermerriam Oct 1, 2018

carver commented Oct 1, 2018

Less constrained ordered task prep #1335

Less constrained ordered task prep #1335

Conversation

carver commented Sep 28, 2018

What was wrong?

How was it fixed?

Cute Animal Picture

pipermerriam left a comment

Choose a reason for hiding this comment

pipermerriam Oct 1, 2018

Choose a reason for hiding this comment

pipermerriam Oct 1, 2018

Choose a reason for hiding this comment

carver Oct 1, 2018

Choose a reason for hiding this comment

pipermerriam Oct 1, 2018

Choose a reason for hiding this comment

carver Oct 1, 2018

Choose a reason for hiding this comment

pipermerriam Oct 1, 2018

Choose a reason for hiding this comment

carver commented Oct 1, 2018