Launchpad itself

Merge lp:~mwhudson/launchpad/more-task-scheduled-bug-408638 into lp:launchpad

more-task-scheduled-bug-408638
Merge into devel

Proposed by Michael Hudson-Doyle on 2009-08-06

Status:	Merged
Approved by:	Jonathan Lange on 2009-08-06
Approved revision:	no longer in the source branch.
Merged at revision:	not available
Proposed branch:	lp:~mwhudson/launchpad/more-task-scheduled-bug-408638
Merge into:	lp:launchpad
Diff against target:	None lines
To merge this branch:	bzr merge lp:~mwhudson/launchpad/more-task-scheduled-bug-408638
Related bugs:	Link a bug report

Reviewer	Date Requested	Status
Jonathan Lange (community)	2009-08-06	Approve on 2009-08-06
Canonical Launchpad Engineering	2009-08-06	Pending
Review via email: mp+9749@code.launchpad.net

Revision history for this message

Michael Hudson-Doyle (mwhudson) wrote on 2009-08-06:

EPIGRAMS IN PROGRAMMING:

58. Fools ignore complexity. Pragmatists suffer it. Some can avoid it. Geniuses remove it.

I hope I'm being a pragmatist here.

Hi Jono,

I hope you can make time to have a look at the branch some time today. If you don't have time for a full review, I'll get Tim to look at it tomorrow. But you already know what a DeferredLock is :)

If the code and tests don't make sense, then I've failed in my mission, but there are two issues addressed here:

1) Some race conditions around the puller exiting while a request for a job is pending (basically, change ITaskSource.stop to return deferred)
2) Addressing the behaviour where only one job gets pulled per run of the puller (see bug 408638)

Cheers,
mwh

Revision history for this message

Jonathan Lange (jml) wrote on 2009-08-06:

Download full text (12.4 KiB)

On Thu, Aug 6, 2009 at 10:18 AM, Michael Hudson<email address hidden> wrote:
> You have been requested to review the proposed merge of lp:~mwhudson/launchpad/more-task-scheduled-bug-408638 into lp:launchpad/devel.
>
> EPIGRAMS IN PROGRAMMING:
>
> 58. Fools ignore complexity. Pragmatists suffer it. Some can avoid it. Geniuses remove it.
>
> I hope I'm being a pragmatist here.
>

I think so.

> Hi Jono,
>
> I hope you can make time to have a look at the branch some time today. If you don't have time for a full review, I'll get Tim to look at it tomorrow. But you already know what a DeferredLock is :)
>
> If the code and tests don't make sense, then I've failed in my mission, but there are two issues addressed here:
>
> 1) Some race conditions around the puller exiting while a request for a job is pending (basically, change ITaskSource.stop to return deferred)
> 2) Addressing the behaviour where only one job gets pulled per run of the puller (see bug 408638)
>

Wow, concurrency really is quite hard!

However, I really do think that this branch is as simple as possible, barring any paradigm breakthroughs. Thanks for doing such a good job with it.

I've got a few comments and questions that I'd like you to address before this lands.

> === modified file 'lib/canonical/twistedsupport/task.py'
> --- lib/canonical/twistedsupport/task.py 2009-07-17 00:26:05 +0000
> +++ lib/canonical/twistedsupport/task.py 2009-08-06 09:08:47 +0000
> @@ -37,6 +37,12 @@
> """Stop generating tasks.
>
> Any subsequent calls to `stop` are silently ignored.
> +
> + :return: A Deferred that will fire when the source is stopped. It is
> + possible that tasks may be produced until this deferred fires.
> + The deferred will fire with a boolean; True if the source is still
> + stopped, False if the source has been restarted since stop() was
> + called.

Do we know this for sure?

What I mean is, is it possible for the source to have been restarted but for
this to return False. It smells like a possible race condition.

> """
>
>
> @@ -100,10 +106,13 @@
> clock = reactor
> self._clock = clock
> self._looping_call = None
> + self._polling_lock = defer.DeferredLock()

I think it's worth adding a comment on how this lock is used & why.

> + self._started = False
>

And maybe for this one too.

On Thu, Aug 6, 2009 at 10:18 AM, Michael Hudson<michael.hudson@canonical.com> wrote:
> You have been requested to review the proposed merge of lp:~mwhudson/launchpad/more-task-scheduled-bug-408638 into lp:launchpad/devel.
>
> EPIGRAMS IN PROGRAMMING:
>
> 58. Fools ignore complexity. Pragmatists suffer it. Some can avoid it. Geniuses remove it.
>
> I hope I'm being a pragmatist here.
>

I think so.

> Hi Jono,
>
> I hope you can make time to have a look at the branch some time today.  If you don't have time for a full review, I'll get Tim to look at it tomorrow.  But you already know what a DeferredLock is :)
>
> If the code and tests don't make sense, then I've failed in my mission, but there are two issues addressed here:
>
> 1) Some race conditions around the puller exiting while a request for a job is pending (basically, change ITaskSource.stop to return deferred)
> 2) Addressing the behaviour where only one job gets pulled per run of the puller (see bug 408638)
>

Wow, concurrency really is quite hard!

However, I really do think that this branch is as simple as possible, barring any paradigm breakthroughs. Thanks for doing such a good job with it.

I've got a few comments and questions that I'd like you to address before this lands.

> === modified file 'lib/canonical/twistedsupport/task.py'
> --- lib/canonical/twistedsupport/task.py	2009-07-17 00:26:05 +0000
> +++ lib/canonical/twistedsupport/task.py	2009-08-06 09:08:47 +0000
> @@ -37,6 +37,12 @@
>          """Stop generating tasks.
>  
>          Any subsequent calls to `stop` are silently ignored.
> +
> +        :return: A Deferred that will fire when the source is stopped.  It is
> +            possible that tasks may be produced until this deferred fires.
> +            The deferred will fire with a boolean; True if the source is still
> +            stopped, False if the source has been restarted since stop() was
> +            called.

Do we know this for sure?

What I mean is, is it possible for the source to have been restarted but for
this to return False. It smells like a possible race condition.

>          """
>  
>  
> @@ -100,10 +106,13 @@
>              clock = reactor
>          self._clock = clock
>          self._looping_call = None
> +        self._polling_lock = defer.DeferredLock()

I think it's worth adding a comment on how this lock is used & why.

> +        self._started = False
>

And maybe for this one too.

>      def start(self, task_consumer):
>          """See `ITaskSource`."""
>          self.stop()
> +        self._started = True
>          self._looping_call = LoopingCall(self._poll, task_consumer)
>          self._looping_call.clock = self._clock
>          self._looping_call.start(self._interval)
> @@ -122,15 +131,21 @@
>              # If task production fails, we inform the consumer of this, but we
>              # don't let any deferred it returns delay subsequent polls.
>              task_consumer.taskProductionFailed(reason)
> -        d = defer.maybeDeferred(self._task_producer)
> -        d.addCallbacks(got_task, task_failed)
> -        return d
> +        def poll():
> +            if self._started:
> +                d = defer.maybeDeferred(self._task_producer)
> +                return d.addCallbacks(got_task, task_failed)
> +        return self._polling_lock.run(poll)
>  
>      def stop(self):
>          """See `ITaskSource`."""
>          if self._looping_call is not None:
>              self._looping_call.stop()
>              self._looping_call = None
> +        self._started = False
> +        def _return_still_stopped():
> +            return not self._started
> +        return self._polling_lock.run(_return_still_stopped)
>

Reading this and the function above it, the logic seems essentially to say,
"don't return from stop while we're still polling".

This makes me think of my comment on the interface docstring above. Is this
behaviour a part of the contract, or is it an implementation detail?

>  
>  class AlreadyRunningError(Exception):
> @@ -164,6 +179,19 @@
>          self._worker_limit = worker_limit
>          self._worker_count = 0
>          self._terminationDeferred = None
> +        self._stopping_lock = None
> +
> +    def _stop(self):
> +        def _release_or_stop(still_stopped):
> +            if still_stopped and self._worker_count == 0:
> +                self._terminationDeferred.callback(None)
> +                # Note that in this case we don't release the lock: we don't
> +                # want to try to fire the _terminationDeferred twice!

I don't quite follow this. Why would releasing the lock fire the
_terminationDeferred again?

> +            else:
> +                self._stopping_lock.release()
> +        def _call_stop(ignored):
> +            return self._task_source.stop().addCallback(_release_or_stop)

Does this callback have to be added here? Couldn't it also be added directly
to the _stopping_lock.acquire() callback? I think that adding the callbacks in
sequence would make the code slightly easier to read.

> +        return self._stopping_lock.acquire().addCallback(_call_stop)
>

>      def consume(self, task_source):
>          """Start consuming tasks from 'task_source'.
> @@ -178,9 +206,7 @@
>              raise AlreadyRunningError(self, self._task_source)
>          self._task_source = task_source
>          self._terminationDeferred = defer.Deferred()
> -        # This merely begins polling. This means that we acquire our initial
> -        # batch of work at the rate of one task per polling interval. As long
> -        # as the polling interval is small, this is probably OK.

Why isn't this comment relevant any more?

> +        self._stopping_lock = defer.DeferredLock()
>          task_source.start(self)
>          return self._terminationDeferred
>

> === modified file 'lib/canonical/twistedsupport/tests/test_task.py'
> --- lib/canonical/twistedsupport/tests/test_task.py	2009-07-17 00:26:05 +0000
> +++ lib/canonical/twistedsupport/tests/test_task.py	2009-08-06 09:08:47 +0000
> @@ -7,7 +7,7 @@
>  
>  import unittest
>  
> -from twisted.internet.defer import Deferred
> +from twisted.internet.defer import Deferred, succeed
>  from twisted.internet.task import Clock
>  
>  from zope.interface import implements
> @@ -49,14 +49,19 @@
>  
>      implements(ITaskSource)
>  
> -    def __init__(self, log):
> +    def __init__(self, log, stop_deferred=None):
>          self._log = log
> +        if stop_deferred is None:
> +            self.stop_deferred = succeed(True)
> +        else:
> +            self.stop_deferred = stop_deferred
>  
>      def start(self, consumer):
>          self._log.append(('start', consumer))
>  
>      def stop(self):
>          self._log.append('stop')
> +        return self.stop_deferred
>  
>  
>  class TestPollingTaskSource(TestCase):
> @@ -144,6 +149,16 @@
>          # No more calls were made.
>          self.assertEqual(0, self._num_task_producer_calls)
>  
> +    def test_stop_deferred_fires_immediately_if_no_polling(self):
> +        # Calling stop when the source is not polling returns a deferred that
> +        # fires immediately with True.
> +        task_source = self.makeTaskSource()
> +        task_source.start(NoopTaskConsumer())
> +        stop_deferred = task_source.stop()
> +        stop_calls = []
> +        stop_deferred.addCallback(stop_calls.append)
> +        self.assertEqual([True], stop_calls)
> +
>      def test_start_multiple_times_polls_immediately(self):
>          # Starting a task source multiple times polls immediately.
>          clock = Clock()
> @@ -241,6 +256,74 @@
>          clock.advance(interval)
>          self.assertEqual(len(produced_deferreds), 2)
>  
> +    def test_stop_deferred_doesnt_fire_until_polling_finished(self):
> +        # If there is a call to the task producer outstanding when stop() is
> +        # called, stop() returns a deferred that fires when the poll finishes.
> +        # The value fired with is True if the source is still stopped when the
> +        # deferred fires.
> +        produced_deferred = Deferred()
> +        def producer():
> +            return produced_deferred
> +        task_source = self.makeTaskSource(task_producer=producer)
> +        task_source.start(NoopTaskConsumer())
> +        # The call to start calls producer.  It returns produced_deferred
> +        # which has not been fired, so stop returns a deferred that has not
> +        # been fired.
> +        stop_deferred = task_source.stop()
> +        stop_called = []
> +        stop_deferred.addCallback(stop_called.append)
> +        self.assertEqual([], stop_called)
> +        # When the task producing deferred fires, the stop deferred fires with
> +        # 'True' to indicate that the source is still stopped.
> +        produced_deferred.callback(None)
> +        self.assertEqual([True], stop_called)
> +
> +    def test_stop_deferred_fires_with_false_if_source_restarted(self):
> +        # If there is a call to the task producer outstanding when stop() is
> +        # called, stop() returns a deferred that fires when the poll finishes.
> +        # The value fired with is False if the source is no longer stopped
> +        # when the deferred fires.
> +        produced_deferred = Deferred()
> +        def producer():
> +            return produced_deferred
> +        task_source = self.makeTaskSource(task_producer=producer)
> +        task_source.start(NoopTaskConsumer())
> +        # The call to start calls producer.  It returns produced_deferred
> +        # which has not been fired so stop returns a deferred that has not
> +        # been fired.
> +        stop_deferred = task_source.stop()
> +        stop_called = []
> +        stop_deferred.addCallback(stop_called.append)
> +        # Now we restart the source.
> +        task_source.start(NoopTaskConsumer())
> +        self.assertEqual([], stop_called)
> +        # When the task producing deferred fires, the stop deferred fires with
> +        # 'False' to indicate that the source has been restarted.
> +        produced_deferred.callback(None)
> +        self.assertEqual([False], stop_called)
> +
> +    def test_stop_start_stop_when_polling_doesnt_poll_again(self):
> +        # XXX

Docstring here.

> +        produced_deferreds = []
> +        def producer():
> +            d = Deferred()
> +            produced_deferreds.append(d)
> +            return d
> +        task_source = self.makeTaskSource(task_producer=producer)
> +        # Start the source.  This calls the producer.
> +        task_source.start(NoopTaskConsumer())
> +        self.assertEqual(1, len(produced_deferreds))
> +        task_source.stop()
> +        # If we start it again, this does not call the producer because
> +        # the above call is still in process.
> +        task_source.start(NoopTaskConsumer())
> +        self.assertEqual(1, len(produced_deferreds))
> +        # If we now stop the source and the initial poll for a task completes,
> +        # we don't poll again.
> +        task_source.stop()
> +        produced_deferreds[0].callback(None)
> +        self.assertEqual(1, len(produced_deferreds))
> +
>      def test_taskStarted_deferred_doesnt_delay_polling(self):
>          # If taskStarted returns a deferred, we don't wait for it to fire
>          # before polling again.
> @@ -354,7 +437,7 @@
>          consumer.consume(source)
>          self.assertRaises(AlreadyRunningError, consumer.consume, source)
>  
> -    def test_consume_returns_deferred_doesnt_fire_until_tasks(self):
> +    def test_consumer_doesnt_finish_until_tasks_finish(self):
>          # `consume` returns a Deferred that fires when no more tasks are
>          # running, but only after we've actually done something.
>          consumer = self.makeConsumer()
> @@ -363,7 +446,7 @@
>          d.addCallback(log.append)
>          self.assertEqual([], log)
>  
> -    def test_consume_returns_deferred_fires_when_tasks_done(self):
> +    def test_consumer_finishes_when_tasks_done(self):
>          # `consume` returns a Deferred that fires when no more tasks are
>          # running.
>          consumer = self.makeConsumer()
> @@ -373,7 +456,7 @@
>          consumer.taskStarted(lambda: None)
>          self.assertEqual([None], task_log)
>  
> -    def test_consume_returns_deferred_fires_if_no_tasks_found(self):
> +    def test_consumer_finishes_if_no_tasks_found(self):
>          # `consume` returns a Deferred that fires if no tasks are found when
>          # no tasks are running.
>          consumer = self.makeConsumer()
> @@ -383,7 +466,37 @@
>          consumer.noTasksFound()
>          self.assertEqual([None], task_log)

These method renames are nice, thanks.

cheers,
jml

review: Needs Fixing

Revision history for this message

Michael Hudson-Doyle (mwhudson) wrote on 2009-08-06:

Download full text (13.9 KiB)

Jonathan Lange wrote:
> Review: Needs Fixing
> On Thu, Aug 6, 2009 at 10:18 AM, Michael Hudson<email address hidden> wrote:
>> You have been requested to review the proposed merge of lp:~mwhudson/launchpad/more-task-scheduled-bug-408638 into lp:launchpad/devel.
>>
>> EPIGRAMS IN PROGRAMMING:
>>
>> 58. Fools ignore complexity. Pragmatists suffer it. Some can avoid it. Geniuses remove it.
>>
>> I hope I'm being a pragmatist here.
>>
>
> I think so.

Good :) Some genius can clean up my mess later.

>> Hi Jono,
>>
>> I hope you can make time to have a look at the branch some time today. If you don't have time for a full review, I'll get Tim to look at it tomorrow. But you already know what a DeferredLock is :)
>>
>> If the code and tests don't make sense, then I've failed in my mission, but there are two issues addressed here:
>>
>> 1) Some race conditions around the puller exiting while a request for a job is pending (basically, change ITaskSource.stop to return deferred)
>> 2) Addressing the behaviour where only one job gets pulled per run of the puller (see bug 408638)
>>
>
> Wow, concurrency really is quite hard!

Yeah, no kidding.

> However, I really do think that this branch is as simple as possible, barring any paradigm breakthroughs. Thanks for doing such a good job with it.
>
> I've got a few comments and questions that I'd like you to address before this lands.
>
>> === modified file 'lib/canonical/twistedsupport/task.py'
>> --- lib/canonical/twistedsupport/task.py 2009-07-17 00:26:05 +0000
>> +++ lib/canonical/twistedsupport/task.py 2009-08-06 09:08:47 +0000
>> @@ -37,6 +37,12 @@
>> """Stop generating tasks.
>>
>> Any subsequent calls to `stop` are silently ignored.
>> +
>> + :return: A Deferred that will fire when the source is stopped. It is
>> + possible that tasks may be produced until this deferred fires.
>> + The deferred will fire with a boolean; True if the source is still
>> + stopped, False if the source has been restarted since stop() was
>> + called.
>
> Do we know this for sure?

Well it's an interface docstring, so we can say so.

> What I mean is, is it possible for the source to have been restarted but for
> this to return False. It smells like a possible race condition.

I think in a twisted world, it makes sense to talk about the state of
the source at the moment the deferred is fired. I guess you have to be
careful about any callbacks you add that themselves return deferred.

But anyway, until I did it like this, I had no idea how I was going to
make this all work.

>> """
>>
>>
>> @@ -100,10 +106,13 @@
>> clock = reactor
>> self._clock = clock
>> self._looping_call = None
>> + self._polling_lock = defer.DeferredLock()
>
> I think it's worth adding a comment on how this lock is used & why.
>
>> + self._started = False
>>
>
> And maybe for this one too.

I realized that "_loopingcall is not None" was identical with "_started"
so I deleted _started.

>> def start(self, task_consumer):
>> """See `ITaskSource`."""
>> self.stop()
>> + self._sta...

Jonathan Lange wrote:
> Review: Needs Fixing
> On Thu, Aug 6, 2009 at 10:18 AM, Michael Hudson<michael.hudson@canonical.com> wrote:
>> You have been requested to review the proposed merge of lp:~mwhudson/launchpad/more-task-scheduled-bug-408638 into lp:launchpad/devel.
>>
>> EPIGRAMS IN PROGRAMMING:
>>
>> 58. Fools ignore complexity. Pragmatists suffer it. Some can avoid it. Geniuses remove it.
>>
>> I hope I'm being a pragmatist here.
>>
> 
> I think so.

Good :)  Some genius can clean up my mess later.

>> Hi Jono,
>>
>> I hope you can make time to have a look at the branch some time today.  If you don't have time for a full review, I'll get Tim to look at it tomorrow.  But you already know what a DeferredLock is :)
>>
>> If the code and tests don't make sense, then I've failed in my mission, but there are two issues addressed here:
>>
>> 1) Some race conditions around the puller exiting while a request for a job is pending (basically, change ITaskSource.stop to return deferred)
>> 2) Addressing the behaviour where only one job gets pulled per run of the puller (see bug 408638)
>>
> 
> Wow, concurrency really is quite hard!

Yeah, no kidding.

> However, I really do think that this branch is as simple as possible, barring any paradigm breakthroughs. Thanks for doing such a good job with it.
> 
> I've got a few comments and questions that I'd like you to address before this lands.
> 
>> === modified file 'lib/canonical/twistedsupport/task.py'
>> --- lib/canonical/twistedsupport/task.py	2009-07-17 00:26:05 +0000
>> +++ lib/canonical/twistedsupport/task.py	2009-08-06 09:08:47 +0000
>> @@ -37,6 +37,12 @@
>>          """Stop generating tasks.
>>  
>>          Any subsequent calls to `stop` are silently ignored.
>> +
>> +        :return: A Deferred that will fire when the source is stopped.  It is
>> +            possible that tasks may be produced until this deferred fires.
>> +            The deferred will fire with a boolean; True if the source is still
>> +            stopped, False if the source has been restarted since stop() was
>> +            called.
> 
> Do we know this for sure?

Well it's an interface docstring, so we can say so.

> What I mean is, is it possible for the source to have been restarted but for
> this to return False. It smells like a possible race condition.

I think in a twisted world, it makes sense to talk about the state of
the source at the moment the deferred is fired.  I guess you have to be
careful about any callbacks you add that themselves return deferred.

But anyway, until I did it like this, I had no idea how I was going to
make this all work.

>>          """
>>  
>>  
>> @@ -100,10 +106,13 @@
>>              clock = reactor
>>          self._clock = clock
>>          self._looping_call = None
>> +        self._polling_lock = defer.DeferredLock()
> 
> I think it's worth adding a comment on how this lock is used & why.
> 
>> +        self._started = False
>>
> 
> And maybe for this one too.

I realized that "_loopingcall is not None" was identical with "_started"
so I deleted _started.

>>      def start(self, task_consumer):
>>          """See `ITaskSource`."""
>>          self.stop()
>> +        self._started = True
>>          self._looping_call = LoopingCall(self._poll, task_consumer)
>>          self._looping_call.clock = self._clock
>>          self._looping_call.start(self._interval)
>> @@ -122,15 +131,21 @@
>>              # If task production fails, we inform the consumer of this, but we
>>              # don't let any deferred it returns delay subsequent polls.
>>              task_consumer.taskProductionFailed(reason)
>> -        d = defer.maybeDeferred(self._task_producer)
>> -        d.addCallbacks(got_task, task_failed)
>> -        return d
>> +        def poll():
>> +            if self._started:
>> +                d = defer.maybeDeferred(self._task_producer)
>> +                return d.addCallbacks(got_task, task_failed)
>> +        return self._polling_lock.run(poll)
>>  
>>      def stop(self):
>>          """See `ITaskSource`."""
>>          if self._looping_call is not None:
>>              self._looping_call.stop()
>>              self._looping_call = None
>> +        self._started = False
>> +        def _return_still_stopped():
>> +            return not self._started
>> +        return self._polling_lock.run(_return_still_stopped)
>>
> 
> Reading this and the function above it, the logic seems essentially to say,
> "don't return from stop while we're still polling".
> 
> This makes me think of my comment on the interface docstring above. Is this
> behaviour a part of the contract, or is it an implementation detail?

I think it's part of the contract: in a message queue version,
presumably unsubscribing from the queue involves the network, i.e. a
deferred will be in there somewhere (even stopListening() returns
deferred!), so you can't call stop() and be sure that a task won't be
arriving just as you do so.

>>  
>>  class AlreadyRunningError(Exception):
>> @@ -164,6 +179,19 @@
>>          self._worker_limit = worker_limit
>>          self._worker_count = 0
>>          self._terminationDeferred = None
>> +        self._stopping_lock = None
>> +
>> +    def _stop(self):
>> +        def _release_or_stop(still_stopped):
>> +            if still_stopped and self._worker_count == 0:
>> +                self._terminationDeferred.callback(None)
>> +                # Note that in this case we don't release the lock: we don't
>> +                # want to try to fire the _terminationDeferred twice!
> 
> I don't quite follow this. Why would releasing the lock fire the
> _terminationDeferred again?

I'm not completely sure about this, but if you manage to call _stop()
twice, you could end up here twice.

>> +            else:
>> +                self._stopping_lock.release()
>> +        def _call_stop(ignored):
>> +            return self._task_source.stop().addCallback(_release_or_stop)
> 
> Does this callback have to be added here? Couldn't it also be added directly
> to the _stopping_lock.acquire() callback? I think that adding the callbacks in
> sequence would make the code slightly easier to read.

Yeah, I think you're right.

>> +        return self._stopping_lock.acquire().addCallback(_call_stop)
>>
> 
>>      def consume(self, task_source):
>>          """Start consuming tasks from 'task_source'.
>> @@ -178,9 +206,7 @@
>>              raise AlreadyRunningError(self, self._task_source)
>>          self._task_source = task_source
>>          self._terminationDeferred = defer.Deferred()
>> -        # This merely begins polling. This means that we acquire our initial
>> -        # batch of work at the rate of one task per polling interval. As long
>> -        # as the polling interval is small, this is probably OK.
> 
> Why isn't this comment relevant any more?

Because of the change in taskStarted:

>> +        self._stopping_lock = defer.DeferredLock()
>>          task_source.start(self)
>>          return self._terminationDeferred

This one.

> 
>> === modified file 'lib/canonical/twistedsupport/tests/test_task.py'
>> --- lib/canonical/twistedsupport/tests/test_task.py	2009-07-17 00:26:05 +0000
>> +++ lib/canonical/twistedsupport/tests/test_task.py	2009-08-06 09:08:47 +0000
>> @@ -7,7 +7,7 @@
>>  
>>  import unittest
>>  
>> -from twisted.internet.defer import Deferred
>> +from twisted.internet.defer import Deferred, succeed
>>  from twisted.internet.task import Clock
>>  
>>  from zope.interface import implements
>> @@ -49,14 +49,19 @@
>>  
>>      implements(ITaskSource)
>>  
>> -    def __init__(self, log):
>> +    def __init__(self, log, stop_deferred=None):
>>          self._log = log
>> +        if stop_deferred is None:
>> +            self.stop_deferred = succeed(True)
>> +        else:
>> +            self.stop_deferred = stop_deferred
>>  
>>      def start(self, consumer):
>>          self._log.append(('start', consumer))
>>  
>>      def stop(self):
>>          self._log.append('stop')
>> +        return self.stop_deferred
>>  
>>  
>>  class TestPollingTaskSource(TestCase):
>> @@ -144,6 +149,16 @@
>>          # No more calls were made.
>>          self.assertEqual(0, self._num_task_producer_calls)
>>  
>> +    def test_stop_deferred_fires_immediately_if_no_polling(self):
>> +        # Calling stop when the source is not polling returns a deferred that
>> +        # fires immediately with True.
>> +        task_source = self.makeTaskSource()
>> +        task_source.start(NoopTaskConsumer())
>> +        stop_deferred = task_source.stop()
>> +        stop_calls = []
>> +        stop_deferred.addCallback(stop_calls.append)
>> +        self.assertEqual([True], stop_calls)
>> +
>>      def test_start_multiple_times_polls_immediately(self):
>>          # Starting a task source multiple times polls immediately.
>>          clock = Clock()
>> @@ -241,6 +256,74 @@
>>          clock.advance(interval)
>>          self.assertEqual(len(produced_deferreds), 2)
>>  
>> +    def test_stop_deferred_doesnt_fire_until_polling_finished(self):
>> +        # If there is a call to the task producer outstanding when stop() is
>> +        # called, stop() returns a deferred that fires when the poll finishes.
>> +        # The value fired with is True if the source is still stopped when the
>> +        # deferred fires.
>> +        produced_deferred = Deferred()
>> +        def producer():
>> +            return produced_deferred
>> +        task_source = self.makeTaskSource(task_producer=producer)
>> +        task_source.start(NoopTaskConsumer())
>> +        # The call to start calls producer.  It returns produced_deferred
>> +        # which has not been fired, so stop returns a deferred that has not
>> +        # been fired.
>> +        stop_deferred = task_source.stop()
>> +        stop_called = []
>> +        stop_deferred.addCallback(stop_called.append)
>> +        self.assertEqual([], stop_called)
>> +        # When the task producing deferred fires, the stop deferred fires with
>> +        # 'True' to indicate that the source is still stopped.
>> +        produced_deferred.callback(None)
>> +        self.assertEqual([True], stop_called)
>> +
>> +    def test_stop_deferred_fires_with_false_if_source_restarted(self):
>> +        # If there is a call to the task producer outstanding when stop() is
>> +        # called, stop() returns a deferred that fires when the poll finishes.
>> +        # The value fired with is False if the source is no longer stopped
>> +        # when the deferred fires.
>> +        produced_deferred = Deferred()
>> +        def producer():
>> +            return produced_deferred
>> +        task_source = self.makeTaskSource(task_producer=producer)
>> +        task_source.start(NoopTaskConsumer())
>> +        # The call to start calls producer.  It returns produced_deferred
>> +        # which has not been fired so stop returns a deferred that has not
>> +        # been fired.
>> +        stop_deferred = task_source.stop()
>> +        stop_called = []
>> +        stop_deferred.addCallback(stop_called.append)
>> +        # Now we restart the source.
>> +        task_source.start(NoopTaskConsumer())
>> +        self.assertEqual([], stop_called)
>> +        # When the task producing deferred fires, the stop deferred fires with
>> +        # 'False' to indicate that the source has been restarted.
>> +        produced_deferred.callback(None)
>> +        self.assertEqual([False], stop_called)
>> +
>> +    def test_stop_start_stop_when_polling_doesnt_poll_again(self):
>> +        # XXX
> 
> Docstring here.

Dammit, I thought I'd got all of them.  Added.

>> +        produced_deferreds = []
>> +        def producer():
>> +            d = Deferred()
>> +            produced_deferreds.append(d)
>> +            return d
>> +        task_source = self.makeTaskSource(task_producer=producer)
>> +        # Start the source.  This calls the producer.
>> +        task_source.start(NoopTaskConsumer())
>> +        self.assertEqual(1, len(produced_deferreds))
>> +        task_source.stop()
>> +        # If we start it again, this does not call the producer because
>> +        # the above call is still in process.
>> +        task_source.start(NoopTaskConsumer())
>> +        self.assertEqual(1, len(produced_deferreds))
>> +        # If we now stop the source and the initial poll for a task completes,
>> +        # we don't poll again.
>> +        task_source.stop()
>> +        produced_deferreds[0].callback(None)
>> +        self.assertEqual(1, len(produced_deferreds))
>> +
>>      def test_taskStarted_deferred_doesnt_delay_polling(self):
>>          # If taskStarted returns a deferred, we don't wait for it to fire
>>          # before polling again.
>> @@ -354,7 +437,7 @@
>>          consumer.consume(source)
>>          self.assertRaises(AlreadyRunningError, consumer.consume, source)
>>  
>> -    def test_consume_returns_deferred_doesnt_fire_until_tasks(self):
>> +    def test_consumer_doesnt_finish_until_tasks_finish(self):
>>          # `consume` returns a Deferred that fires when no more tasks are
>>          # running, but only after we've actually done something.
>>          consumer = self.makeConsumer()
>> @@ -363,7 +446,7 @@
>>          d.addCallback(log.append)
>>          self.assertEqual([], log)
>>  
>> -    def test_consume_returns_deferred_fires_when_tasks_done(self):
>> +    def test_consumer_finishes_when_tasks_done(self):
>>          # `consume` returns a Deferred that fires when no more tasks are
>>          # running.
>>          consumer = self.makeConsumer()
>> @@ -373,7 +456,7 @@
>>          consumer.taskStarted(lambda: None)
>>          self.assertEqual([None], task_log)
>>  
>> -    def test_consume_returns_deferred_fires_if_no_tasks_found(self):
>> +    def test_consumer_finishes_if_no_tasks_found(self):
>>          # `consume` returns a Deferred that fires if no tasks are found when
>>          # no tasks are running.
>>          consumer = self.makeConsumer()
>> @@ -383,7 +466,37 @@
>>          consumer.noTasksFound()
>>          self.assertEqual([None], task_log)
> 
> These method renames are nice, thanks.

Without the renaming some of my new tests had names that got over the
80th column all by themselves :)

Cheers,
mwh

more-task-scheduled-bug-408638-progress.diff

Revision history for this message

Jonathan Lange (jml) wrote on 2009-08-06:

Thanks Michael, this looks great.

review: Approve

Preview Diff

[H/L] Next/Prev Comment, [J/K] Next/Prev File, [N/P] Next/Prev Hunk

Subscribers

People subscribed via source and target branches

to all changes:

Barki Mustapha

Celso Providelo

Christian Reis

Christy Awad

Colin Watson

Harpianto,ANDI

James Troup

John A Meinel

Kevin bush

Launchpad code reviewers

Launchpad code reviewers from Canonical

Matthew Tanner

Maximiliano Bertacchini

Michael Hudson-Doyle

Oguz Ersoz

Simon Brakhane

Ubuntu-BR DevOps

William Grant

alhawiti

api.ng

pedro cavazos

todaioan

wenjingwen

to status/vote changes:

Tzaddi

Tzaddi Belding

Launchpad itself

Merge lp:~mwhudson/launchpad/more-task-scheduled-bug-408638 into lp:launchpad

Commit message

Description of the change

Preview Diff

Subscribers

1	=== modified file 'lib/canonical/twistedsupport/task.py'
2	--- lib/canonical/twistedsupport/task.py 2009-08-06 09:08:47 +0000
3	+++ lib/canonical/twistedsupport/task.py 2009-08-06 10:38:24 +0000
4	@@ -106,13 +106,14 @@
5	clock = reactor
6	self._clock = clock
7	self._looping_call = None
8	+ # _polling_lock is used to prevent concurrent attempts to poll for
9	+ # work, and to delay the firing of the deferred returned from stop()
10	+ # until any poll in progress at the moment of the call is complete.
11	self._polling_lock = defer.DeferredLock()
12	- self._started = False
13
14	def start(self, task_consumer):
15	"""See `ITaskSource`."""
16	self.stop()
17	- self._started = True
18	self._looping_call = LoopingCall(self._poll, task_consumer)
19	self._looping_call.clock = self._clock
20	self._looping_call.start(self._interval)
21	@@ -132,7 +133,9 @@
22	# don't let any deferred it returns delay subsequent polls.
23	task_consumer.taskProductionFailed(reason)
24	def poll():
25	- if self._started:
26	+ # If stop() has been called before the lock was acquired, don't
27	+ # actually poll for more work.
28	+ if self._looping_call:
29	d = defer.maybeDeferred(self._task_producer)
30	return d.addCallbacks(got_task, task_failed)
31	return self._polling_lock.run(poll)
32	@@ -142,9 +145,8 @@
33	if self._looping_call is not None:
34	self._looping_call.stop()
35	self._looping_call = None
36	- self._started = False
37	def _return_still_stopped():
38	- return not self._started
39	+ return self._looping_call is None
40	return self._polling_lock.run(_return_still_stopped)
41
42
43	@@ -190,8 +192,11 @@
44	else:
45	self._stopping_lock.release()
46	def _call_stop(ignored):
47	- return self._task_source.stop().addCallback(_release_or_stop)
48	- return self._stopping_lock.acquire().addCallback(_call_stop)
49	+ return self._task_source.stop()
50	+ d = self._stopping_lock.acquire()
51	+ d.addCallback(_call_stop)
52	+ d.addCallback(_release_or_stop)
53	+ return d
54
55	def consume(self, task_source):
56	"""Start consuming tasks from 'task_source'.
57
58	=== modified file 'lib/canonical/twistedsupport/tests/test_task.py'
59	--- lib/canonical/twistedsupport/tests/test_task.py 2009-08-06 09:08:47 +0000
60	+++ lib/canonical/twistedsupport/tests/test_task.py 2009-08-06 10:41:48 +0000
61	@@ -303,7 +303,9 @@
62	self.assertEqual([False], stop_called)
63
64	def test_stop_start_stop_when_polling_doesnt_poll_again(self):
65	- # XXX
66	+ # If, while task acquisition is in progress, stop(), start() and
67	+ # stop() again are called in sequence, we shouldn't try to acquire
68	+ # another job when the first acquisition completes.
69	produced_deferreds = []
70	def producer():
71	d = Deferred()

 === modified file 'lib/canonical/twistedsupport/task.py'
 --- lib/canonical/twistedsupport/task.py	2009-07-17 00:26:05 +0000
 +++ lib/canonical/twistedsupport/task.py	2009-08-06 09:08:47 +0000
@@ -37,6 +37,12 @@
          """Stop generating tasks.
          Any subsequent calls to `stop` are silently ignored.
++
++        :return: A Deferred that will fire when the source is stopped.  It is
++            possible that tasks may be produced until this deferred fires.
++            The deferred will fire with a boolean; True if the source is still
++            stopped, False if the source has been restarted since stop() was
++            called.
          """
@@ -100,10 +106,13 @@
              clock = reactor
          self._clock = clock
          self._looping_call = None
++        self._polling_lock = defer.DeferredLock()
++        self._started = False
      def start(self, task_consumer):
          """See `ITaskSource`."""
          self.stop()
++        self._started = True
          self._looping_call = LoopingCall(self._poll, task_consumer)
          self._looping_call.clock = self._clock
          self._looping_call.start(self._interval)
@@ -122,15 +131,21 @@
              # If task production fails, we inform the consumer of this, but we
              # don't let any deferred it returns delay subsequent polls.
              task_consumer.taskProductionFailed(reason)
--        d = defer.maybeDeferred(self._task_producer)
--        d.addCallbacks(got_task, task_failed)
--        return d
++        def poll():
++            if self._started:
++                d = defer.maybeDeferred(self._task_producer)
++                return d.addCallbacks(got_task, task_failed)
++        return self._polling_lock.run(poll)
      def stop(self):
          """See `ITaskSource`."""
          if self._looping_call is not None:
              self._looping_call.stop()
              self._looping_call = None
++        self._started = False
++        def _return_still_stopped():
++            return not self._started
++        return self._polling_lock.run(_return_still_stopped)
  class AlreadyRunningError(Exception):
@@ -164,6 +179,19 @@
          self._worker_limit = worker_limit
          self._worker_count = 0
          self._terminationDeferred = None
++        self._stopping_lock = None
++
++    def _stop(self):
++        def _release_or_stop(still_stopped):
++            if still_stopped and self._worker_count == 0:
++                self._terminationDeferred.callback(None)
++                # Note that in this case we don't release the lock: we don't
++                # want to try to fire the _terminationDeferred twice!
++            else:
++                self._stopping_lock.release()
++        def _call_stop(ignored):
++            return self._task_source.stop().addCallback(_release_or_stop)
++        return self._stopping_lock.acquire().addCallback(_call_stop)
      def consume(self, task_source):
          """Start consuming tasks from 'task_source'.
@@ -178,9 +206,7 @@
              raise AlreadyRunningError(self, self._task_source)
          self._task_source = task_source
          self._terminationDeferred = defer.Deferred()
--        # This merely begins polling. This means that we acquire our initial
--        # batch of work at the rate of one task per polling interval. As long
--        # as the polling interval is small, this is probably OK.
++        self._stopping_lock = defer.DeferredLock()
          task_source.start(self)
          return self._terminationDeferred
@@ -196,7 +222,9 @@
              raise NotRunningError(self)
          self._worker_count += 1
          if self._worker_count >= self._worker_limit:
--            self._task_source.stop()
++            self._stop()
++        else:
++            self._task_source.start(self)
          d = defer.maybeDeferred(task)
          # We don't expect these tasks to have interesting return values or
          # failure modes.
@@ -213,8 +241,7 @@
          find any jobs, if we actually start any jobs then the exit condition
          in _taskEnded will always be reached before this one.
          """
--        if self._worker_count == 0:
--            self._terminationDeferred.callback(None)
++        self._stop()
      def taskProductionFailed(self, reason):
          """See `ITaskConsumer`.
@@ -236,9 +263,7 @@
          """
          if self._task_source is None:
              raise NotRunningError(self)
--        self._task_source.stop()
--        if self._worker_count == 0:
--            self._terminationDeferred.callback(None)
++        self._stop()
      def _taskEnded(self, ignored):
          """Handle a task reaching completion.
@@ -252,8 +277,7 @@
          """
          self._worker_count -= 1
          if self._worker_count == 0:
--            self._task_source.stop()
--            self._terminationDeferred.callback(None)
++            self._stop()
          elif self._worker_count < self._worker_limit:
              self._task_source.start(self)
          else:
 === modified file 'lib/canonical/twistedsupport/tests/test_task.py'
 --- lib/canonical/twistedsupport/tests/test_task.py	2009-07-17 00:26:05 +0000
 +++ lib/canonical/twistedsupport/tests/test_task.py	2009-08-06 09:08:47 +0000
@@ -7,7 +7,7 @@
  import unittest
--from twisted.internet.defer import Deferred
++from twisted.internet.defer import Deferred, succeed
  from twisted.internet.task import Clock
  from zope.interface import implements
@@ -49,14 +49,19 @@
      implements(ITaskSource)
--    def __init__(self, log):
++    def __init__(self, log, stop_deferred=None):
          self._log = log
++        if stop_deferred is None:
++            self.stop_deferred = succeed(True)
++        else:
++            self.stop_deferred = stop_deferred
      def start(self, consumer):
          self._log.append(('start', consumer))
      def stop(self):
          self._log.append('stop')
++        return self.stop_deferred
  class TestPollingTaskSource(TestCase):
@@ -144,6 +149,16 @@
          # No more calls were made.
          self.assertEqual(0, self._num_task_producer_calls)
++    def test_stop_deferred_fires_immediately_if_no_polling(self):
++        # Calling stop when the source is not polling returns a deferred that
++        # fires immediately with True.
++        task_source = self.makeTaskSource()
++        task_source.start(NoopTaskConsumer())
++        stop_deferred = task_source.stop()
++        stop_calls = []
++        stop_deferred.addCallback(stop_calls.append)
++        self.assertEqual([True], stop_calls)
++
      def test_start_multiple_times_polls_immediately(self):
          # Starting a task source multiple times polls immediately.
          clock = Clock()
@@ -241,6 +256,74 @@
          clock.advance(interval)
          self.assertEqual(len(produced_deferreds), 2)
++    def test_stop_deferred_doesnt_fire_until_polling_finished(self):
++        # If there is a call to the task producer outstanding when stop() is
++        # called, stop() returns a deferred that fires when the poll finishes.
++        # The value fired with is True if the source is still stopped when the
++        # deferred fires.
++        produced_deferred = Deferred()
++        def producer():
++            return produced_deferred
++        task_source = self.makeTaskSource(task_producer=producer)
++        task_source.start(NoopTaskConsumer())
++        # The call to start calls producer.  It returns produced_deferred
++        # which has not been fired, so stop returns a deferred that has not
++        # been fired.
++        stop_deferred = task_source.stop()
++        stop_called = []
++        stop_deferred.addCallback(stop_called.append)
++        self.assertEqual([], stop_called)
++        # When the task producing deferred fires, the stop deferred fires with
++        # 'True' to indicate that the source is still stopped.
++        produced_deferred.callback(None)
++        self.assertEqual([True], stop_called)
++
++    def test_stop_deferred_fires_with_false_if_source_restarted(self):
++        # If there is a call to the task producer outstanding when stop() is
++        # called, stop() returns a deferred that fires when the poll finishes.
++        # The value fired with is False if the source is no longer stopped
++        # when the deferred fires.
++        produced_deferred = Deferred()
++        def producer():
++            return produced_deferred
++        task_source = self.makeTaskSource(task_producer=producer)
++        task_source.start(NoopTaskConsumer())
++        # The call to start calls producer.  It returns produced_deferred
++        # which has not been fired so stop returns a deferred that has not
++        # been fired.
++        stop_deferred = task_source.stop()
++        stop_called = []
++        stop_deferred.addCallback(stop_called.append)
++        # Now we restart the source.
++        task_source.start(NoopTaskConsumer())
++        self.assertEqual([], stop_called)
++        # When the task producing deferred fires, the stop deferred fires with
++        # 'False' to indicate that the source has been restarted.
++        produced_deferred.callback(None)
++        self.assertEqual([False], stop_called)
++
++    def test_stop_start_stop_when_polling_doesnt_poll_again(self):
++        # XXX
++        produced_deferreds = []
++        def producer():
++            d = Deferred()
++            produced_deferreds.append(d)
++            return d
++        task_source = self.makeTaskSource(task_producer=producer)
++        # Start the source.  This calls the producer.
++        task_source.start(NoopTaskConsumer())
++        self.assertEqual(1, len(produced_deferreds))
++        task_source.stop()
++        # If we start it again, this does not call the producer because
++        # the above call is still in process.
++        task_source.start(NoopTaskConsumer())
++        self.assertEqual(1, len(produced_deferreds))
++        # If we now stop the source and the initial poll for a task completes,
++        # we don't poll again.
++        task_source.stop()
++        produced_deferreds[0].callback(None)
++        self.assertEqual(1, len(produced_deferreds))
++
      def test_taskStarted_deferred_doesnt_delay_polling(self):
          # If taskStarted returns a deferred, we don't wait for it to fire
          # before polling again.
@@ -354,7 +437,7 @@
          consumer.consume(source)
          self.assertRaises(AlreadyRunningError, consumer.consume, source)
--    def test_consume_returns_deferred_doesnt_fire_until_tasks(self):
++    def test_consumer_doesnt_finish_until_tasks_finish(self):
          # `consume` returns a Deferred that fires when no more tasks are
          # running, but only after we've actually done something.
          consumer = self.makeConsumer()
@@ -363,7 +446,7 @@
          d.addCallback(log.append)
          self.assertEqual([], log)
--    def test_consume_returns_deferred_fires_when_tasks_done(self):
++    def test_consumer_finishes_when_tasks_done(self):
          # `consume` returns a Deferred that fires when no more tasks are
          # running.
          consumer = self.makeConsumer()
@@ -373,7 +456,7 @@
          consumer.taskStarted(lambda: None)
          self.assertEqual([None], task_log)
--    def test_consume_returns_deferred_fires_if_no_tasks_found(self):
++    def test_consumer_finishes_if_no_tasks_found(self):
          # `consume` returns a Deferred that fires if no tasks are found when
          # no tasks are running.
          consumer = self.makeConsumer()
@@ -383,7 +466,37 @@
          consumer.noTasksFound()
          self.assertEqual([None], task_log)
--    def test_consume_deferred_no_fire_if_no_tasks_found_and_job_running(self):
++    def test_consumer_doesnt_finish_until_stop_deferred_fires(self):
++        # The Deferred returned by `consume` does not fire until the deferred
++        # returned by the source's stop() method fires with True to indicate
++        # that the source is still stopped.
++        consumer = self.makeConsumer()
++        consume_log = []
++        stop_deferred = Deferred()
++        source = LoggingSource([], stop_deferred)
++        d = consumer.consume(source)
++        d.addCallback(consume_log.append)
++        consumer.noTasksFound()
++        self.assertEqual([], consume_log)
++        stop_deferred.callback(True)
++        self.assertEqual([None], consume_log)
++
++    def test_consumer_doesnt_finish_if_stop_doesnt_stop(self):
++        # The Deferred returned by `consume` does not fire when the deferred
++        # returned by the source's stop() method fires with False to indicate
++        # that the source has been restarted.
++        consumer = self.makeConsumer()
++        consume_log = []
++        stop_deferred = Deferred()
++        source = LoggingSource([], stop_deferred)
++        d = consumer.consume(source)
++        d.addCallback(consume_log.append)
++        consumer.noTasksFound()
++        self.assertEqual([], consume_log)
++        stop_deferred.callback(False)
++        self.assertEqual([], consume_log)
++
++    def test_consumer_doesnt_finish_if_no_tasks_found_and_job_running(self):
          # If no tasks are found while a job is running, the Deferred returned
          # by `consume` is not fired.
          consumer = self.makeConsumer()
@@ -402,7 +515,7 @@
          del log[:]
          # Finishes immediately, all tasks are done.
          consumer.taskStarted(lambda: None)
--        self.assertEqual(['stop'], log)
++        self.assertEqual(1, log.count('stop'))
      def test_taskStarted_before_consume_raises_error(self):
          # taskStarted can only be called after we have started consuming. This
@@ -427,6 +540,18 @@
          consumer.taskStarted(lambda: log.append('task'))
          self.assertEqual(['task'], log)
++    def test_taskStarted_restarts_source(self):
++        # If, after the task passed to taskStarted has been started, the
++        # consumer is not yet at its worker_limit, it starts the source again
++        # in order consume as many pending jobs as we can as quickly as we
++        # can.
++        log = []
++        consumer = self.makeConsumer()
++        consumer.consume(LoggingSource(log))
++        del log[:]
++        consumer.taskStarted(self._neverEndingTask)
++        self.assertEqual([('start', consumer)], log)
++
      def test_reaching_working_limit_stops_source(self):
          # Each time taskStarted is called, we start a worker. When we reach
          # the worker limit, we tell the source to stop generating work.
@@ -437,10 +562,10 @@
          consumer.consume(source)
          del log[:]
          consumer.taskStarted(self._neverEndingTask)
--        self.assertEqual([], log)
++        self.assertEqual(0, log.count('stop'))
          for i in range(worker_limit - 1):
              consumer.taskStarted(self._neverEndingTask)
--        self.assertEqual(['stop'], log)
++        self.assertEqual(1, log.count('stop'))
      def test_passing_working_limit_stops_source(self):
          # If we have already reached the worker limit, and taskStarted is