Storm

Merge lp:~jtv/storm/profile-fetches into lp:storm

profile-fetches
Merge into trunk

Proposed by Jeroen T. Vermeulen on 2010-12-10

Status:	Needs review
Proposed branch:	lp:~jtv/storm/profile-fetches
Merge into:	lp:storm
Diff against target:	1109 lines (+903/-7) (has conflicts) 8 files modified storm/database.py (+1/-0) storm/fetch_profile.py (+255/-0) storm/references.py (+12/-4) storm/store.py (+97/-3) tests/fetch_context.py (+164/-0) tests/fetch_profile.py (+64/-0) tests/fetch_statistics.py (+99/-0) tests/store/base.py (+211/-0) Text conflict in tests/store/base.py
To merge this branch:	bzr merge lp:~jtv/storm/profile-fetches
Related bugs:	Link a bug report

Reviewer	Date Requested	Status
Storm Developers	2010-12-15	Pending
Storm Developers	2010-12-10	Pending
Review via email: mp+43323@code.launchpad.net

Description of the change

= Fetch-Profiling =

Profile dependencies between object fetches from the database.

This work has been raised on the launchpad-dev mailing list, and in a later stage, discussed with Jamu Kakar, Stuart Bishop, and others.

== The problem ==

Profiling will help map out and optimize data needs. For instance, consider this loop:

def get_x_for(item):
return item.other_object.x

# ...

for item in query_items():
total += get_x_for(item)

This will fetch item.other_object individually for each item coming out of query_items(). It's a common anti-pattern in ORM performance, and easy enough to optimize: just outer-join item.other_object inside query_items, so that it will already be in cache when get_x_for needs it.

Keeping track of all such dependencies is tedious, brittle, and a source of major abstraction leaks. Conventional ways of dealing with them involve profiling, analysis, matching query patterns to code paths, mapping out data requirements, and identifying downside risks of optimization. The result is a single "point-solution" fix. The effort produces experience as a side effect, but transfer of such experience from one human to others is relatively ineffective.

Then, after that's all done and the code has been optimized, it's difficult to keep track of which optimizations are still relevant. Code is alive, and the more intricate and beautiful an optimization is, the easier it is to break. Testing for such "semantically-neutral" breakage is often difficult, and monitoring the relevance of the tests themselves can be costly.

Refactorings in particular raise questions: will I need to port this optimization to the new structure? Will I still be hitting the right indexes? Could the new structure make the optimization unnecessary or less relevant? Am I prefetching a lot of objects that I don't need? And after I've made all those choices, how can I compare the new code's performance to the old code's performance as it's been tuned over time?

Fetch-profiling brings us closer to solving all those problems, but be patient. For now, it simplifies the mapping of data requirements by eliminating the tracing and the matching of query patterns to code paths. Read on for where we go next.

== Visibility improvements ==

Profiling would expose the problem pattern in the example very clearly. After running through the loop a few times, you'd inspect the store's statistics. The statistics will tell you how many item.other_object instances were loaded from the database (for "item"s returned by query_items) as well as how this number compares to the number of "item"s loaded by query_items, as a percentage.

The highest "item" numbers will identify the places most in need of pre-fetching optimizations. Among those, the highest percentages of "item.other_object" loads identify the places where simple prejoins are most likely to be beneficial. Lower percentages may indicate that many of the foreign-key references are null, or that most of the objects they refer to are already covered by other caching, or that most of the objects you might prefetch in the query would be irrelevant.

== Future improvements ==

This is phase 1 of a proposed multi-phase development. For phase 2 I'd like to automate the prefetching so that code like this (using features layered on top of the existing Storm API) will optimize itself, without requiring manual tweaking. After that come policy tuning and automated context definition (see below).

With automated prefetching, the most basic optimizations will no longer be specified in the application. They will work themselves out automatically based on feedback from a real, running production system.

It is at that point where the big problems resolve themselves. After a code change the individual optimizations will re-apply themselves as appropriate. There is no need to track their relevance manually.

== Concepts ==

Cast of characters:
• A "fetch" is the retrieval and de-marshalling of an object from the database. Reading just a foreign key (e.g. to check for NULL) is not a fetch from the table it refers to; neither is retrieving an object from cache.
• A "fetch context" is some indicator of what part of the application is executing a query. To the application, this is managed as a "call stack" of strings.
• An "original fetch" is a free-form query, as performed using Store.find or Store.execute.
• A "fetch origin" (within a context) is a class participating in an original fetch.
• A "derived fetch" is the retrieval of objects that are clearly derived (directly or indirectly) from an original fetch through a chain of reference.

In the example loop, query_items() would contain at least an original fetch. The reference to item.other_object inside get_x_for is a derived fetch (derived directly from the original fetch, as it happens). Derived fetches can also be tracked across stores.

There's only room for one fetch context in this example, since derived fetches are associated with the same context as their original fetches. In a web application, the most useful context would probably be the request type, but for detailed optimization you'll want more fine-grained contexts. The typical ideal granularity for automated optimization would be just one original query per context.

Contexts form a hierarchy so as to suit all these use cases, as well as "drilling down" during analysis of data requirements. A context manager helps mark regions of code as being a specific context. Another idea would be a decorator (probably at the application level though, where it's easier to find the right store) and an optional argument to find() that selects a context for just one query.

Original fetches are identified by the fetched class as well as the context. This makes it possible to associate derived fetches with individual classes in a join, and track their dependent fetches separately.

== Implementation notes ==

I'm not planning to map out full dependency chains from fetch origins to derived fetches for now; that would probably become too costly. We'd have to see how useful that information is in practice.

You may note how fetch_context is tracked in Stores, in Results, and in ObjectInfos. The reason for this is that objects may be fetched from a result set long after the store has moved on to a different context. An object fetch should be associated with the context that the result set was produced in, which in turn is the context the store was in at the time.

All interesting analysis and optimization work will be done outside the performance-critical path of query execution. Profiling costs should be minimal, limited to simple dict lookups and counter increments.

Jeroen

Revision history for this message

Robert Collins (lifeless) wrote on 2010-12-17:

Hi Jeroen, thanks for pointing me at this.

I think this is a very interesting project. I think the stats will be useful for manual optimisation in the short term in Launchpad.

As far as the autotuning goes long term, the jit-vm style learn-and-improve doesn't interest me for Launchpad : https://dev.launchpad.net/LEP/PersistenceLayer contains my broad plans for addressing systematic performance issues in Launchpad. I think a jit-vm auto tuning layer would be a fascinating project, but the warm-up time in many JIT's can be substantial, and is only ameliorated by loops running hundreds or thousands of times : and still at best only approaches the efficiency available by writing in a more efficient language. Thus my interest in providing a more efficient DSL than storms bound-object approach. I'd love to see storm become radically simpler in aid of that: faster short circuits in object caching - optionally no object caching at all. Constrained and parameterised references would be awesome too.

Cheers,
Rob

Revision history for this message

Jeroen T. Vermeulen (jtv) wrote on 2010-12-20:

I'd be more careful in assuming similarity with a JIT VM. Differences in relative overhead aside, JIT compilers don't specialize method calls much. Actually there is a JVM that combines JIT with inlining of all code, but from what I hear that actually yields fantastic results. A lot of the startup overhead would be in the inlining, which doesn't come up per se in fetch profiling.

As an example of specialization, if Launchpad used automatic optimization based on this profiler, a given query method somewhere deep down the call stack gets optimized separately when used from the web service API or when used from the web UI. The two calls have very different needs. We don't have any decent solution for that at the moment.

If you're willing to be as aggressive in fetching objects as to do it "statically," then you might as well use automatic optimization with a warmup time of 1: generate optimization advice after a first pass through a stretch of code, then repeat periodically to cover any objects that may also be needed but weren't referenced in that first run. To amortize startup cost over more requests, pickle the optimization choices and presto: profile-driven optimizations get reused across restarts.

Separately from that, the term "efficient" is treacherous. A static approach is almost guaranteed to be more efficient in terms of computing power, yes, but less efficient when it comes to the human factors: flexibility, legibility, conceptual cleanliness. (Isn't that why we're using python in the first place?) A dynamic approach on the other hand can narrow the gap in computational efficiency without sacrificing any of those other efficiencies. It's also easier to deploy and fine-tune such optimizations across the entire application.

Jeroen

lp:~jtv/storm/profile-fetches updated on 2011-06-20

419. By Jeroen T. Vermeulen on 2010-12-22: Don't support derived_from without a known reference.
420. By Jeroen T. Vermeulen on 2011-01-01: Record origin, source, and reference; ignore cross-store dependencies.
421. By Jeroen T. Vermeulen on 2011-01-01: Cosmetic.
422. By Jeroen T. Vermeulen on 2011-06-20: Documentation; made is_root a @property.

Unmerged revisions

422. By Jeroen T. Vermeulen on 2011-06-20: Documentation; made is_root a @property.
421. By Jeroen T. Vermeulen on 2011-01-01: Cosmetic.
420. By Jeroen T. Vermeulen on 2011-01-01: Record origin, source, and reference; ignore cross-store dependencies.
419. By Jeroen T. Vermeulen on 2010-12-22: Don't support derived_from without a known reference.
418. By Jeroen T. Vermeulen on 2010-12-16: Aggregate stats by context name; nicer cumulate API.
417. By Jeroen T. Vermeulen on 2010-12-16: Context iteration.
416. By Jeroen T. Vermeulen on 2010-12-16: Test add_to_dict separately.
415. By Jeroen T. Vermeulen on 2010-12-16: Move profiling functions out of Store.
414. By Jeroen T. Vermeulen on 2010-12-15: Context manager.
413. By Jeroen T. Vermeulen on 2010-12-15: Track derived fetches across stores.

Preview Diff

[H/L] Next/Prev Comment, [J/K] Next/Prev File, [N/P] Next/Prev Hunk

Subscribers

People subscribed via source and target branches

to all changes:

Colin Watson

James Henstridge

Jeroen T. Vermeulen

Michael Hudson-Doyle

Sidnei da Silva

to status/vote changes:

Richard Boulton

 === modified file 'storm/database.py'
 --- storm/database.py	2010-04-16 07:14:25 +0000
 +++ storm/database.py	2011-06-20 13:09:26 +0000
@@ -52,6 +52,7 @@
      def __init__(self, connection, raw_cursor):
          self._connection = connection # Ensures deallocation order.
          self._raw_cursor = raw_cursor
++        self.fetch_origin = None
          if raw_cursor.arraysize == 1:
              # Default of 1 is silly.
              self._raw_cursor.arraysize = 10
 === added file 'storm/fetch_profile.py'
 --- storm/fetch_profile.py	1970-01-01 00:00:00 +0000
 +++ storm/fetch_profile.py	2011-06-20 13:09:26 +0000
@@ -0,0 +1,255 @@
++#
++# Copyright (c) 2011 Canonical
++#
++# Written by Jeroen Vermeulen at Canonical.
++#
++# This file is part of Storm Object Relational Mapper.
++#
++# Storm is free software; you can redistribute it and/or modify
++# it under the terms of the GNU Lesser General Public License as
++# published by the Free Software Foundation; either version 2.1 of
++# the License, or (at your option) any later version.
++#
++# Storm is distributed in the hope that it will be useful,
++# but WITHOUT ANY WARRANTY; without even the implied warranty of
++# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
++# GNU Lesser General Public License for more details.
++#
++# You should have received a copy of the GNU Lesser General Public License
++# along with this program.  If not, see <http://www.gnu.org/licenses/>.
++#
++
++"""Fetch-profiling support.
++
++This profiles how objects are pulled from the database into the local cache.
++Accesses to objects that are already cached are ignored.
++
++
++= Original and Derived Fetches =
++
++The profiler distinguishes two ways in which an object can be retrieved from
++the database: "original fetches" and "derived fetches."
++
++An original fetch happens during a free-form query, such as...
++
++    employees = store.find(Employee, Employee.name.startswith(name_pattern))
++
++A derived fetch happens when the program follows a reference from an object
++that needs to be retrieved from the database.  These are often a problem in
++ORM performance, because object-oriented programs can easily request large
++numbers of fetches in small, inefficient queries.  For instance, this could
++query all your company's Department records one by one in separate queries:
++
++    for emp in employees:
++        print(emp.department.name)
++
++Here, the fetch that pulls emp.department into the cache is "derived" from
++the original fetch that retrieved emp itself.  A derived fetch can also be
++derived from another derived fetch, but ultimately the chain leads back to
++either an original fetch or a new object in memory.
++
++When it comes to optimizing your application, one of the things you'll want to
++look at is reducing derived fetches.  That simple loop over employees might be
++many times faster if you loaded all of the departments you needed into cache
++in one single query:
++
++    # "Pre-fetch" the employees' departments into Storm's cache.
++    dept_ids = set([emp.department_id for emp in employees])
++    list(store.find(Department, Department.id.is_in(dept_ids)))
++    for emp in employees:
++        # Faster, because emp.department is in cache now!
++        print(emp.department.name)
++
++Sometimes it may even be most efficient to join the derived fetch into your
++original query:
++
++    emps_and_depts = store.find(
++        (Employee, Department),
++        Department.id == Employee.department_id,
++        employee.name.startswith(name_pattern))
++
++    for emp, dept in emps_and_depts:
++        print dept.name
++
++You'll recognize opportunities for this optimization when you see the same
++derived fetch happen many times after the same original query.  But it's
++probably not worth doing this when the derived fetches are infrequent: the
++code path that does the derived fetch could be rare, or the reference might
++be None in most cases.  Or more likely in this example, the query will return
++many employees but only very few different departments.  Most are probably
++already in the cache, in which case the profile won't count them.
++
++The profile tells you how many objects were fetched at each step along any
++data access path.  So for example the profiler might report that the
++original employees query pulled 1,233 objects into memory, but the
++"emp.department" in the loop only fetched 62 departments: that means you can
++save 61 queries by fetching all departments in one go.  Or you might see zero
++derived fetches because the departments are already in cache, in which case
++the loop is not worth optimizing.  You might even see about the same number
++of departments fetched as you do employees, in which case your best option may
++be to join Employee and Department together into one query.  Maybe there is
++too much data to hold in cache, and you need to get really creative.
++
++
++= Fetch Contexts =
++
++When you spot a performance problem in the profile, you'll want to know
++exactly where in your application it occurs.  Sometimes the name of the
++function is enough, but most of the time you'll want to know something more
++about the context in which that function was called.
++
++To help you keep track of this information, the profiler lets you define
++"fetch contexts," which are profiled separately.  An original fetch is
++counted in its current context, but a derived fetch is counted in the context
++of the original fetch that it is ultimately derived from.
++
++So when you look at the profile for a particular context, you see not only the
++data it queries (its original fetches) but also the "future" of that data: how
++will this data be used after my function returns it?  What can we start
++prefetching here to speed up the code that consumes this data?  It doesn't
++matter whether that other code is in the same context or not.
++
++You can name contexts whatever you like.  Contexts can also nest inside
++contexts: you can have a context "process_salaries" with a context
++"get_employees" inside it, and a different context "send_birthday_cards" also
++with a "get_employees" inside it.  Those two nested contexts are different
++ones, even though they're both called "get_employees," and even though they
++could actually be the same function.
++
++This lets you profile the same code separately in different situations, and
++you can then decide whether they need the same optimizations or not.  For
++instance, a function in a web application could have different performance
++characteristics depending on which page is being rendered or even who is
++viewing it (an administrator might see more data than someone who is not
++logged in, for example).  In that case you could incorporate that information
++in your context names, so that you'll be able to tell them apart in the
++profile.
++"""
++
++
++__all__ = ["FetchContext", "FetchStatistics", "fetch_context"]
++
++
++class fetch_context(object):
++    """Context manager to mark a region of code as a fetch context."""
++    def __init__(self, store, context_name):
++        self.store = store
++        self.context_name = context_name
++
++    def __enter__(self):
++        """Start context `context_name`for the given `Store`."""
++        self.store.push_fetch_context(self.context_name)
++
++    def __exit__(self, *args, **kwargs):
++        """Close context."""
++        self.store.pop_fetch_context()
++
++
++class FetchContext(object):
++    """A context in which database fetches are recorded for profiling.
++
++    Contexts nest in order to support detailed views and aggregation.
++    However they need not exactly match the program's call stack.
++
++    Profiling is disabled in the root context.
++
++    :ivar name: The context's name.
++    :ivar parent: The parent context.
++    :ivar children: Child contexts, mapped by name.
++    :ivar stats: `FetchStatistics` for this context.
++    """
++    def __init__(self, name, parent=None):
++        self.name = name
++        self.parent = parent
++        self.children = {}
++        self.stats = FetchStatistics()
++
++    def __iter__(self):
++        for child in self.children.itervalues():
++            yield child
++            for grandchild in child:
++                yield grandchild
++
++    @property
++    def is_root(self):
++        """Is this the root context?"""
++        return self.parent is None
++
++    def get_child(self, name):
++        """Find a child context of the given name, or create one."""
++        child = self.children.get(name)
++        if child is None:
++            child = self.children[name] = FetchContext(name, parent=self)
++        return child
++
++    def cumulate_stats(self):
++        """Add `FetchStatistics` for this context and its children."""
++        stats = self.stats.copy()
++        for child in self:
++            stats.merge(child.stats)
++        return stats
++
++    def aggregate_stats_by_name(self):
++        """Aggregate `FetchStatistics` for self and children by context name.
++
++        Returns a dict mapping context names to aggregated statistics for
++        all `FetchContext`s of those respective names among self and its
++        children.
++        """
++        names = set(child.name for child in self).union([self.name])
++        stats = dict(
++            (name, FetchStatistics())
++            for name in names if name is not None)
++        stats[self.name].merge(self.stats)
++        for child in self:
++            stats[child.name].merge(child.stats)
++        return stats
++
++
++def add_number_to_dict(dictionary, key, value=1):
++    """Add `value` to `dictionary[key]`, defaulting to 0."""
++    dictionary.setdefault(key, 0)
++    dictionary[key] += value
++
++
++def add_dict_to_dict(dest, addition):
++    """Add the values from dict `addition` into dict `dest`."""
++    for key, value in addition:
++        add_number_to_dict(dest, key, value)
++
++
++class FetchStatistics(object):
++    """Fetch profiling statistics.
++
++    Statistics are recorded for each context, but they can also be
++    aggregated.
++
++    :ivar original_fetches: Maps fetch origin (i.e. a class being fetched)
++        to a count of the number of original fetches on that class.
++    :ivar derived_fetches: Maps derived fetches to their respective fetch
++        counts.  A derived derived fetch is represented as a tuple
++        (origin, source, Reference).
++    """
++    def __init__(self):
++        self.original_fetches = {}
++        self.derived_fetches = {}
++
++    def copy(self):
++        new = FetchStatistics()
++        new.original_fetches = self.original_fetches.copy()
++        new.derived_fetches = self.derived_fetches.copy()
++        return new
++
++    def record_original_fetch(self, origin):
++        """Record an original fetch in the statistics."""
++        add_number_to_dict(self.original_fetches, origin)
++
++    def record_derived_fetch(self, origin, source, reference):
++        """Record a derived fetch in the statistics."""
++        fetch = (origin, source, reference)
++        add_number_to_dict(self.derived_fetches, fetch)
++
++    def merge(self, other_stats):
++        """For aggregation purposes: merge `other_stats` into `self`."""
++        add_dict_to_dict(self.original_fetches, other_stats.original_fetches)
++        add_dict_to_dict(self.derived_fetches, other_stats.derived_fetches)
 === modified file 'storm/references.py'
 --- storm/references.py	2010-06-01 08:33:33 +0000
 +++ storm/references.py	2011-06-20 13:09:26 +0000
@@ -1,5 +1,5 @@
+ #
--# Copyright (c) 2006, 2007 Canonical
++# Copyright (c) 2006-2011 Canonical
+ #
  # Written by Gustavo Niemeyer <gustavo@niemeyer.net>
+ #
@@ -22,7 +22,8 @@
  from storm.exceptions import (
      ClassInfoError, FeatureError, NoStoreError, WrongStoreError)
--from storm.store import Store, get_where_for_args, LostObjectError
++from storm.store import (
++    Store, get_where_for_args, LostObjectError, record_derived_fetch)
  from storm.variables import LazyValue
  from storm.expr import (
      Select, Column, Exists, ComparableExpr, LeftJoin, Not, SQLRaw,
@@ -133,7 +134,8 @@
      def __get__(self, local, cls=None):
          if local is not None:
              # Don't use local here, as it might be security proxied.
--            local = get_obj_info(local).get_obj()
++            local_obj_info = get_obj_info(local)
++            local = local_obj_info.get_obj()
          if self._cls is None:
              self._cls = _find_descriptor_class(cls or local.__class__, self)
@@ -154,11 +156,17 @@
          if self._relation.remote_key_is_primary:
              remote = store.get(self._relation.remote_cls,
--                               self._relation.get_local_variables(local))
++                               self._relation.get_local_variables(local),
++                               derived_from=(local, self))
          else:
              where = self._relation.get_where_for_remote(local)
              result = store.find(self._relation.remote_cls, where)
++            result.fetch_context = local_obj_info["fetch_context"]
++            if not result.fetch_context.is_root:
++                result.fetch_origin = local_obj_info.get("fetch_origin")
              remote = result.one()
++            if remote is not None:
++                record_derived_fetch(self, local_obj_info, get_obj_info(remote))
          if remote is not None:
              self._relation.link(local, remote)
 === modified file 'storm/store.py'
 --- storm/store.py	2011-05-16 10:45:52 +0000
 +++ storm/store.py	2011-06-20 13:09:26 +0000
@@ -1,5 +1,5 @@
+ #
--# Copyright (c) 2006, 2007 Canonical
++# Copyright (c) 2006-2011 Canonical
+ #
  # Written by Gustavo Niemeyer <gustavo@niemeyer.net>
+ #
@@ -37,6 +37,7 @@
  from storm.exceptions import (
      WrongStoreError, NotFlushedError, OrderLoopError, UnorderedError,
      NotOneError, FeatureError, CompileError, LostObjectError, ClassInfoError)
++from storm.fetch_profile import FetchContext
  from storm import Undef
  from storm.cache import Cache
  from storm.event import EventSystem
@@ -49,6 +50,48 @@
  PENDING_REMOVE = 2
++def record_unfetched_object(fetch_context, obj_info):
++    """Record creation of an object not fetched from the database."""
++    obj_info["fetch_context"] = fetch_context
++    if not fetch_context.is_root:
++        obj_info["fetch_origin"] = obj_info.cls_info.cls
++
++
++def record_original_fetch(obj_info, fetch_context, cls):
++    """Record an original fetch.
++
++    The fetch context may or may not be store's currently active context; the
++    active context may have changed between the point where the query occurs
++    in the program and the point where it is actually issued to the database.
++
++    :param obj_info: `ObjectInfo` for the object being fetched.
++    :param fetch_context: The `FetchContext` issuing the fetch.
++    :param cls: The class that's being fetched.
++    """
++    obj_info["fetch_context"] = fetch_context
++    if not fetch_context.is_root:
++        obj_info["fetch_origin"] = cls
++        fetch_context.stats.record_original_fetch(cls)
++
++
++def record_derived_fetch(reference, local, remote):
++    """Record a derived fetch.
++
++    :param reference: The `Reference` whose dereference triggers the fetch.
++    :param local: `ObjectInfo` for the object whose reference to the object is
++        being followed.
++    :param remote: `ObjectInfo` for the object that's being fetched.
++    """
++    fetch_context = local["fetch_context"]
++    remote["fetch_context"] = fetch_context
++    if not fetch_context.is_root:
++        fetch_origin = local["fetch_origin"]
++        remote["fetch_origin"] = fetch_origin
++        fetch_context.stats.record_derived_fetch(fetch_origin,
++                                                 local.cls_info.cls,
++                                                 reference)
++
++
  class Store(object):
      """The Storm Store.
@@ -80,6 +123,7 @@
              self._cache = cache
          self._implicit_flush_block_count = 0
          self._sequence = 0 # Advisory ordering.
++        self.fetch_context = FetchContext(None)
      def get_database(self):
          """Return this Store's Database object."""
@@ -137,13 +181,16 @@
          self.invalidate()
          self._connection.rollback()
--    def get(self, cls, key):
++    def get(self, cls, key, derived_from=None):
          """Get object of type cls with the given primary key from the database.
          If the object is alive the database won't be touched.
          @param cls: Class of the object to be retrieved.
          @param key: Primary key of object. May be a tuple for composed keys.
++        @param derived_from: For profiling purposes, an optional tuple of the
++            object that this fetch is derived from and its reference property
++            that linked to this object.
          @return: The object found with the given primary key, or None
              if no object is found.
@@ -176,10 +223,28 @@
                          default_tables=cls_info.table, limit=1)
          result = self._connection.execute(select)
++
++        if derived_from is None:
++            result.fetch_context = self.fetch_context
++        else:
++            origin_obj, origin_ref = derived_from
++            origin_obj_info = get_obj_info(origin_obj)
++            result.fetch_context = origin_obj_info["fetch_context"]
++            if not result.fetch_context.is_root:
++                result.fetch_origin = origin_obj_info.get("fetch_origin")
++
          values = result.get_one()
          if values is None:
              return None
--        return self._load_object(cls_info, result, values)
++
++        obj = self._load_object(cls_info, result, values)
++        if derived_from is not None:
++            obj_info = get_obj_info(obj)
++            if origin_obj_info["store"] == self:
++                record_derived_fetch(origin_ref, origin_obj_info, obj_info)
++            else:
++                record_original_fetch(obj_info, self.fetch_context, cls)
++        return obj
      def find(self, cls_spec, *args, **kwargs):
          """Perform a query.
@@ -260,6 +325,7 @@
              obj_info["pending"] = PENDING_ADD
              self._set_dirty(obj_info)
              self._enable_lazy_resolving(obj_info)
++            record_unfetched_object(self.fetch_context, obj_info)
              obj_info.event.emit("added")
          return obj
@@ -710,6 +776,10 @@
              self._set_values(obj_info, cls_info.columns, result, values,
                               replace_unknown_lazy=True)
++            if result.fetch_origin is None:
++                # This is an original fetch.
++                record_original_fetch(obj_info, result.fetch_context, cls)
++
              self._add_to_alive(obj_info)
              self._enable_change_notification(obj_info)
              self._enable_lazy_resolving(obj_info)
@@ -895,6 +965,23 @@
              self._set_values(obj_info, autoreload_columns,
                               result, result.get_one())
++    def push_fetch_context(self, context_name):
++        """Enter a fetch context.
++
++        If no fetch context was active previously, this enables
++        profiling.
++        """
++        self.fetch_context = self.fetch_context.get_child(context_name)
++
++    def pop_fetch_context(self):
++        """Leave the current fetch context.
++
++        If the current context was the outermost one, this disables
++        profiling.
++        """
++        assert not self.fetch_context.is_root, "Popped root fetch context."
++        self.fetch_context = self.fetch_context.parent
++
  class ResultSet(object):
      """The representation of the results of a query.
@@ -920,6 +1007,8 @@
          self._distinct = False
          self._group_by = Undef
          self._having = Undef
++        self.fetch_context = store.fetch_context
++        self.fetch_origin = None
      def copy(self):
          """Return a copy of this ResultSet object, with the same configuration.
@@ -976,6 +1065,7 @@
          """Iterate the results of the query.
          """
          result = self._store._connection.execute(self._get_select())
++        result.fetch_context = self.fetch_context
          for values in result:
              yield self._load_objects(result, values)
@@ -1068,6 +1158,7 @@
          select.limit = 1
          select.order_by = Undef
          result = self._store._connection.execute(select)
++        result.fetch_context = self.fetch_context
          values = result.get_one()
          if values:
              return self._load_objects(result, values)
@@ -1081,6 +1172,7 @@
          select = self._get_select()
          select.limit = 1
          result = self._store._connection.execute(select)
++        result.fetch_context = self.fetch_context
          values = result.get_one()
          if values:
              return self._load_objects(result, values)
@@ -1122,6 +1214,7 @@
              else:
                  select.order_by.append(Desc(expr))
          result = self._store._connection.execute(select)
++        result.fetch_context = self.fetch_context
          values = result.get_one()
          if values:
              return self._load_objects(result, values)
@@ -1140,6 +1233,7 @@
          if select.limit is not Undef and select.limit > 2:
              select.limit = 2
          result = self._store._connection.execute(select)
++        result.fetch_context = self.fetch_context
          values = result.get_one()
          if result.get_one():
              raise NotOneError("one() used with more than one result available")
 === added file 'tests/fetch_context.py'
 --- tests/fetch_context.py	1970-01-01 00:00:00 +0000
 +++ tests/fetch_context.py	2011-06-20 13:09:26 +0000
@@ -0,0 +1,164 @@
++# -*- coding: utf-8 -*-
++
++from storm.fetch_profile import FetchContext, FetchStatistics, fetch_context
++
++from tests.helper import TestHelper
++
++
++class FakeStats(object):
++    def __init__(self, contents=None):
++        if contents is None:
++            self.contents = set()
++        else:
++            self.contents = set(contents)
++
++    def merge(self, other_stats):
++        self.contents = self.contents.union(other_stats.contents)
++
++
++class FakeStore(object):
++    def __init__(self):
++        self.fetch_context = FetchContext(None)
++
++    def push_fetch_context(self, name):
++        self.fetch_context = FetchContext(name, parent=self.fetch_context)
++
++    def pop_fetch_context(self):
++        self.fetch_context = self.fetch_context.parent
++
++
++class FetchContextTest(TestHelper):
++    def get_relatives(self, context):
++        """Return a tuple of `context`s parent and children."""
++        return (context.parent, context.children)
++
++    def test_initially_childless(self):
++        self.assertEqual({}, FetchContext("context").children)
++
++    def test_iter_childless_context_yields_nothing(self):
++        self.assertEqual([], list(FetchContext("context")))
++
++    def test_iter_does_not_yield_parent(self):
++        parent = FetchContext("parent")
++        child = parent.get_child("child")
++        self.assertEqual([], list(child))
++
++    def test_iter_context_with_children_yields_children(self):
++        root = FetchContext("root")
++        one = root.get_child("one")
++        two = root.get_child("two")
++        self.assertEqual(set([one, two]), set(root))
++
++    def test_iter_context_includes_grandchildren(self):
++        root = FetchContext("root")
++        child = root.get_child("child")
++        grandchild = child.get_child("grandchild")
++        self.assertEqual(set([child, grandchild]), set(root))
++
++    def test_iter_context_includes_grand_grandchildren(self):
++        root = FetchContext("root")
++        child = root.get_child("child")
++        grandchild = child.get_child("grandchild")
++        grand_grandchild = grandchild.get_child("grand-grandchild")
++        self.assertEqual(set([child, grandchild, grand_grandchild]), set(root))
++
++    def test_is_root_for_root(self):
++        self.assertTrue(FetchContext("root").is_root())
++
++    def test_is_root_for_child(self):
++        root = FetchContext("root")
++        self.assertFalse(root.get_child("child").is_root())
++
++    def test_get_child_creates_first_child(self):
++        parent = FetchContext("parent")
++        child = parent.get_child("child")
++
++        self.assertEqual((None, {"child": child}), self.get_relatives(parent))
++        self.assertEqual((parent, {}), self.get_relatives(child))
++
++    def test_get_child_adds_child(self):
++        parent = FetchContext("parent")
++        eldest = parent.get_child("eldest")
++        youngest = parent.get_child("youngest")
++
++        children = {
++            "eldest": eldest,
++            "youngest": youngest,
++        }
++        self.assertEqual((None, children), self.get_relatives(parent))
++        self.assertEqual((parent, {}), self.get_relatives(eldest))
++        self.assertEqual((parent, {}), self.get_relatives(youngest))
++        self.assertNotEqual(eldest, youngest)
++
++    def test_get_child_finds_child(self):
++        parent = FetchContext("parent")
++        child = parent.get_child("child")
++        self.assertEqual(child, parent.get_child("child"))
++
++    def test_cumulate_stats_on_empty_context_yields_empty(self):
++        stats = FetchContext("context").cumulate_stats()
++        self.assertEqual({}, stats.original_fetches)
++        self.assertEqual({}, stats.derived_fetches)
++
++    def test_cumulate_stats_includes_local_stats(self):
++        context = FetchContext("context")
++        context.stats.original_fetches = {("origin", "reference", "store"): 1}
++        self.assertEqual(context.stats.original_fetches,
++                         context.cumulate_stats().original_fetches)
++
++    def test_cumulate_stats_includes_child_stats(self):
++        parent = FetchContext("parent")
++        child = parent.get_child("child")
++        child.stats.original_fetches = {("origin", "reference", "store"): 1}
++        self.assertEqual(child.stats.original_fetches,
++                         parent.cumulate_stats().original_fetches)
++
++    def test_aggregate_stats_by_name_includes_local_stats(self):
++        context = FetchContext("context")
++        fetch = ("origin", "reference", "store")
++        context.stats.original_fetches[fetch] = 1
++        stats = context.aggregate_stats_by_name()
++        self.assertEqual(["context"], stats.keys())
++        self.assertEqual({fetch: 1}, stats["context"].original_fetches)
++
++    def test_aggregate_stats_by_name_includes_child_stats(self):
++        parent = FetchContext("parent")
++        child = parent.get_child("child")
++        fetch = ("origin", "reference", "store")
++        child.stats.original_fetches[fetch] = 1
++        stats = parent.aggregate_stats_by_name()
++        self.assertEqual({fetch: 1}, stats["child"].original_fetches)
++
++    def test_aggregate_stats_by_name_aggregates(self):
++        root = FetchContext("x")
++        fetch = ("origin", "reference", "store")
++        root.stats.original_fetches[fetch] = 1
++        root.get_child("x").stats.original_fetches[fetch] = 1
++        stats = root.aggregate_stats_by_name()
++        self.assertEqual(["x"], stats.keys())
++        self.assertEqual({fetch: 2}, stats["x"].original_fetches)
++
++    def test_context_manager_pushes_context(self):
++        store = FakeStore()
++        with fetch_context(store, "with"):
++            current_context = store.fetch_context.name
++        self.assertEqual("with", current_context)
++
++    def test_context_manager_pops_context_on_normal_exit(self):
++        store = FakeStore()
++        with fetch_context(store, "with"):
++            pass
++        self.assertTrue(store.fetch_context.is_root())
++
++    def test_context_manager_pops_context_on_exception(self):
++        class ArbitraryException(Exception):
++            pass
++
++        store = FakeStore()
++        try:
++            with fetch_context(store, "with"):
++                raise ArbitraryException()
++        except ArbitraryException:
++            pass
++
++        self.assertTrue(store.fetch_context.is_root())
 === added file 'tests/fetch_profile.py'
 --- tests/fetch_profile.py	1970-01-01 00:00:00 +0000
 +++ tests/fetch_profile.py	2011-06-20 13:09:26 +0000
@@ -0,0 +1,64 @@
++# -*- coding: utf-8 -*-
++
++from storm.store import Store, record_original_fetch, record_derived_fetch
++
++from tests.helper import TestHelper
++
++
++class DummyDatabase(object):
++
++    def connect(self, event=None):
++        return None
++
++
++class FetchProfilingTest(TestHelper):
++
++    def test_initial_context_is_root(self):
++        store = Store(DummyDatabase())
++        self.assertTrue(store.fetch_context.is_root())
++
++    def test_push_fetch_context(self):
++        store = Store(DummyDatabase())
++        store.push_fetch_context("context")
++        self.assertFalse(store.fetch_context.is_root())
++
++    def test_pop_fetch_context(self):
++        store = Store(DummyDatabase())
++        store.push_fetch_context("context")
++        store.pop_fetch_context()
++        self.assertTrue(store.fetch_context.is_root())
++
++    def record_original_fetch(self):
++        store = Store(DummyDatabase())
++        store.push_fetch_context("context")
++        fake_object = {"store": store}
++        record_original_fetch(fake_object, store.fetch_context, "class")
++        self.assertEqual({"class": 1},
++                         store.fetch_context.stats.original_fetches)
++        self.assertEqual(store.fetch_context, fake_object["fetch_context"])
++
++    def test_record_derived_fetch(self):
++        class FakeObjInfo(dict):
++            pass
++        class FakeClsInfo(object):
++            def __init__(self, cls):
++                self.cls = cls
++
++        store = Store(DummyDatabase())
++        store.push_fetch_context("context")
++        fake_local_object = FakeObjInfo(store=store,
++                                        fetch_context=store.fetch_context,
++                                        fetch_origin="origin")
++        fake_local_object.cls_info = FakeClsInfo("source")
++        fake_remote_object = FakeObjInfo(store=store)
++        record_derived_fetch("reference", fake_local_object, fake_remote_object)
++
++        self.assertEqual({("origin", "source", "reference"): 1},
++                         store.fetch_context.stats.derived_fetches)
++        self.assertEqual("origin", fake_remote_object["fetch_origin"])
++
++    def test_root_context_does_not_profile(self):
++        store = Store(DummyDatabase())
++        fake_object = {"store": store}
++        record_original_fetch(fake_object, store.fetch_context, "class")
++        self.assertEqual({}, store.fetch_context.stats.original_fetches)
 === added file 'tests/fetch_statistics.py'
 --- tests/fetch_statistics.py	1970-01-01 00:00:00 +0000
 +++ tests/fetch_statistics.py	2011-06-20 13:09:26 +0000
@@ -0,0 +1,99 @@
++# -*- coding: utf-8 -*-
++
++from storm.fetch_profile import add_to_dict, FetchStatistics
++
++from tests.helper import TestHelper
++
++
++class AddToDictTest(TestHelper):
++    def test_creates_entry(self):
++        data = {}
++        add_to_dict(data, "x", 1)
++        self.assertEqual({"x": 1}, data)
++
++    def test_adds_to_entry(self):
++        data = {"x": 1}
++        add_to_dict(data, "x", 1)
++        self.assertEqual({"x": 2}, data)
++
++
++class FetchStatisticsTest(TestHelper):
++    def test_initially_empty(self):
++        empty = FetchStatistics()
++        self.assertEqual({}, empty.original_fetches)
++        self.assertEqual({}, empty.derived_fetches)
++
++    def test_record_original_fetch(self):
++        stats = FetchStatistics()
++        stats.record_original_fetch("origin")
++        self.assertEqual({"origin": 1}, stats.original_fetches)
++
++    def test_record_derived_fetch(self):
++        stats = FetchStatistics()
++        fetch = ("origin", "source", "reference")
++        stats.record_derived_fetch(*fetch)
++        self.assertEqual({fetch: 1}, stats.derived_fetches)
++
++    def test_copy(self):
++        stats = FetchStatistics()
++        stats.record_original_fetch("origin")
++        stats.record_derived_fetch("origin", "source", "reference")
++        copy = stats.copy()
++        self.assertEqual(stats.original_fetches, copy.original_fetches)
++        self.assertEqual(stats.derived_fetches, copy.derived_fetches)
++        self.assertNotEqual(stats, copy)
++
++    def test_merge_empty_does_nothing(self):
++        stats = FetchStatistics()
++        stats.original_fetches = {"origin": 1}
++        derived_fetch = ("origin", "source", "reference")
++        stats.derived_fetches = {derived_fetch: 1}
++        empty = FetchStatistics()
++        stats.merge(empty)
++        self.assertEqual({"origin": 1}, stats.original_fetches)
++        self.assertEqual({derived_fetch: 1}, stats.derived_fetches)
++
++    def test_merge_adds_counts(self):
++        stats = FetchStatistics()
++        other_stats = FetchStatistics()
++        other_stats.original_fetches = {"other_origin": 1}
++        derived_fetch = ("other_origin", "other_source", "reference")
++        other_stats.derived_fetches = {derived_fetch: 1}
++        stats.merge(other_stats)
++        self.assertEqual(other_stats.original_fetches, stats.original_fetches)
++        self.assertEqual(other_stats.derived_fetches, stats.derived_fetches)
++
++    def test_merge_leaves_existing_counts_in_place(self):
++        stats = FetchStatistics()
++        stats.original_fetches = {"origin": 1}
++        derived_fetch = ("origin", "source", "reference")
++        stats.derived_fetches = {derived_fetch: 1}
++        other_stats = FetchStatistics()
++        other_stats.original_fetches = {"other_origin": 1}
++        other_derived_fetch = ("other_origin", "other_source", "reference")
++        other_stats.derived_fetches = {other_derived_fetch: 1}
++        stats.merge(other_stats)
++
++        cumulative_original_fetches = {
++            "origin": 1,
++            "other_origin": 1,
++        }
++        cumulative_derived_fetches = {
++            derived_fetch: 1,
++            other_derived_fetch: 1,
++        }
++        self.assertEqual(cumulative_original_fetches, stats.original_fetches)
++        self.assertEqual(cumulative_derived_fetches, stats.derived_fetches)
++
++    def test_merge_sums_counts(self):
++        stats = FetchStatistics()
++        stats.original_fetches = {"origin": 1}
++        derived_fetch = ("origin", "source", "reference")
++        stats.derived_fetches = {derived_fetch: 1}
++        other_stats = FetchStatistics()
++        other_stats.original_fetches = {"origin": 1}
++        other_stats.derived_fetches = {derived_fetch: 1}
++        stats.merge(other_stats)
++
++        self.assertEqual({"origin": 2}, stats.original_fetches)
++        self.assertEqual({derived_fetch: 2}, stats.derived_fetches)
 === modified file 'tests/store/base.py'
 --- tests/store/base.py	2011-02-14 12:17:54 +0000
 +++ tests/store/base.py	2011-06-20 13:09:26 +0000
@@ -29,8 +29,13 @@
  from storm.references import Reference, ReferenceSet, Proxy
  from storm.database import Result
++<<<<<<< TREE
  from storm.properties import (
      Int, Float, RawStr, Unicode, Property, Pickle, UUID)
++=======
++from storm.fetch_profile import fetch_context
++from storm.properties import Int, Float, RawStr, Unicode, Property, Pickle
++>>>>>>> MERGE-SOURCE
  from storm.properties import PropertyPublisherMeta, Decimal
  from storm.variables import PickleVariable
  from storm.expr import (
@@ -6004,6 +6009,212 @@
          result_to_remove = self.store.find(Foo, Foo.id <= 30)
          self.assertEquals(result_to_remove.remove(), 3)
++    def test_push_fetch_context(self):
++        root = self.store.fetch_context
++        self.store.push_fetch_context("child")
++        self.assertEqual(root, self.store.fetch_context.parent)
++
++    def test_pop_fetch_context(self):
++        root = self.store.fetch_context
++        self.store.push_fetch_context("child")
++        self.store.pop_fetch_context()
++        self.assertEqual(root, self.store.fetch_context)
++
++    def test_fetch_context_manager(self):
++        with fetch_context(self.store, "with-context"):
++            context_name = self.store.fetch_context.name
++        self.assertEqual("with-context", context_name)
++
++    def test_profile_find(self):
++        self.store.push_fetch_context("test")
++        obj = self.store.find(Foo).any()
++        stats = self.store.fetch_context.stats
++        self.assertEqual({Foo: 1}, stats.original_fetches)
++        self.assertEqual({}, stats.derived_fetches)
++
++    def test_profile_get(self):
++        self.store.push_fetch_context("test")
++        obj = self.store.get(Foo, 10)
++        stats = self.store.fetch_context.stats
++        self.assertEqual({Foo: 1}, stats.original_fetches)
++        self.assertEqual({}, stats.derived_fetches)
++
++    def test_profile_get_derived_from(self):
++        self.store.push_fetch_context("test")
++        bar = self.store.get(Bar, 100)
++        foo = self.store.get(Foo, bar.foo_id, derived_from=(bar, Bar.foo))
++        stats = self.store.fetch_context.stats
++        self.assertEqual({Bar: 1}, stats.original_fetches)
++        fetch = (Bar, Bar, Bar.foo)
++        self.assertEqual({fetch: 1}, stats.derived_fetches)
++
++    def test_profile_dereference(self):
++        self.store.push_fetch_context("test")
++        bar = self.store.get(Bar, 100)
++        foo = bar.foo
++        stats = self.store.fetch_context.stats
++        fetch = (Bar, Bar, Bar.foo)
++        self.assertEqual({fetch: 1}, stats.derived_fetches)
++
++    def test_profile_indirect_derived_fetch_records_origin_and_source(self):
++        self.store.execute("""
++            CREATE TEMPORARY TABLE splat (id integer, bar_id integer)
++            """)
++        self.store.execute("INSERT INTO splat (id, bar_id) VALUES (1, 100)")
++
++        class Splat(object):
++            __storm_table__ = "splat"
++            id = Int(primary=True)
++            bar_id = Int()
++            bar = Reference(bar_id, Bar.id)
++
++        with fetch_context(self.store, "test"):
++            splat = self.store.get(Splat, 1)
++            context = self.store.fetch_context
++
++        foo = splat.bar.foo
++
++        expected_fetches = {
++            (Splat, Splat, Splat.bar): 1,
++            (Splat, Bar, Bar.foo): 1,
++        }
++        self.assertEqual(expected_fetches, context.stats.derived_fetches)
++
++    def test_profile_derived_get_records_origin_and_source(self):
++        self.store.execute("""
++            CREATE TEMPORARY TABLE splat (id integer, bar_id integer)
++            """)
++        self.store.execute("INSERT INTO splat (id, bar_id) VALUES (1, 100)")
++
++        class Splat(object):
++            __storm_table__ = "splat"
++            id = Int(primary=True)
++            bar_id = Int()
++            bar = Reference(bar_id, Bar.id)
++
++        with fetch_context(self.store, "test"):
++            splat = self.store.get(Splat, 1)
++            context = self.store.fetch_context
++
++        bar = splat.bar
++        self.store.get(Foo, bar.foo_id, derived_from=(bar, Bar.foo))
++
++        expected_fetches = {
++            (Splat, Splat, Splat.bar): 1,
++            (Splat, Bar, Bar.foo): 1,
++        }
++        self.assertEqual(expected_fetches, context.stats.derived_fetches)
++
++
++    def test_profile_new_object_is_origin_but_not_fetched(self):
++        self.store.push_fetch_context("test")
++        bar = Bar()
++        bar.id = 999
++        bar.foo_id = 10
++        self.store.add(bar)
++        foo = bar.foo
++        stats = self.store.fetch_context.stats
++        self.assertEqual({}, stats.original_fetches)
++        self.assertEqual({(Bar, Bar, Bar.foo): 1}, stats.derived_fetches)
++
++    def test_profile_cached_objects_not_fetched(self):
++        foo = self.store.get(Foo, 10)
++        bar = self.store.get(Bar, 100)
++        self.store.push_fetch_context("test")
++        same_foo = bar.foo
++        self.assertEqual({}, self.store.fetch_context.stats.original_fetches)
++        self.assertEqual({}, self.store.fetch_context.stats.derived_fetches)
++
++    def test_profile_derived_fetch_uses_original_context(self):
++        with fetch_context(self.store, "original-fetch-context"):
++            original_context = self.store.fetch_context
++            bar = self.store.get(Bar, 100)
++        with fetch_context(self.store, "later-fetch-context"):
++            later_context = self.store.fetch_context
++            foo = bar.foo
++        self.assertEqual({}, later_context.stats.derived_fetches)
++        self.assertEqual({(Bar, Bar, Bar.foo): 1},
++                         original_context.stats.derived_fetches)
++
++    def test_profile_result_uses_original_context(self):
++        with fetch_context(self.store, "original-fetch-context"):
++            original_context = self.store.fetch_context
++            bar_result = self.store.find(Foo, Foo.id == 10)
++        with fetch_context(self.store, "later-fetch-context"):
++            later_context = self.store.fetch_context
++            bar = bar_result.one()
++        self.assertEqual({}, later_context.stats.original_fetches)
++        self.assertEqual({Foo: 1}, original_context.stats.original_fetches)
++
++    def test_profile_result_find_uses_original_context(self):
++        with fetch_context(self.store, "original-fetch-context"):
++            original_context = self.store.fetch_context
++            original_result = self.store.find(Foo, Foo.id == 10)
++        with fetch_context(self.store, "later-fetch-context"):
++            later_context = self.store.fetch_context
++            original_result.find(True).one()
++        self.assertEqual({}, later_context.stats.original_fetches)
++        self.assertEqual({Foo: 1}, original_context.stats.original_fetches)
++
++    def test_profile_contexts_persist(self):
++        with fetch_context(self.store, "context"):
++            foo = self.store.get(Foo, 10)
++            context = self.store.fetch_context
++        with fetch_context(self.store, "context"):
++            bar = self.store.get(Foo, 20)
++        self.assertEqual({Foo: 2}, context.stats.original_fetches)
++
++    def test_profile_does_not_count_empty_result(self):
++        self.store.push_fetch_context("context")
++        self.store.find(Foo, False).any()
++        self.assertEqual({}, self.store.fetch_context.stats.original_fetches)
++
++    def test_profile_counts_objects_fetched(self):
++        self.store.push_fetch_context("context")
++        list(self.store.find(Foo, Foo.id.is_in([10, 20])))
++        self.assertEqual({Foo: 2},
++                         self.store.fetch_context.stats.original_fetches)
++
++    def test_profile_counts_all_objects_in_join(self):
++        self.store.push_fetch_context("context")
++        list(self.store.find((Foo, Bar), Foo.id == Bar.foo_id, Foo.id == 10))
++        expected_fetches = {
++            Foo: 1,
++            Bar: 1,
++        }
++        self.assertEqual(expected_fetches,
++                         self.store.fetch_context.stats.original_fetches)
++
++    def test_profile_tracks_origin_within_join(self):
++        self.store.execute("UPDATE %s SET selfref_id = %d WHERE id = %d" % (
++            SelfRef.__storm_table__, 25, 15))
++        self.store.push_fetch_context("context")
++        query = self.store.find((Bar, SelfRef),
++                                Bar.id == 100,
++                                SelfRef.id == 15)
++        (bar, selfref) = query.one()
++        foo = bar.foo
++        expected_derived_fetches = {
++            (Bar, Bar, Bar.foo): 1,
++        }
++        self.assertEqual(expected_derived_fetches,
++                         self.store.fetch_context.stats.derived_fetches)
++        other_selfref = selfref.selfref
++        expected_derived_fetches[(SelfRef, SelfRef, SelfRef.selfref)] = 1
++        self.assertEqual(expected_derived_fetches,
++                         self.store.fetch_context.stats.derived_fetches)
++
++    def test_profile_derived_fetch_on_different_store_is_original_fetch(self):
++        self.store.push_fetch_context("context")
++        bar = self.store.get(Bar, 100)
++        other_store = self.create_store()
++        other_store.push_fetch_context("remote-context")
++        other_store.get(Foo, bar.foo_id, derived_from=(bar, Bar.foo))
++        self.assertEqual({}, self.store.fetch_context.stats.derived_fetches)
++        self.assertEqual({Foo: 1},
++                         other_store.fetch_context.stats.original_fetches)
++        self.assertEqual({}, other_store.fetch_context.stats.derived_fetches)
++
  class EmptyResultSetTest(object):