`
huangro
  • 浏览: 336986 次
  • 性别: Icon_minigender_1
  • 来自: 杭州
社区版块
存档分类
最新评论

Nevow-traversal (转)

阅读更多
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332
Nevow Object Traversal
======================

*Object traversal* is the process Nevow uses to determine what object to use to
render HTML for a particular URL. When an HTTP request comes in to the web
server, the object publisher splits the URL into segments, and repeatedly calls
methods which consume path segments and return objects which represent that
path, until all segments have been consumed. At the core, the Nevow traversal
API is very simple. However, it provides some higher level functionality layered
on top of this to satisfy common use cases.

* `Object Traversal Basics`_
* `locateChild in depth`_
* `childFactory method`_
* `child_* methods and attributes`_
* `Dots in child names`_
* `children dictionary`_
* `The default trailing slash handler`_
* `ICurrentSegments and IRemainingSegments`_

Object Traversal Basics
-----------------------

The *root resource* is the top-level object in the URL space; it conceptually
represents the URI "/". The Nevow *object traversal* and *object publishing*
machinery uses only two methods to locate an object suitable for publishing and
to generate the HTML from it; these methods are described in the interface
``nevow.inevow.IResource``::


  class IResource(compy.Interface):
      def locateChild(self, ctx, segments):
          """Locate another object which can be adapted to IResource
          Return a tuple of resource, path segments
          """

      def renderHTTP(self, ctx):
          """Render a request
          """

``renderHTTP`` can be as simple as a method which simply returns a string of HTML.
Let's examine what happens when object traversal occurs over a very simple root
resource::

  from zope.interface import implements

  class SimpleRoot(object):
      implements(inevow.IResource)

      def locateChild(self, ctx, segments):
          return self, ()

      def renderHTTP(self, ctx):
          return "Hello, world!"

This resource, when passed as the root resource to ``appserver.NevowSite`` or
``wsgi.createWSGIApplication``, will immediately return itself, consuming all path
segments. This means that for every URI a user visits on a web server which is
serving this root resource, the text "Hello, world!" will be rendered. Let's
examine the value of ``segments`` for various values of URI:

/foo/bar
  ('foo', 'bar')

/
  ('', )

/foo/bar/baz.html
  ('foo', 'bar', 'baz.html')

/foo/bar/directory/
  ('foo', 'bar', 'directory', '')

So we see that Nevow does nothing more than split the URI on the string '/' and
pass these path segments to our application for consumption. Armed with these
two methods alone, we already have enough information to write applications
which service any form of URL imaginable in any way we wish. However, there are
some common URL handling patterns which Nevow provides higher level support for.

``locateChild`` in depth
------------------------

One common URL handling pattern involves parents which only know about their
direct children. For example, a ``Directory`` object may only know about the
contents of a single directory, but if it contains other directories, it does
not know about the contents of them. Let's examine a simple ``Directory`` object
which can provide directory listings and serves up objects for child directories
and files::

  from zope.interface import implements           

  class Directory(object):
      implements(inevow.IResource)

      def __init__(self, directory):
          self.directory = directory

      def renderHTTP(self, ctx):
          html = ['<ul>']
          for child in os.listdir(self.directory):
              fullpath = os.path.join(self.directory, child)
              if os.path.isdir(fullpath):
                  child += '/'
              html.extend(['<li><a href="', child, '">', child, '</a></li>'])
          html.append('</ul>')
          return ''.join(html)

      def locateChild(self, ctx, segments):
          name = segments[0]
          fullpath = os.path.join(self.directory, name)
          if not os.path.exists(fullpath):
              return None, () # 404

          if os.path.isdir(fullpath):
              return Directory(fullpath), segments[1:]
          if os.path.isfile(fullpath):
              return static.File(fullpath), segments[1:]

Because this implementation of ``locateChild`` only consumed one segment and
returned the rest of them (``segments[1:]``), the object traversal process will
continue by calling ``locateChild`` on the returned resource and passing the
partially-consumed segments. In this way, a directory structure of any depth can
be traversed, and directory listings or file contents can be rendered for any
existing directories and files.

So, let us examine what happens when the URI "/foo/bar/baz.html" is traversed,
where "foo" and "bar" are directories, and "baz.html" is a file.

Directory('/').locateChild(ctx, ('foo', 'bar', 'baz.html'))
    Returns Directory('/foo'), ('bar', 'baz.html')

Directory('/foo').locateChild(ctx, ('bar', 'baz.html'))
    Returns Directory('/foo/bar'), ('baz.html, )

Directory('/foo/bar').locateChild(ctx, ('baz.html'))
    Returns File('/foo/bar/baz.html'), ()

No more segments to be consumed; ``File('/foo/bar/baz.html').renderHTTP(ctx)`` is
called, and the result is sent to the browser.
                       
``childFactory`` method
-----------------------

Consuming one URI segment at a time by checking to see if a requested resource
exists and returning a new object is a very common pattern. Nevow's default
implementation of ``IResource``, ``nevow.rend.Page``, contains an implementation of
``locateChild`` which provides more convenient hooks for implementing object
traversal. One of these hooks is ``childFactory``. Let us imagine for the sake of
example that we wished to render a tree of dictionaries. Our data structure
might look something like this::

    tree = dict(
        one=dict(
            foo=None,
            bar=None),
        two=dict(
            baz=dict(
                quux=None)))

Given this data structure, the valid URIs would be:

* /
* /one
* /one/foo
* /one/bar
* /two
* /two/baz
* /two/baz/quux

Let us construct a ``rend.Page`` subclass which uses the default ``locateChild``
implementation and overrides the ``childFactory`` hook instead::

  class DictTree(rend.Page):
      def __init__(self, dataDict):
          self.dataDict = dataDict

      def renderHTTP(self, ctx):
          if self.dataDict is None:
              return "Leaf"
          html = ['<ul>']
          for key in self.dataDict.keys():
              html.extend(['<li><a href="', key, '">', key, '</a></li>'])
          html.append('</ul>')
          return ''.join(html)

      def childFactory(self, ctx, name):
          if name not in self.dataDict:
              return rend.NotFound # 404
          return DictTree(self.dataDict[name])

As you can see, the ``childFactory`` implementation is considerably shorter than the
equivalent ``locateChild`` implementation would have been.

``child_*`` methods and attributes
----------------------------------

Often we may wish to have some hardcoded URLs which are not dynamically
generated based on some data structure. For example, we might have an
application which uses an external CSS stylesheet, an external JavaScript file,
and a folder full of images. The ``rend.Page`` ``locateChild`` implementation provides a
convenient way for us to express these relationships by using ``child``-prefixed
methods::

  class Linker(rend.Page):
      def renderHTTP(self, ctx):
          return """<html>
    <head>
      <link href="css" rel="stylesheet" />
      <script type="text/javascript" src="scripts" />
    <body>
      <img src="images/logo.png" />
    </body>
  </html>"""

      def child_css(self, ctx):
          return static.File('/Users/dp/styles.css')

      def child_scripts(self, ctx):
          return static.File('/Users/dp/scripts.js')

      def child_images(self, ctx):
          return static.File('/Users/dp/images/')

One thing you may have noticed is that all of the examples so far have returned
new object instances whenever they were implementing a traversal API. However,
there is no reason these instances cannot be shared. One could for example
return a global resource instance, an instance which was previously inserted in
a dict, or lazily create and cache dynamic resource instances on the fly. The
``rend.Page`` ``locateChild`` implementation also provides a convenient way to express
that one global resource instance should always be used for a particular url,
the ``child``-prefixed attribute::

  class FasterLinker(Linker):
      child_css = static.File('/Users/dp/styles.css')
      child_scripts = static.File('/Users/dp/scripts.js')
      child_images = static.File('/Users/dp/images/')

Dots in child names
-------------------

When a URL contains dots, which is quite common in normal URLs, it is simple
enough to handle these URL segments in ``locateChild`` or ``childFactory`` -- one of the
passed segments will simply be a string containing a dot. However, it is not
immediately obvious how one would express a URL segment with a dot in it when
using ``child``-prefixed methods. The solution is really quite simple::

  class DotChildren(rend.Page):
      return '<html><head><script type="text/javascript" src="scripts.js" /></head></html>'

  setattr(DotChildren, 'child_scripts.js', static.File('/Users/dp/scripts.js'))

The same technique could be used to install a child method with a dot in the
name.

children dictionary
-------------------

The final hook supported by the default implementation of locateChild is the
``rend.Page.children`` dictionary::

  class Main(rend.Page):
      children = {
          'people': People(),
          'jobs': Jobs(),
          'events': Events()}

      def renderHTTP(self, ctx):
          return """\
<html>
    <head>
        <title>Our Site</title>
    </head>
    <body>
        <p>bla bla bla</p>
    </body>
</html>"""


Hooks are checked in the following order:

  1. ``self.dictionary``
  2. ``self.child_*``
  3. ``self.childFactory``

The default trailing slash handler
----------------------------------

When a URI which is being handled ends in a slash, such as when the '/' URI is
being rendered or when a directory-like URI is being rendered, the string ''
appears in the path segments which will be traversed. Again, handling this case
is trivial inside either ``locateChild`` or ``childFactory``, but it may not be
immediately obvious what ``child``-prefixed method or attribute will be looked up.
The method or attribute name which will be used is simply ``child`` with a single
trailing underscore.

The ``rend.Page`` class provides an implementation of this method which can work in
two different ways. If the attribute ``addSlash`` is True, the default trailing
slash handler will return ``self``. In the case when ``addSlash`` is True, the default
``rend.Page.renderHTTP`` implementation will simply perform a redirect which adds
the missing slash to the URL.

The default trailing slash handler also returns self if ``addSlash`` is false, but
emits a warning as it does so. This warning may become an exception at some
point in the future.

``ICurrentSegments`` and ``IRemainingSegments``
-----------------------------------------------

During the object traversal process, it may be useful to discover which segments
have already been handled and which segments are remaining to be handled. This
information may be obtained from the ``context`` object which is passed to all the
traversal APIs. The interfaces ``nevow.inevow.ICurrentSegments`` and
``nevow.inevow.IRemainingSegments`` are used to retrieve this information. To
retrieve a tuple of segments which have previously been consumed during object
traversal, use this syntax::

  segs = ICurrentSegments(ctx)

The same is true of ``IRemainingSegments``. ``IRemainingSegments`` is the same value
which is passed as ``segments`` to ``locateChild``, but may also be useful in the
implementations of ``childFactory`` or a ``child``-prefixed method, where this
information would not otherwise be available.
 
Conclusion
==========

Nevow makes it easy to handle complex URL hierarchies. The most basic object
traversal interface, ``nevow.inevow.IResource.locateChild``, provides powerful and
flexible control over the entire object traversal process. Nevow's canonical
``IResource`` implementation, ``rend.Page``, also includes the convenience hooks
``childFactory`` along with ``child``-prefixed method and attribute semantics to
simplify common use cases.
 
分享到:
评论

相关推荐

    java-leetcode-107-binary-tree-level-order-traversal

    java java_leetcode-107-binary-tree-level-order-traversal

    java-leetcode-102-binary-tree-level-order-traversal

    java java_leetcode-102-binary-tree-level-order-traversal

    es-module-traversal:遍历es模块导入图

    es模块遍历从某些ES模块入口点收集所有导入的文件, 您还可以自定义resolver和read功能,以遍历来自静态服务器的导入图npm install es-module-traversal基准测试使用yarn test:bench命令运行基准yarn test:bench 它...

    js-leetcode题解之107-binary-tree-level-order-traversal-ii.js

    通过LeetCode题解的这个文件“js-leetcode题解之107-binary-tree-level-order-traversal-ii.js”,我们可以学习如何使用JavaScript编写二叉树层序遍历的变种,以及如何处理和操作队列数据结构。

    Pre-order-binary-tree-traversal.rar_pre_pre-orderC语言

    这个压缩包文件"Pre-order-binary-tree-traversal.rar_pre_pre-orderC语言"显然是为了教授如何使用C语言进行二叉树的前序遍历。下面将详细介绍前序遍历的概念、算法以及C语言实现的关键点。 1. **前序遍历的概念**...

    c语言-leetcode题解之0094-binary-tree-inorder-traversal.zip

    其中,二叉树的中序遍历(binary tree inorder traversal)是数据结构中的一个经典算法问题。中序遍历的特点是先访问左子树,然后是根节点,最后是右子树,这种遍历方式适用于二叉搜索树,能够得到排序的序列。 在...

    First-root-traversal-non-recursive.zip_ROOT

    关于"First-root-traversal-non-recursive.zip_ROOT"文件,根据其标题和描述,我们可以推断出该文件包含了关于二叉树非递归先根遍历的具体实现。文件可能提供了一种或多种编程语言的实现示例,详细阐述了算法的细节...

    fs-traversal-relationship-display:生成一个 fs-traversal 关系路径的 HTML 显示

    script src =" fs-traversal-relationship-display.js " &gt; &lt;/ script &gt;&lt; link rel =" stylesheet " href =" fs-traversal-relationship-display.css " &gt; 传递FSTraversal.的结果FSTraversal. 直接...

    python-leetcode题解之107-Binary-Tree-Level-Order-Traversal-II

    1. 二叉树的层级遍历(Level Order Traversal)是基础数据结构中常见的算法问题,它要求按照树的层次结构从上到下、从左到右遍历树中的节点。 2. 在LeetCode平台上,问题编号107的题目是“Binary Tree Level Order ...

    traverson-traversal:使用HypermediaHATEOAS API的Web组件

    $ bower install traverson-traversal --save 用法 导入Web组件polyfill: &lt; script src = "/bower_components/webcomponentsjs/webcomponents-lite.js" &gt; &lt; / script &gt; 导入Traverson遍历元素: &lt; ...

    complete-binary-tree--traversal.zip_最小完全树

    完全二叉树是一种特殊的二叉树结构,每个层(除了可能的最后一层)都是完全填充的,且所有结点都尽可能地集中在左侧。在完全二叉树中,如果最后一个层是不完全填充的,则所有结点都集中在左端。...

    python-leetcode题解之102-Binary-Tree-Level-Order-Traversal

    由于LeetCode平台提供了一个公共的编程问题库供开发者练习和提升编程能力,因此我们选择了其中的题目编号102,即"Binary Tree Level Order Traversal"(二叉树的层序遍历)。为了解决这个问题,我们通常需要掌握队列...

    binary-tree-traversal-binary-tree.zip_tree

    在"binary tree traversal binary tree.doc"文档中,可能会详细探讨这些概念并给出具体的代码示例,包括如何实现这些遍历方法以及在不同场景下如何选择合适的二叉树结构。通过学习和理解这些内容,开发者能够更好地...

    python-leetcode题解之103-Binary-Tree-Zigzag-Level-Order-Traversal

    今天我们将详细讨论Python实现LeetCode第103题——二叉树的锯齿形层序遍历(Binary Tree Zigzag Level Order Traversal)的解决方案。 二叉树的锯齿形层序遍历要求我们以不同于传统层序遍历的方式输出树节点值,即...

    python-leetcode题解之144-Binary-Tree-Preorder-Traversal

    在对二叉树进行前序遍历的过程中,首先需要了解前序遍历的基本概念。前序遍历是一种深度优先遍历二叉树的方法,按照“根节点 - 左子树 - 右子树”的顺序访问每个节点。在编写Python代码解决LeetCode上144题时,我们...

    js-leetcode题解之144-binary-tree-preorder-traversal.js

    对于LeetCode上编号为144的题目“Binary Tree Preorder Traversal”,我们可以采用递归或非递归的方式来解决。 首先,递归方法是前序遍历实现中最直观的一种。在递归函数中,我们首先将根节点的值添加到结果数组中...

    python-leetcode题解之145-Binary-Tree-Postorder-Traversal

    在计算机科学中,二叉树是一种重要的数据结构,用于模拟具有层次关系的数据。在二叉树中,每个节点最多有两个子节点,称为左子节点和右子节点。后序遍历是遍历二叉树的三种主要方式之一,其他两种是前序遍历和中序...

    js-leetcode题解之145-binary-tree-postorder-traversal.js

    在JavaScript中实现二叉树后序遍历,是众多算法练习者在LeetCode上常遇到的题目之一。题目的主要任务是遍历给定的二叉树,并按照后序遍历的规则输出节点的值。后序遍历,也就是先遍历左子树,再遍历右子树,最后访问...

    js-leetcode题解之102-binary-tree-level-order-traversal.js

    在讨论JavaScript解决LeetCode题解的实践中,我们关注的焦点是102题:二叉树的层序遍历。层序遍历是指按照树的层次从上到下,从左到右的顺序遍历树中的节点。它是一种广泛应用的二叉树遍历方法,尤其在树结构的算法...

    js-leetcode题解之94-binary-tree-inorder-traversal.js

    在LeetCode上,问题编号94的题目“Binary Tree Inorder Traversal”就是要求对给定的二叉树进行中序遍历,并返回遍历的结果。给定的二叉树用JavaScript对象来表示,每个节点的值为一个数字,节点结构包含值、左子...

Global site tag (gtag.js) - Google Analytics