scm002

浏览: 319663 次

最近访客更多访客>>

RenBilei

dxbedu

chenzhihui

choyajoy

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

Python dictdiffer模块比较两个字典

博客分类：

Python

http://dictdiffer.readthedocs.io/en/latest/

Dictdiffer

Dictdiffer is a helper module that helps you to diff and patch dictionaries.

Installation

Dictdiffer is on PyPI so all you need is:

$ pip install dictdiffer

Usage

Let’s start with an example on how to find the diff between two dictionaries using diff() method:

from dictdiffer import diff, patch, swap, revert

first = {
    "title": "hello",
    "fork_count": 20,
    "stargazers": ["/users/20", "/users/30"],
    "settings": {
        "assignees": [100, 101, 201],
    }
}

second = {
    "title": "hellooo",
    "fork_count": 20,
    "stargazers": ["/users/20", "/users/30", "/users/40"],
    "settings": {
        "assignees": [100, 101, 202],
    }
}

result = diff(first, second)

assert list(result) == [
    ('change', ['settings', 'assignees', 2], (201, 202)),
    ('add', 'stargazers', [(2, '/users/40')]),
    ('change', 'title', ('hello', 'hellooo'))]

Now we can apply the diff result with patch() method:

result = diff(first, second)
patched = patch(result, first)

assert patched == second

Also we can swap the diff result with swap() method:

result = diff(first, second)
swapped = swap(result)

assert list(swapped) == [
    ('change', ['settings', 'assignees', 2], (202, 201)),
    ('remove', 'stargazers', [(2, '/users/40')]),
    ('change', 'title', ('hellooo', 'hello'))]

Let’s revert the last changes:

result = diff(first, second)
reverted = revert(result, patched)
assert reverted == first

A tolerance can be used to consider closed values as equal. The tolerance parameter only applies for int and float.

Let’s try with a tolerance of 10% with the values 10 and 10.5:

first = {'a': 10.0}
second = {'a': 10.5}

result = diff(first, second, tolerance=0.1)

assert list(result) == []

Now with a tolerance of 1%:

result = diff(first, second, tolerance=0.01)

assert list(result) == ('change', 'a', (10.0, 10.5))

API

Dictdiffer is a helper module to diff and patch dictionaries.

dictdiffer.diff(first, second, node=None, ignore=None, path_limit=None, expand=False, tolerance=2.220446049250313e-16)

Compare two dictionary/list/set objects, and returns a diff result.

Return an iterator with differences between two objects. The diff items represent addition/deletion/change and the item value is a deep copy from the corresponding source or destination objects.

>>> from dictdiffer import diff
>>> result = diff({'a': 'b'}, {'a': 'c'})
>>> list(result)
[('change', 'a', ('b', 'c'))]

The keys can be skipped from difference calculation when they are included in ignore argument of type collections.Container.

>>> list(diff({'a': 1, 'b': 2}, {'a': 3, 'b': 4}, ignore=set(['a'])))
[('change', 'b', (2, 4))]
>>> class IgnoreCase(set):
...     def __contains__(self, key):
...         return set.__contains__(self, str(key).lower())
>>> list(diff({'a': 1, 'b': 2}, {'A': 3, 'b': 4}, ignore=IgnoreCase('a')))
[('change', 'b', (2, 4))]

The difference calculation can be limitted to certain path:

>>> list(diff({}, {'a': {'b': 'c'}}))
[('add', '', [('a', {'b': 'c'})])]

>>> from dictdiffer.utils import PathLimit
>>> list(diff({}, {'a': {'b': 'c'}}, path_limit=PathLimit()))
[('add', '', [('a', {})]), ('add', 'a', [('b', 'c')])]

>>> from dictdiffer.utils import PathLimit
>>> list(diff({}, {'a': {'b': 'c'}}, path_limit=PathLimit([('a',)])))
[('add', '', [('a', {'b': 'c'})])]

>>> from dictdiffer.utils import PathLimit
>>> list(diff({}, {'a': {'b': 'c'}},
...           path_limit=PathLimit([('a', 'b')])))
[('add', '', [('a', {})]), ('add', 'a', [('b', 'c')])]

The patch can be expanded to small units e.g. when adding multiple values:

>>> list(diff({'fruits': []}, {'fruits': ['apple', 'mango']}))
[('add', 'fruits', [(0, 'apple'), (1, 'mango')])]

>>> list(diff({'fruits': []}, {'fruits': ['apple', 'mango']}, expand=True))
[('add', 'fruits', [(0, 'apple')]), ('add', 'fruits', [(1, 'mango')])]

Parameters:

first – The original dictionary, list or set.
second – New dictionary, list or set.
node – Key for comparison that can be used in dot_lookup().
ignore – List of keys that should not be checked.
path_limit – List of path limit tuples or dictdiffer.utils.Pathlimit object to limit the diff recursion depth.
expand – Expand the patches.
tolerance – Threshold to consider when comparing two float numbers.

Changed in version 0.3: Added ignore parameter.

Changed in version 0.4: Arguments first and second can now contain a set.

Changed in version 0.5: Added path_limit parameter. Added expand paramter. Added tolerance parameter.

Changed in version 0.7: Diff items are deep copies from its corresponding objects.

dictdiffer.patch(diff_result, destination)

Patch the diff result to the old dictionary.

dictdiffer.swap(diff_result)

Swap the diff result.

It uses following mapping:

remove -> add
add -> remove

In addition, swap the changed values for change flag.

>>> from dictdiffer import swap
>>> swapped = swap([('add', 'a.b.c', [('a', 'b'), ('c', 'd')])])
>>> next(swapped)
('remove', 'a.b.c', [('c', 'd'), ('a', 'b')])

>>> swapped = swap([('change', 'a.b.c', ('a', 'b'))])
>>> next(swapped)
('change', 'a.b.c', ('b', 'a'))

dictdiffer.revert(diff_result, destination)

Call swap function to revert patched dictionary object.

Usage example:

>>> from dictdiffer import diff, revert
>>> first = {'a': 'b'}
>>> second = {'a': 'c'}
>>> revert(diff(first, second), second)
{'a': 'b'}

dictdiffer.dot_lookup(source, lookup, parent=False)

Allow you to reach dictionary items with string or list lookup.

Recursively find value by lookup key split by ‘.’.

>>> from dictdiffer.utils import dot_lookup
>>> dot_lookup({'a': {'b': 'hello'}}, 'a.b')
'hello'

If parent argument is True, returns the parent node of matched object.

>>> dot_lookup({'a': {'b': 'hello'}}, 'a.b', parent=True)
{'b': 'hello'}

If node is empty value, returns the whole dictionary object.

>>> dot_lookup({'a': {'b': 'hello'}}, '')
{'a': {'b': 'hello'}}

Changes

Version 0.6.1 (released 2016-11-22)

Changes order of items for REMOVE section of generated patches when swap is called so the list items are removed from the end. (#85)
Improves API documentation for ignore argument in diff function. (#79)
Executes doctests during PyTest invocation.

Version 0.6.0 (released 2016-06-22)

Adds support for comparing NumPy arrays. (#68)
Adds support for comparing mutable mappings, sequences and sets from collections.abs module. (#67)
Updates package structure, sorts imports and runs doctests.
Fixes order in which handled conflicts are unified so that the Merger’s unified_patches can be always applied.

Version 0.5.0 (released 2016-01-04)

Adds tolerance parameter used when user wants to treat closed values as equals
Adds support for comparing numerical values and NaN. (#54) (#55)

Version 0.4.0 (released 2015-03-11)

Adds support for diffing and patching of sets. (#44)
New tests for diff on the same lists. (#48)
Fix for exception when dict has unicode keys and ignore parameter is provided. (#50)
PEP8 improvements.

Version 0.3.0 (released 2014-11-05)

Adds ignore argument to diff function that allows skipping check on specified keys. (#34 #35)
Fix for diffing of dict or list subclasses. (#37)
Better instance checking of diffing objects. (#39)

Version 0.2.0 (released 2014-09-29)

Fix for empty list instructions. (#30)
Regression test for empty list instructions.

Version 0.1.0 (released 2014-09-01)

Fix for list removal issues during patching caused by wrong iteration. (#10)
Fix for issues with multiple value types for the same key. (#10)
Fix for issues with strings handled as iterables. (#6)
Fix for integer keys. (#12)
Regression test for complex dictionaries. (#4)
Better testing with Travis CI, tox, pytest, code coverage. (#10)
Initial release of documentation on ReadTheDocs. (#21 #24)
Support for Python 3. (#15)

Version 0.0.4 (released 2014-01-04)

List diff behavior treats lists as lists instead of sets. (#3)
Differed typed objects are flagged as changed now.
Swap function refactored.

Version 0.0.3 (released 2013-05-26)

Initial public release on PyPI.

Contributing

Bug reports, feature requests, and other contributions are welcome. If you find a demonstrable problem that is caused by the code of this library, please:

Search for already reported problems.
Check if the issue has been fixed or is still reproducible on the latest master branch.
Create an issue with a test case.

If you create a feature branch, you can run the tests to ensure everything is operating correctly:

$ ./run-tests.sh

...

Name                  Stmts   Miss  Cover   Missing
---------------------------------------------------
dictdiffer/__init__      88      0   100%
dictdiffer/version        2      0   100%
---------------------------------------------------
TOTAL                    90      0   100%

...

52 passed, 2 skipped in 0.44 seconds

License

Dictdiffer is free software; you can redistribute it and/or modify it under the terms of the MIT License quoted below.

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

In applying this license, CERN does not waive the privileges and immunities granted to it by virtue of its status as an Intergovernmental Organization or submit itself to any jurisdiction.

Authors

Dictdiffer was originally developed by Fatih Erikli. It is now being developed and maintained by the Invenio collaboration. You can contact us at info@inveniosoftware.org.

Contributors:

Fatih Erikli <fatiherikli@gmail.com>
Brian Rue <brianrue@gmail.com>
Lars Holm Nielsen <lars.holm.nielsen@cern.ch>
Tibor Simko <tibor.simko@cern.ch>
Jiri Kuncar <jiri.kuncar@cern.ch>
Jason Peddle <jwpeddle@gmail.com>
Martin Vesper <martin.vesper@cern.ch>
Gilles DAVID <frodon1@gmail.com>
Alexander Mohr <amohr@farmersbusinessnetwork.com>

分享到：

Make Gerrit look for new repositories | gitlab 项目迁移

2017-03-04 17:51
浏览 3563
评论(0)
分类:编程语言
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Python dictdiffer模块比较两个字典

Dictdiffer

Installation

Usage

API

Changes

Contributing

License

Authors

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Python dictdiffer模块比较两个字典

Dictdiffer

Installation

Usage

API

Changes

Contributing

License

Authors

评论

发表评论

相关推荐

lxml.etree

python 判断一个对象是否可迭代

给python交互式命令行增加自动补全和命令历史

python 字典格式化

python最简洁的条件判断语句写法

python 格式化json

python下载文件的三种方法

python 格式化字典字符串

python __fatal退出函数

Python logging

思考：如果list中既包含字符串，又包含整数，由于非字符串类型没有lower()方法，所以列表生成式会报错：

去除重复字符串并保持原来顺序输出

Python getpass 输入密码

优秀Python学习资源收集汇总（强烈推荐）

python 模拟登录iteye博客

Python之路【目录】

Python 正则 提取由start开始到end结束的行

Python文件替代fileinput模块

linecache，想读某行周围的哪行都可以

python __file__ 与相对路径

最近访客更多访客>>

Python 正则提取由start开始到end结束的行

python file 与相对路径