`
swingfly
  • 浏览: 53590 次
  • 性别: Icon_minigender_1
  • 来自: 成都
社区版块
存档分类
最新评论

Understanding import

阅读更多

Understanding import

import hello
hello.say_hello(
 

 

The first thing we need to understand to do this is how Python’s import mechanism works; in other words, the exact steps Python goes through when it encounters an import statement. If you want all the gory details, the official Python documentation on importing isn’t too bad an explanation of this process, and Fredrik Lundh has an excellent write-up of all the nooks and crannies of Python’s importing mechanism . I highly recommend giving both of those a thorough read at some point, but for now let’s walk through the key points together.

When you have a statement like import hello in a Python program, Python goes through two steps to actually import it:

  1. Locate and, if necessary, initialize the module.
  2. Bind the resulting module object to a name in your current scope.

So when Python sees import hello , it wants to locate and possibly initialize a module named hello , then assign the resulting module object to the name hello in your program’s current scope. If the import statement ocurs at the top of a file, hello will become a module-global name, for example.

The first step — locating and initializing the module — can happen in either of a couple of ways:

  1. If the module hasn’t already been initialized, that needs to happen. For most Python modules, that simply consists of executing the code in the module so that, for example, any classes or functions it contains get defined.
  2. If the module has already been initialized, there will be a module object already in memory for it, and Python can simply grab that object.

Python figures out whether the module has already been initialized by looking at a dictionary named modules which lives inside the built-in module “sys” of the Python standard library; sys.modules has keys corresponding to the import paths of modules which have already been loaded and initialized, and the values are the resulting module objects.

So the actual mechanism is pretty simple: when we say import hello in a Python program, Python goes and looks for the key “hello” in the sys.modules dictionary. If that key exists, Python gets the already-initialized module object out of sys.modules[‘hello’] , and if it doesn’t then Python goes out to your file system and starts looking through the directories on your Python import path for a file or module named hello , which will — if found — be initialized and create an entry in sys.modules . If it isn’t found, Python will raise an ImportError .

One important thing to note here is that if you have a module which can conceivably be imported in multiple different ways — say, because both your project directory and application are directly on your Python path, so that both from myproject.blog.models import Entry and from blog.models import Entry will work — you can end up with a single module getting initialized more than once, and having more than one entry in sys.modules (one for each different way you’ve imported it). Significant sections of Django’s model-loading code exist to work around this and ensure that a given model class only gets initialized once.

Also, note that for module names which contain dots (e.g., import foo.bar ), the mechanism is slightly different: Python looks for an entry in sys.modules which matches up to, but not including, the right-most dot, then looks inside the resulting module object for the final part. So in the statement import foo.bar.baz , Python looks for the entry “foo.bar” in sys.modules , then looks for something named baz inside the resulting module object.

By now you might be wondering whether, since sys.modules is a dictionary, you can just go stick things into it. The answer is that you can: you’re free to do anything to sys.modules that’s legal to do to a Python dictionary, though it’s almost always a bad idea to go messing around with it. But this points the way to how we’re going to make our eventual import statement work: once we’ve constructed the hello module, we can simply stick it into sys.modules and Python will happily let us import it without ever bothering to check if an actual module of that name exists on the file system.

Understanding modules

If you’ve ever tried to access, say, a nonexistent Django setting, you’ve probably seen an error like this:

AttributeError: 'module' object has no attribute 'some_random_name'


And so far I’ve been using the phrase “module object” to refer to Python modules. Both of these give us a clue about how we can build a module on the fly: modules, like everything else in Python, are simply objects, and you can instantiate new module objects just as you can instantiate objects from classes you’ve defined in your applications, assuming you know where to look.

The place to look is another module from Python’s standard library : types , which contains the type objects for many of Python’s built-in types. If you know your way around the types module, you can dynamically build nearly any sort of standard Python object on the fly, even some objects that you can’t normally construct otherwise. In this case the one we’re interested in is types.ModuleType , which we can use to create a brand-new module object at runtime; it works the same as instantiating any other object, and requires at least one argument: the name of the module object to create. You can also optionally pass a second argument which will become the new module’s docstring, so that Python’s built-in help() function will be able to show documentation for it (and other automated documentation parsers will be able to extract its documentation), but we’ll leave that off for this example.

So let’s go ahead and start building our hello module. Pop open a Python interpreter and type in the following:

import types
hello_mod = types ModuleType('hello')
 

We now have a module object bound to the variable hello_mod ; you can check that it really is a module and has the correct name — “hello” — in the interpreter:

hello_mod<module 'hello' (built-in)>
 

At this point, the new module is simply a blank slate; we can stick anything into it that we like. So let’s define the say_hello() function we’re going to use:

def say_hello():
    print "Hello"
 

And then add it to our hello module:

hello_mod.say_hello = say_hello
 

You can call this function and verify that it works:

hello_mod.say_hello()
 

Putting it all together

Of course, we still need to make our new module importable via the name “hello”, but armed with an understanding of sys.modules this is easy:

import sys
sys.modules.setdefault('hello', hello_mod)

>>><module 'hello' (built-in)>
 

We’re using setdefault() here instead of assigning directly to sys.modules[‘hello’] , because if there’s already a loaded module named hello we shouldn’t overwrite it; the setdefault() method of a Python dictionary takes a key and and a value, and then either inserts the value into the dictionary with the given key, if that key wasn’t already in use in the dictonary, or else does nothing to the dictionary. In either case it returns the value which ends up in the dictionary, which provides an easy way to figure out if you added a new value or not. In this case, the return value of setdefault() was the module we just created, so we know it was added to sys.modules successfully.

And now we can use the “magic”:

import hello
 
hello.say_hello()

>>>Hello

 

This works because our dynamically-created module object is now in sys.modules under the name “hello”; since it’s in there, Python will simply return it any time we say import hello and never bother checking the file system to see if a “real” module of that name exists on the Python path. We can even use the alternate from syntax to import just the say_hello() function:

from hello import say_hello
say_hello()

>>>Hello
 

Once again, Python is simply giving us back the contents of the module object we created in memory; the fact that it exists in sys.modules again bypasses any need to check the file system. And as soon as you exit the Python interpreter, the hello module will simply disappear; since it only ever existed in memory inside this single Python process, it will go away as soon as that Python process exits.

And now you know

At this point you can probably work out how Django — back in the 0.90 and 0.91 days — used to create the “magic” model modules which were importable from django.models ; there was a lot more work going on to build up the things which eventually lived inside that module, but ultimately it boiled down to the same two things we just did: createing a new module object with types.ModuleType , and making it importable by inserting it into sys.modules .

I can’t stress enough that this is something you probably shouldn’t ever do in real-world code, because — as the example of Django’s old-style model system shows — it’s confusing and counterintuitive to mysteriously create modules where people aren’t expecting them. And messing with sys.modules , unless you really know what you’re doing, can also be dangerous; if you’re not careful you might accidentally delete or overwrite the entry for a module you were relying on, and then you’ll be in a real pickle.

But knowing how the process works — even if you never actually use it — helps to turn this from “magic” into a fairly straightforward application of a Python feature, and provides a useful glimpse into some of Python’s fundamental workings: knowing how Python’s import mechanism works, for example, is incredibly important and handy for a broad range of Python programming tasks, and shows off a major part of Python’s sometimes-downplayed dynamic nature.

分享到:
评论

相关推荐

    R for Data Science: Import, Tidy, Transform, Visualize, and Model Data

    R for Data Science: Import, Tidy, Transform, Visualize, and Model Data ISBN: 1491910399 | 2017 | True PDF | 522 pages | 32 MB What exactly is data science? With this book, you’ll gain a clear ...

    understanding-es6.pdf

    模块(Modules)是ES6中用于组织代码的一种方式,它允许开发者将代码分割成不同的文件,并通过导入(import)和导出(export)机制在这些文件之间共享代码。 **示例:** **math.js:** ```javascript export ...

    R for Data Science_ Import, Tidy, Transform, Visualize, and Model Data

    Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make ...

    R for Data Science Visualize Model Transform Tidy and Import Data.pdf

    Data Import with readr Chapter 9. Tidy Data with tidyr Chapter 10. Relational Data with dplyr Chapter 11. Strings with stringr Chapter 12. Factors with forcats Chapter 13. Dates and Times with ...

    math-understanding-tools

    项目目的 已经开发了数学理解工具来...import { GraphicVector , OVector2 , UpdateService } from "math-understanding-tools" ; let options = { antialias : true , autoResize : true } ; const app = new Appli

    Understanding ECMAScript 6 The Definitive Guide for JavaScript Developers pdf 0分

    7. 模块化(Modules):ES6标准化了JavaScript模块系统,提供了import和export关键字来导入和导出模块。这使得代码组织和复用变得更加方便,尤其是对于大型项目和库。 Nicholas C. Zakas在书中详细介绍了以上每一个...

    understanding ecmascript 6 中文版

    综上所述,《Understanding ECMAScript 6》是一本详尽介绍了 ES6 新特性的书籍,对于想要深入理解 ES6 的开发者来说是非常有价值的参考资料。这本书不仅覆盖了 ES6 的所有主要特性,还提供了一些实用的例子和深入的...

    Understanding Java servlets

    import javax.servlet.*; import javax.servlet.http.*; import java.io.*; public class HelloWorldServlet extends HttpServlet { public void doGet(HttpServletRequest request, HttpServletResponse response...

    understanding-ecmascript-6-zh-ver.pdf

    根据提供的文件信息,本文将基于《Understanding ECMAScript 6》中文版的内容概要来探讨ECMAScript 2015(简称ES6)的重要知识点。ECMAScript 2015是JavaScript的一个重大更新版本,引入了许多新的特性和语法改进,...

    understanding_tensorflow_nn:TenTensorFlow入门

    import pandas as pd import numpy as np import tensorflow as tf from collections import Counter from sklearn . datasets import fetch_20newsgroups TensorFlow的工作方式 import tensorflow as tf my_graph...

    Elasticsearch for Hadoop

    Table of Contents ...Creating the MapReduce job to import data from Elasticsearch to HDFS Writing the Tweets2Hdfs mapper Running the example Testing the job execution output Summary ...

    reading-understanding-computation-with-js

    ES6引入了模块系统,使用`import`和`export`关键字来导入和导出模块,这对于代码组织和复用非常有用。 6. **错误处理**: JavaScript使用`try...catch`语句进行异常处理,当代码块中出现错误时,可以捕获并处理...

    understanding-es6-zakas

    2. **模块(Modules)**:ES6的模块系统通过`import`和`export`关键字实现了代码的封装和重用。这使得代码组织更加清晰,提高了可维护性,同时避免了命名冲突。 3. **箭头函数(Arrow Functions)**:箭头函数的...

    MSN机器人源代码(C#)

    虽然C#自身不包含内置的NLP库,但你可以集成开源的NLP工具,如Stanford NLP或Microsoft的LUIS(Language Understanding Intelligent Service)来实现这一功能。 聊天机器人的响应策略也是关键。它可以基于预定义的...

    看懂java源码-udemy-javascript-understanding-the-weird-parts:JavaScript的源代码:

    5. **ES6及后续版本新特性**:箭头函数、模板字符串、解构赋值、类和模块系统(`import`和`export`)等,这些都是现代JavaScript开发的必备知识。 6. **JavaScript引擎工作原理**:V8引擎的优化技巧,如即时编译...

    谅解6ua:@nzakas的“ Understanding ES6”的乌克兰语翻译

    《谅解6ua》是《Understanding ES6》一书的乌克兰语译版,这本书由Nolan Zakas撰写,深入解析了ECMAScript 6(ES6)这一JavaScript编程语言的重要升级。ES6,又称为ES2015,是JavaScript语言历史上的一个里程碑,...

    Scala Data Analysis Cookbook(PACKT,2015)

    Starting with introductory recipes on utilizing the Breeze and Spark libraries, get to grips withhow to import data from a host of possible sources and how to pre-process numerical, string, and date ...

Global site tag (gtag.js) - Google Analytics