Configuration
Configuration of Parquet can be done using the setConf method on SparkSession or by running SET key=value commands using SQL.
Property Name Default Meaning
spark.sql.parquet.binaryAsString false Some other Parquet-producing systems, in particular Impala, Hive, and older versions of Spark SQL, do not differentiate between binary data and strings when writing out the Parquet schema. This flag tells Spark SQL to interpret binary data as a string to provide compatibility with these systems.
spark.sql.parquet.int96AsTimestamp true Some Parquet-producing systems, in particular Impala and Hive, store Timestamp into INT96. This flag tells Spark SQL to interpret INT96 data as a timestamp to provide compatibility with these systems.
spark.sql.parquet.cacheMetadata true Turns on caching of Parquet schema metadata. Can speed up querying of static data.
spark.sql.parquet.compression.codec snappy Sets the compression codec use when writing Parquet files. Acceptable values include: uncompressed, snappy, gzip, lzo.
spark.sql.parquet.filterPushdown true Enables Parquet filter push-down optimization when set to true.
spark.sql.hive.convertMetastoreParquet true When set to false, Spark SQL will use the Hive SerDe for parquet tables instead of the built in support.
spark.sql.parquet.mergeSchema false
When true, the Parquet data source merges schemas collected from all data files, otherwise the schema is picked from the summary file or a random data file if no summary file is available.
spark.sql.optimizer.metadataOnly true
When true, enable the metadata-only query optimization that use the table's metadata to produce the partition columns instead of table scans. It applies when all the columns scanned are partition columns and the query has an aggregate operator that satisfies distinct semantics.
分享到:
相关推荐
java SWT 界面开发环境配置常见错误发生时,可能用到的资源, org.eclipse.core.commands_3.6.1.选用了64位机器环境下eclipse开发用到的jar包。
wiley.ubuntu.linux.toolbox.1000.plus.commands.for.ubuntu.and.debian.power.users.nov.2007.pdf
BOOK6=HLP\DBG51.CHM("uVision2 Debug Commands",GEN) BOOK7=HLP\ISD51.CHM("ISD51 In System Debugger",GEN) BOOK8=HLP\FlashMon51.CHM("Flash Monitor",GEN) BOOK9=MON390\MON390.HTM("MON390: Dallas Contiguous ...
6. **T-SQL User Guide**:关于T-SQL语言的详细介绍,适用于Sybase ASE。 - [http://infocenter.sybase.com/help/index.jsp?topic=/com.sybase.infocenter.dc32300.1570/html/sqlug/title.htm]...
标题"PyPI 官网下载 | django-schedule-commands-2020.12.24.tar.gz" 提供的信息表明,这是一个从Python Package Index(PyPI)官方源下载的软件包,名为"django-schedule-commands",版本号为2020.12.24,且已打包...
Sams.MySQL.Phrasebook.Essential.Code.and.Commands.Mar.2006.chm
import org.eclipse.core.commands.ExecutionException; import com.teamcenter.rac.aif.AbstractAIFUIApplication; import com.teamcenter.rac.aifrcp.AIFUtility; import com.teamcenter.rac.kernel.TCSession...
使用想服务器上传jsp执行windows或者linux命令,来获取服务器的资源情况。 Commands with JSP.sjp
在本文中,我们将深入探讨一个名为"rt.commands"的Python库,该库的版本为0.1,已经封装在一个zip压缩包中。 "rt.commands-0.1.zip"是一个包含了"rt.commands"库的压缩文件,它可能是为了便于分发、存储和安装而...
这个压缩包提供了两个版本的安装程序,分别是Citrix.XenApp.Commands.Install_x64.msi和Citrix.XenApp.Commands.Install_x86.msi,分别适用于64位和32位的操作系统。 Citrix XenApp,原名为MetaFrame或Presentation...
《PyPI官网下载:datanommer.commands-0.3.0.tar.gz——探索Python库在分布式环境中的应用》 PyPI(Python Package Index),是Python开发者的重要资源库,它为全球的Python开发者提供了一个发布、查找和安装Python...
Juniper-commands-v2.xls
《PyPI官网下载:深入解析dodo_commands-0.10.3.tar.gz》 PyPI,Python Package Index,是Python开发者们分享和获取开源软件包的重要平台。在这个平台上,我们可以找到各种各样的Python库,方便地管理和使用。本文...
The shell is a fundamental tool in Linux, allowing users to interact with the operating system through commands. This chapter teaches readers how to: - **Understanding the Command Line**: Basic ...
createAccount=com.cloud.api.commands.CreateAccountCmd;3 deleteAccount=com.cloud.api.commands.DeleteAccountCmd;3 updateAccount=com.cloud.api.commands.UpdateAccountCmd;3 disableAccount=...
《PyPI官网下载:django-schedule-commands-2020.12.29.tar.gz》 在Python的世界里,PyPI(Python Package Index)是最重要的软件仓库,它为开发者提供了一个平台来发布和分享他们的开源Python项目。本文将深入探讨...
资源分类:Python库 所属语言:Python 资源全名:magic-commands-0.0.13.1.post2.tar.gz 资源来源:官方 安装方法:https://lanzao.blog.csdn.net/article/details/101784059
"Octave-Commands-源码.rar"是一个包含Octave命令源码的压缩文件,可能包含了Octave内置函数、扩展库或用户自定义函数的实现。尽管标签为空,我们可以从标题和描述中推测,这个压缩包的内容是关于Octave的命令源代码...
标题中的"PyPI 官网下载 | dodo_commands-0.14.3.tar.gz"表明这是一个在Python Package Index(PyPI)上发布的开源软件包,名为`dodo_commands`,版本为0.14.3,其打包格式是tar.gz。PyPI是Python开发者发布和分享...