`
saint13
  • 浏览: 39931 次
  • 来自: ...
文章分类
社区版块
存档分类
最新评论

Input Data Source中的role

 
阅读更多

Help中的内容如下:
Choose the data Role from the drop-list. The role determines how the data set is used throughout the process flow. The role may be any of the following:

  • RAW - used as raw input to the node (default).
  • TRAIN - used to fit initial models.
  • VALIDATE - used by default for model assessment. The VALIDATE data set is also used for fine tuning the model. Note:   The Decision Tree and Neural Network nodes have the capacity of over fitting the TRAIN data set. To prevent these nodes from over fitting the TRAIN data set, the VALIDATE data set is automatically used to retreat to a simpler fit than the fit based on the TRAIN data alone. The VALIDATE data set can also be used by the Regression node for fine tuning stepwise regression models.   
  • TEST - additional "hold out" data set that you can use for model assessment.
  • SCORE - used to score a new data set that may not contain the target.
If you do not have training, validation, and test data sets, then you can create them with a successor Data Partition node.
******************************************8

但是这些role的用法究竟如何?如何影响后续的节点?

 



分享到:
评论

相关推荐

    bootstrap4整合tagsinput和typeahead 的静态代码,保证可用

    这里 `data-role="tagsinput"` 是 Tagsinput 的触发器,它将把普通的文本输入转换为标签输入。 然后,为了实现 Typeahead 功能,我们需要配置数据源。数据源可以是静态数组,也可以是动态获取的数据。例如,如果...

    Professional C# 3rd Edition

    Updating the Data Source 763 Building a Schema 764 Other Common Requirements 770 Summary 778 Chapter 23: Manipulating XML 781 XML Standards Support in .NET 782 Introducing the System.Xml Namespace 782...

    C Programming

    - **Data Processing**: This involves the transformation of raw data into meaningful information through various steps such as input, processing, output, and storage. #### Programming Languages This ...

    基于python的开放领域事件抽取系统源码数据库论文.doc

    The system's backend design primarily relies on Python, an open-source language, ensuring flexibility for future upgrades and problem-solving. Employing a B/S (Browser/Server) architecture for the ...

    USB Complete 3rdEdition

    Device Endpoints: the Source and Sink of Data 38 Pipes: Connecting Endpoints to the Host 40 Types of Transfers 40 Stream and Message Pipes 42 Initiating a Transfer 43 Transactions: the Building Blocks...

    最全的oracle常用命令大全.txt

    使用INPUT命令可以在SQL缓冲区中增加一行或多行 SQL>i SQL>输入内容 e、调用外部系统编辑器 SQL>edit 文件名 可以使用DEFINE命令设置系统变量EDITOR来改变文本编辑器的类型,在login.sql文件中定义如下一行 DEFINE_...

    微软内部资料-SQL性能优化5

    The only source of any storage location information is the sysindexes table, which keeps track of the address of the root page for every index, and the first IAM page for the index or table....

    基于ssm+mysql的在线订花系统源码数据库论文.docx

    MySQL, a popular open-source relational database management system, is utilized for storing and managing data related to flowers, orders, users, and other essential entities in the system. It ensures ...

    TMS320C6000 Assembly Language Tools User Guider

    1. **Assembler Overview**: Provides an overview of the assembler’s role in the software development flow and its usage. 2. **Invoking the Assembler**: Explains how to invoke the assembler through ...

    Problem Solving with C++ (7th edition)

    - **Virtual Functions**: Discussion of virtual functions and their role in achieving polymorphism through inheritance. **Programming Project 15.12**: This project likely involves designing and ...

    Pro PHP Security(Pro)

    5. **Securing Sessions**: Session hijacking can compromise user data and disrupt operations. The book discusses techniques for managing sessions securely, including session fixation prevention and ...

    Itanium Architecture For Programmers

    Comparing Variants of a Source File Section 3.4. Assembler Statement Types Section 3.5. The Functions of a Symbolic Assembler Section 3.6. The Assembly Process Section 3.7. The Linking ...

    Address Book with Multiple Field Search(个人信息管理系统)

    Soft copy: Write up, test results data, and source code. Be ready to demonstrate the program. What you can share with each other Data files etc. Sample Program Run AddressBook mylist.csv Welcome to ...

    cucu-a compiler you can understand

    In the realm of computer science and programming, compilers play a crucial role in translating human-readable source code into machine-executable binary code. CUCU, as described in the given document,...

    数位板压力测试

    The “(1.1)” notation also marks the definitions of new functions, messages, and data structures. The nota-tion “1.1:” marks new text or commentaries explaining new functionality added to existing ...

    The.Fortran.2003.Handbook

    - **Fortran 77 (1978):** Standardized the language and added several new features, such as free-form source input and the `INTEGER` type. - **Fortran 90 (1991):** A major revision that included ...

    Yii.Application.Development.Cookbook.3rd.Edition

    Learn how to keep your application secure according to the general web application security principle "filter input, escape output. Write RESTfull Web Services using Yii2 and built-in features ...

    scipylectures

    - **Introduction:** Overview of the Matplotlib library and its role in data visualization. - **Simple Plot:** Creating basic plots with lines, markers, and labels. - **Figures, Subplots, Axes, and ...

    数据库操作处理笔记

    create tablespace tablespace_name logging datafile '/XXX/xxx/datafile_name1.ora' size 4095M; ``` - 临时表空间:主要用于排序操作或临时对象的存储。 ```sql create temporary tablespace tmp_tablespace...

Global site tag (gtag.js) - Google Analytics