本文主要是介绍sqoop从mysql导入数据到hive时tinyint(1)格式自动变成Boolean解决方案,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!
sqoop从mysql导入数据到hive时tinyint(1)格式自动变成Boolean解决方案
首先,来一段官网原文:
27.2.5. MySQL: Import of TINYINT(1) from MySQL behaves strangely
Problem: Sqoop is treating TINYINT(1) columns as booleans, which is for example causing issues with HIVE import. This is because by default the MySQL JDBC connector maps the TINYINT(1) to java.sql.Types.BIT, which Sqoop by default maps to Boolean.
Solution: A more clean solution is to force MySQL JDBC Connector to stop converting TINYINT(1) to java.sql.Types.BIT by adding tinyInt1isBit=false into your JDBC path (to create something like jdbc:mysql://localhost/test?tinyInt1isBit=false). Another solution would be to explicitly override the column mapping for the datatype TINYINT(1) column. For example, if the column name is foo, then pass the following option to Sqoop during import: --map-column-hive foo=tinyint. In the case of non-Hive imports to HDFS, use --map-column-java foo=integer.
附官网链接:https://sqoop.apache.org/docs/1.4.7/SqoopUserGuide.html
问题:
Mysql中存在tinyint(1)时,在数据导入到HDFS时,该字段默认会被转化为boolean数据类型,导致数据内容丢失(都变为NULL)。
解决方案:
方式1: 在jdbc的连接后面加上:tinyInt1isBit=false
–connect jdbc:mysql://192.168.9.80:3306/kgc_behivour_log?tinyInt1isBit=false
方式2:显式覆盖数据类型TINYINT(1)列的列映射。
例如,如果列名为foo,则在导入期间将以下选项传递给Sqoop:-- map-column-hive foo = tinyint
在非Hive导入HDFS的情况下,使用--map-column-java foo = integer
Shylin
这篇关于sqoop从mysql导入数据到hive时tinyint(1)格式自动变成Boolean解决方案的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!