Posts

Showing posts from May, 2020

Dynamic Partition Settings in Hive

SET hive.exec.dynamic.partition=true; SET hive.exec.dynamic.partition.mode=nonstrict;

Overview : About Kafka

-           What is Kafka - Kafka is a cluster. -           What is Cluster - It’s a group of system -           What all are the component inside Kafka - Kafka Broker and Zookeeper -           What is Zookeeper - Zookeeper is used to balance the load. Ex : If primary node server is down and it will automatically move it into secondary node… So, the application will not get impact. -           What is Kafka Broker - It will help to pass the message (data) into consumer. Overall flow is: Multiple producer will feed the data into Kafka cluster and multiple consumer will get the data from Kafka and will move it into data center. The data will be moved based on below identity. 1.        Producer + Topic + Offset ...

View table Stats Status in Teradata

HELP STATISTICS <tablename>; 

Index creation and Table level of AMP size in Teradata

How to create a table using unique primary index: CREATE MULTISET TABLE [databasename].[tablename] (                                       col_1    INTEGER,   col_2    VARCHAR(50) ) UNIQUE PRIMARY INDEX ( col_1 ); How to create a table using primary index: CREATE MULTISET TABLE [databasename].[tablename] (                                      col_1    INTEGER,   col_2    VARCHAR(50) ) PRIMARY INDEX ( col_1 ); Example 3: Checking data distribution of given columns on the AMP SELECT HASHAMP(HASHBUCKET(HASHROW([columnlist]))),COUNT(*) FROM [databasename].[t...

Hadoop Commands

hadoop archive The  hadoop archive  command creates a Hadoop archive, a file that contains other files. A Hadoop archive always has a  *.har  extension. hadoop classpath The  hadoop classpath  command prints the class path needed to access the Hadoop jar and the required libraries. hadoop daemonlog The  hadoop daemonlog  command gets and sets the log level for each daemon. hadoop distcp The  hadoop distcp  command is a tool used for large inter- and intra-cluster copying. hadoop fs The  hadoop fs  command runs a generic file system user client that interacts with the MapR file system. hadoop jar The  hadoop jar  command runs a program contained in a JAR file. Users can bundle their MapReduce code in a JAR file and execute it using this command. hadoop job The  hadoop job  command enables you to manage MapReduce jobs. hadoop mfs The  hadoop mfs  command displays directory inf...

Yarn Commands

  yarn applicationattempt   Syntax yarn application    [-list [<-appStates States>] [<-appTypes Types>] ]    [-status ApplicationId]    [-kill ApplicationId] ΓΌ   yarn cluster       The  yarn classpath  command prints the class path needed to access the Hadoop jar and the required libraries. Syntax yarn classpath yarn daemonlog Gets or sets the log level for each daemon. Syntax yarn daemonlog [-getlevel <host:port> <name>] | [-setlevel <host:port> <name> <level>] Parameters Parameter Description -getlevel <host:port> <name> Prints the log level of the daemon running at <host:port>. This command internally connects to  http://<host:port>/logLevel?log=<name> . -setlevel <host:port> <name> <level> Sets the log level of the daemon running...

Finding Index Details in Teradata

Finding Index Details in Teradata SELECT   IND.DatabaseName,         CASE TAB.TableKind              WHEN 'I' THEN 'Join index'              WHEN 'N' THEN 'Hash index'              WHEN 'T' THEN 'Table'              WHEN 'V' THEN 'View'              WHEN 'O' THEN 'NoPI Table'              WHEN 'Q' THEN 'Queue table'         END AS ObjectType,         IND.TableName AS ObjectName,         IND.IndexNumber,        ...