Online Python & Kafka Workshop im Vorlesungsstil
In einem Workshop Format, stundenweise verteilt über mehrere Wochen im Vorlesungsstil, gibt es ab 8.2.2021 ein super spannendes Online Training zum Thema Python & Kafka bei der Trivadis. Für alle BI-ler und Entwickler im DWH- und Big-Data Umfeld, aber auch Datenanalysten und -architekten, ist dieser…
Apache Kafka installation on Linux
In the following post, I will show how to install Apache Kafka on a Linux VM. Apache Kafka is an open-source stream processing platform developed by the Apache Software Foundation written in Scala and Java. First of all we need a Linux VM (e.g Centos 7) with…
Hadoop HDFS maximum number of files
Just a short blog about HDFS directory file and memory limits. I’ve recently faced an issue with Hive with the following error message org.apache.hadoop.ipc.RemoteException: The directory item limit of /tmp/hive/hive is exceeded: limit=1048576 items=1048576 The parameter which caused the error above is called dfs.namenode.fs-limits.max-directory-items dfs.namenode.fs-limits.max-directory-items…
Integrate haproxy service in MapR Control System
If you would like to create your own MapR Service under the control of warden here’s an example based on a haproxy configuration how to achieve this. The method should work for any other service as well create new role $ touch /opt/mapr/roles/haproxy create…