Hi All, Welcome back! This is something that I follow, and has become a handy practice for me while using HDFS. I will keep on updating this post with whatever becomes a good practice for me! HDFS Alias Linux provides this “alias”, which can be used to replace the entire command by a word. HDFS […]Read more "HDFS Best Practices"
After working with BigData for around 3+ years, I have come across a plethora of projects. Esp the Apache movement is very strong for the BigData domain. Each one coming up with a project which addresses to a particular solution. But, with so many projects coming up, I did not find a single place where […]Read more "Apache BigData Project Catalogue"
How to import Apache Arrow’s Java code in Eclipse Hello BigData Gurus, I recently came across Apache Arrow. For me, it seems to be a fantastic thing. There are so many formats out there, this really needed a standardization. And that is what Apache Arrow brings on to the table. Its […]Read more "Apache Arrow: Import Java code in Eclipse"
Kafka v0.11 Distributed Cluster Config Hi All, Welcome back to the bigdatagurus blog. Since I delved into the big data domain, I have been working with Kafka every now n then. Kafka is one of the best solutions to decouple your components. So, instead of components talking within themselves, kafka comes in as a broker […]Read more "Kafka – Which knobs to turn?"
All the important changes a java developer must knowRead more "Java 9 – Important Changes Every Java Developer Must Know"
Hi Java cum BigData Gurus, Its been some time for me to post something here. Thanks for liking and commenting on my post about Spark cluster setup. Today, we will look into executing a Spark Java WordCount example using maven. To execute the code, you will need eclipse, and the code. Code is available on […]Read more "SparkSession Example, using Java"
Friends, People think Splunk as a SIEM, but its not just SIEM. not just for logs. I think it can be used for anything that is machine generated, use it instead of R, Spark or python or anything. Yaa, it is costly, but if you already have it, you can use it for much much […]Read more "An encounter with Splunk : Boot up"