Hi All, In one of the use-cases, we needed only English data from twitter. And we wanted to use flume. Flume by default, gets data in all languages from twitter. And we are supposed to filter it for language, which may be complicated. Instead, why not filter the messages at Twitter side itself. Makes sense, […]Read more "How to start modifying twitter-flume code"
Hi BigData Gurus, it has been a long time since my last post. But the time period was interesting. Got my hands on a lot of machine learning and AI stuff. It is something very very interesting. More blogs will come this way as soon as I get some good stuff ready in Machine Learning / AI domains. Till then, stay tuned. But […]Read more "Quartz Clustering – A quick primer"
Hi All, Welcome back! This is something that I follow, and has become a handy practice for me while using HDFS. I will keep on updating this post with whatever becomes a good practice for me! HDFS Alias Linux provides this “alias”, which can be used to replace the entire command by a word. HDFS […]Read more "HDFS Best Practices"
After working with BigData for around 3+ years, I have come across a plethora of projects. Esp the Apache movement is very strong for the BigData domain. Each one coming up with a project which addresses to a particular solution. But, with so many projects coming up, I did not find a single place where […]Read more "Apache BigData Project Catalogue"
How to import Apache Arrow’s Java code in Eclipse Hello BigData Gurus, I recently came across Apache Arrow. For me, it seems to be a fantastic thing. There are so many formats out there, this really needed a standardization. And that is what Apache Arrow brings on to the table. Its […]Read more "Apache Arrow: Import Java code in Eclipse"
Kafka v0.11 Distributed Cluster Config Hi All, Welcome back to the bigdatagurus blog. Since I delved into the big data domain, I have been working with Kafka every now n then. Kafka is one of the best solutions to decouple your components. So, instead of components talking within themselves, kafka comes in as a broker […]Read more "Kafka – Which knobs to turn?"
All the important changes a java developer must knowRead more "Java 9 – Important Changes Every Java Developer Must Know"