[Solved-1 Solution] Pig programming to use split on group by having count(*) ?
What is group by ?
- The GROUP by operator is used to group the data in one or more relations. It collects the data having the same key.
What is count ?
- The COUNT() function of Pig Latin is used to get the number of elements in a bag. While counting the number of tuples in a bag, the COUNT() function ignores (will not count) the tuples having a NULL value in the FIRST FIELD.
Problem :
How to use split on group by operator in pig ?
Solution 1:
We can Group by item, get the count and then use filter on the count