Note: Inside the Hadoop API we only have access to Configuration
, not to
org.apache.spark.SparkContext, so we cannot use broadcasts to convey
the actual filter predicate.
Try to find the given expression in the tree of filters in order to determine whether it is safe to remove it from the higher level filters.
Try to find the given expression in the tree of filters in order to determine whether it is safe to remove it from the higher level filters. Note that strictly speaking we could stop the search whenever an expression is found that contains this expression as subexpression (e.g., when searching for "a" and "(a or c)" is found) but we don't care about optimizations here since the filter tree is assumed to be small.
The org.apache.spark.sql.parquet.CatalystFilter to expand and search
The expression to look for
An optional org.apache.spark.sql.parquet.CatalystFilter that contains the expression.
Note: Inside the Hadoop API we only have access to Configuration
, not to
org.apache.spark.SparkContext, so we cannot use broadcasts to convey
the actual filter predicate.