| Modifier and Type | Class and Description |
|---|---|
(package private) static class |
SimpleProfiler.Run
A run of the profiler.
|
(package private) static class |
SimpleProfiler.Space
Work space for a particular combination of columns.
|
Profiler.Column, Profiler.Distribution, Profiler.FunctionalDependency, Profiler.Profile, Profiler.RowCount, Profiler.Statistic, Profiler.Unique| Modifier and Type | Field and Description |
|---|---|
private static com.google.common.base.Function<List<Comparable>,Comparable> |
ONLY |
| Constructor and Description |
|---|
SimpleProfiler() |
| Modifier and Type | Method and Description |
|---|---|
Profiler.Profile |
profile(Iterable<List<Comparable>> rows,
List<Profiler.Column> columns,
Collection<ImmutableBitSet> initialGroups)
Creates a profile of a data set.
|
static double |
surprise(double expected,
double actual)
Returns a measure of how much an actual value differs from expected.
|
private static final com.google.common.base.Function<List<Comparable>,Comparable> ONLY
public Profiler.Profile profile(Iterable<List<Comparable>> rows, List<Profiler.Column> columns, Collection<ImmutableBitSet> initialGroups)
Profilerprofile in interface Profilerrows - List of rows. Can be iterated over more than once (maybe not
cheaply)columns - Column definitionsinitialGroups - List of combinations of columns that should be
profiled early, because they may be interestingpublic static double surprise(double expected,
double actual)
abs(expected - actual) / (expected + actual).
Examples:
expected - Expected valueactual - Actual valueCopyright © 2012–2018 The Apache Software Foundation. All rights reserved.