================================================================================================
Dataset Benchmark
================================================================================================

OpenJDK 64-Bit Server VM 17.0.6+10 on Linux 5.15.0-1031-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
back-to-back map long:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                8913           9044         185         11.2          89.1       1.0X
DataFrame                                          1631           1650          27         61.3          16.3       5.5X
Dataset                                            2575           2608          47         38.8          25.7       3.5X

OpenJDK 64-Bit Server VM 17.0.6+10 on Linux 5.15.0-1031-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
back-to-back map:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                               10597          10599           3          9.4         106.0       1.0X
DataFrame                                          3872           3893          29         25.8          38.7       2.7X
Dataset                                           11548          11571          33          8.7         115.5       0.9X

OpenJDK 64-Bit Server VM 17.0.6+10 on Linux 5.15.0-1031-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
back-to-back filter Long:                 Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                2617           2648          43         38.2          26.2       1.0X
DataFrame                                           990           1004          19        101.0           9.9       2.6X
Dataset                                            2380           2400          30         42.0          23.8       1.1X

OpenJDK 64-Bit Server VM 17.0.6+10 on Linux 5.15.0-1031-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
back-to-back filter:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                2852           2862          14         35.1          28.5       1.0X
DataFrame                                           147            183          19        680.1           1.5      19.4X
Dataset                                            3347           3413          94         29.9          33.5       0.9X

OpenJDK 64-Bit Server VM 17.0.6+10 on Linux 5.15.0-1031-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
aggregate:                                Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD sum                                            2823           2830          11         35.4          28.2       1.0X
DataFrame sum                                        61             78          10       1631.2           0.6      46.0X
Dataset sum using Aggregator                       2935           2943          12         34.1          29.3       1.0X
Dataset complex Aggregator                         6909           6966          81         14.5          69.1       0.4X


