================================================================================================
Dataset Benchmark
================================================================================================

OpenJDK 64-Bit Server VM 17.0.16+8-LTS on Linux 6.11.0-1018-azure
AMD EPYC 7763 64-Core Processor
back-to-back map long:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                5736           5828         129         17.4          57.4       1.0X
DataFrame                                          1164           1168           6         85.9          11.6       4.9X
Dataset                                            1604           1701         137         62.4          16.0       3.6X

OpenJDK 64-Bit Server VM 17.0.16+8-LTS on Linux 6.11.0-1018-azure
AMD EPYC 7763 64-Core Processor
back-to-back map:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                6867           6977         155         14.6          68.7       1.0X
DataFrame                                          2745           2782          52         36.4          27.5       2.5X
Dataset                                            6475           6615         198         15.4          64.7       1.1X

OpenJDK 64-Bit Server VM 17.0.16+8-LTS on Linux 6.11.0-1018-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter Long:                 Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                4072           4132          84         24.6          40.7       1.0X
DataFrame                                           683            696          12        146.4           6.8       6.0X
Dataset                                            1724           1763          55         58.0          17.2       2.4X

OpenJDK 64-Bit Server VM 17.0.16+8-LTS on Linux 6.11.0-1018-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                2035           2090          78         49.1          20.3       1.0X
DataFrame                                           108            120          11        927.7           1.1      18.9X
Dataset                                            2467           2477          14         40.5          24.7       0.8X

OpenJDK 64-Bit Server VM 17.0.16+8-LTS on Linux 6.11.0-1018-azure
AMD EPYC 7763 64-Core Processor
aggregate:                                Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD sum                                            1402           1422          29         71.3          14.0       1.0X
DataFrame sum                                        67             78           9       1499.3           0.7      21.0X
Dataset sum using Aggregator                       2099           2104           7         47.6          21.0       0.7X
Dataset complex Aggregator                         4956           5026          99         20.2          49.6       0.3X


