Release Notes
Last updated
Last updated
Support for window functions
Better support for databricks
Detecting resource configuraiton better in configuration tab
Bug and UI fixes
DataFusion Comet support
bug fixes
Nvidia RAPIDS for Spark support
bug fixes
Bug Fixes
New visibility & alerts - Driver's memory
Updated README
Bug Fixes
Driver memory monitoring & alert
Updated readme
Bug fixes
New alert - Large data Broadcast, for requesting to broadcast large data sets with the broadcast() function
New alert - Large filter conditions, for wiring long filter conditions instead of using join logic
UI Improvements
Support spark versions 2.4 logs in history server with version later than 3.2 Limited feature-set is available due to events having less data than spark 3.0 and up
Better Databricks stage to node support
Support spark.dataflint.runId in custom history server providers when appId is not the spark appId
Better support for Databricks Photon plans
Input nodes shows partitions filters and push down filters
Stage Breakdown - press the blue down arrow on sql node to see stage information
Apache Iceberg alerts improvements
Add avg file size in read/write
More information when hovering on stage
Apache Iceberg support
Better node naming
Read metrics and reading small files alerts
Write metrics and overwriting most of table alerts
Require enabling iceberg metric reporter, can be done for you by enabling spark.dataflint.iceberg.autoCatalogDiscovery to true, or setting the iceberg metric reporter manually for each catalog, for example:
Add support for history server with cluster-mode jobs (i.e. with attempt numbet)
Fix "wasted cores" calculation
Fix status tab SQL is flickering when there is SQL with sub queriers
Fix scala 2.13 support
DataFlint SaaS support
partition Skew Alert:
Scala 2.13 support
A spark flag to disable web app mixpanel telemetries - spark.dataflint.telemetry.enabled
(true/false)
Renamed Core Activity Rate to Wasted Cores Ratio (which is 100 - Core Activity Rate), and added an alert for wasted cores too high
Resources tab - see a graph of your cluster executors count over time, use it to tune your resource allocation settings and save cost!
Minor visual fixes
DataFlint Resource Tab:
Small fix to platform identification
Databricks support
Visual improvements
public release
Heat map
Flint Assistant, require OpenAI Key
Minor fix relates to spark operator and nginx
IO only, shows only input, joins and output:
Basic mode (default), shows also transformations like filters, aggregations and selects:
Also there is plans informations for:
Joins
Sorts
Selects
Repartitions
Support both http and https access with enabling mix-content only on https mode
Support for spark 3.5.X
Initial version, includes:
Status page
Summary page
Configuration Page
Alerts page
New alert - large number of small tasks (see )