0.1.2
Enhanced UI/UX in inventory and impact pages, lineage graph improvements, and new impact chart features.
What's Changed
Frontend Service
New Features and Enhancements:
Inventory Table Update: Minor update to the inventory table columns for improved data representation and user experience.
Grouped Bar Chart on Impact Page: Implemented a new grouped bar chart feature on the impact page, replacing the previous stacked bar chart for a clearer visualization of data.
Impact Page Developments: Integration of the impact API and inclusion of impact chart data from Redux for enhanced data handling and visualization.
Lineage Graph Enhancements: Significant improvements to the lineage graph, including:
Highlighting the full path on node click for better focus and understanding.
Custom styling for children and root nodes.
Addition of child nodes inside the parent node using ReactFlow subflow and dagre layout.
Workspace filter addition and default selection of the first workspace on the lineage graph page.
Customization of lineage node handles to match dagre layout.
UI/UX Improvements:
Column Width Adjustment: Fixed the column width on the issueTracking table for better readability.
Clock Icon Addition: Added a clock icon to the impact card for a more intuitive representation of time-related metrics.
Potential Dev Hours Savings: Included potential development hours savings on the impact tab, providing a clearer indication of efficiency gains.
Licensing Table Modifications: Made specific changes to the licensing table, including separate columns lists for PowerBI and Tableau.
Operational and Infrastructure Changes:
Enhanced Hover Functionality: Updated the hover-over mechanism on metric cards to reduce accidental triggers.
Workspace Filter and Input Field Enhancements: Improved workspace filter functionality and focus management on the lineage graph window.
Gateway Service
New Features and Enhancements:
Inventory Page Update: Minor update to inventory page columns for better data representation and user experience.
Recommendation Configuration Enhancement: Added
asset_id
,asset_name
, andasset_type
in Recommendation Config to provide more detailed recommendations.Snowflake Proxy Improvement: Updated the Snowflake proxy to deserialize JSON properly from the VARIANT column, enhancing data handling and accuracy.
Usage Impact Chart Development: Introduced a method for
get_usage_impact_chart
in the snowflake_bi_proxy, along with a refactor of the query to match the latest design. This feature aims to provide better insights into usage impact through comprehensive charting.
Bug Fixes and Refinements:
Bot User Creation Bug Fix: Resolved an issue causing new bot users not to be created on fresh builds due to a missing
account_id
foreign key in theUserOrmModel
.SQL Query Refinement for Impact Charts: Updated and cleansed the SQL query used to retrieve data for impact charts, ensuring more accurate and relevant information.
Operational and Infrastructure Changes:
Data Model Naming Correction: Fixed the naming of the database model used to gather data for usage impact charts.
API Addition for Impact Charts: Added a new API to return impact charts data, enhancing the ability to visualize usage impact effectively.
Column Additions for Inventory Page: Added
dev_usage_ratio
and link columns to the inventory page, providing more detailed and relevant information for inventory analysis.Engineering vs. Usage Hours Analysis: Introduced analysis for Engineering versus Usage Hours, enabling deeper insights into engineering efforts versus actual usage.
Diagnostics Service
New Features and Enhancements:
Enhanced Dataset Embedding: Updated
ml_app
to embed sentences of datasets instead of JSON itself, improving data matching accuracy.Duplicate Dataset Detection: Implemented a new approach for duplicate dataset detection using embeddings, which improves performance and accuracy. This includes changes in processing datasets for both Tableau and PowerBI.
Airflow Configuration Optimization: Aligned with Airflow production recommendations, now using LocalExecutor on a single VM configuration.
Improved Similarity Metrics: Updated precision on similarity scores to 3 significant figures and relaxed duplication rules for a more accurate detection of duplicate tables.
Enhanced Dataset Tree Maps: Added new types, similarity scores, and children to the top level of
dim_dataset_treemap
, enabling better visualization and understanding of dataset structures.Performance Enhancements: Numerous optimizations and refactors to improve the efficiency and speed of various processes, especially in similarity detection and embedding calculations.
Tableau and PowerBI Integration Improvements: Numerous updates and fixes to enhance integration with Tableau and PowerBI, including better handling of dataset similarities and license summaries.
Embedding Model Enhancements: Transitioned to a smaller FastText model for reduced memory footprint and updated Docker compose files for better service management.
CI/CD Pipeline Updates: Added
ml_app
to the CI/CD pipeline and updated Docker image names for consistency.
Bug Fixes and Refinements:
Syntax and Logic Fixes: Addressed various issues including fixing Snowflake syntax, resolving ambiguous alias names, and correcting incremental logic errors.
UI and UX Improvements: Adjustments to sorting and filtering logic in various models for a more intuitive user experience.
Data Handling Corrections: Fixed issues related to data embedding, including removing newline characters in sentences for embedding generation and adjusting data types and structures for better processing.
Last updated