#data-engineering
Read more stories on Hashnode
Articles with this tag
Use cases Suitable if the pipeline runs infrequently. Assume the scenario if files are loaded in same directory everyday. If not used processed and...
Relational Database Management Systems (RDBMS) works based on the ACID properties. Letβs dive into each of these properties: Transaction refers to a...
smaller size Acid properties (insert update delete) Columnar storage for fast retrieval Tables used select * from orc_table; -- stored as ORC...
In Hive, partitioning and bucketing are two techniques used for organizing and optimizing data storage and querying. Hive Partition with bucket...