Skip to content

Commit 0792be7

Browse files
authored
Merge pull request #7463 from MicrosoftDocs/JasonHQX-patch-7
Update export-data-lake-faq.yml
2 parents 420db24 + 8689e19 commit 0792be7

File tree

1 file changed

+10
-4
lines changed

1 file changed

+10
-4
lines changed

powerapps-docs/maker/data-platform/export-data-lake-faq.yml

Lines changed: 10 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -52,10 +52,16 @@ sections:
5252
- question: When should I use a yearly or monthly partition strategy?
5353
answer: |
5454
For Dataverse tables where data volume is high within a year, we recommend you use monthly partitions. Doing so results in smaller files and better performance. Additionally, if the rows in Dataverse tables are updated frequently, splitting into multiple smaller files help improve performance in the case of in-place update scenarios.
55-
- question: When do I use Append only mode for a historical view of changes?
55+
- question: What is append only mode and what is the difference between append only and in-place update mode?
56+
answer: |
57+
In append only mode, incremental data from Dataverse tables are appended to the corresponding file partition in the lake. For more information: [Advanced Configuration Options in Azure Synapse Link](azure-synapse-link-advanced-configuration.md)
58+
- question: When do I use append only mode for a historical view of changes?
5659
answer: |
5760
Append only mode is the recommended option for writing Dataverse table data to the lake, especially when the data volumes are high within a partition with frequently changing data. Again, this is a commonly used and highly recommended option for enterprise customers. Additionally, you can choose to use this mode for scenarios where the intent is to incrementally review changes from Dataverse and process the changes for ETL, AI, and ML scenarios. Append only mode provides a history of changes, instead of the latest change or in place update, and enables several time series from AI scenarios, such as prediction or forecasting analytics based on historical values.
58-
- question: Why do I see duplicated version numbers when I export data using Append only mode?
61+
- question: How do I retrieve the most up-to-date row of each record and exclude deleted rows when I export data in append only mode?
62+
answer: |
63+
In append only mode, you should identify the latest version of record with the same ID using **VersionNumber** and **SinkModifiedOn** then apply **isDeleted=0** on the latest version.
64+
- question: Why do I see duplicated version numbers when I export data using append only mode?
5965
answer: |
6066
For append only mode, if Azure Synapse Link for Dataverse does not get an acknowledgement from the Azure data lake that the data has been committed due to any reason such as network delays, Synapse Link will retry in those scenarios and commit the data again. The downstream consumption should be made resilient to this scenario by filtering data using SinkModifiedOn.
6167
- question: Which Dataverse tables are not supported for export?
@@ -67,9 +73,9 @@ sections:
6773
- question: Does Azure Synapse Link support calculated columns?
6874
answer: |
6975
Yes. In Dataverse, the calculated column keeps only the formula information and the real value depends on the base entity column so the calculated column will be updated if and only if all data (base tables) related to calculated columns are exported via Azure Synapse Link.
70-
- question: Which Dataverse tables use Append only by default?
76+
- question: Which Dataverse tables use append only mode by default?
7177
answer: |
72-
All tables that do not have a createdOn field will be synced using Append only mode by default. This includes relationship tables as well as the ActivityParty table.
78+
All tables that do not have a createdOn field will be synced using append only mode by default. This includes relationship tables as well as the ActivityParty table.
7379
- question: Why does Azure Synapse Link for Dataverse require all resources to be in the same region and what can I do about it?
7480
answer: |
7581
To ensure high performance and low latency in addition to preventing egress charges, Synapse Link requires all resources to be located in the same region. If you have a cross-region scenario, you can:

0 commit comments

Comments
 (0)