<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic reading parquet file using python sdk in Dropbox API Support &amp; Feedback</title>
    <link>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/674917#M30433</link>
    <description>&lt;P&gt;Hi, I am trying to read a parquet file using pandas and vaex. I can sucessfully read a .csv but I get the following error message when I try to download the parquet file with dbx.files_download :&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;dropbox.exceptions.ApiError: ApiError('4ce7ef4f93544a4fa18c29478e1869a8', DownloadError('path', LookupError('not_file', None)))&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Full code:&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;ACCESS_TOKEN = &lt;SPAN&gt;"My_Token"&lt;/SPAN&gt;&lt;SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;# Initialize the Dropbox API client&lt;BR /&gt;&lt;/SPAN&gt;dbx = dropbox.Dropbox(ACCESS_TOKEN)&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;# download csv file from dropbox&lt;BR /&gt;&lt;/SPAN&gt;metadata&lt;SPAN&gt;, &lt;/SPAN&gt;f_csv = dbx.files_download(&lt;SPAN&gt;'/County_test.csv'&lt;/SPAN&gt;)&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;# this works for csv and pandas&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;with &lt;/SPAN&gt;io.BytesIO(f_csv.content) &lt;SPAN&gt;as &lt;/SPAN&gt;stream:&lt;BR /&gt;    df = pd.read_csv(stream&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;index_col&lt;/SPAN&gt;=&lt;SPAN&gt;0&lt;/SPAN&gt;)&lt;BR /&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;(df)&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;# this works for csv and vaex&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;with &lt;/SPAN&gt;io.BytesIO(f_csv.content) &lt;SPAN&gt;as &lt;/SPAN&gt;stream:&lt;BR /&gt;    df = vaex.read_csv(stream&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;index_col&lt;/SPAN&gt;=&lt;SPAN&gt;0&lt;/SPAN&gt;)&lt;BR /&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;(df)&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;# download parquet file from dropbox FAILS&lt;BR /&gt;&lt;/SPAN&gt;metadata&lt;SPAN&gt;, &lt;/SPAN&gt;f_parquet = dbx.files_download(&lt;SPAN&gt;'/County_test.parquet'&lt;/SPAN&gt;)&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;# this part NOT tested yet&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;with &lt;/SPAN&gt;io.BytesIO(f_parquet.content) &lt;SPAN&gt;as &lt;/SPAN&gt;stream:&lt;BR /&gt;    df = pd.read_parquet(stream&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;index_col&lt;/SPAN&gt;=&lt;SPAN&gt;0&lt;/SPAN&gt;)&lt;BR /&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;(df)&lt;/PRE&gt;&lt;/DIV&gt;</description>
    <pubDate>Thu, 06 Apr 2023 16:37:44 GMT</pubDate>
    <dc:creator>notoriusjack</dc:creator>
    <dc:date>2023-04-06T16:37:44Z</dc:date>
    <item>
      <title>reading parquet file using python sdk</title>
      <link>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/674917#M30433</link>
      <description>&lt;P&gt;Hi, I am trying to read a parquet file using pandas and vaex. I can sucessfully read a .csv but I get the following error message when I try to download the parquet file with dbx.files_download :&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;dropbox.exceptions.ApiError: ApiError('4ce7ef4f93544a4fa18c29478e1869a8', DownloadError('path', LookupError('not_file', None)))&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Full code:&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;ACCESS_TOKEN = &lt;SPAN&gt;"My_Token"&lt;/SPAN&gt;&lt;SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;# Initialize the Dropbox API client&lt;BR /&gt;&lt;/SPAN&gt;dbx = dropbox.Dropbox(ACCESS_TOKEN)&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;# download csv file from dropbox&lt;BR /&gt;&lt;/SPAN&gt;metadata&lt;SPAN&gt;, &lt;/SPAN&gt;f_csv = dbx.files_download(&lt;SPAN&gt;'/County_test.csv'&lt;/SPAN&gt;)&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;# this works for csv and pandas&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;with &lt;/SPAN&gt;io.BytesIO(f_csv.content) &lt;SPAN&gt;as &lt;/SPAN&gt;stream:&lt;BR /&gt;    df = pd.read_csv(stream&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;index_col&lt;/SPAN&gt;=&lt;SPAN&gt;0&lt;/SPAN&gt;)&lt;BR /&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;(df)&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;# this works for csv and vaex&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;with &lt;/SPAN&gt;io.BytesIO(f_csv.content) &lt;SPAN&gt;as &lt;/SPAN&gt;stream:&lt;BR /&gt;    df = vaex.read_csv(stream&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;index_col&lt;/SPAN&gt;=&lt;SPAN&gt;0&lt;/SPAN&gt;)&lt;BR /&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;(df)&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;# download parquet file from dropbox FAILS&lt;BR /&gt;&lt;/SPAN&gt;metadata&lt;SPAN&gt;, &lt;/SPAN&gt;f_parquet = dbx.files_download(&lt;SPAN&gt;'/County_test.parquet'&lt;/SPAN&gt;)&lt;BR /&gt;&lt;BR /&gt;&lt;SPAN&gt;# this part NOT tested yet&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;with &lt;/SPAN&gt;io.BytesIO(f_parquet.content) &lt;SPAN&gt;as &lt;/SPAN&gt;stream:&lt;BR /&gt;    df = pd.read_parquet(stream&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;index_col&lt;/SPAN&gt;=&lt;SPAN&gt;0&lt;/SPAN&gt;)&lt;BR /&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;(df)&lt;/PRE&gt;&lt;/DIV&gt;</description>
      <pubDate>Thu, 06 Apr 2023 16:37:44 GMT</pubDate>
      <guid>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/674917#M30433</guid>
      <dc:creator>notoriusjack</dc:creator>
      <dc:date>2023-04-06T16:37:44Z</dc:date>
    </item>
    <item>
      <title>Re: reading parquet file using python sdk</title>
      <link>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/674921#M30434</link>
      <description>&lt;P&gt;I see you're getting a 'not_file' &lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/files.html#dropbox.files.LookupError" target="_blank" rel="noopener"&gt;error&lt;/A&gt;, which means: "We were expecting a file, but the given path refers to something that isn’t a file."&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;It looks like the ".parquet" is not a file, but rather a sort of folder, possibly referred to as a "package" or "bundle" in some environments.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;That being the case, to download that you would instead need to use &lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_download_zip" target="_blank" rel="noopener"&gt;files_download_zip&lt;/A&gt; (or&amp;nbsp;&lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_download_zip_to_file" target="_blank"&gt;files_download_zip_to_file&lt;/A&gt;) and then unzip the downloaded zip file, or walk through the contents using &lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_list_folder" target="_blank" rel="noopener"&gt;files_list_folder&lt;/A&gt;/&lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_list_folder_continue" target="_blank" rel="noopener"&gt;files_list_folder_continue&lt;/A&gt; and then download each individual nested item using &lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_download" target="_blank" rel="noopener"&gt;files_download&lt;/A&gt; (or &lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_download_to_file" target="_blank" rel="noopener"&gt;files_download_to_file&lt;/A&gt;). The first option of using &lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_download_zip" target="_blank" rel="noopener"&gt;files_download_zip&lt;/A&gt; is probably better/faster.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Apr 2023 13:18:43 GMT</pubDate>
      <guid>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/674921#M30434</guid>
      <dc:creator>Greg-DB</dc:creator>
      <dc:date>2023-04-07T13:18:43Z</dc:date>
    </item>
    <item>
      <title>Re: reading parquet file using python sdk</title>
      <link>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675084#M30438</link>
      <description>&lt;P&gt;Thanks for replying. I have tried with one of your solutions but it's really slow.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;file_list = []&lt;BR /&gt;&lt;SPAN&gt;for &lt;/SPAN&gt;entry &lt;SPAN&gt;in &lt;/SPAN&gt;dbx.files_list_folder(&lt;SPAN&gt;path&lt;/SPAN&gt;=&lt;SPAN&gt;"/County.parquet"&lt;/SPAN&gt;).entries:&lt;BR /&gt;    &lt;SPAN&gt;print&lt;/SPAN&gt;(entry.path_lower)&lt;BR /&gt;    _&lt;SPAN&gt;, &lt;/SPAN&gt;dwnld_file = dbx.files_download(entry.path_lower)&lt;BR /&gt;    &lt;SPAN&gt;with &lt;/SPAN&gt;io.BytesIO(dwnld_file.content) &lt;SPAN&gt;as &lt;/SPAN&gt;stream:&lt;BR /&gt;        pd_df = pd.read_parquet(stream) &lt;SPAN&gt;#this works&lt;BR /&gt;&lt;/SPAN&gt;        vdf = vaex.from_pandas(pd_df)&lt;BR /&gt;        &lt;SPAN&gt;del &lt;/SPAN&gt;pd_df&lt;BR /&gt;        file_list.append(vdf)&lt;BR /&gt;        conc_df = vaex.concat(file_list)&lt;BR /&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;(conc_df)&lt;/PRE&gt;&lt;/DIV&gt;&lt;P&gt;I have tried with dbx.files_download_zip but I can't find a way to read the data it returns 'utf-8' codec can't decode byte 0x82 in position 12: invalid start byte.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 07 Apr 2023 12:59:09 GMT</pubDate>
      <guid>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675084#M30438</guid>
      <dc:creator>notoriusjack</dc:creator>
      <dc:date>2023-04-07T12:59:09Z</dc:date>
    </item>
    <item>
      <title>Re: reading parquet file using python sdk</title>
      <link>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675089#M30441</link>
      <description>&lt;P&gt;Yes, using &lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_list_folder" target="_blank" rel="noopener nofollow noreferrer"&gt;files_list_folder&lt;/A&gt;/&lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_list_folder_continue" target="_blank" rel="noopener nofollow noreferrer"&gt;files_list_folder_continue&lt;/A&gt; and &lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_download" target="_blank" rel="noopener nofollow noreferrer"&gt;files_download&lt;/A&gt; (or &lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_download_to_file" target="_blank" rel="noopener nofollow noreferrer"&gt;files_download_to_file&lt;/A&gt;) requires more API calls so it would be less performant.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The &lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_download_zip" target="_blank"&gt;files_download_zip&lt;/A&gt; method would return the requested data the same way &lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_download" target="_blank"&gt;files_download&lt;/A&gt; would, except that it would be zip data that you would need to unzip to access the original folder. Also, note that you can use &lt;A href="https://dropbox-sdk-python.readthedocs.io/en/latest/api/dropbox.html#dropbox.dropbox_client.Dropbox.files_download_zip_to_file" target="_blank"&gt;files_download_zip_to_file&lt;/A&gt; if you want to save the zip data to a file. Can you share the code you're having trouble with for that, and indicate which line fails with that error?&lt;/P&gt;</description>
      <pubDate>Fri, 07 Apr 2023 13:36:25 GMT</pubDate>
      <guid>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675089#M30441</guid>
      <dc:creator>Greg-DB</dc:creator>
      <dc:date>2023-04-07T13:36:25Z</dc:date>
    </item>
    <item>
      <title>Re: reading parquet file using python sdk</title>
      <link>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675098#M30443</link>
      <description>&lt;P&gt;When I try:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;md&lt;SPAN&gt;, &lt;/SPAN&gt;zipFile = dbx.files_download_zip(&lt;SPAN&gt;'/County_test.parquet'&lt;/SPAN&gt;)&lt;BR /&gt;&lt;SPAN&gt;with &lt;/SPAN&gt;ZipFile(zipFile&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;'r'&lt;/SPAN&gt;) &lt;SPAN&gt;as &lt;/SPAN&gt;zip:&lt;/PRE&gt;&lt;/DIV&gt;&lt;P&gt;&lt;SPAN&gt;with &lt;/SPAN&gt;ZipFile(zipFile&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;'r'&lt;/SPAN&gt;) &lt;SPAN&gt;as &lt;/SPAN&gt;zip: fails with AttributeError: 'Response' object has no attribute 'seek'&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I can't understand how to solve this&lt;/P&gt;</description>
      <pubDate>Fri, 07 Apr 2023 14:15:06 GMT</pubDate>
      <guid>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675098#M30443</guid>
      <dc:creator>notoriusjack</dc:creator>
      <dc:date>2023-04-07T14:15:06Z</dc:date>
    </item>
    <item>
      <title>Re: reading parquet file using python sdk</title>
      <link>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675100#M30445</link>
      <description>&lt;P&gt;The files_download_zip method works like the files_download method, in that the second value it returns is the response object. To access the data from the response object, you would access the 'content' field like you did in your other code snippet. So, in this case, it would be 'zipFile.content'.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Beyond that, refer to the documentation for ZipFile, BytesIO, pandas, etc., for information on using those. Those aren't made by Dropbox so I can't offer support for those in particular.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Apr 2023 14:19:39 GMT</pubDate>
      <guid>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675100#M30445</guid>
      <dc:creator>Greg-DB</dc:creator>
      <dc:date>2023-04-07T14:19:39Z</dc:date>
    </item>
    <item>
      <title>Re: reading parquet file using python sdk</title>
      <link>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675108#M30446</link>
      <description>&lt;P&gt;Thank you for your support, I just managed to do it with dbx.files_download_zip but it takes more or less the same time to process.&lt;/P&gt;&lt;P&gt;Do you know if pandas or vaex support reading the data directly from a file in Dropbox?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;md&lt;SPAN&gt;, &lt;/SPAN&gt;zipFile = dbx.files_download_zip(&lt;SPAN&gt;'/County.parquet'&lt;/SPAN&gt;)&lt;BR /&gt;file_list = []&lt;BR /&gt;&lt;SPAN&gt;with &lt;/SPAN&gt;ZipFile(io.BytesIO(zipFile.content)&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;'r'&lt;/SPAN&gt;) &lt;SPAN&gt;as &lt;/SPAN&gt;zip_ref:&lt;BR /&gt;    &lt;SPAN&gt;for &lt;/SPAN&gt;file &lt;SPAN&gt;in &lt;/SPAN&gt;zip_ref.infolist():&lt;BR /&gt;        &lt;SPAN&gt;if &lt;/SPAN&gt;file.filename.endswith(&lt;SPAN&gt;'.parquet'&lt;/SPAN&gt;):&lt;BR /&gt;            pd_df = pd.read_parquet(zip_ref.open(file.filename))  &lt;SPAN&gt;# this works&lt;BR /&gt;&lt;/SPAN&gt;            vdf = vaex.from_pandas(pd_df)&lt;BR /&gt;            &lt;SPAN&gt;del &lt;/SPAN&gt;pd_df&lt;BR /&gt;            file_list.append(vdf)&lt;BR /&gt;            conc_df = vaex.concat(file_list)&lt;BR /&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;(conc_df)&lt;/PRE&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 07 Apr 2023 15:28:50 GMT</pubDate>
      <guid>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675108#M30446</guid>
      <dc:creator>notoriusjack</dc:creator>
      <dc:date>2023-04-07T15:28:50Z</dc:date>
    </item>
    <item>
      <title>Re: reading parquet file using python sdk</title>
      <link>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675109#M30447</link>
      <description>&lt;P&gt;I can't offer support for pandas or vaex themselves as they are not made by Dropbox. I suggest referring to the documentation for those for information on their capabilities.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Apr 2023 15:30:50 GMT</pubDate>
      <guid>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675109#M30447</guid>
      <dc:creator>Greg-DB</dc:creator>
      <dc:date>2023-04-07T15:30:50Z</dc:date>
    </item>
    <item>
      <title>Re: reading parquet file using python sdk</title>
      <link>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675136#M30450</link>
      <description>&lt;P&gt;I understand, and I found out it's possible. I put this here as it might help someone else.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You can use copy link (set the permissions as you like) and use the URL inside pandas.read_csv or pandas.read_parquet to read the dataset.&lt;/P&gt;&lt;P&gt;However the copy link will have a 'dl' parameter equal to 0, you have to change it to 1 to make it work. Example:&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;# this does not work&lt;BR /&gt;df = pd.read_parquet(&lt;SPAN&gt;'https://www.dropbox.com/s/somecode/part.0.parquet?dl=0'&lt;/SPAN&gt;)&lt;BR /&gt;&lt;BR /&gt;# this works&lt;BR /&gt;df = pd.read_parquet(&lt;SPAN&gt;'https://www.dropbox.com/s/somecode/part.0.parquet?dl=1'&lt;/SPAN&gt;)&lt;/PRE&gt;&lt;/DIV&gt;&lt;P&gt;Thank you again for helping out&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 07 Apr 2023 16:53:55 GMT</pubDate>
      <guid>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675136#M30450</guid>
      <dc:creator>notoriusjack</dc:creator>
      <dc:date>2023-04-07T16:53:55Z</dc:date>
    </item>
    <item>
      <title>Re: reading parquet file using python sdk</title>
      <link>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675140#M30451</link>
      <description>&lt;P&gt;Thanks for following up and sharing that. For reference, the &lt;A href="https://help.dropbox.com/share/force-download" target="_blank"&gt;URL parameters for shared links like that are documented here&lt;/A&gt;.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Apr 2023 16:58:25 GMT</pubDate>
      <guid>https://www.dropboxforum.com/t5/Dropbox-API-Support-Feedback/reading-parquet-file-using-python-sdk/m-p/675140#M30451</guid>
      <dc:creator>Greg-DB</dc:creator>
      <dc:date>2023-04-07T16:58:25Z</dc:date>
    </item>
  </channel>
</rss>

