is there a way to have a tmp directory on the server?

Answered

Dutch Smoushond posted this in #help-forum

Dutch SmoushondOP

2024-03-18T23:28:34.000Z

Answered by Ray

we could use /tmp on vercel for temporary storage
https://github.com/vercel/vercel/discussions/5320#discussioncomment-110775

View full answer

75 Replies

Dutch SmoushondOP

2024-03-19T15:58:10.000Z

[OfficeParser]: Error: ENOENT: no such file or directory, mkdir 'officeParserTemp/tempfiles'

Trying to use officeparser through langchain

Toyger

2024-03-19T16:07:24.000Z

you already have tmp, it's linux /tmp folder
so if you want to create here something then inside it like
/tmp/officeParserTemp/tempfiles
but as you should understand temporary can mean as still as request is happening, because vercel running on ephermal instances, so on invokation of next request this folder can be already deleted.

@Toyger you already have tmp, it's linux `/tmp` folder so if you want to create here something then inside it like `/tmp/officeParserTemp/tempfiles` but as you should understand temporary can mean as still as request is happening, because vercel running on ephermal instances, so on invokation of next request this folder can be already deleted.

Dutch SmoushondOP

2024-03-19T16:09:50.000Z

Thank you for the response, so what do you think would work in this instance? Im using package https://js.langchain.com/docs/integrations/document_loaders/file_loaders/pptx which I believe is built on top of this package https://github.com/harshankur/officeParser#readme which states the need for the tmp folder

@Dutch Smoushond Thank you for the response, so what do you think would work in this instance? Im using package https://js.langchain.com/docs/integrations/document_loaders/file_loaders/pptx which I believe is built on top of this package https://github.com/harshankur/officeParser#readme which states the need for the tmp folder

Toyger

2024-03-19T16:20:55.000Z

probably it is using it, but they didn't expose temp folder option, you either use your own implementation where you'll expose it, either ask in langchain issues can they expose it as option to customize temp folder location

American Crow

2024-03-19T16:21:43.000Z

i just read over the docs (not carefully) you sure you need a temp folder? i don't see that part. Can you not just simply read a file from public or whatver?

@Toyger probably it is using it, but they didn't expose temp folder option, you either use your own implementation where you'll expose it, either ask in langchain issues can they expose it as option to customize temp folder location

Dutch SmoushondOP

2024-03-19T16:22:47.000Z

I could maybe just use officeparser directly

@American Crow i just read over the docs (not carefully) you sure you need a temp folder? i don't see that part. Can you not just simply read a file from public or whatver?

Dutch SmoushondOP

2024-03-19T16:23:54.000Z

This is in the officeparser github

Is aws s3 a solution? Ive read some have used that

@American Crow i just read over the docs (not carefully) you sure you need a temp folder? i don't see that part. Can you not just simply read a file from public or whatver?

Dutch SmoushondOP

2024-03-19T16:26:32.000Z

Also, if you dig in to the langchain node_modules, you'll find

A method that takes a `raw` buffer and `metadata` as parameters and
     * returns a promise that resolves to an array of `Document` instances. It
     * uses the `parseOfficeAsync` function from the `officeparser` module to extract
     * the raw text content from the buffer. If the extracted powerpoint content is
     * empty, it returns an empty array. Otherwise, it creates a new
     * `Document` instance with the extracted powerpoint content and the provided
     * metadata, and returns it as an array.

American Crow

2024-03-19T16:26:54.000Z

you right Duke i found the issue:
https://github.com/langchain-ai/langchainjs/issues/4000

Dutch SmoushondOP

2024-03-19T16:28:30.000Z

The pdf loader works fine tho, is it writing to the tmp without a problem? Never knew that

@American Crow you right Duke i found the issue: https://github.com/langchain-ai/langchainjs/issues/4000

Dutch SmoushondOP

2024-03-19T16:38:49.000Z

Unfortunate that they still never addressed this

American Crow

2024-03-19T16:42:40.000Z

yea sorry can't really help