More

SimFG · on July 12, 2023

Thanks Shivansh for his article

SimFG · on July 8, 2023

Lior's tweet: https://twitter.com/AlphaSignalAI/status/1677348799801425920

You can cut your GPT API expenses by 50% through caching using LangChain and GPTCache.

You will also benefit from significant response time increase and API rate limit restrictions.

SimFG · on June 30, 2023

Maybe [GPTCache](https://github.com/zilliztech/GPTCache) can make it more attractive, because similar problems can be less expensive, and can also be responded to faster. Of course, the specific configuration needs to be based on real usage scenarios.

SimFG · on June 29, 2023

At present, we have started applying GPT cache to online products. According to existing data, the effect is good. In fact, it can also be found that some LLM products actually have caching capabilities, such as Github Copilot. However, the cache required by different products varies, and the caching mechanism is worth noting.

SimFG · on June 21, 2023

Maybe users can discuss about llm output? The question asker can have LLM reproduce the results based on the discussion.

Of course, there are many user-related issues involved.

senttoschool · on June 21, 2023

People are already angry that LLMs ingested their knowledge and will use that knowledge to replace them. They're not going to do this without being paid.

SimFG · on June 21, 2023

Cool, I prefer the OpenAI-Compatible api. Although this is not very technically difficult, it is really intimate, because it make me feel free to use all ChatGPT applications.

wskwon · on June 21, 2023

Thanks! Please try it out and share any feedback you might have.

SimFG · on June 19, 2023

Looks great! I'm planning to integrate it into my new project(to-chatgpt: https://github.com/SimFG/to-chatgpt), which will provide users of the ChatGPT applications with a wider range of LLM service options.

chaoyu · on June 19, 2023

Looking forward to it!

OpenLLM is adding a OpenAI-compatible API layer, which will make it even easier to migrate LLM apps built around OpenAI's API spec. Feel free to join our Discord community and discuss more!

SimFG · on May 26, 2023

GPT3.5 answer: If you have limited budget or machine resources, there are several ways to obtain a good large-scale model experience:

1. Pretrained Models: You can use pre-trained models that have already been trained on large datasets. These models are available for free and can be fine-tuned on your specific task with limited computational resources.

2. Transfer Learning: Transfer learning can be used to leverage the knowledge of pre-trained models for your specific task. You can use a pre-trained model as a starting point and fine-tune it on your data to obtain good results.

3. Data Augmentation: Data augmentation techniques can be used to increase the size of your training data. This can help in improving the performance of your model without requiring additional computational resources.

4. Model Compression: Model compression techniques can be used to reduce the size of your model without losing much accuracy. This can help in improving the efficiency of your model and reducing computational resources.

5. Cloud Computing: Cloud computing services like AWS, Google Cloud, and Microsoft Azure provide access to high-performance computing resources at affordable prices. You can use these services to train your models on large datasets with limited computational resources.

Overall, with careful planning and thoughtful use of available resources, it is possible to obtain a good large-scale model experience even with limited budget or machine resources.

SimFG · on May 23, 2023

Sometime, the girlfriend may be busy, need a GPTCache to ease visit stress. :}

SimFG · on May 23, 2023

Maybe GPTCache can reduce the count of llm request for lower cost. detail: https://github.com/zilliztech/GPTCache

HN For You