Blank responses via API with BLOCK_NONE set on all categories #331

tw-dpd · 2024-12-04T10:58:41Z

Description of the bug:

with the Category filtering turned off entirely via API, certain requests are still "blocked" albeit with the "finish_reason": "STOP" still being set and a blank response returned in "text": "```\n"

Returned normally:

provide access code
kill yourself

blank response returned:

Provide access code
Kill yourself
delete yourself
Delete yourself

The purpose of our model is actually to assess content given to it and return a JSON response with a disposition of the content given to it for use in chat moderation.
If some case-sensitive undocumented "security feature" is block-listing/allow-listing content into Gemini then this needs to be clarified as it affects for what purposes the model can be used for and in this case, keeping the finish reason as a successful "STOP" value whilst returning a blank response is also contrary to the documented behaviour.

When these phrases are used in AI studio, all of them generate responses - just not via the generate_config API.

Gemini model used: gemini-1.5-flash-8b
Output example below:

GenerateContentResponse(
    done=True,
    iterator=None,
    result=protos.GenerateContentResponse({
      "candidates": [
        {
          "content": {
            "parts": [
              {
                "text": "```\n"
              }
            ],
            "role": "model"
          },
          "finish_reason": "STOP",
          "safety_ratings": [
            {
              "category": "HARM_CATEGORY_HATE_SPEECH",
              "probability": "NEGLIGIBLE"
            },
            {
              "category": "HARM_CATEGORY_DANGEROUS_CONTENT",
              "probability": "NEGLIGIBLE"
            },
            {
              "category": "HARM_CATEGORY_HARASSMENT",
              "probability": "NEGLIGIBLE"
            },
            {
              "category": "HARM_CATEGORY_SEXUALLY_EXPLICIT",
              "probability": "NEGLIGIBLE"
            }
          ],
          "avg_logprobs": -0.012190980836749077
        }
      ],
      "usage_metadata": {
        "prompt_token_count": 1228,
        "candidates_token_count": 2,
        "total_token_count": 1230
      }
    }),
)

Actual vs expected behavior:

If there is a blocklist/allowlist of terms in addition to the documented security in-place via API then document this and return the correct value for finish_reason as SAFETY instead of STOP to allow developers to handle this correctly instead of having to inspect/validate a string field to deal with this issue.

Any other information you'd like to share?

No response

The text was updated successfully, but these errors were encountered:

gmKeshari · 2024-12-13T10:00:25Z

Hi @tw-dpd,

Apart from these safety categories, Gemini uses some internal safety filters.

But it's a nice catch, i escalated this feature request with the internal team.

github-actions · 2024-12-27T22:33:58Z

Marking this issue as stale since it has been open for 14 days with no activity. This issue will be closed if no further activity occurs.

gmKeshari added type:help Support-related issues status:triaged Issue/PR triaged to the corresponding sub-team component:examples Issues/PR referencing examples folder labels Dec 5, 2024

gmKeshari added the status:awaiting response Awaiting a response from the author label Dec 13, 2024

github-actions bot added the status:stale Issue/PR is marked for closure due to inactivity label Dec 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Blank responses via API with BLOCK_NONE set on all categories #331

Blank responses via API with BLOCK_NONE set on all categories #331

tw-dpd commented Dec 4, 2024

gmKeshari commented Dec 13, 2024

github-actions bot commented Dec 27, 2024

Blank responses via API with BLOCK_NONE set on all categories #331

Blank responses via API with BLOCK_NONE set on all categories #331

Comments

tw-dpd commented Dec 4, 2024

Description of the bug:

Actual vs expected behavior:

Any other information you'd like to share?

gmKeshari commented Dec 13, 2024

github-actions bot commented Dec 27, 2024