paging the query result with multiple heads

We have a microservice for relationship modeling between objects. A relation is defined between primary and secondary objects with cardinality constraints like 1-1, 1-N, N-N, etc.
The microservice provides API like Create relation, Find relations, Get secondaries, Get primaries, etc.

The query API "Get secondaries" takes a primary object and returns back all the related secondary objects. Since the related secondary object could be large, the results are paginated.

We had another microservice which was making good use of this relation microservice to work with relations. This consuming service accepted a similar pagination options like page index and number and passed the same to the relation service, and returned back the calling application the page results as obtained from the relation service. It was so far so good.

We recently identified that the consuming microservice was a bit chatty with the relation microservice as it had to call "Get secondaries" API multiple times given that there were multiple primary objects for which secondary objects had to be fetched.

So we thought to make the "Get Secondaries" API a bulk API by making it accept multiple primary objects as input. But then we got stuck with how the pagination would work.
The API would return related secondary objects for each primary but limit the secondary objects to the page size like earlier.

This seemed fine for the first call, but we are unsure how would this behave for the subsequent calls. If there were lesser number of secondary objects than the page size for one or more primary objects, what should be the input for subsequent calls. Do I need to pass those primary objects again?

This is where we are looking for suggestion on how to design this bulk API. Any input is welcome.

asked Nov 18 '18 at 13:06

tyrion

171214

Did you ever decide on a solution? If so, please feel free to add your own answer explaining what you did so that others can learn from this too.

– Matthew Pope
Dec 10 '18 at 23:56

add a comment |

The query API "Get secondaries" takes a primary object and returns back all the related secondary objects. Since the related secondary object could be large, the results are paginated.

This is where we are looking for suggestion on how to design this bulk API. Any input is welcome.

asked Nov 18 '18 at 13:06

tyrion

171214

Did you ever decide on a solution? If so, please feel free to add your own answer explaining what you did so that others can learn from this too.

– Matthew Pope
Dec 10 '18 at 23:56

add a comment |

The query API "Get secondaries" takes a primary object and returns back all the related secondary objects. Since the related secondary object could be large, the results are paginated.

This is where we are looking for suggestion on how to design this bulk API. Any input is welcome.

asked Nov 18 '18 at 13:06

tyrion

171214

The query API "Get secondaries" takes a primary object and returns back all the related secondary objects. Since the related secondary object could be large, the results are paginated.

This is where we are looking for suggestion on how to design this bulk API. Any input is welcome.

algorithm pagination microservices relation

asked Nov 18 '18 at 13:06

tyrion

171214

asked Nov 18 '18 at 13:06

tyrion

171214

asked Nov 18 '18 at 13:06

tyrion

171214

asked Nov 18 '18 at 13:06

tyrion

171214

asked Nov 18 '18 at 13:06

tyrion

171214

Did you ever decide on a solution? If so, please feel free to add your own answer explaining what you did so that others can learn from this too.

– Matthew Pope
Dec 10 '18 at 23:56

add a comment |

Did you ever decide on a solution? If so, please feel free to add your own answer explaining what you did so that others can learn from this too.

– Matthew Pope
Dec 10 '18 at 23:56

Did you ever decide on a solution? If so, please feel free to add your own answer explaining what you did so that others can learn from this too.

– Matthew Pope
Dec 10 '18 at 23:56

add a comment |

1 Answer
1

active

oldest

votes

Basically, you should have some way to ensure that the relationship service knows what the original query was when receiving a paginated request.

A simple and maintainable way for your relationship service to handle this is to preprocess the request by sorting the requested primary objects in some way (ie. sort alphabetically by Id), and then simply iterating through the primary objects, adding secondary objects to the response, until the response is full.

The simplest thing for clients to do is to always use the same batch request and just add an index number or page token to the request.

I'd recommend a page token that mentions the last seen item (for example, lastSeen=primaryId,secondaryId (which you should obfuscate in some way to avoid a leaky abstraction)). Then, the service can look at the original request, and know where to resume iterating through all of the primary objects.

Alternately, you can encode enough information into a page token so that you can reconstruct whatever you need from the original request. This allows you to make some adjustments to the query on subsequent requests. (For example, if the client requests primaries A-Z, and you return secondary objects A1 - J5 in the first response, then you could modify the request to be J-Z; already seen J5, encode it so that you aren't leaking your implementation details, and return it to the client as the page token.) Then, instead of responding with the original request + page number, the client simply responds with the page token.

Either way, clients of the relationship service should never have to "figure out" what the request for the next page should be. The pagination should only require the consumer to increment a number or respond with a page token that was given to it by the relationship service.

Another consideration is the database that you are using. For example, in DynamoDB, the way to get the 100th item for a query like select * from secondaries where primaryId='ABC' requires you to read all items up to the 100th item. If you have a NoSQL database, or if you think you might move to a NoSQL database at some point in the future, you might find that a page token makes it much simpler to maintain where you are in the result set (as compared to an index number).

I found this article to be very helpful when I was learning about pagination myself, and I'd recommend reading it. It primarily deals with pagination concerns for UIs, but the fundamentals are the same.

TLDR: Don't make the consumer do any work. The consumer should repeat the original request with an added index number or page token, or the consumer should send a request containing only a page token.

answered Nov 24 '18 at 1:50

Matthew Pope

1,6741712

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53361185%2fpaging-the-query-result-with-multiple-heads%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

Basically, you should have some way to ensure that the relationship service knows what the original query was when receiving a paginated request.

The simplest thing for clients to do is to always use the same batch request and just add an index number or page token to the request.

answered Nov 24 '18 at 1:50

Matthew Pope

1,6741712

add a comment |

Basically, you should have some way to ensure that the relationship service knows what the original query was when receiving a paginated request.

The simplest thing for clients to do is to always use the same batch request and just add an index number or page token to the request.

answered Nov 24 '18 at 1:50

Matthew Pope

1,6741712

add a comment |

Basically, you should have some way to ensure that the relationship service knows what the original query was when receiving a paginated request.

The simplest thing for clients to do is to always use the same batch request and just add an index number or page token to the request.

answered Nov 24 '18 at 1:50

Matthew Pope

1,6741712

Basically, you should have some way to ensure that the relationship service knows what the original query was when receiving a paginated request.

The simplest thing for clients to do is to always use the same batch request and just add an index number or page token to the request.

answered Nov 24 '18 at 1:50

Matthew Pope

1,6741712

answered Nov 24 '18 at 1:50

Matthew Pope

1,6741712

answered Nov 24 '18 at 1:50

Matthew Pope

1,6741712

answered Nov 24 '18 at 1:50

Matthew Pope

1,6741712

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

EJPvIix

搜尋此網誌

Ytukyg